A comparision of different classification systems for automatic singer identification

KARAMAN, Emrah

DSpace Home
→
Enstitüler
→
Fen Bilimleri Enstitüsü
→
Fen Bilimleri Enstitü Tezleri
→
Fen Bilimleri Enstitüsü Yüksek Lisans Tezleri
→
View Item

dc.contributor.author	KARAMAN, Emrah
dc.date.accessioned	2015-11-20T13:20:50Z	NULL
dc.date.available	2015-11-20T13:20:50Z	NULL
dc.date.issued	2009
dc.identifier.uri	http://hdl.handle.net/20.500.12397/8290	NULL
dc.description.abstract	Bu projede otomatik şarkıcı tanıma problemi için yöntemler incelenmiş ve 15 şarkıcı için bir otomatik şarkıcı tanıma sistemi oluşturulmuştur. Sistem iki aşamadan oluşmaktadır. Öncelikle sistem girdisi olarak kullanılan şarkı vokal kısım yani şarkıcının sesiyle beraber enstrüman seslerinin olduğu ve vokal olmayan kısım yani sadece enstrüman seslerinin olduğu kısım olarak ikiye ayrılır. Daha sonra şarkıcı tanıma ve sınıflandırma için tanıma aşaması gerçekleştirilir. Her iki aşamada da ses öznitelikleri çıkarma ve sınıflandırma işlemleri uygulanmaktadır. Öznitelik çıkarma işleminden önce şarkıya örnekleme düşürme, normalleştirme, çerçeveleme ve pencereleme gibi ön işlemler uygulanır. Öznitelik olarak enerji, spektral akı, sıfır geçiş oranı, mel frekansı cepstrum katsayıları (MFCC) ve doğrusal öngörülü cepstrum katsayıları (LPCC) kullanılır. Daha sonra bu öznitelikler kullanılarak şarkıcıyı belirlemek için destek vektör makineleri (SVM), çoklu gauss karışım modelleri (GMM) ve çok katmanlı algılayıcılarla (MLP) sınıflandırıcılar oluşturulur. In this project, methods for automatic singer identification problem are investigated, and a singer identification system for 15 singers is implemented. The system consists of two parts. Firstly, the song as the input of the system is segmented into two parts: vocal part, which consists of the singers? voice and instrument sounds, and non-vocal part which consists of only instruments? sounds. Then, the identification step for modeling and classification of singer is applied. Both steps consist of the audio feature extraction and classification methods. In the beginning of feature extraction, preprocessing is applied to data such as down-sampling, normalization, pre-emphasizing, frame blocking and windowing. Energy, spectral flux, zero crossing rate, mel frequency cepstrum coefficients (MFCC) and linear prediction cepstrum coefficients (LPCC) are used for feature extraction. Then, support vector machine (SVM), gaussian mixture model (GMM) and multilayer perceptron (MLP) classifiers are constructed for classification of the singer with using all these extracted features.	en_US
dc.language.iso	en	en_US
dc.publisher	DEÜ Fen Bilimleri Enstitüsü	en_US
dc.subject	Singer identification, audio feature extraction, MFCC, LPCC, SVM,GMM, MLP,Şarkıcı tanıma, ses öznitelikleri, MFCC, LPCC, SVM, GMM,MLP	en_US
dc.title	A comparision of different classification systems for automatic singer identification	en_US
dc.title.alternative	Otomatik şarkıcı tanımada farklı sınıflandırma sistemlerinin karşılaştırılması	en_US
dc.type	Thesis	en_US