Bu öğeden alıntı yapmak, öğeye bağlanmak için bu tanımlayıcıyı kullanınız: http://hdl.handle.net/11452/32597
Başlık: Comparison of spectrum estimators in speaker verification: Mismatch conditions induced by vocal effort
Yazarlar: Kinnunen, Tomi
Rajan, Padmanabhan
Pohjalainen, Jouni
Alku, Paavo
Bimbot, F.
Cerisara, C.
Fougeron, C.
Gravier, G.
Lamel, L.
Pellegrino, F.
Perrier, P.
Uludağ Üniversitesi/Mühendislik Fakültesi/Elektrik ve Elektronik Mühendisliği Bölümü.
Hanilçi, Cemal
Ertaş, Figen
AAH-4188-2021
S-4967-2016
35781455400
24724154500
Anahtar kelimeler: Computer science
Engineering
Speaker recognition
Vocal effort mismatch
Spectrum estimation
Linear Prediction
Models
Poles
Spectrum analysis
Fundamental frequencies
Mel-frequency cepstral coefficients
Recognition accuracy
Speaker recognition
Speaker recognition evaluations
Speaker verification
Spectrum estimation
Vocal efforts
Speech recognition
Yayın Tarihi: 2013
Yayıncı: Isca-Int Speech Communication Assoc
Atıf: Hanilçi, C. vd. (2013). “Comparison of spectrum estimators in speaker verification: Mismatch conditions induced by vocal effort”. Interspeech, 14th Annual Conference of the International Speech Communication Association, 1-5, 2880-2884.
Özet: We study the problem of vocal effort mismatch in speaker verification. Changes in speaker's vocal effort induce changes in fundamental frequency (F0) and formant structure which introduce unwanted intra-speaker variations to features. We compare seven alternative spectrum estimators in the context of melfrequency cepstral coefficient (MFCC) extraction for speaker verification. The compared variants include traditional FFT spectrum and six parametric all-pole models. Experimental results on the NIST 2010 speaker recognition evaluation (SRE) corpus utilizing both GMM-UBM and more recent GMM supervector classifier indicate that spectrum estimation has a considerable impact on speaker verification accuracy under mismatched vocal effort conditions. The highest recognition accuracy was achieved using a particular variant of temporally weighted all-pole model, stabilized weighted linear prediction (SWLP).
Açıklama: Bu çalışma, 25-29 Ağustos 2013 tarihleri arasında Lyon[Fransa]’da düzenlenen 14. Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2013)’da bildiri olarak sunulmuştur.
URI: http://hdl.handle.net/11452/32597
ISSN: 2308-457X
978-1-62993-443-3
Koleksiyonlarda Görünür:Scopus
Web of Science

Bu öğenin dosyaları:
Bu öğeyle ilişkili dosya bulunmamaktadır.


DSpace'deki bütün öğeler, aksi belirtilmedikçe, tüm hakları saklı tutulmak şartıyla telif hakkı ile korunmaktadır.