Comparing spectrum estimators in speaker verification under additive noise degradation

Kinnunen, Tomi H.; Saeidi, Rahim; Pohjalainen, Jouni; Alku, Paavo; Sandberg, Johan; Hansson-Sandsten, Maria

Please use this identifier to cite or link to this item: http://hdl.handle.net/11452/26025

Full metadata record

DC Field	Value	Language
dc.contributor.author	Kinnunen, Tomi H.	-
dc.contributor.author	Saeidi, Rahim	-
dc.contributor.author	Pohjalainen, Jouni	-
dc.contributor.author	Alku, Paavo	-
dc.contributor.author	Sandberg, Johan	-
dc.contributor.author	Hansson-Sandsten, Maria	-
dc.date.accessioned	2022-04-25T07:39:51Z	-
dc.date.available	2022-04-25T07:39:51Z	-
dc.date.issued	2012	-
dc.identifier.citation	Hanilci, C. vd. (2012). "Comparing spectrum estimators in speaker verification under additive noise degradation". International Conference on Acoustics Speech and Signal Processing ICASSP, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 4769-4772.	en_US
dc.identifier.isbn	978-1-4673-0046-9	-
dc.identifier.issn	1520-6149	-
dc.identifier.uri	https://doi.org/10.1109/ICASSP.2012.6288985	-
dc.identifier.uri	https://ieeexplore.ieee.org/document/6288985	-
dc.identifier.uri	http://hdl.handle.net/11452/26025	-
dc.description	Bu çalışma, 25-30 Mart 2012 tarihleri arasında Kyoto[Japonya]’da düzenlenen IEEE International Conference on Acoustics, Speech and Signal Processing’da bildiri olarak sunulmuştur.	tr_TR
dc.description.abstract	Different short-term spectrum estimators for speaker verification under additive noise are considered. Conventionally, mel-frequency cepstral coefficients (MFCCs) are computed from discrete Fourier transform (DFT) spectra of windowed speech frames. Recently, linear prediction (LP) and its temporally weighted variants have been substituted as the spectrum analysis method in speech and speaker recognition. In this paper, 12 different short-term spectrum estimation methods are compared for speaker verification under additive noise contamination. Experimental results conducted on NIST 2002 SRE show that the spectrum estimation method has a large effect on recognition performance and stabilized weighted LP (SWLP) and minimum variance distortionless response (MVDR) methods yield approximately 7 % and 8 % relative improvements over the standard DFT method at -10 dB SNR level of factory and babble noises, respectively in terms of equal error rate (EER).	en_US
dc.description.sponsorship	Inst Elect & Elect Engineers, Signal Processing Soc	en_US
dc.description.sponsorship	IEEE	en_US
dc.language.iso	en	en_US
dc.publisher	IEEE	en_US
dc.rights	info:eu-repo/semantics/openAccess	en_US
dc.rights	Atıf Gayri Ticari Türetilemez 4.0 Uluslararası	tr_TR
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/4.0/	*
dc.subject	Acoustics	en_US
dc.subject	Engineering	en_US
dc.subject	Spectrum estimation	en_US
dc.subject	Speaker verification	en_US
dc.subject	Weighted linear prediction	en_US
dc.subject	Speech	en_US
dc.subject	Recognition	en_US
dc.subject	Acoustic noise	en_US
dc.subject	Additive noise	en_US
dc.subject	Discrete	en_US
dc.subject	Signal processing	en_US
dc.subject	Spectrum analysis	en_US
dc.subject	Babble noise	en_US
dc.subject	Dft method	en_US
dc.subject	Equal error rate	en_US
dc.subject	Linear prediction	en_US
dc.subject	Mel-frequency cepstral coefficients	en_US
dc.subject	Minimum variance distortionless response	en_US
dc.subject	Noise contamination	en_US
dc.subject	Noise degradations	en_US
dc.subject	Recognition performance	en_US
dc.subject	Speaker recognition	en_US
dc.subject	Fourier transforms	en_US
dc.subject	Spectrum estimators	en_US
dc.subject	Speech frames	en_US
dc.subject	Speech recognition	en_US
dc.title	Comparing spectrum estimators in speaker verification under additive noise degradation	en_US
dc.type	Proceedings Paper	en_US
dc.identifier.wos	000312381404210	tr_TR
dc.identifier.scopus	2-s2.0-84867590081	tr_TR
dc.relation.publicationcategory	Konferans Öğesi - Uluslararası	tr_TR
dc.contributor.department	Uludağ Üniversitesi/Mühendislik Fakültesi/Elektronik Mühendisliği Bölümü.	tr_TR
dc.identifier.startpage	4769	tr_TR
dc.identifier.endpage	4772	tr_TR
dc.relation.journal	International Conference on Acoustics Speech and Signal Processing ICASSP, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)	en_US
dc.contributor.buuauthor	Hanilci, Cemal	-
dc.contributor.buuauthor	Ertaş, Figen	-
dc.contributor.researcherid	AAH-4188-2021	tr_TR
dc.contributor.researcherid	S-4967-2016	tr_TR
dc.relation.collaboration	Yurt dışı	tr_TR
dc.subject.wos	Acoustics	en_US
dc.subject.wos	Engineering, electrical & electronic	en_US
dc.indexed.wos	CPCIS	en_US
dc.indexed.scopus	Scopus	en_US
dc.contributor.scopusid	35781455400	tr_TR
dc.contributor.scopusid	24724154500	tr_TR
dc.subject.scopus	Speaker Verification; Language Recognition; Utterance	en_US
Appears in Collections:	Scopus Web of Science

Files in This Item:

File	Description	Size	Format
Hanilci_vd_2012.pdf		306.62 kB	Adobe PDF	View/Open

Show simple item record

This item is licensed under a Creative Commons License

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets