Please use this identifier to cite or link to this item:
http://hdl.handle.net/11452/26025
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Kinnunen, Tomi H. | - |
dc.contributor.author | Saeidi, Rahim | - |
dc.contributor.author | Pohjalainen, Jouni | - |
dc.contributor.author | Alku, Paavo | - |
dc.contributor.author | Sandberg, Johan | - |
dc.contributor.author | Hansson-Sandsten, Maria | - |
dc.date.accessioned | 2022-04-25T07:39:51Z | - |
dc.date.available | 2022-04-25T07:39:51Z | - |
dc.date.issued | 2012 | - |
dc.identifier.citation | Hanilci, C. vd. (2012). "Comparing spectrum estimators in speaker verification under additive noise degradation". International Conference on Acoustics Speech and Signal Processing ICASSP, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 4769-4772. | en_US |
dc.identifier.isbn | 978-1-4673-0046-9 | - |
dc.identifier.issn | 1520-6149 | - |
dc.identifier.uri | https://doi.org/10.1109/ICASSP.2012.6288985 | - |
dc.identifier.uri | https://ieeexplore.ieee.org/document/6288985 | - |
dc.identifier.uri | http://hdl.handle.net/11452/26025 | - |
dc.description | Bu çalışma, 25-30 Mart 2012 tarihleri arasında Kyoto[Japonya]’da düzenlenen IEEE International Conference on Acoustics, Speech and Signal Processing’da bildiri olarak sunulmuştur. | tr_TR |
dc.description.abstract | Different short-term spectrum estimators for speaker verification under additive noise are considered. Conventionally, mel-frequency cepstral coefficients (MFCCs) are computed from discrete Fourier transform (DFT) spectra of windowed speech frames. Recently, linear prediction (LP) and its temporally weighted variants have been substituted as the spectrum analysis method in speech and speaker recognition. In this paper, 12 different short-term spectrum estimation methods are compared for speaker verification under additive noise contamination. Experimental results conducted on NIST 2002 SRE show that the spectrum estimation method has a large effect on recognition performance and stabilized weighted LP (SWLP) and minimum variance distortionless response (MVDR) methods yield approximately 7 % and 8 % relative improvements over the standard DFT method at -10 dB SNR level of factory and babble noises, respectively in terms of equal error rate (EER). | en_US |
dc.description.sponsorship | Inst Elect & Elect Engineers, Signal Processing Soc | en_US |
dc.description.sponsorship | IEEE | en_US |
dc.language.iso | en | en_US |
dc.publisher | IEEE | en_US |
dc.rights | info:eu-repo/semantics/openAccess | en_US |
dc.rights | Atıf Gayri Ticari Türetilemez 4.0 Uluslararası | tr_TR |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | * |
dc.subject | Acoustics | en_US |
dc.subject | Engineering | en_US |
dc.subject | Spectrum estimation | en_US |
dc.subject | Speaker verification | en_US |
dc.subject | Weighted linear prediction | en_US |
dc.subject | Speech | en_US |
dc.subject | Recognition | en_US |
dc.subject | Acoustic noise | en_US |
dc.subject | Additive noise | en_US |
dc.subject | Discrete | en_US |
dc.subject | Signal processing | en_US |
dc.subject | Spectrum analysis | en_US |
dc.subject | Babble noise | en_US |
dc.subject | Dft method | en_US |
dc.subject | Equal error rate | en_US |
dc.subject | Linear prediction | en_US |
dc.subject | Mel-frequency cepstral coefficients | en_US |
dc.subject | Minimum variance distortionless response | en_US |
dc.subject | Noise contamination | en_US |
dc.subject | Noise degradations | en_US |
dc.subject | Recognition performance | en_US |
dc.subject | Speaker recognition | en_US |
dc.subject | Fourier transforms | en_US |
dc.subject | Spectrum estimators | en_US |
dc.subject | Speech frames | en_US |
dc.subject | Speech recognition | en_US |
dc.title | Comparing spectrum estimators in speaker verification under additive noise degradation | en_US |
dc.type | Proceedings Paper | en_US |
dc.identifier.wos | 000312381404210 | tr_TR |
dc.identifier.scopus | 2-s2.0-84867590081 | tr_TR |
dc.relation.publicationcategory | Konferans Öğesi - Uluslararası | tr_TR |
dc.contributor.department | Uludağ Üniversitesi/Mühendislik Fakültesi/Elektronik Mühendisliği Bölümü. | tr_TR |
dc.identifier.startpage | 4769 | tr_TR |
dc.identifier.endpage | 4772 | tr_TR |
dc.relation.journal | International Conference on Acoustics Speech and Signal Processing ICASSP, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) | en_US |
dc.contributor.buuauthor | Hanilci, Cemal | - |
dc.contributor.buuauthor | Ertaş, Figen | - |
dc.contributor.researcherid | AAH-4188-2021 | tr_TR |
dc.contributor.researcherid | S-4967-2016 | tr_TR |
dc.relation.collaboration | Yurt dışı | tr_TR |
dc.subject.wos | Acoustics | en_US |
dc.subject.wos | Engineering, electrical & electronic | en_US |
dc.indexed.wos | CPCIS | en_US |
dc.indexed.scopus | Scopus | en_US |
dc.contributor.scopusid | 35781455400 | tr_TR |
dc.contributor.scopusid | 24724154500 | tr_TR |
dc.subject.scopus | Speaker Verification; Language Recognition; Utterance | en_US |
Appears in Collections: | Scopus Web of Science |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Hanilci_vd_2012.pdf | 306.62 kB | Adobe PDF | View/Open |
This item is licensed under a Creative Commons License