Please use this identifier to cite or link to this item:
http://hdl.handle.net/11452/24884
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Mestre, X. | - |
dc.contributor.author | Hernando, J. | - |
dc.contributor.author | Pardas, M. | - |
dc.date.accessioned | 2022-03-08T06:06:46Z | - |
dc.date.available | 2022-03-08T06:06:46Z | - |
dc.date.issued | 2011 | - |
dc.identifier.citation | Hanilçi, C. ve Ertaş, F. (2011). ''VQ-UBM based speaker verification through dimension reduction using local PCA''. ed. X. Mestre vd. 19. European Signal Processing Conference (Eusipco-2011), 1303-1306. | en_US |
dc.identifier.issn | 2076-1465 | - |
dc.identifier.uri | https://ieeexplore.ieee.org/document/7074260 | - |
dc.identifier.uri | http://hdl.handle.net/11452/24884 | - |
dc.description | Bu çalışma, 29 Ağustos-2 Eylül 2011 tarihleri arasında Barselona[İspanya]'da düzenlenen 19. European Signal Processing Conference (Eusipco-2011)'de bildiri olarak sunulmuştur. | tr_TR |
dc.description.abstract | The universal background model (UBM) based classifiers have recently been popular for speaker recognition. In this paper, we propose a dimension reduction method using local principal component analysis to improve the performance of speaker verification systems, where maximum a Posteriori (MAP) adapted vector quantization classifier (VQ-MAP or VQ-UBM) is employed. The proposed system first partitions the UBM data into disjoint regions (clusters) via conventional VQ algorithm and PCA is performed on the set of feature vectors in each region to obtain transformation matrix. Then, multiple speaker model is constructed using the set of transformed feature vectors closest to each cluster through MAP adaptation. Conducting experiments on NIST 2001 SRE, it is shown that transforming the data onto a lower dimensional space by the proposed method improves the recognition accuracy. | en_US |
dc.language.iso | en | en_US |
dc.publisher | European Assoc Signal Speech & Image Processing-Eurasip | en_US |
dc.rights | info:eu-repo/semantics/closedAccess | en_US |
dc.subject | Engineering | en_US |
dc.subject | Imaging science & photographic technology | en_US |
dc.subject | Gaussian mixture-models | en_US |
dc.subject | Identification | en_US |
dc.subject | Gmm | en_US |
dc.subject | Recognition | en_US |
dc.subject | Linear transformations | en_US |
dc.subject | Metadata | en_US |
dc.subject | Principal component analysis | en_US |
dc.subject | Signal processing | en_US |
dc.subject | Vector quantization | en_US |
dc.subject | Dimension reduction | en_US |
dc.subject | Dimension reduction method | en_US |
dc.subject | Dimensional spaces | en_US |
dc.subject | Disjoint regions | en_US |
dc.subject | Feature vectors | en_US |
dc.subject | Local principal component analysis | en_US |
dc.subject | MAP adaptation | en_US |
dc.subject | Maximum a posteriori | en_US |
dc.subject | Recognition accuracy | en_US |
dc.subject | Speaker model | en_US |
dc.subject | Speaker recognition | en_US |
dc.subject | Speaker verification | en_US |
dc.subject | Speaker verification system | en_US |
dc.subject | Transformation matrices | en_US |
dc.subject | Universal background model | en_US |
dc.subject | VQ algorithm | en_US |
dc.subject | Speech recognition | en_US |
dc.title | VQ-UBM based speaker verification through dimension reduction using local PCA | en_US |
dc.type | Proceedings Paper | en_US |
dc.identifier.wos | 000377963100264 | tr_TR |
dc.identifier.scopus | 2-s2.0-84863731345 | tr_TR |
dc.relation.publicationcategory | Konferans Öğesi - Uluslararası | tr_TR |
dc.contributor.department | Uludağ Üniversitesi/Mühendislik Fakültesi/Elektrik ve Elektronik Mühendisliği Bölümü. | tr_TR |
dc.identifier.startpage | 1303 | tr_TR |
dc.identifier.endpage | 1306 | tr_TR |
dc.relation.journal | 19. European Signal Processing Conference (Eusipco-2011) | en_US |
dc.contributor.buuauthor | Hanilçi, Cemal | - |
dc.contributor.buuauthor | Ertaş, Figen | - |
dc.contributor.researcherid | AAH-4188-2021 | tr_TR |
dc.contributor.researcherid | S-4967-2016 | tr_TR |
dc.subject.wos | Engineering, electrical & electronic | en_US |
dc.subject.wos | Imaging science & photographic technology | en_US |
dc.indexed.wos | CPCIS | en_US |
dc.indexed.scopus | Scopus | en_US |
dc.contributor.scopusid | 35781455400 | tr_TR |
dc.contributor.scopusid | 24724154500 | tr_TR |
dc.subject.scopus | Speaker Verification; Language Recognition; Utterance | en_US |
Appears in Collections: | Scopus Web of Science |
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.