dc.contributor.author | López Moreno, Ignacio | |
dc.contributor.author | Ramos Castro, Daniel | |
dc.contributor.author | González Rodríguez, Joaquín | |
dc.contributor.author | Toledano, Doroteo T. | |
dc.contributor.other | UAM. Departamento de Ingeniería Informática | es_ES |
dc.date.accessioned | 2015-01-29T19:06:14Z | |
dc.date.available | 2015-01-29T19:06:14Z | |
dc.date.issued | 2008-09 | |
dc.identifier.citation | 9th Annual Conference of the International Speech Communication Association. September 22-26, 2008 | en_US |
dc.identifier.issn | 2308-457X | |
dc.identifier.uri | http://hdl.handle.net/10486/663473 | |
dc.description | Proceedings of Interspeech 2008, Brisbane (Australia) | en_US |
dc.description.abstract | State-of-the-art language recognition systems usually combine multiple acoustic and phonotactic subsystems. The outputs of those systems are usually fused in different ways but the score from a trial is always obtained from N scores from N subsystems. In this paper, a robust novel approach to subsystem fusion in language recognition is proposed based on the relative performance of each trial not just to the claimed model but to all available models. The proposed technique exploits the relative behavior of a given speech utterance over the cohort of anchor models from the different subsystems, resulting in the proposed anchor-model fusion. Experiments fusing seven phone-SVM subsystems submitted by the authors to NIST LRE 2007 assess the robustness to non-uniform data availability over rule-based and trained fusion schemes as linear kernel SVM, as well as significant improvements in performance both in average EER and Cavg as used in NIST LRE. | en_US |
dc.description.sponsorship | This work was funded by the Spanish Ministry of Science and Technology under project TEC2006-13170-C02-01. | en_US |
dc.format.extent | 4 pag. | es_ES |
dc.format.mimetype | application/pdf | en |
dc.language.iso | eng | en |
dc.publisher | International Speech Communication Association | en_US |
dc.relation.ispartof | Interspeech | en_US |
dc.rights | © 2008 ISCA | en_US |
dc.subject.other | language recognition | en_US |
dc.subject.other | speaker recognition | en_US |
dc.title | Anchor-Model Fusion for Language Recognition | en_US |
dc.type | conferenceObject | en |
dc.subject.eciencia | Informática | es_ES |
dc.subject.eciencia | Telecomunicaciones | es_ES |
dc.relation.publisherversion | http://www.isca-speech.org/archive/interspeech_2008/i08_0727.html | |
dc.identifier.publicationfirstpage | 727 | |
dc.identifier.publicationlastpage | 730 | |
dc.relation.eventdate | September 22-26, 2008 | en_US |
dc.relation.eventnumber | 9 | |
dc.relation.eventplace | Brisbane (Australia) | en_US |
dc.relation.eventtitle | 9th Annual Conference of the International Speech Communication Association (Interspeech 2008) | en_US |
dc.type.version | info:eu-repo/semantics/publishedVersion | en |
dc.contributor.group | Análisis y Tratamiento de Voz y Señales Biométricas (ING EPS-002) | es_ES |
dc.rights.accessRights | openAccess | en |
dc.facultadUAM | Escuela Politécnica Superior | |