On the use of high-level information in speaker and language recognition
Entity
UAM. Departamento de Ingeniería InformáticaDate
2006-11-10Citation
IV Jornadas en Tecnología del Habla. Universidad de Zaragoza. Editado por Luis Buera, Eduardo Lleida, Antonio Miguel y Alfonso Ortega. Zaragoza, 2006. 355-360ISBN
84-96214-82-6Editor's Version
http://jth2006.unizar.es/finals/4jth_128.pdfSubjects
Automatic Speaker Recognition; TelecomunicacionesNote
Actas de las IV Jornadas de Tecnología del Habla (JTH 2006)Rights
© 2006 Los autoresAbstract
Automatic Speaker Recognition systems have been largely dominated by acoustic-spectral based systems, relying in proper modelling of the short-term vocal tract of speakers. However, there is scientific and intuitive evidence that speaker specific
information is embedded in the speech signal in multiple short- and long-term characteristics. In this work, a multilevel speaker recognition system combining acoustic, phonotactic and prosodic subsystems is presented and assessed using NIST 2005 Speaker Recognition Evaluation data.
For language recognition systems, the NIST 2005 Language Recognition Evaluation was selected to measure performance of a high-level language recognition systems.
Files in this item
Google Scholar:Montero-Asenjo, Alberto
-
González Domínguez, Javier
-
Ramos Castro, Daniel
-
López Moreno, Ignacio
-
Toledano, Doroteo T.
-
González Rodríguez, Joaquín
This item appears in the following Collection(s)
Related items
Showing items related by title, author, creator and subject.