On the use of high-level information in speaker and language recognition

Montero-Asenjo, Alberto; González Domínguez, Javier; Ramos Castro, Daniel; López Moreno, Ignacio; Toledano, Doroteo T.; González Rodríguez, Joaquín

UAM_Biblioteca

Author

Montero-Asenjo, Alberto; González Domínguez, Javier; Ramos Castro, Daniel

; López Moreno, Ignacio; Toledano, Doroteo T.; González Rodríguez, Joaquín

Entity

UAM. Departamento de Ingeniería Informática

Date

2006-11-10

Citation

IV Jornadas en Tecnología del Habla. Universidad de Zaragoza. Editado por Luis Buera, Eduardo Lleida, Antonio Miguel y Alfonso Ortega. Zaragoza, 2006. 355-360

ISBN

84-96214-82-6

Editor's Version

http://jth2006.unizar.es/finals/4jth_128.pdf

Subjects

Automatic Speaker Recognition; Telecomunicaciones

URI

http://hdl.handle.net/10486/663754

Note

Actas de las IV Jornadas de Tecnología del Habla (JTH 2006)

Rights

Abstract

Automatic Speaker Recognition systems have been largely dominated by acoustic-spectral based systems, relying in proper modelling of the short-term vocal tract of speakers. However, there is scientific and intuitive evidence that speaker specific information is embedded in the speech signal in multiple short- and long-term characteristics. In this work, a multilevel speaker recognition system combining acoustic, phonotactic and prosodic subsystems is presented and assessed using NIST 2005 Speaker Recognition Evaluation data. For language recognition systems, the NIST 2005 Language Recognition Evaluation was selected to measure performance of a high-level language recognition systems.

Show full item record