Improved language recognition using better phonetic decoders and fusion with MFCC and SDC features

Toledano, Doroteo T.; González Domínguez, Javier; Abejón González, Alejandro; Spada, Danilo; Mateos García, Ismael; González Rodríguez, Joaquín

UAM_Biblioteca

Author

Toledano, Doroteo T.; González Domínguez, Javier; Abejón González, Alejandro; Spada, Danilo; Mateos García, Ismael; González Rodríguez, Joaquín

Entity

UAM. Departamento de Ingeniería Informática

Publisher

International Speech Communication Association

Date

2007-08

Citation

8th Annual Conference of the International Speech Communication Association. August 27-31, 2007

ISSN

1990-9772

Funded by

This work was funded by the Spanish Ministry of Science and Technology under project TEC2006-13170-C02-01.

Editor's Version

http://www.isca-speech.org/archive/interspeech_2007/i07_0194.html

Subjects

Language recognition; PPRLM; SVM; Informática

URI

http://hdl.handle.net/10486/663614

Note

Proceedings of Interspeech 2007, Antwerp (Belgium)

Rights

Abstract

One of the most popular and better performing approaches to language recognition (LR) is Parallel Phonetic Recognition followed by Language Modeling (PPRLM). In this paper we report several improvements in our PPRLM system that allowed us to move from an Equal Error Rate (EER) of over 15% to less than 8% on NIST LR Evaluation 2005 data still using a standard PPRLM system. The most successful improvement was the retraining of the phonetic decoders on larger and more appropriate corpora. We have also developed a new system based on Support Vector Machines (SVMs) that uses as features both Mel Frequency Cepstral Coefficients (MFCCs) and Shifted Delta Cepstra (SDC). This new SVM system alone gives an EER of 10.5% on NIST LRE 2005 data. Fusing our PPRLM system and the new SVM system we achieve an EER of 5.43% on NIST LRE 2005 data, a relative reduction of almost 66% from our baseline system.

Show full item record

Files in this item

Name

improved_toledano_Interspeech_2007.pdf

Size

208.5Kb

Format

PDF

Google™ Scholar:Toledano, Doroteo T. - González Domínguez, Javier - Abejón González, Alejandro - Spada, Danilo - Mateos García, Ismael - González Rodríguez, Joaquín

This item appears in the following Collection(s)

Producción científica en acceso abierto de la UAM [20343]

UAM_Biblioteca