Comparison of ALBAYZIN query-by-example spoken term detection 2012 and 2014 evaluations
Entity
UAM. Departamento de Tecnología Electrónica y de las ComunicacionesPublisher
Springer International PublishingDate
2016-12Citation
10.1186/s13636-016-0080-2
Eurasip Journal on Audio, Speech, and Music Processing 2016.1 (2016): 1-19
ISSN
1687-4722DOI
10.1186/s13636-016-0080-2Funded by
This research was funded by the Spanish Government ('SpeechTech4All Project' TEC2012 38939 C03 01 and 'CMC-V2 Project' TEC2012 37585 C02 01), the Galician Government through the research contract GRC2014/024 (Modalidade: Grupos de Referencia Competitiva 2014) and 'AtlantTIC Project' CN2012/160, and also by the Spanish Government and the European Regional Development Fund (ERDF) under project TACTICA.Project
Gobierno de España. TEC2012-38939-C03-01; Gobierno de España. TEC2012-37585-C02-01Editor's Version
http://dx.doi.org/10.1186/s13636-016-0080-2Subjects
Query-by-example spoken term detection; International evaluation; Search on spontaneous speech; TelecomunicacionesRights
© 2016 Tejedor et al.Abstract
Query-by-example spoken term detection (QbE STD) aims at retrieving data from a speech repository given an acoustic query containing the term of interest as input. Nowadays, it is receiving much interest due to the large volume of multimedia information. This paper presents the systems submitted to the ALBAYZIN QbE STD 2014 evaluation held as a part of the ALBAYZIN 2014 Evaluation campaign within the context of the IberSPEECH 2014 conference. This is the second QbE STD evaluation in Spanish, which allows us to evaluate the progress in this technology for this language. The evaluation consists in retrieving the speech files that contain the input queries, indicating the start and end times where the input queries were found, along with a score value that reflects the confidence given to the detection of the query. Evaluation is conducted on a Spanish spontaneous speech database containing a set of talks from workshops, which amount to about 7 h of speech. We present the database, the evaluation metric, the systems submitted to the evaluation, the results, and compare this second evaluation with the first ALBAYZIN QbE STD evaluation held in 2012. Four different research groups took part in the evaluations held in 2012 and 2014. In 2014, new multi-word and foreign queries were added to the single-word and in-language queries used in 2012. Systems submitted to the second evaluation are hybrid systems which integrate letter transcription- and template matching-based systems. Despite the significant improvement obtained by the systems submitted to this second evaluation compared to those of the first evaluation, results still show the difficulty of this task and indicate that there is still room for improvement.
Files in this item
Google Scholar:Tejedor, Javier
-
Toledano, Doroteo T.
-
Lopez-Otero, Paula
-
Docío-Fernández, Laura
-
García-Mateo, Carmen García
This item appears in the following Collection(s)
Related items
Showing items related by title, author, creator and subject.
-
Spoken term detection ALBAYZIN 2014 evaluation: overview, systems, results, and discussion
Tejedor, Javier; Toledano, Doroteo T.; Lopez-Otero, Paula; Docío-Fernández, Laura; García-Mateo, Carmen García; Cardenal, Antonio; Echeverry-Correa, Julián David; Coucheiro-Limeres, Alejandro; Olcoz, Julia; Miguel, Antonio
2015-12-08