UAM | UAM_Biblioteca | Unified search engine | Scientific Production Portal | UAM Research Data Repository
Biblos-e Archivo
    • español
    • English
  • English 
    • español
    • English
  • Log in
JavaScript is disabled for your browser. Some features of this site may not work without it.

Search Biblos-e Archivo

Advanced Search

Browse

All of Biblos-e ArchivoCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsFacultiesThis CollectionBy Issue DateAuthorsTitlesSubjectsFaculties

My Account

Log inRegister

Statistics

View Usage Statistics

Help

Information about Biblos-e ArchivoI want to submit my workFrequently Asked Questions

UAM_Biblioteca

View Item 
  •   Biblos-e Archivo
  • 1 - Producción científica en acceso abierto de la UAM
  • Producción científica en acceso abierto de la UAM
  • View Item
  •   Biblos-e Archivo
  • 1 - Producción científica en acceso abierto de la UAM
  • Producción científica en acceso abierto de la UAM
  • View Item

The multi-domain international search on speech 2020 ALBAYZIN evaluation: Overview, systems, results, discussion and post-evaluation analyses

Author
Tejedor, Javier; Toledano, Doroteo T.; Ramirez, Jose M.; Montalvo, Ana R.; Alvarez-Trejos, Juan Ignacio
Entity
UAM. Departamento de Tecnología Electrónica y de las Comunicaciones
Publisher
MDPI
Date
2021-09-14
Citation
10.3390/app11188519
Applied Sciences-Basel 11.18 (2021): 8519
 
 
 
ISSN
2076-3417 (online)
DOI
10.3390/app11188519
Funded by
This research was funded by the Ministry of Science, Innovation and Universities of Spain, grant number RTI2018-095324-B-I00, and project DSForSec (grant number RTI2018-098091-B-I00). The APC was funded by the project DSForSec (grant number RTI2018-098091-B-I00) from the Ministry of Science, Innovation and Universities of Spain
Project
Gobierno de España. RTI2018-095324-B-I00; Gobierno de España. RTI2018-098091-B-I00
Editor's Version
https://doi.org/10.3390/app11188519
Subjects
International evaluation; Query-by-example spoken term detection; Search on speech; Spanish language; Spoken term detection; Telecomunicaciones
URI
http://hdl.handle.net/10486/701134
Rights
© The author(s)

Licencia Creative Commons
Esta obra está bajo una Licencia Creative Commons Atribución 4.0 Internacional.

Abstract

The large amount of information stored in audio and video repositories makes search on speech (SoS) a challenging area that is continuously receiving much interest. Within SoS, spoken term detection (STD) aims to retrieve speech data given a text-based representation of a search query (which can include one or more words). On the other hand, query-by-example spoken term detection (QbE STD) aims to retrieve speech data given an acoustic representation of a search query. This is the first paper that presents an internationally open multi-domain evaluation for SoS in Spanish that includes both STD and QbE STD tasks. The evaluation was carefully designed so that several post-evaluation analyses of the main results could be carried out. The evaluation tasks aim to retrieve the speech files that contain the queries, providing their start and end times and a score that reflects how likely the detection within the given time intervals and speech file is. Three different speech databases in Spanish that comprise different domains were employed in the evaluation: the MAVIR database, which comprises a set of talks from workshops; the RTVE database, which includes broadcast news programs; and the SPARL20 database, which contains Spanish parliament sessions. We present the evaluation itself, the three databases, the evaluation metric, the systems submitted to the evaluation, the evaluation results and some detailed post-evaluation analyses based on specific query properties (in-vocabulary/out-of-vocabulary queries, single-word/multi-word queries and native/foreign queries). The most novel features of the submitted systems are a data augmentation technique for the STD task and an end-to-end system for the QbE STD task. The obtained results suggest that there is clearly room for improvement in the SoS task and that performance is highly sensitive to changes in the data domain
Show full item record

Files in this item

Thumbnail
Name
9000157.pdf
Size
686.0Kb
Format
PDF

Refworks Export

Google™ Scholar:Tejedor, Javier - Toledano, Doroteo T. - Ramirez, Jose M. - Montalvo, Ana R. - Alvarez-Trejos, Juan Ignacio

This item appears in the following Collection(s)

  • Producción científica en acceso abierto de la UAM [17777]

Related items

Showing items related by title, author, creator and subject.

  • Query-by-example spoken term detection ALBAYZIN 2012 evaluation: Overview, systems, results, and discussion 

    Tejedor Noguerales, Javier; Toledano, Doroteo T.; Anguera, Xavier; Varona, Amparo; Hurtado, Lluís F; Miguel, Antonio; Colás Pasamontes, JoséAutoridad UAM
    2013-08-11
  • Spoken term detection ALBAYZIN 2014 evaluation: overview, systems, results, and discussion 

    Tejedor, Javier; Toledano, Doroteo T.; Lopez-Otero, Paula; Docío-Fernández, Laura; García-Mateo, Carmen García; Cardenal, Antonio; Echeverry-Correa, Julián David; Coucheiro-Limeres, Alejandro; Olcoz, Julia; Miguel, Antonio
    2015-12-08
  • Comparison of ALBAYZIN query-by-example spoken term detection 2012 and 2014 evaluations 

    Tejedor, Javier; Toledano, Doroteo T.; Lopez-Otero, Paula; Docío-Fernández, Laura; García-Mateo, Carmen García
    2016-12
All the documents from Biblos-e Archivo are protected by copyrights. Some rights reserved.
Universidad Autónoma de Madrid. Biblioteca
Contact Us | Send Feedback
We are onFacebookCanal BiblosYouTubeTwitterPinterestWhatsappInstagram

Declaración de accesibilidad

 

 

All the documents from Biblos-e Archivo are protected by copyrights. Some rights reserved.
Universidad Autónoma de Madrid. Biblioteca
Contact Us | Send Feedback
We are onFacebookCanal BiblosYouTubeTwitterPinterestWhatsappInstagram

Declaración de accesibilidad