Show simple item record

dc.contributor.authorLiu, Chao
dc.contributor.authorWang, Dong
dc.contributor.authorTejedor Noguerales, Javier
dc.contributor.otherUAM. Departamento de Tecnología Electrónica y de las Comunicacioneses_ES
dc.date.accessioned2015-05-28T10:17:13Z
dc.date.available2015-05-28T10:17:13Z
dc.date.issued2012
dc.identifier.citationINTERSPEECH 2012: 13th Annual Conference of the International Speech Communication Association, ISCA, 2012. 2093-2096en_US
dc.identifier.issn1990-9772
dc.identifier.urihttp://hdl.handle.net/10486/666456
dc.description.abstractAn efficient indexing scheme is essentially important for spoken term detection (STD) on large databases, particularly for phone-based systems that have been widely adopted to achieve vocabulary-independent detection. While the finite state transducer (FST) composition provides a standard indexing approach, the n-gram reverse indexing is more flexible in connectivity representation and confidence measuring and therefore may result in better performance than searching within the original lattices or the equivalent FSTs. In this paper we present an n-gram FST indexing approach which combines the flexibility of n-gram indexing and the efficiency of FST indexing. Specifically, we employ the n-gram indexing to relax connectivity in original lattices and then formalize the indices into an FST for online search. We demonstrate this approach with a phone-based STD task where the lattice is sparse due to strong language models. The results show that n-gram FST indexing provides not only better detection performance than lattice search, but also a faster detection than both conventional n-gram and FST indexing. Index Terms: spoken term indexing, finite state transducer, spoken term detection, speech recognitionen_US
dc.format.extent4 pág.es_ES
dc.format.mimetypeapplication/pdfen
dc.language.isoengen
dc.publisherInternational Speech Communication Associationen_US
dc.relation.ispartofInterspeechen_US
dc.rights© 2012 ISCAen_US
dc.subject.otherSpoken term indexingen_US
dc.subject.otherFinite state transduceren_US
dc.subject.otherSpoken term detectionen_US
dc.subject.otherSpeech recognitionen_US
dc.titleN-gram FST indexing for spoken term detectionen_US
dc.typeconferenceObjecten
dc.subject.ecienciaInformáticaes_ES
dc.subject.ecienciaTelecomunicacioneses_ES
dc.relation.publisherversionhttp://www.isca-speech.org/archive/interspeech_2012/i12_2093.html
dc.identifier.publicationfirstpage2093
dc.identifier.publicationlastpage2096
dc.relation.eventdateSeptember 9-13, 2012en_US
dc.relation.eventnumber13
dc.relation.eventplacePortland (United States)en_US
dc.relation.eventtitle13th Annual Conference of the International Speech Communication Association, INTERSPEECH 2012en_US
dc.type.versioninfo:eu-repo/semantics/publishedVersionen
dc.contributor.groupLaboratorio de Tecnología Hombre-Computador (ING EPS-010)es_ES
dc.rights.accessRightsopenAccessen
dc.authorUAMTejedor Noguerales, Javier (261273)
dc.facultadUAMEscuela Politécnica Superior


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record