Towards Spoken-Document Retrieval for the Enterprise: Approximate Word-Lattice Indexing with Text Indexers
- Frank Seide ,
- Peng Yu ,
- Roger (Peng) Yu ,
- Yu Shi
Proc. IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) |
Enterprise-scale search engines are generally designed for linear text. Linear text is suboptimal for audio search, where accuracy can be significantly improved if the search includes alternate recognition candidates, commonly represented as word lattices. We propose two methods to enable text indexers to approximately index lattices with little or no code change: “TMI” (Time-based Merging for Indexing) aims at lattice-index size reduction, and the “sausage”-like “TALE” (Time-Anchored Lattice Expansion) approximation requires no indexer-code or data-format changes at all. On four enterprise-type data sets (meetings, phone calls, lectures, and voicemail), TMI and TALE improve accuracy by 30-60% for multi-word phrase searches and by 130% for two-term AND queries, compared to indexing linear text.