TY - BOOK
T1 - Lexicon Optimization for Dutch Speech Recognition in Spoken Document Retrieval
AU - Ordelman, Roeland J.F.
AU - van Hessen, Adrianus J.
AU - de Jong, Franciska M.G.
N1 - Imported from CTIT
PY - 2001/6
Y1 - 2001/6
N2 - In this paper, ongoing work concerning the language modelling and lexicon optimization of a Dutch speech recognition system for Spoken Document Retrieval is described: the collection and normalization of a training data set and the optimization of our recognition lexicon. Effects on lexical coverage of the amount of training data, of decompounding compound words and of different selection methods for proper names and acronyms are discussed.
AB - In this paper, ongoing work concerning the language modelling and lexicon optimization of a Dutch speech recognition system for Spoken Document Retrieval is described: the collection and normalization of a training data set and the optimization of our recognition lexicon. Effects on lexical coverage of the amount of training data, of decompounding compound words and of different selection methods for proper names and acronyms are discussed.
M3 - Report
T3 - CTIT technical report series
BT - Lexicon Optimization for Dutch Speech Recognition in Spoken Document Retrieval
PB - Centre for Telematics and Information Technology (CTIT)
CY - Enschede
ER -