Whereas nowadays within-word co-articulation effects are usually sufficiently dealt with in automatic speech recognition, this is not always the case with phrase level co-articulation effects (PLC). This paper describes a first approach in dealing with phrase level co-articulation by applying these rules on the reference transcripts used for training our recogniser and by adding a set of temporary PLC phones that later on will be mapped on the original phones. In fact we temporarily break down acoustic context into a general and a PLC context. With this method, more robust models could be trained because phones that are confused due to PLC effects like for example /v/-/f/ and /z/-/s/, receive their own models. A first attempt to apply this method is described.
|Title of host publication||Proceedings of the ESCA ETRW Workshop Accessing Information in Spoken Audio|
|Editors||Tony Robinson, Steve Renals|
|Place of Publication||Cambridge, UK|
|Number of pages||5|
|Publication status||Published - 19 Feb 1999|
Ordelman, R. J. F., van Hessen, A. J., & van Leeuwen, D. A. (1999). Dealing with Phrase Level Co-Articulation (PLC) in speech recognition: a first approach. In T. Robinson, & S. Renals (Eds.), Proceedings of the ESCA ETRW Workshop Accessing Information in Spoken Audio (pp. 64-68). Cambridge, UK: ESCA.