Contrast in concept-to-speech generation

Mariet Theune, M. Walker (Editor), O. Rambow (Editor)

    Research output: Contribution to journalArticleAcademicpeer-review

    7 Citations (Scopus)


    In concept-to-speech systems, spoken output is generated on the basis of a text that has been produced by the system itself. In such systems, linguistic information from the text generation component may be exploited to achieve a higher prosodic quality of the speech output than can be obtained in a plain text-to-speech system. In this paper we discuss how information from natural language generation can be used to compute prosody in a concept-to-speech system, focusing on the automatic marking of contrastive accents on the basis of information about the preceding discourse. We discuss and compare some formal approaches to this problem and present the results of a small perception experiment that was carried out to test which discourse contexts trigger a preference for contrastive accent, and which do not. Finally, we describe a method for marking contrastive accent in a generic concept-to-speech system called D2S. In D2S, contrastive accent is assigned to generated phrases expressing different aspects of similar events. Unlike in previous approaches, there is no restriction on the kind of entities that may be considered contrastive. This is in line with the observation that, given the ‘right' context, any two items may stand in contrast to each other.
    Original languageUndefined
    Article number10.1016/S0885-2308(02)00010-4
    Pages (from-to)491-530
    Number of pages40
    JournalInternational journal of speech technology
    Issue number3-4
    Publication statusPublished - Oct 2002


    • EWI-6580
    • METIS-207901
    • IR-63335

    Cite this