Generating Expressive Speech for Storytelling Applications

G. Bailly (Editor), Mariet Theune, Koen Meijs, N. Campbell (Editor), W. Hamza (Editor), Dirk K.J. Heylen, Roeland J.F. Ordelman, H. Hoge (Editor), T. Jianhua (Editor)

    Research output: Contribution to journalArticleAcademicpeer-review

    70 Citations (Scopus)
    40 Downloads (Pure)

    Abstract

    Work on expressive speech synthesis has long focused on the expression of basic emotions. In recent years, however, interest in other expressive styles has been increasing. The research presented in this paper aims at the generation of a storytelling speaking style, which is suitable for storytelling applications and more in general, for applications aimed at children. Based on an analysis of human storytellers' speech, we designed and implemented a set of prosodic rules for converting "neutral" speech, as produced by a text-to-speech system, into storytelling speech. An evaluation of our storytelling speech generation system showed encouraging results.
    Original languageUndefined
    Pages (from-to)1137-1144
    Number of pages8
    JournalIEEE transactions on audio, speech and language processing
    Volume14
    Issue number2/4
    DOIs
    Publication statusPublished - Jul 2006

    Keywords

    • Speech Analysis
    • Child-directed speech
    • IR-57674
    • Speech synthesis
    • EWI-6842
    • expressive prosody
    • expressive speech
    • METIS-238148

    Cite this

    Bailly, G. (Editor) ; Theune, Mariet ; Meijs, Koen ; Campbell, N. (Editor) ; Hamza, W. (Editor) ; Heylen, Dirk K.J. ; Ordelman, Roeland J.F. ; Hoge, H. (Editor) ; Jianhua, T. (Editor). / Generating Expressive Speech for Storytelling Applications. In: IEEE transactions on audio, speech and language processing. 2006 ; Vol. 14, No. 2/4. pp. 1137-1144.
    @article{39bb0a19dece49d19a6749106e64e487,
    title = "Generating Expressive Speech for Storytelling Applications",
    abstract = "Work on expressive speech synthesis has long focused on the expression of basic emotions. In recent years, however, interest in other expressive styles has been increasing. The research presented in this paper aims at the generation of a storytelling speaking style, which is suitable for storytelling applications and more in general, for applications aimed at children. Based on an analysis of human storytellers' speech, we designed and implemented a set of prosodic rules for converting {"}neutral{"} speech, as produced by a text-to-speech system, into storytelling speech. An evaluation of our storytelling speech generation system showed encouraging results.",
    keywords = "Speech Analysis, Child-directed speech, IR-57674, Speech synthesis, EWI-6842, expressive prosody, expressive speech, METIS-238148",
    author = "G. Bailly and Mariet Theune and Koen Meijs and N. Campbell and W. Hamza and Heylen, {Dirk K.J.} and Ordelman, {Roeland J.F.} and H. Hoge and T. Jianhua",
    note = "10.1109/TASL.2006.876129",
    year = "2006",
    month = "7",
    doi = "10.1109/TASL.2006.876129",
    language = "Undefined",
    volume = "14",
    pages = "1137--1144",
    journal = "IEEE transactions on audio, speech and language processing",
    issn = "1558-7916",
    publisher = "IEEE",
    number = "2/4",

    }

    Bailly, G (ed.), Theune, M, Meijs, K, Campbell, N (ed.), Hamza, W (ed.), Heylen, DKJ, Ordelman, RJF, Hoge, H (ed.) & Jianhua, T (ed.) 2006, 'Generating Expressive Speech for Storytelling Applications', IEEE transactions on audio, speech and language processing, vol. 14, no. 2/4, pp. 1137-1144. https://doi.org/10.1109/TASL.2006.876129

    Generating Expressive Speech for Storytelling Applications. / Bailly, G. (Editor); Theune, Mariet; Meijs, Koen; Campbell, N. (Editor); Hamza, W. (Editor); Heylen, Dirk K.J.; Ordelman, Roeland J.F.; Hoge, H. (Editor); Jianhua, T. (Editor).

    In: IEEE transactions on audio, speech and language processing, Vol. 14, No. 2/4, 07.2006, p. 1137-1144.

    Research output: Contribution to journalArticleAcademicpeer-review

    TY - JOUR

    T1 - Generating Expressive Speech for Storytelling Applications

    AU - Theune, Mariet

    AU - Meijs, Koen

    AU - Heylen, Dirk K.J.

    AU - Ordelman, Roeland J.F.

    A2 - Bailly, G.

    A2 - Campbell, N.

    A2 - Hamza, W.

    A2 - Hoge, H.

    A2 - Jianhua, T.

    N1 - 10.1109/TASL.2006.876129

    PY - 2006/7

    Y1 - 2006/7

    N2 - Work on expressive speech synthesis has long focused on the expression of basic emotions. In recent years, however, interest in other expressive styles has been increasing. The research presented in this paper aims at the generation of a storytelling speaking style, which is suitable for storytelling applications and more in general, for applications aimed at children. Based on an analysis of human storytellers' speech, we designed and implemented a set of prosodic rules for converting "neutral" speech, as produced by a text-to-speech system, into storytelling speech. An evaluation of our storytelling speech generation system showed encouraging results.

    AB - Work on expressive speech synthesis has long focused on the expression of basic emotions. In recent years, however, interest in other expressive styles has been increasing. The research presented in this paper aims at the generation of a storytelling speaking style, which is suitable for storytelling applications and more in general, for applications aimed at children. Based on an analysis of human storytellers' speech, we designed and implemented a set of prosodic rules for converting "neutral" speech, as produced by a text-to-speech system, into storytelling speech. An evaluation of our storytelling speech generation system showed encouraging results.

    KW - Speech Analysis

    KW - Child-directed speech

    KW - IR-57674

    KW - Speech synthesis

    KW - EWI-6842

    KW - expressive prosody

    KW - expressive speech

    KW - METIS-238148

    U2 - 10.1109/TASL.2006.876129

    DO - 10.1109/TASL.2006.876129

    M3 - Article

    VL - 14

    SP - 1137

    EP - 1144

    JO - IEEE transactions on audio, speech and language processing

    JF - IEEE transactions on audio, speech and language processing

    SN - 1558-7916

    IS - 2/4

    ER -