Arousal and Valence Prediction in Spontaneous Emotional Speech: Felt versus Perceived Emotion

Khiet Phuong Truong, David A. van Leeuwen, Mark A. Neerincx, Franciska M.G. de Jong

    Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

    21 Citations (Scopus)
    203 Downloads (Pure)

    Abstract

    In this paper, we describe emotion recognition experiments carried out for spontaneous affective speech with the aim to compare the added value of annotation of felt emotion versus annotation of perceived emotion. Using speech material available in the TNO-GAMING corpus (a corpus containing audiovisual recordings of people playing videogames), speech-based affect recognizers were developed that can predict Arousal and Valence scalar values. Two types of recognizers were developed in parallel: one trained with felt emotion annotations (generated by the gamers themselves) and one trained with perceived/observed emotion annotations (generated by a group of observers). The experiments showed that, in speech, with the methods and features currently used, observed emotions are easier to predict than felt emotions. The results suggest that recognition performance strongly depends on how and by whom the emotion annotations are carried out.
    Original languageEnglish
    Title of host publicationProceedings of the 10th Annual Conference of the International Speech Communication Association (Interspeech 2009)
    PublisherInternational Speech Communication Association
    Pages2027-2030
    Number of pages4
    Publication statusPublished - 2009
    Event10th Annual Conference of the International Speech Communication Association, INTERSPEECH 2009 - Brighton, United Kingdom
    Duration: 6 Sept 200910 Sept 2009
    Conference number: 10
    http://www.interspeech2010.jpn.org/

    Publication series

    NamePublications Speech Processing Group, Brno University of Technology
    PublisherInternational Speech Communication Association
    ISSN (Print)1990-9772

    Conference

    Conference10th Annual Conference of the International Speech Communication Association, INTERSPEECH 2009
    Abbreviated titleINTERSPEECH
    Country/TerritoryUnited Kingdom
    CityBrighton
    Period6/09/0910/09/09
    Internet address

    Keywords

    • IR-68948
    • EWI-17024
    • METIS-264250

    Fingerprint

    Dive into the research topics of 'Arousal and Valence Prediction in Spontaneous Emotional Speech: Felt versus Perceived Emotion'. Together they form a unique fingerprint.

    Cite this