HAI Alice - An Information-Providing Closed-Domain Dialog Corpus

Jelte Barachia van Waterschoot, Guillaume Dubuisson Duplessis, Lorenzo Gatti, Merijn Bruijnes, Dirk K.J. Heylen

    Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

    37 Downloads (Pure)


    The contribution of this paper is twofold: 1) we provide a public corpus for Human-Agent Interaction (where the agent is controlled by a Wizard of Oz) and 2) we show a study on verbal alignment in Human-Agent Interaction, to exemplify the corpus' use. In our recordings for the Human-Agent Interaction Alice-corpus (HAI Alice-corpus), participants talked to a wizarded agent, who provided them with information about the book Alice in Wonderland and its author.
    The wizard had immediate and almost full control over the agent's verbal and nonverbal behavior, as the wizard provided the agent's speech through his own voice and his facial expressions were directly copied onto the agent. The agent's hand gestures were controlled through a button interface.
    Data was collected to create a corpus with unexpected situations, such as misunderstandings, (accidental) false information, and interruptions. The HAI Alice-corpus consists of transcribed audio-video recordings of 15 conversations (more than 900 utterances) between users and the wizarded agent. As a use-case example, we measured the verbal alignment between the user and the agent. The paper contains information about the setup of the data collection, the unexpected situations and a description of our verbal alignment study.
    Original languageEnglish
    Title of host publicationProceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)
    EditorsNicoletta Calzolari, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Koiti Hasida, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis, Takenobu Tokunaga
    Place of PublicationMiyazaki
    PublisherEuropean Language Resources Association (ELRA)
    Number of pages7
    ISBN (Electronic)9791095546009
    Publication statusPublished - 9 May 2018
    Event11th International Conference on Language Resources and Evaluation 2018 - Phoenix Seagaia Resort, Miyazaki, Japan
    Duration: 7 May 201812 May 2018
    Conference number: 11


    Conference11th International Conference on Language Resources and Evaluation 2018
    Abbreviated titleLREC 2018
    Internet address


    • Corpus
    • Human-Agent Interaction
    • Wizard of Oz
    • Closed Domain
    • Information-Providing
    • Unexpected Situations
    • Verbal Alignment


    Dive into the research topics of 'HAI Alice - An Information-Providing Closed-Domain Dialog Corpus'. Together they form a unique fingerprint.

    Cite this