HAI Alice - An Information-Providing Closed-Domain Dialog Corpus

Jelte Barachia van Waterschoot, Guillaume Dubuisson Duplessis, Lorenzo Gatti, Merijn Bruijnes, Dirk K.J. Heylen

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

15 Downloads (Pure)

Abstract

The contribution of this paper is twofold: 1) we provide a public corpus for Human-Agent Interaction (where the agent is controlled by a Wizard of Oz) and 2) we show a study on verbal alignment in Human-Agent Interaction, to exemplify the corpus' use. In our recordings for the Human-Agent Interaction Alice-corpus (HAI Alice-corpus), participants talked to a wizarded agent, who provided them with information about the book Alice in Wonderland and its author.
The wizard had immediate and almost full control over the agent's verbal and nonverbal behavior, as the wizard provided the agent's speech through his own voice and his facial expressions were directly copied onto the agent. The agent's hand gestures were controlled through a button interface.
Data was collected to create a corpus with unexpected situations, such as misunderstandings, (accidental) false information, and interruptions. The HAI Alice-corpus consists of transcribed audio-video recordings of 15 conversations (more than 900 utterances) between users and the wizarded agent. As a use-case example, we measured the verbal alignment between the user and the agent. The paper contains information about the setup of the data collection, the unexpected situations and a description of our verbal alignment study.
Original languageEnglish
Title of host publicationProceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)
EditorsNicoletta Calzolari, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Koiti Hasida, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis, Takenobu Tokunaga
Place of PublicationMiyazaki
PublisherEuropean Language Resources Association (ELRA)
Pages2746-2752
Number of pages7
Edition11
ISBN (Electronic)9791095546009
Publication statusPublished - 9 May 2018
Event11th International Conference on Language Resources and Evaluation 2018 - Phoenix Seagaia Resort, Miyazaki, Japan
Duration: 7 May 201812 May 2018
Conference number: 11
http://lrec2018.lrec-conf.org/en/

Conference

Conference11th International Conference on Language Resources and Evaluation 2018
Abbreviated titleLREC 2018
CountryJapan
CityMiyazaki
Period7/05/1812/05/18
Internet address

Fingerprint

Video recording

Keywords

  • Corpus
  • Human-Agent Interaction
  • Wizard of Oz
  • Closed Domain
  • Information-Providing
  • Unexpected Situations
  • Verbal Alignment

Cite this

van Waterschoot, J. B., Dubuisson Duplessis, G., Gatti, L., Bruijnes, M., & Heylen, D. K. J. (2018). HAI Alice - An Information-Providing Closed-Domain Dialog Corpus. In N. Calzolari, K. Choukri, C. Cieri, T. Declerck, S. Goggi, K. Hasida, H. Isahara, B. Maegaard, J. Mariani, H. Mazo, A. Moreno, J. Odijk, S. Piperidis, ... T. Tokunaga (Eds.), Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) (11 ed., pp. 2746-2752). [429] Miyazaki: European Language Resources Association (ELRA).
van Waterschoot, Jelte Barachia ; Dubuisson Duplessis, Guillaume ; Gatti, Lorenzo ; Bruijnes, Merijn ; Heylen, Dirk K.J. / HAI Alice - An Information-Providing Closed-Domain Dialog Corpus. Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018). editor / Nicoletta Calzolari ; Khalid Choukri ; Christopher Cieri ; Thierry Declerck ; Sara Goggi ; Koiti Hasida ; Hitoshi Isahara ; Bente Maegaard ; Joseph Mariani ; Hélène Mazo ; Asuncion Moreno ; Jan Odijk ; Stelios Piperidis ; Takenobu Tokunaga. 11. ed. Miyazaki : European Language Resources Association (ELRA), 2018. pp. 2746-2752
@inproceedings{27eb248359874a43bd6f55ee6ac3a39e,
title = "HAI Alice - An Information-Providing Closed-Domain Dialog Corpus",
abstract = "The contribution of this paper is twofold: 1) we provide a public corpus for Human-Agent Interaction (where the agent is controlled by a Wizard of Oz) and 2) we show a study on verbal alignment in Human-Agent Interaction, to exemplify the corpus' use. In our recordings for the Human-Agent Interaction Alice-corpus (HAI Alice-corpus), participants talked to a wizarded agent, who provided them with information about the book Alice in Wonderland and its author.The wizard had immediate and almost full control over the agent's verbal and nonverbal behavior, as the wizard provided the agent's speech through his own voice and his facial expressions were directly copied onto the agent. The agent's hand gestures were controlled through a button interface.Data was collected to create a corpus with unexpected situations, such as misunderstandings, (accidental) false information, and interruptions. The HAI Alice-corpus consists of transcribed audio-video recordings of 15 conversations (more than 900 utterances) between users and the wizarded agent. As a use-case example, we measured the verbal alignment between the user and the agent. The paper contains information about the setup of the data collection, the unexpected situations and a description of our verbal alignment study.",
keywords = "Corpus, Human-Agent Interaction, Wizard of Oz, Closed Domain, Information-Providing, Unexpected Situations, Verbal Alignment",
author = "{van Waterschoot}, {Jelte Barachia} and {Dubuisson Duplessis}, Guillaume and Lorenzo Gatti and Merijn Bruijnes and Heylen, {Dirk K.J.}",
year = "2018",
month = "5",
day = "9",
language = "English",
pages = "2746--2752",
editor = "Nicoletta Calzolari and Khalid Choukri and Christopher Cieri and Thierry Declerck and Sara Goggi and Koiti Hasida and Hitoshi Isahara and Bente Maegaard and Joseph Mariani and H{\'e}l{\`e}ne Mazo and Asuncion Moreno and Jan Odijk and Stelios Piperidis and Takenobu Tokunaga",
booktitle = "Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)",
publisher = "European Language Resources Association (ELRA)",
edition = "11",

}

van Waterschoot, JB, Dubuisson Duplessis, G, Gatti, L, Bruijnes, M & Heylen, DKJ 2018, HAI Alice - An Information-Providing Closed-Domain Dialog Corpus. in N Calzolari, K Choukri, C Cieri, T Declerck, S Goggi, K Hasida, H Isahara, B Maegaard, J Mariani, H Mazo, A Moreno, J Odijk, S Piperidis & T Tokunaga (eds), Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018). 11 edn, 429, European Language Resources Association (ELRA), Miyazaki, pp. 2746-2752, 11th International Conference on Language Resources and Evaluation 2018, Miyazaki, Japan, 7/05/18.

HAI Alice - An Information-Providing Closed-Domain Dialog Corpus. / van Waterschoot, Jelte Barachia; Dubuisson Duplessis, Guillaume; Gatti, Lorenzo ; Bruijnes, Merijn ; Heylen, Dirk K.J.

Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018). ed. / Nicoletta Calzolari; Khalid Choukri; Christopher Cieri; Thierry Declerck; Sara Goggi; Koiti Hasida; Hitoshi Isahara; Bente Maegaard; Joseph Mariani; Hélène Mazo; Asuncion Moreno; Jan Odijk; Stelios Piperidis; Takenobu Tokunaga. 11. ed. Miyazaki : European Language Resources Association (ELRA), 2018. p. 2746-2752 429.

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

TY - GEN

T1 - HAI Alice - An Information-Providing Closed-Domain Dialog Corpus

AU - van Waterschoot, Jelte Barachia

AU - Dubuisson Duplessis, Guillaume

AU - Gatti, Lorenzo

AU - Bruijnes, Merijn

AU - Heylen, Dirk K.J.

PY - 2018/5/9

Y1 - 2018/5/9

N2 - The contribution of this paper is twofold: 1) we provide a public corpus for Human-Agent Interaction (where the agent is controlled by a Wizard of Oz) and 2) we show a study on verbal alignment in Human-Agent Interaction, to exemplify the corpus' use. In our recordings for the Human-Agent Interaction Alice-corpus (HAI Alice-corpus), participants talked to a wizarded agent, who provided them with information about the book Alice in Wonderland and its author.The wizard had immediate and almost full control over the agent's verbal and nonverbal behavior, as the wizard provided the agent's speech through his own voice and his facial expressions were directly copied onto the agent. The agent's hand gestures were controlled through a button interface.Data was collected to create a corpus with unexpected situations, such as misunderstandings, (accidental) false information, and interruptions. The HAI Alice-corpus consists of transcribed audio-video recordings of 15 conversations (more than 900 utterances) between users and the wizarded agent. As a use-case example, we measured the verbal alignment between the user and the agent. The paper contains information about the setup of the data collection, the unexpected situations and a description of our verbal alignment study.

AB - The contribution of this paper is twofold: 1) we provide a public corpus for Human-Agent Interaction (where the agent is controlled by a Wizard of Oz) and 2) we show a study on verbal alignment in Human-Agent Interaction, to exemplify the corpus' use. In our recordings for the Human-Agent Interaction Alice-corpus (HAI Alice-corpus), participants talked to a wizarded agent, who provided them with information about the book Alice in Wonderland and its author.The wizard had immediate and almost full control over the agent's verbal and nonverbal behavior, as the wizard provided the agent's speech through his own voice and his facial expressions were directly copied onto the agent. The agent's hand gestures were controlled through a button interface.Data was collected to create a corpus with unexpected situations, such as misunderstandings, (accidental) false information, and interruptions. The HAI Alice-corpus consists of transcribed audio-video recordings of 15 conversations (more than 900 utterances) between users and the wizarded agent. As a use-case example, we measured the verbal alignment between the user and the agent. The paper contains information about the setup of the data collection, the unexpected situations and a description of our verbal alignment study.

KW - Corpus

KW - Human-Agent Interaction

KW - Wizard of Oz

KW - Closed Domain

KW - Information-Providing

KW - Unexpected Situations

KW - Verbal Alignment

M3 - Conference contribution

SP - 2746

EP - 2752

BT - Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

A2 - Calzolari, Nicoletta

A2 - Choukri, Khalid

A2 - Cieri, Christopher

A2 - Declerck, Thierry

A2 - Goggi, Sara

A2 - Hasida, Koiti

A2 - Isahara, Hitoshi

A2 - Maegaard, Bente

A2 - Mariani, Joseph

A2 - Mazo, Hélène

A2 - Moreno, Asuncion

A2 - Odijk, Jan

A2 - Piperidis, Stelios

A2 - Tokunaga, Takenobu

PB - European Language Resources Association (ELRA)

CY - Miyazaki

ER -

van Waterschoot JB, Dubuisson Duplessis G, Gatti L, Bruijnes M, Heylen DKJ. HAI Alice - An Information-Providing Closed-Domain Dialog Corpus. In Calzolari N, Choukri K, Cieri C, Declerck T, Goggi S, Hasida K, Isahara H, Maegaard B, Mariani J, Mazo H, Moreno A, Odijk J, Piperidis S, Tokunaga T, editors, Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018). 11 ed. Miyazaki: European Language Resources Association (ELRA). 2018. p. 2746-2752. 429