Cleaning up and standardizing a folktale corpus for humanities research

Iwe Muiser, Mariet Theune, Theo Meder

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

157 Downloads (Pure)

Abstract

Recordings in the field of folk narrative have been made around the world for many decades. By digitizing and annotating these texts, they are frozen in time and are better suited for searching, sorting and performing research on. This paper describes the first steps of the process of standardization and preparation of digital folktale metadata for scientific use and improving availability of the data for humanities and, more specifically, folktale research. The Dutch Folktale Database has been used as case study but, since these problems are common in all corpora with manually created metadata, the explanation of the process is kept as general as possible.
Original languageUndefined
Title of host publicationProceedings of the Second Workshop on Annotation of Corpora for Research in the Humanities (ACRH-2)
EditorsF. Mambrini, M. Passarotti, C. Sporleder
Place of PublicationLissabon
PublisherEdicoes Colibri
Pages63-74
Number of pages12
ISBN (Print)978-989-689-273-9
Publication statusPublished - Nov 2012
EventSecond Workshop on Annotation of Corpora for Research in the Humanities, ACRH-2 - Lissabon, Portugal
Duration: 29 Nov 201229 Nov 2012

Publication series

Name
PublisherEdições Colibri

Workshop

WorkshopSecond Workshop on Annotation of Corpora for Research in the Humanities, ACRH-2
Period29/11/1229/11/12
Other29 November 2012

Keywords

  • EWI-22627
  • IR-83440
  • METIS-293221

Cite this