Towards Affordable Disclosure of Spoken Word Archives

Roeland J.F. Ordelman, W.F.L. Heeren, M.A.H. Huijbregts, Djoerd Hiemstra, Franciska M.G. de Jong

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

94 Downloads (Pure)

Abstract

This paper presents and discusses ongoing work aiming at affordable disclosure of real-world spoken word archives in general, and in particular of a collection of recorded interviews with Dutch survivors of World War II concentration camp Buchenwald. Given such collections, the least we want to be able to provide is search at different levels and a flexible way of presenting results. Strategies for automatic annotation based on speech recognition – supporting e.g., within-document search– are outlined and discussed with respect to the Buchenwald interview collection. In addition, usability aspects of the spoken word search are discussed on the basis of our experiences with the online Buchenwald web portal. It is concluded that, although user feedback is generally fairly positive, automatic annotation performance is still far from satisfactory, and requires additional research.
Original languageUndefined
Title of host publicationProceedings of the ECDL 2008 Workshop on Information Access to Cultural Heritage (IACH2008)
EditorsM Larson, K Fernie, J Oomen, J. Cigarran
Place of PublicationAmsterdam, The Netherlands
PublisherILPS, University of Amsterdam
Pages-
Number of pages15
ISBN (Print)978-90-813489-1-1
Publication statusPublished - 18 Sept 2008
EventECDL 2008 Workshop on Information Access to Cultural Heritage, IACH 2008 - Aarhus, Denmark
Duration: 18 Sept 200818 Sept 2008

Publication series

Name
PublisherILPS, University of Amsterdam
NumberSupplement

Conference

ConferenceECDL 2008 Workshop on Information Access to Cultural Heritage, IACH 2008
Abbreviated titleIACH 2008
Country/TerritoryDenmark
CityAarhus
Period18/09/0818/09/08

Keywords

  • EWI-13526
  • HMI-MR: MULTIMEDIA RETRIEVAL
  • IR-65010
  • METIS-251204
  • HMI-SLT: Speech and Language Technology

Cite this