Towards Affordable Disclosure of Spoken Heritage Archives

Roeland Ordelman, Willemijn Heeren, Franciska de Jong, Marijn Huijbregts, Djoerd Hiemstra

Research output: Contribution to journalArticleAcademicpeer-review

6 Citations (Scopus)
17 Downloads (Pure)

Abstract

This paper presents and discusses ongoing work aiming at affordable disclosure of real-world spoken heritage archives in general, and in particular of a collection of recorded interviews with Dutch survivors of World War II concentration camp Buchenwald. Given such collections, we at least want to provide search at different levels and a flexible way of presenting results. Strategies for automatic annotation based on speech recognition - supporting e.g., within-document search - are outlined and discussed with respect to the Buchenwald interview collection. In addition, usability aspects of the spoken word search are discussed on the basis of our experiences with the online Buchenwald web portal. It is concluded that, although user feedback is generally fairly positive, automatic annotation performance is not yet satisfactory, and requires additional research.
Original languageEnglish
Article number687
Number of pages17
JournalJournal of digital information
Volume10
Issue number6
Publication statusPublished - Dec 2009

Fingerprint

Speech recognition
concentration camp
Feedback
interview
World War II
performance
experience

Keywords

  • DB-MMR: MULTIMEDIA RETRIEVAL
  • HMI-MR: MULTIMEDIA RETRIEVAL
  • HMI-SLT: Speech and Language Technology
  • Speech Indexing
  • Speech Recognition
  • Multimedia Retrieval
  • Workflow
  • Usability
  • Cultural heritage archives

Cite this

@article{d92f0a84eb194e11901bd50e727b82e0,
title = "Towards Affordable Disclosure of Spoken Heritage Archives",
abstract = "This paper presents and discusses ongoing work aiming at affordable disclosure of real-world spoken heritage archives in general, and in particular of a collection of recorded interviews with Dutch survivors of World War II concentration camp Buchenwald. Given such collections, we at least want to provide search at different levels and a flexible way of presenting results. Strategies for automatic annotation based on speech recognition - supporting e.g., within-document search - are outlined and discussed with respect to the Buchenwald interview collection. In addition, usability aspects of the spoken word search are discussed on the basis of our experiences with the online Buchenwald web portal. It is concluded that, although user feedback is generally fairly positive, automatic annotation performance is not yet satisfactory, and requires additional research.",
keywords = "DB-MMR: MULTIMEDIA RETRIEVAL, HMI-MR: MULTIMEDIA RETRIEVAL, HMI-SLT: Speech and Language Technology, Speech Indexing, Speech Recognition, Multimedia Retrieval, Workflow, Usability, Cultural heritage archives",
author = "Roeland Ordelman and Willemijn Heeren and {de Jong}, Franciska and Marijn Huijbregts and Djoerd Hiemstra",
note = "Special Issue on Information Access to Cultural Heritage",
year = "2009",
month = "12",
language = "English",
volume = "10",
journal = "Journal of digital information",
issn = "1368-7506",
publisher = "British Computer Society",
number = "6",

}

Towards Affordable Disclosure of Spoken Heritage Archives. / Ordelman, Roeland; Heeren, Willemijn; de Jong, Franciska; Huijbregts, Marijn; Hiemstra, Djoerd.

In: Journal of digital information, Vol. 10, No. 6, 687, 12.2009.

Research output: Contribution to journalArticleAcademicpeer-review

TY - JOUR

T1 - Towards Affordable Disclosure of Spoken Heritage Archives

AU - Ordelman, Roeland

AU - Heeren, Willemijn

AU - de Jong, Franciska

AU - Huijbregts, Marijn

AU - Hiemstra, Djoerd

N1 - Special Issue on Information Access to Cultural Heritage

PY - 2009/12

Y1 - 2009/12

N2 - This paper presents and discusses ongoing work aiming at affordable disclosure of real-world spoken heritage archives in general, and in particular of a collection of recorded interviews with Dutch survivors of World War II concentration camp Buchenwald. Given such collections, we at least want to provide search at different levels and a flexible way of presenting results. Strategies for automatic annotation based on speech recognition - supporting e.g., within-document search - are outlined and discussed with respect to the Buchenwald interview collection. In addition, usability aspects of the spoken word search are discussed on the basis of our experiences with the online Buchenwald web portal. It is concluded that, although user feedback is generally fairly positive, automatic annotation performance is not yet satisfactory, and requires additional research.

AB - This paper presents and discusses ongoing work aiming at affordable disclosure of real-world spoken heritage archives in general, and in particular of a collection of recorded interviews with Dutch survivors of World War II concentration camp Buchenwald. Given such collections, we at least want to provide search at different levels and a flexible way of presenting results. Strategies for automatic annotation based on speech recognition - supporting e.g., within-document search - are outlined and discussed with respect to the Buchenwald interview collection. In addition, usability aspects of the spoken word search are discussed on the basis of our experiences with the online Buchenwald web portal. It is concluded that, although user feedback is generally fairly positive, automatic annotation performance is not yet satisfactory, and requires additional research.

KW - DB-MMR: MULTIMEDIA RETRIEVAL

KW - HMI-MR: MULTIMEDIA RETRIEVAL

KW - HMI-SLT: Speech and Language Technology

KW - Speech Indexing

KW - Speech Recognition

KW - Multimedia Retrieval

KW - Workflow

KW - Usability

KW - Cultural heritage archives

M3 - Article

VL - 10

JO - Journal of digital information

JF - Journal of digital information

SN - 1368-7506

IS - 6

M1 - 687

ER -