Radio Oranje: Enhanced Access to a Historical Spoken Word Collection

Laurens Bastiaan van der Werff, W.F.L. Heeren, Roeland J.F. Ordelman, Franciska M.G. de Jong

    Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

    5 Citations (Scopus)
    162 Downloads (Pure)

    Abstract

    Access to historical audio collections is typically very restricted: content is often only available on physical (analog) media and the metadata is usually limited to keywords, giving access at the level of relatively large fragments, e.g., an entire tape. Many spoken word heritage collections are now being digitized, which allows the introduction of more advanced search technology. This paper presents an approach that supports online access and search for recordings of historical speeches. A demonstrator has been built, based on the so-called Radio Oranje collection, which contains radio speeches by the Dutch Queen Wilhelmina that were broadcast during World War II. The audio has been aligned with its original 1940s manual transcriptions to create a time-stamped index that enables the speeches to be searched at the word level. Results are presented together with related photos from an external database.
    Original languageEnglish
    Title of host publicationComputational Linguistics in the Netherlands
    Subtitle of host publicationselected papers from the seventeenth CLIN meeting
    EditorsPeter Dirx, Ineke Schuurman, Vincent Vandeghinste, Frank van Eynde
    Place of PublicationUtrecht
    PublisherLandelijke Onderzoekschool Taalwetenschap
    Pages207-218
    Number of pages12
    ISBN (Print)978-90-78328-41-4
    Publication statusPublished - 12 Jan 2007
    Event17th Meeting of Computational Linguistics in the Netherlands, CLIN 2006 - University of Leuven, Leuven, Belgium
    Duration: 12 Jan 200712 Jan 2007
    Conference number: 17

    Publication series

    NameLOT Occasional Series
    PublisherLandelijke Onderzoekschool Taalwetenschap
    Number7

    Conference

    Conference17th Meeting of Computational Linguistics in the Netherlands, CLIN 2006
    Abbreviated titleCLIN
    CountryBelgium
    CityLeuven
    Period12/01/0712/01/07
    Other(Held on January 12th, 2007)

    Keywords

    • HMI-SLT: Speech and Language Technology
    • HMI-MR: MULTIMEDIA RETRIEVAL

    Fingerprint

    Dive into the research topics of 'Radio Oranje: Enhanced Access to a Historical Spoken Word Collection'. Together they form a unique fingerprint.

    Cite this