Annotation of Heterogeneous Multimedia Content Using Automatic Speech Recognition

M.A.H. Huijbregts, Roeland J.F. Ordelman, Franciska M.G. de Jong

    Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

    59 Citations (Scopus)
    234 Downloads (Pure)

    Abstract

    This paper reports on the setup and evaluation of robust speech recognition system parts, geared towards transcript generation for heterogeneous, real-life media collections. The system is deployed for generating speech transcripts for the NIST/TRECVID-2007 test collection, part of a Dutch real-life archive of news-related genres. Performance figures for this type of content are compared to figures for broadcast news test data.
    Original languageUndefined
    Title of host publicationProceedings of the Second International Conference on Semantic and Digital Media Technologies, SAMT 2007
    Place of PublicationBerlin
    PublisherSpringer
    Pages78-90
    Number of pages13
    ISBN (Print)978-3-540-77033-6
    DOIs
    Publication statusPublished - Dec 2007
    EventSecond International Conference on Semantic and Digital Media Technologies, SAMT 2007 - Genoa, Italy
    Duration: 5 Dec 20077 Dec 2007

    Publication series

    NameLecture Notes in Computer Science
    PublisherSpringer Verlag
    Number07CH37910C
    Volume4816
    ISSN (Print)0302-9743
    ISSN (Electronic)1611-3349

    Conference

    ConferenceSecond International Conference on Semantic and Digital Media Technologies, SAMT 2007
    Period5/12/077/12/07
    Other5-7 December 2007

    Keywords

    • HMI-SLT: Speech and Language Technology
    • HMI-MR: MULTIMEDIA RETRIEVAL
    • EC Grant Agreement nr.: FP6/027685
    • METIS-245906
    • EWI-11664
    • IR-62090
    • EC Grant Agreement nr.: FP6/027413

    Cite this