Annotation of Heterogeneous Multimedia Content Using Automatic Speech Recognition

    Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

    51 Citations (Scopus)
    58 Downloads (Pure)

    Abstract

    This paper reports on the setup and evaluation of robust speech recognition system parts, geared towards transcript generation for heterogeneous, real-life media collections. The system is deployed for generating speech transcripts for the NIST/TRECVID-2007 test collection, part of a Dutch real-life archive of news-related genres. Performance figures for this type of content are compared to figures for broadcast news test data.
    Original languageUndefined
    Title of host publicationProceedings of the Second International Conference on Semantic and Digital Media Technologies, SAMT 2007
    Place of PublicationBerlin
    PublisherSpringer
    Pages78-90
    Number of pages13
    ISBN (Print)978-3-540-77033-6
    DOIs
    Publication statusPublished - Dec 2007

    Publication series

    NameLecture Notes in Computer Science
    PublisherSpringer Verlag
    Number07CH37910C
    Volume4816
    ISSN (Print)0302-9743
    ISSN (Electronic)1611-3349

    Keywords

    • HMI-SLT: Speech and Language Technology
    • HMI-MR: MULTIMEDIA RETRIEVAL
    • EC Grant Agreement nr.: FP6/027685
    • METIS-245906
    • EWI-11664
    • IR-62090
    • EC Grant Agreement nr.: FP6/027413

    Cite this

    Huijbregts, M. A. H., Ordelman, R. J. F., & de Jong, F. M. G. (2007). Annotation of Heterogeneous Multimedia Content Using Automatic Speech Recognition. In Proceedings of the Second International Conference on Semantic and Digital Media Technologies, SAMT 2007 (pp. 78-90). [10.1007/978-3-540-77051-0_8] (Lecture Notes in Computer Science; Vol. 4816, No. 07CH37910C). Berlin: Springer. https://doi.org/10.1007/978-3-540-77051-0_8
    Huijbregts, M.A.H. ; Ordelman, Roeland J.F. ; de Jong, Franciska M.G. / Annotation of Heterogeneous Multimedia Content Using Automatic Speech Recognition. Proceedings of the Second International Conference on Semantic and Digital Media Technologies, SAMT 2007. Berlin : Springer, 2007. pp. 78-90 (Lecture Notes in Computer Science; 07CH37910C).
    @inproceedings{a293634230d54cf39df58ecb174ae8fe,
    title = "Annotation of Heterogeneous Multimedia Content Using Automatic Speech Recognition",
    abstract = "This paper reports on the setup and evaluation of robust speech recognition system parts, geared towards transcript generation for heterogeneous, real-life media collections. The system is deployed for generating speech transcripts for the NIST/TRECVID-2007 test collection, part of a Dutch real-life archive of news-related genres. Performance figures for this type of content are compared to figures for broadcast news test data.",
    keywords = "HMI-SLT: Speech and Language Technology, HMI-MR: MULTIMEDIA RETRIEVAL, EC Grant Agreement nr.: FP6/027685, METIS-245906, EWI-11664, IR-62090, EC Grant Agreement nr.: FP6/027413",
    author = "M.A.H. Huijbregts and Ordelman, {Roeland J.F.} and {de Jong}, {Franciska M.G.}",
    note = "10.1007/978-3-540-77051-0_8",
    year = "2007",
    month = "12",
    doi = "10.1007/978-3-540-77051-0_8",
    language = "Undefined",
    isbn = "978-3-540-77033-6",
    series = "Lecture Notes in Computer Science",
    publisher = "Springer",
    number = "07CH37910C",
    pages = "78--90",
    booktitle = "Proceedings of the Second International Conference on Semantic and Digital Media Technologies, SAMT 2007",

    }

    Huijbregts, MAH, Ordelman, RJF & de Jong, FMG 2007, Annotation of Heterogeneous Multimedia Content Using Automatic Speech Recognition. in Proceedings of the Second International Conference on Semantic and Digital Media Technologies, SAMT 2007., 10.1007/978-3-540-77051-0_8, Lecture Notes in Computer Science, no. 07CH37910C, vol. 4816, Springer, Berlin, pp. 78-90. https://doi.org/10.1007/978-3-540-77051-0_8

    Annotation of Heterogeneous Multimedia Content Using Automatic Speech Recognition. / Huijbregts, M.A.H.; Ordelman, Roeland J.F.; de Jong, Franciska M.G.

    Proceedings of the Second International Conference on Semantic and Digital Media Technologies, SAMT 2007. Berlin : Springer, 2007. p. 78-90 10.1007/978-3-540-77051-0_8 (Lecture Notes in Computer Science; Vol. 4816, No. 07CH37910C).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

    TY - GEN

    T1 - Annotation of Heterogeneous Multimedia Content Using Automatic Speech Recognition

    AU - Huijbregts, M.A.H.

    AU - Ordelman, Roeland J.F.

    AU - de Jong, Franciska M.G.

    N1 - 10.1007/978-3-540-77051-0_8

    PY - 2007/12

    Y1 - 2007/12

    N2 - This paper reports on the setup and evaluation of robust speech recognition system parts, geared towards transcript generation for heterogeneous, real-life media collections. The system is deployed for generating speech transcripts for the NIST/TRECVID-2007 test collection, part of a Dutch real-life archive of news-related genres. Performance figures for this type of content are compared to figures for broadcast news test data.

    AB - This paper reports on the setup and evaluation of robust speech recognition system parts, geared towards transcript generation for heterogeneous, real-life media collections. The system is deployed for generating speech transcripts for the NIST/TRECVID-2007 test collection, part of a Dutch real-life archive of news-related genres. Performance figures for this type of content are compared to figures for broadcast news test data.

    KW - HMI-SLT: Speech and Language Technology

    KW - HMI-MR: MULTIMEDIA RETRIEVAL

    KW - EC Grant Agreement nr.: FP6/027685

    KW - METIS-245906

    KW - EWI-11664

    KW - IR-62090

    KW - EC Grant Agreement nr.: FP6/027413

    U2 - 10.1007/978-3-540-77051-0_8

    DO - 10.1007/978-3-540-77051-0_8

    M3 - Conference contribution

    SN - 978-3-540-77033-6

    T3 - Lecture Notes in Computer Science

    SP - 78

    EP - 90

    BT - Proceedings of the Second International Conference on Semantic and Digital Media Technologies, SAMT 2007

    PB - Springer

    CY - Berlin

    ER -

    Huijbregts MAH, Ordelman RJF, de Jong FMG. Annotation of Heterogeneous Multimedia Content Using Automatic Speech Recognition. In Proceedings of the Second International Conference on Semantic and Digital Media Technologies, SAMT 2007. Berlin: Springer. 2007. p. 78-90. 10.1007/978-3-540-77051-0_8. (Lecture Notes in Computer Science; 07CH37910C). https://doi.org/10.1007/978-3-540-77051-0_8