Automated speech and audio analysis for semantic access to multimedia

    Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

    10 Citations (Scopus)
    56 Downloads (Pure)

    Abstract

    The deployment and integration of audio processing tools can enhance the semantic annotation of multimedia content, and as a consequence, improve the effectiveness of conceptual access tools. This paper overviews the various ways in which automatic speech and audio analysis can contribute to increased granularity of automatically extracted metadata. A number of techniques will be presented, including the alignment of speech and text resources, large vocabulary speech recognition, key word spotting and speaker classification. The applicability of techniques will be discussed from a media crossing perspective. The added value of the techniques and their potential contribution to the content value chain will be illustrated by the description of two (complementary) demonstrators for browsing broadcast news archives.
    Original languageUndefined
    Title of host publicationProceedings of the First International Conference on Semantic and Digital Media Technologies, SAMT 2006
    EditorsY. Avrithis, Y. Kompatsiaris, S. Staab, N.E. O' Connor
    Place of PublicationBerlin
    PublisherSpringer
    Pages226-240
    Number of pages15
    ISBN (Print)3-540-49335-2
    DOIs
    Publication statusPublished - 6 Dec 2006

    Publication series

    NameLecture Notes in Computer Science
    PublisherSpringer Verlag
    Number06EX1521
    Volume4306

    Keywords

    • HMI-SLT: Speech and Language Technology
    • HMI-MR: MULTIMEDIA RETRIEVAL
    • EC Grant Agreement nr.: FP6/506811
    • IR-66586
    • EWI-8073
    • EC Grant Agreement nr.: FP6/027413
    • METIS-237582
    • EC Grant Agreement nr.: FP6/027685

    Cite this

    de Jong, F. M. G., Ordelman, R. J. F., & Huijbregts, M. A. H. (2006). Automated speech and audio analysis for semantic access to multimedia. In Y. Avrithis, Y. Kompatsiaris, S. Staab, & N. E. O' Connor (Eds.), Proceedings of the First International Conference on Semantic and Digital Media Technologies, SAMT 2006 (pp. 226-240). [10.1007/11930334_18] (Lecture Notes in Computer Science; Vol. 4306, No. 06EX1521). Berlin: Springer. https://doi.org/10.1007/11930334_18