Speech Indexing

Roeland J.F. Ordelman, Franciska M.G. de Jong, D.A. van Leeuwen

    Research output: Chapter in Book/Report/Conference proceedingChapterAcademicpeer-review


    This chapter will focus on the automatic extraction of information from the speech in multimedia documents. This approach is often referred to as speech indexing and it can be regarded as a subfield of audio indexing that also incorporates for example the analysis of music and sounds. If the objective of the recognition of the words spoken is to support retrieval, one commonly speaks of spoken document retrieval (SDR). If the objective is on the coupling of various media types the term media mining or even cross-media mining is used. Most attention in this chapter will go to SDR. The focus is less on searching (an index of ) a multimedia database, but on enabling multiple views on the data by cross-linking all the available multifaceted information sources in a multimedia database. In section 1.6 cross-media mining will be discussed in more detail.
    Original languageUndefined
    Title of host publicationMultimedia Retrieval
    EditorsHenk Blanken, A.P. de Vries, H.E. Blok, L. Feng
    Place of PublicationHeidelberg
    Number of pages26
    ISBN (Print)978-3-540-72894-8
    Publication statusPublished - 2007

    Publication series

    NameData-Centric Systems and Applications
    PublisherSpringer Verlag


    • EWI-11008
    • HMI-SLT: Speech and Language Technology
    • IR-61901
    • Audio search
    • Speech Indexing
    • Speech Recognition
    • Spoken Document Retrieval
    • METIS-241883

    Cite this