Spoken Content Retrieval: Searching Spontaneous Conversational Speech

Joachim Kohler, Martha Larson, Franciska de Jong, Roeland Ordelman, Wessel Kraaij

    Research output: Contribution to journalArticleProfessional

    Abstract

    The second workshop on Searching Spontaneous Conversational Speech (SSCS 2008) was held in Singapore on July 24, 2008 in conjunction with the 31st Annual International ACM SIGIR Conference. The goal of the workshop was to bring the speech community and the information retrieval community together. The forum was designed to be conducive to the close interaction and the intense discussion necessary to promote fusion of these fields into a single discipline with a concerted vision of spoken content retrieval. At the workshop, talks and posters were presented covering a wide range of topics including vocabulary independent search, spoken term detection, combination of models/indexes, use of speech recognition lattices for search, segmentation, temporal analysis, benchmarking, exploitation of prosody, speech surrogates for user interfaces and multi-language collections. Demonstrations of speech-based retrieval systems from a variety of application domains introduced a strong practical emphasis into the workshop program. The workshop concluded with a panel discussion, whose goal it was to identify future research directions for speech retrieval. Among the important challenges identified during the panel discussions were: dealing with large scale multimedia collections, representing audio/video content effectively in the user interface, focusing on perfecting the component technologies on which speech retrieval systems are based, and developing systems and approaches that will enable users (both content seekers and content providers) to actively create their own speech search applications or contribute to the indexability of their content.
    Original languageEnglish
    Pages (from-to)67-77
    Number of pages10
    JournalSIGIR forum
    Volume42
    Issue number2
    DOIs
    Publication statusPublished - Dec 2008
    EventACM/SIGIR Workshop on Searching Spontaneous Conversational Speech, SSCS 2008 - Singapore, Singapore
    Duration: 24 Jul 200824 Jul 2008

    Fingerprint

    User interfaces
    Benchmarking
    Information retrieval
    Speech recognition
    Demonstrations
    Fusion reactions
    User interface
    Exploitation
    Segmentation
    Index model
    Interaction
    Singapore
    Language
    Multimedia
    Fusion
    Research directions

    Keywords

    • HMI-HF: Human Factors
    • Spoken document retrieval
    • EC Grant Agreement nr.: FP6/027685
    • EC Grant Agreement nr.: FP6/045480
    • EC Grant Agreement nr.: FP6/033104
    • EC Grant Agreement nr.: FP6/003812

    Cite this

    Kohler, Joachim ; Larson, Martha ; de Jong, Franciska ; Ordelman, Roeland ; Kraaij, Wessel. / Spoken Content Retrieval : Searching Spontaneous Conversational Speech. In: SIGIR forum. 2008 ; Vol. 42, No. 2. pp. 67-77.
    @article{e07a3ed6cd7848c5ba93ab1a1dd7d05a,
    title = "Spoken Content Retrieval: Searching Spontaneous Conversational Speech",
    abstract = "The second workshop on Searching Spontaneous Conversational Speech (SSCS 2008) was held in Singapore on July 24, 2008 in conjunction with the 31st Annual International ACM SIGIR Conference. The goal of the workshop was to bring the speech community and the information retrieval community together. The forum was designed to be conducive to the close interaction and the intense discussion necessary to promote fusion of these fields into a single discipline with a concerted vision of spoken content retrieval. At the workshop, talks and posters were presented covering a wide range of topics including vocabulary independent search, spoken term detection, combination of models/indexes, use of speech recognition lattices for search, segmentation, temporal analysis, benchmarking, exploitation of prosody, speech surrogates for user interfaces and multi-language collections. Demonstrations of speech-based retrieval systems from a variety of application domains introduced a strong practical emphasis into the workshop program. The workshop concluded with a panel discussion, whose goal it was to identify future research directions for speech retrieval. Among the important challenges identified during the panel discussions were: dealing with large scale multimedia collections, representing audio/video content effectively in the user interface, focusing on perfecting the component technologies on which speech retrieval systems are based, and developing systems and approaches that will enable users (both content seekers and content providers) to actively create their own speech search applications or contribute to the indexability of their content.",
    keywords = "HMI-HF: Human Factors, Spoken document retrieval, EC Grant Agreement nr.: FP6/027685, EC Grant Agreement nr.: FP6/045480, EC Grant Agreement nr.: FP6/033104, EC Grant Agreement nr.: FP6/003812",
    author = "Joachim Kohler and Martha Larson and {de Jong}, Franciska and Roeland Ordelman and Wessel Kraaij",
    year = "2008",
    month = "12",
    doi = "10.1145/1480506.1480518",
    language = "English",
    volume = "42",
    pages = "67--77",
    journal = "SIGIR forum",
    issn = "0163-5840",
    publisher = "Association for Computing Machinery (ACM)",
    number = "2",

    }

    Spoken Content Retrieval : Searching Spontaneous Conversational Speech. / Kohler, Joachim; Larson, Martha; de Jong, Franciska; Ordelman, Roeland; Kraaij, Wessel.

    In: SIGIR forum, Vol. 42, No. 2, 12.2008, p. 67-77.

    Research output: Contribution to journalArticleProfessional

    TY - JOUR

    T1 - Spoken Content Retrieval

    T2 - Searching Spontaneous Conversational Speech

    AU - Kohler, Joachim

    AU - Larson, Martha

    AU - de Jong, Franciska

    AU - Ordelman, Roeland

    AU - Kraaij, Wessel

    PY - 2008/12

    Y1 - 2008/12

    N2 - The second workshop on Searching Spontaneous Conversational Speech (SSCS 2008) was held in Singapore on July 24, 2008 in conjunction with the 31st Annual International ACM SIGIR Conference. The goal of the workshop was to bring the speech community and the information retrieval community together. The forum was designed to be conducive to the close interaction and the intense discussion necessary to promote fusion of these fields into a single discipline with a concerted vision of spoken content retrieval. At the workshop, talks and posters were presented covering a wide range of topics including vocabulary independent search, spoken term detection, combination of models/indexes, use of speech recognition lattices for search, segmentation, temporal analysis, benchmarking, exploitation of prosody, speech surrogates for user interfaces and multi-language collections. Demonstrations of speech-based retrieval systems from a variety of application domains introduced a strong practical emphasis into the workshop program. The workshop concluded with a panel discussion, whose goal it was to identify future research directions for speech retrieval. Among the important challenges identified during the panel discussions were: dealing with large scale multimedia collections, representing audio/video content effectively in the user interface, focusing on perfecting the component technologies on which speech retrieval systems are based, and developing systems and approaches that will enable users (both content seekers and content providers) to actively create their own speech search applications or contribute to the indexability of their content.

    AB - The second workshop on Searching Spontaneous Conversational Speech (SSCS 2008) was held in Singapore on July 24, 2008 in conjunction with the 31st Annual International ACM SIGIR Conference. The goal of the workshop was to bring the speech community and the information retrieval community together. The forum was designed to be conducive to the close interaction and the intense discussion necessary to promote fusion of these fields into a single discipline with a concerted vision of spoken content retrieval. At the workshop, talks and posters were presented covering a wide range of topics including vocabulary independent search, spoken term detection, combination of models/indexes, use of speech recognition lattices for search, segmentation, temporal analysis, benchmarking, exploitation of prosody, speech surrogates for user interfaces and multi-language collections. Demonstrations of speech-based retrieval systems from a variety of application domains introduced a strong practical emphasis into the workshop program. The workshop concluded with a panel discussion, whose goal it was to identify future research directions for speech retrieval. Among the important challenges identified during the panel discussions were: dealing with large scale multimedia collections, representing audio/video content effectively in the user interface, focusing on perfecting the component technologies on which speech retrieval systems are based, and developing systems and approaches that will enable users (both content seekers and content providers) to actively create their own speech search applications or contribute to the indexability of their content.

    KW - HMI-HF: Human Factors

    KW - Spoken document retrieval

    KW - EC Grant Agreement nr.: FP6/027685

    KW - EC Grant Agreement nr.: FP6/045480

    KW - EC Grant Agreement nr.: FP6/033104

    KW - EC Grant Agreement nr.: FP6/003812

    U2 - 10.1145/1480506.1480518

    DO - 10.1145/1480506.1480518

    M3 - Article

    VL - 42

    SP - 67

    EP - 77

    JO - SIGIR forum

    JF - SIGIR forum

    SN - 0163-5840

    IS - 2

    ER -