Annotation of Heterogeneous Multimedia Content Using Automatic Speech Recognition

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

51 Citations (Scopus)
49 Downloads (Pure)

Abstract

This paper reports on the setup and evaluation of robust speech recognition system parts, geared towards transcript generation for heterogeneous, real-life media collections. The system is deployed for generating speech transcripts for the NIST/TRECVID-2007 test collection, part of a Dutch real-life archive of news-related genres. Performance figures for this type of content are compared to figures for broadcast news test data.
Original languageUndefined
Title of host publicationProceedings of the Second International Conference on Semantic and Digital Media Technologies, SAMT 2007
Place of PublicationBerlin
PublisherSpringer
Pages78-90
Number of pages13
ISBN (Print)978-3-540-77033-6
DOIs
Publication statusPublished - Dec 2007

Publication series

NameLecture Notes in Computer Science
PublisherSpringer Verlag
Number07CH37910C
Volume4816
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Keywords

  • HMI-SLT: Speech and Language Technology
  • HMI-MR: MULTIMEDIA RETRIEVAL
  • EC Grant Agreement nr.: FP6/027685
  • METIS-245906
  • EWI-11664
  • IR-62090
  • EC Grant Agreement nr.: FP6/027413

Cite this

Huijbregts, M. A. H., Ordelman, R. J. F., & de Jong, F. M. G. (2007). Annotation of Heterogeneous Multimedia Content Using Automatic Speech Recognition. In Proceedings of the Second International Conference on Semantic and Digital Media Technologies, SAMT 2007 (pp. 78-90). [10.1007/978-3-540-77051-0_8] (Lecture Notes in Computer Science; Vol. 4816, No. 07CH37910C). Berlin: Springer. https://doi.org/10.1007/978-3-540-77051-0_8
Huijbregts, M.A.H. ; Ordelman, Roeland J.F. ; de Jong, Franciska M.G. / Annotation of Heterogeneous Multimedia Content Using Automatic Speech Recognition. Proceedings of the Second International Conference on Semantic and Digital Media Technologies, SAMT 2007. Berlin : Springer, 2007. pp. 78-90 (Lecture Notes in Computer Science; 07CH37910C).
@inproceedings{a293634230d54cf39df58ecb174ae8fe,
title = "Annotation of Heterogeneous Multimedia Content Using Automatic Speech Recognition",
abstract = "This paper reports on the setup and evaluation of robust speech recognition system parts, geared towards transcript generation for heterogeneous, real-life media collections. The system is deployed for generating speech transcripts for the NIST/TRECVID-2007 test collection, part of a Dutch real-life archive of news-related genres. Performance figures for this type of content are compared to figures for broadcast news test data.",
keywords = "HMI-SLT: Speech and Language Technology, HMI-MR: MULTIMEDIA RETRIEVAL, EC Grant Agreement nr.: FP6/027685, METIS-245906, EWI-11664, IR-62090, EC Grant Agreement nr.: FP6/027413",
author = "M.A.H. Huijbregts and Ordelman, {Roeland J.F.} and {de Jong}, {Franciska M.G.}",
note = "10.1007/978-3-540-77051-0_8",
year = "2007",
month = "12",
doi = "10.1007/978-3-540-77051-0_8",
language = "Undefined",
isbn = "978-3-540-77033-6",
series = "Lecture Notes in Computer Science",
publisher = "Springer",
number = "07CH37910C",
pages = "78--90",
booktitle = "Proceedings of the Second International Conference on Semantic and Digital Media Technologies, SAMT 2007",

}

Huijbregts, MAH, Ordelman, RJF & de Jong, FMG 2007, Annotation of Heterogeneous Multimedia Content Using Automatic Speech Recognition. in Proceedings of the Second International Conference on Semantic and Digital Media Technologies, SAMT 2007., 10.1007/978-3-540-77051-0_8, Lecture Notes in Computer Science, no. 07CH37910C, vol. 4816, Springer, Berlin, pp. 78-90. https://doi.org/10.1007/978-3-540-77051-0_8

Annotation of Heterogeneous Multimedia Content Using Automatic Speech Recognition. / Huijbregts, M.A.H.; Ordelman, Roeland J.F.; de Jong, Franciska M.G.

Proceedings of the Second International Conference on Semantic and Digital Media Technologies, SAMT 2007. Berlin : Springer, 2007. p. 78-90 10.1007/978-3-540-77051-0_8 (Lecture Notes in Computer Science; Vol. 4816, No. 07CH37910C).

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

TY - GEN

T1 - Annotation of Heterogeneous Multimedia Content Using Automatic Speech Recognition

AU - Huijbregts, M.A.H.

AU - Ordelman, Roeland J.F.

AU - de Jong, Franciska M.G.

N1 - 10.1007/978-3-540-77051-0_8

PY - 2007/12

Y1 - 2007/12

N2 - This paper reports on the setup and evaluation of robust speech recognition system parts, geared towards transcript generation for heterogeneous, real-life media collections. The system is deployed for generating speech transcripts for the NIST/TRECVID-2007 test collection, part of a Dutch real-life archive of news-related genres. Performance figures for this type of content are compared to figures for broadcast news test data.

AB - This paper reports on the setup and evaluation of robust speech recognition system parts, geared towards transcript generation for heterogeneous, real-life media collections. The system is deployed for generating speech transcripts for the NIST/TRECVID-2007 test collection, part of a Dutch real-life archive of news-related genres. Performance figures for this type of content are compared to figures for broadcast news test data.

KW - HMI-SLT: Speech and Language Technology

KW - HMI-MR: MULTIMEDIA RETRIEVAL

KW - EC Grant Agreement nr.: FP6/027685

KW - METIS-245906

KW - EWI-11664

KW - IR-62090

KW - EC Grant Agreement nr.: FP6/027413

U2 - 10.1007/978-3-540-77051-0_8

DO - 10.1007/978-3-540-77051-0_8

M3 - Conference contribution

SN - 978-3-540-77033-6

T3 - Lecture Notes in Computer Science

SP - 78

EP - 90

BT - Proceedings of the Second International Conference on Semantic and Digital Media Technologies, SAMT 2007

PB - Springer

CY - Berlin

ER -

Huijbregts MAH, Ordelman RJF, de Jong FMG. Annotation of Heterogeneous Multimedia Content Using Automatic Speech Recognition. In Proceedings of the Second International Conference on Semantic and Digital Media Technologies, SAMT 2007. Berlin: Springer. 2007. p. 78-90. 10.1007/978-3-540-77051-0_8. (Lecture Notes in Computer Science; 07CH37910C). https://doi.org/10.1007/978-3-540-77051-0_8