TY - GEN
T1 - Annotation of Heterogeneous Multimedia Content Using Automatic Speech Recognition
AU - Huijbregts, M.A.H.
AU - Ordelman, Roeland J.F.
AU - de Jong, Franciska M.G.
N1 - 10.1007/978-3-540-77051-0_8
PY - 2007/12
Y1 - 2007/12
N2 - This paper reports on the setup and evaluation of robust speech recognition system parts, geared towards transcript generation for heterogeneous, real-life media collections. The system is deployed for generating speech transcripts for the NIST/TRECVID-2007 test collection, part of a Dutch real-life archive of news-related genres. Performance figures for this type of content are compared to figures for broadcast news test data.
AB - This paper reports on the setup and evaluation of robust speech recognition system parts, geared towards transcript generation for heterogeneous, real-life media collections. The system is deployed for generating speech transcripts for the NIST/TRECVID-2007 test collection, part of a Dutch real-life archive of news-related genres. Performance figures for this type of content are compared to figures for broadcast news test data.
KW - HMI-SLT: Speech and Language Technology
KW - HMI-MR: MULTIMEDIA RETRIEVAL
KW - EC Grant Agreement nr.: FP6/027685
KW - METIS-245906
KW - EWI-11664
KW - IR-62090
KW - EC Grant Agreement nr.: FP6/027413
U2 - 10.1007/978-3-540-77051-0_8
DO - 10.1007/978-3-540-77051-0_8
M3 - Conference contribution
SN - 978-3-540-77033-6
T3 - Lecture Notes in Computer Science
SP - 78
EP - 90
BT - Proceedings of the Second International Conference on Semantic and Digital Media Technologies, SAMT 2007
PB - Springer
CY - Berlin
T2 - Second International Conference on Semantic and Digital Media Technologies, SAMT 2007
Y2 - 5 December 2007 through 7 December 2007
ER -