Language interpretation and generation for football commentary

Antinus Nijholt, Hendrikus J.A. op den Akker, Franciska M.G. de Jong

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Abstract

Our interest in the computer processing of football commentary was at first given in by the EU/IST funded project MUMIS (Multimedia Indexing and Searching) that started in 2000 [11, 18, 19]. This project's objective is to develop technology for automatic indexing of multimedia programme material (texts, news streams, speech transcripts) and to develop a user interface that supports the conceptual querying and browsing of related video content over the internet. One of the innovative features of MUMIS is that it aims at the disclosure of video archives by applying information extraction techniques, originally developed for the textual domain. Information extraction is a technique that is typically suited for content in specific domains. In MUMIS the extraction technology is applied to the domain of football. The project requires the integration of lexicons, ontology and information extraction tools for this domain, and the development of merging algorithms to integrate the (incomplete) information coming from different sources. For example, the information extraction components should be able to extract some thirty different event types, using methods such as part of speech tagging, syntactic parsing, semantic tagging, and discourse analysis Typical football events to be detected are: kick-off, penalty, goal, halftime, free-kick, etc. The user interface should help users to formulate queries that can be matched on the annotations generated by the extraction component and linked to the time-codes of the corresponding video fragments. In parallel to the MUMIS project we surveyed a number of research efforts that deal all with football commentary but for which the technology focus differs: retrieval, interpretation or generation of commentary and related (but not necessarily language oriented) research. There were two reasons for this investigation. One obvious reason is to find out how the new technology and tools can be adapted to similar applications. The other is to see how the domain knowledge obtained for extraction purposes can be employed for other intelligent applications in the same domain. Since at the same time several of our M.Sc. students got interested in learning multi-agent systems for developing teams for the RoboCup leagues we decided to stick close to the football domain in this survey as well. In this paper we start with the survey of the domain specific research. The aim of the survey was to obtain a comprehensive view of the field could guide the selection of new research themes. In the second part of the paper we give a short introduction to the MUMIS project as it can be embedded in the general football related language technology research.
Original languageUndefined
Title of host publicationACTAS-1: VIII Symposio Social
EditorsL.R. Miyares, C. E. Alvarez, M. R. Silva Alvarez
Place of PublicationSantiago de Cuba
PublisherCentro de Lingüística Aplicada
Pages594-599
Number of pages6
ISBN (Print)959-7174-01-4
Publication statusPublished - 20 Jan 2003

Publication series

Name
PublisherCentro de Linguistica Aplicade

Keywords

  • METIS-212961
  • EWI-6727
  • HMI-SLT: Speech and Language Technology
  • IR-63380

Cite this

Nijholt, A., op den Akker, H. J. A., & de Jong, F. M. G. (2003). Language interpretation and generation for football commentary. In L. R. Miyares, C. E. Alvarez, & M. R. Silva Alvarez (Eds.), ACTAS-1: VIII Symposio Social (pp. 594-599). Santiago de Cuba: Centro de Lingüística Aplicada.
Nijholt, Antinus ; op den Akker, Hendrikus J.A. ; de Jong, Franciska M.G. / Language interpretation and generation for football commentary. ACTAS-1: VIII Symposio Social. editor / L.R. Miyares ; C. E. Alvarez ; M. R. Silva Alvarez. Santiago de Cuba : Centro de Lingüística Aplicada, 2003. pp. 594-599
@inproceedings{e7a705abaccf4be6b5395e6e5346e979,
title = "Language interpretation and generation for football commentary",
abstract = "Our interest in the computer processing of football commentary was at first given in by the EU/IST funded project MUMIS (Multimedia Indexing and Searching) that started in 2000 [11, 18, 19]. This project's objective is to develop technology for automatic indexing of multimedia programme material (texts, news streams, speech transcripts) and to develop a user interface that supports the conceptual querying and browsing of related video content over the internet. One of the innovative features of MUMIS is that it aims at the disclosure of video archives by applying information extraction techniques, originally developed for the textual domain. Information extraction is a technique that is typically suited for content in specific domains. In MUMIS the extraction technology is applied to the domain of football. The project requires the integration of lexicons, ontology and information extraction tools for this domain, and the development of merging algorithms to integrate the (incomplete) information coming from different sources. For example, the information extraction components should be able to extract some thirty different event types, using methods such as part of speech tagging, syntactic parsing, semantic tagging, and discourse analysis Typical football events to be detected are: kick-off, penalty, goal, halftime, free-kick, etc. The user interface should help users to formulate queries that can be matched on the annotations generated by the extraction component and linked to the time-codes of the corresponding video fragments. In parallel to the MUMIS project we surveyed a number of research efforts that deal all with football commentary but for which the technology focus differs: retrieval, interpretation or generation of commentary and related (but not necessarily language oriented) research. There were two reasons for this investigation. One obvious reason is to find out how the new technology and tools can be adapted to similar applications. The other is to see how the domain knowledge obtained for extraction purposes can be employed for other intelligent applications in the same domain. Since at the same time several of our M.Sc. students got interested in learning multi-agent systems for developing teams for the RoboCup leagues we decided to stick close to the football domain in this survey as well. In this paper we start with the survey of the domain specific research. The aim of the survey was to obtain a comprehensive view of the field could guide the selection of new research themes. In the second part of the paper we give a short introduction to the MUMIS project as it can be embedded in the general football related language technology research.",
keywords = "METIS-212961, EWI-6727, HMI-SLT: Speech and Language Technology, IR-63380",
author = "Antinus Nijholt and {op den Akker}, {Hendrikus J.A.} and {de Jong}, {Franciska M.G.}",
year = "2003",
month = "1",
day = "20",
language = "Undefined",
isbn = "959-7174-01-4",
publisher = "Centro de Ling{\"u}{\'i}stica Aplicada",
pages = "594--599",
editor = "L.R. Miyares and Alvarez, {C. E.} and {Silva Alvarez}, {M. R.}",
booktitle = "ACTAS-1: VIII Symposio Social",

}

Nijholt, A, op den Akker, HJA & de Jong, FMG 2003, Language interpretation and generation for football commentary. in LR Miyares, CE Alvarez & MR Silva Alvarez (eds), ACTAS-1: VIII Symposio Social. Centro de Lingüística Aplicada, Santiago de Cuba, pp. 594-599.

Language interpretation and generation for football commentary. / Nijholt, Antinus; op den Akker, Hendrikus J.A.; de Jong, Franciska M.G.

ACTAS-1: VIII Symposio Social. ed. / L.R. Miyares; C. E. Alvarez; M. R. Silva Alvarez. Santiago de Cuba : Centro de Lingüística Aplicada, 2003. p. 594-599.

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

TY - GEN

T1 - Language interpretation and generation for football commentary

AU - Nijholt, Antinus

AU - op den Akker, Hendrikus J.A.

AU - de Jong, Franciska M.G.

PY - 2003/1/20

Y1 - 2003/1/20

N2 - Our interest in the computer processing of football commentary was at first given in by the EU/IST funded project MUMIS (Multimedia Indexing and Searching) that started in 2000 [11, 18, 19]. This project's objective is to develop technology for automatic indexing of multimedia programme material (texts, news streams, speech transcripts) and to develop a user interface that supports the conceptual querying and browsing of related video content over the internet. One of the innovative features of MUMIS is that it aims at the disclosure of video archives by applying information extraction techniques, originally developed for the textual domain. Information extraction is a technique that is typically suited for content in specific domains. In MUMIS the extraction technology is applied to the domain of football. The project requires the integration of lexicons, ontology and information extraction tools for this domain, and the development of merging algorithms to integrate the (incomplete) information coming from different sources. For example, the information extraction components should be able to extract some thirty different event types, using methods such as part of speech tagging, syntactic parsing, semantic tagging, and discourse analysis Typical football events to be detected are: kick-off, penalty, goal, halftime, free-kick, etc. The user interface should help users to formulate queries that can be matched on the annotations generated by the extraction component and linked to the time-codes of the corresponding video fragments. In parallel to the MUMIS project we surveyed a number of research efforts that deal all with football commentary but for which the technology focus differs: retrieval, interpretation or generation of commentary and related (but not necessarily language oriented) research. There were two reasons for this investigation. One obvious reason is to find out how the new technology and tools can be adapted to similar applications. The other is to see how the domain knowledge obtained for extraction purposes can be employed for other intelligent applications in the same domain. Since at the same time several of our M.Sc. students got interested in learning multi-agent systems for developing teams for the RoboCup leagues we decided to stick close to the football domain in this survey as well. In this paper we start with the survey of the domain specific research. The aim of the survey was to obtain a comprehensive view of the field could guide the selection of new research themes. In the second part of the paper we give a short introduction to the MUMIS project as it can be embedded in the general football related language technology research.

AB - Our interest in the computer processing of football commentary was at first given in by the EU/IST funded project MUMIS (Multimedia Indexing and Searching) that started in 2000 [11, 18, 19]. This project's objective is to develop technology for automatic indexing of multimedia programme material (texts, news streams, speech transcripts) and to develop a user interface that supports the conceptual querying and browsing of related video content over the internet. One of the innovative features of MUMIS is that it aims at the disclosure of video archives by applying information extraction techniques, originally developed for the textual domain. Information extraction is a technique that is typically suited for content in specific domains. In MUMIS the extraction technology is applied to the domain of football. The project requires the integration of lexicons, ontology and information extraction tools for this domain, and the development of merging algorithms to integrate the (incomplete) information coming from different sources. For example, the information extraction components should be able to extract some thirty different event types, using methods such as part of speech tagging, syntactic parsing, semantic tagging, and discourse analysis Typical football events to be detected are: kick-off, penalty, goal, halftime, free-kick, etc. The user interface should help users to formulate queries that can be matched on the annotations generated by the extraction component and linked to the time-codes of the corresponding video fragments. In parallel to the MUMIS project we surveyed a number of research efforts that deal all with football commentary but for which the technology focus differs: retrieval, interpretation or generation of commentary and related (but not necessarily language oriented) research. There were two reasons for this investigation. One obvious reason is to find out how the new technology and tools can be adapted to similar applications. The other is to see how the domain knowledge obtained for extraction purposes can be employed for other intelligent applications in the same domain. Since at the same time several of our M.Sc. students got interested in learning multi-agent systems for developing teams for the RoboCup leagues we decided to stick close to the football domain in this survey as well. In this paper we start with the survey of the domain specific research. The aim of the survey was to obtain a comprehensive view of the field could guide the selection of new research themes. In the second part of the paper we give a short introduction to the MUMIS project as it can be embedded in the general football related language technology research.

KW - METIS-212961

KW - EWI-6727

KW - HMI-SLT: Speech and Language Technology

KW - IR-63380

M3 - Conference contribution

SN - 959-7174-01-4

SP - 594

EP - 599

BT - ACTAS-1: VIII Symposio Social

A2 - Miyares, L.R.

A2 - Alvarez, C. E.

A2 - Silva Alvarez, M. R.

PB - Centro de Lingüística Aplicada

CY - Santiago de Cuba

ER -

Nijholt A, op den Akker HJA, de Jong FMG. Language interpretation and generation for football commentary. In Miyares LR, Alvarez CE, Silva Alvarez MR, editors, ACTAS-1: VIII Symposio Social. Santiago de Cuba: Centro de Lingüística Aplicada. 2003. p. 594-599