Document categorization using multilingual associative networks based on Wikipedia

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

12 Downloads (Pure)

Abstract

Associative networks are a connectionist language model with the ability to categorize large sets of documents. In this research we combine monolingual associative networks based on Wikipedia to create a larger, multilingual associative network, using the cross-lingual connections between Wikipedia articles. We prove that such multilingual associative networks perform better than monolingual associative networks in tasks related to document categorization by comparing the results of both types of associative network on a multilingual dataset.
Original languageUndefined
Title of host publicationWWW 2015 Companion. Proceedings of the 24th International Conference on World Wide Web
Place of PublicationNew York
PublisherAssociation for Computing Machinery (ACM)
Pages841-846
Number of pages6
ISBN (Print)978-1-4503-3473-0
DOIs
Publication statusPublished - May 2015
Event24th International World Wide Web Conference, WWW 2015 - Florence, Italy
Duration: 18 May 201522 May 2015
Conference number: 24
http://www2015.wwwconference.org/

Publication series

Name
PublisherACM

Conference

Conference24th International World Wide Web Conference, WWW 2015
Abbreviated titleWWW
CountryItaly
CityFlorence
Period18/05/1522/05/15
Internet address

Keywords

  • EWI-25995
  • METIS-312590
  • IR-96796

Cite this

Bloom, N., Theune, M., & de Jong, F. M. G. (2015). Document categorization using multilingual associative networks based on Wikipedia. In WWW 2015 Companion. Proceedings of the 24th International Conference on World Wide Web (pp. 841-846). New York: Association for Computing Machinery (ACM). https://doi.org/10.1145/2740908.2743003
Bloom, Niels ; Theune, Mariet ; de Jong, Franciska M.G. / Document categorization using multilingual associative networks based on Wikipedia. WWW 2015 Companion. Proceedings of the 24th International Conference on World Wide Web. New York : Association for Computing Machinery (ACM), 2015. pp. 841-846
@inproceedings{ab61e292c7cc4baeb0b2d529e725a819,
title = "Document categorization using multilingual associative networks based on Wikipedia",
abstract = "Associative networks are a connectionist language model with the ability to categorize large sets of documents. In this research we combine monolingual associative networks based on Wikipedia to create a larger, multilingual associative network, using the cross-lingual connections between Wikipedia articles. We prove that such multilingual associative networks perform better than monolingual associative networks in tasks related to document categorization by comparing the results of both types of associative network on a multilingual dataset.",
keywords = "EWI-25995, METIS-312590, IR-96796",
author = "Niels Bloom and Mariet Theune and {de Jong}, {Franciska M.G.}",
note = "10.1145/2740908.2743003",
year = "2015",
month = "5",
doi = "10.1145/2740908.2743003",
language = "Undefined",
isbn = "978-1-4503-3473-0",
publisher = "Association for Computing Machinery (ACM)",
pages = "841--846",
booktitle = "WWW 2015 Companion. Proceedings of the 24th International Conference on World Wide Web",
address = "United States",

}

Bloom, N, Theune, M & de Jong, FMG 2015, Document categorization using multilingual associative networks based on Wikipedia. in WWW 2015 Companion. Proceedings of the 24th International Conference on World Wide Web. Association for Computing Machinery (ACM), New York, pp. 841-846, 24th International World Wide Web Conference, WWW 2015, Florence, Italy, 18/05/15. https://doi.org/10.1145/2740908.2743003

Document categorization using multilingual associative networks based on Wikipedia. / Bloom, Niels; Theune, Mariet; de Jong, Franciska M.G.

WWW 2015 Companion. Proceedings of the 24th International Conference on World Wide Web. New York : Association for Computing Machinery (ACM), 2015. p. 841-846.

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

TY - GEN

T1 - Document categorization using multilingual associative networks based on Wikipedia

AU - Bloom, Niels

AU - Theune, Mariet

AU - de Jong, Franciska M.G.

N1 - 10.1145/2740908.2743003

PY - 2015/5

Y1 - 2015/5

N2 - Associative networks are a connectionist language model with the ability to categorize large sets of documents. In this research we combine monolingual associative networks based on Wikipedia to create a larger, multilingual associative network, using the cross-lingual connections between Wikipedia articles. We prove that such multilingual associative networks perform better than monolingual associative networks in tasks related to document categorization by comparing the results of both types of associative network on a multilingual dataset.

AB - Associative networks are a connectionist language model with the ability to categorize large sets of documents. In this research we combine monolingual associative networks based on Wikipedia to create a larger, multilingual associative network, using the cross-lingual connections between Wikipedia articles. We prove that such multilingual associative networks perform better than monolingual associative networks in tasks related to document categorization by comparing the results of both types of associative network on a multilingual dataset.

KW - EWI-25995

KW - METIS-312590

KW - IR-96796

U2 - 10.1145/2740908.2743003

DO - 10.1145/2740908.2743003

M3 - Conference contribution

SN - 978-1-4503-3473-0

SP - 841

EP - 846

BT - WWW 2015 Companion. Proceedings of the 24th International Conference on World Wide Web

PB - Association for Computing Machinery (ACM)

CY - New York

ER -

Bloom N, Theune M, de Jong FMG. Document categorization using multilingual associative networks based on Wikipedia. In WWW 2015 Companion. Proceedings of the 24th International Conference on World Wide Web. New York: Association for Computing Machinery (ACM). 2015. p. 841-846 https://doi.org/10.1145/2740908.2743003