Document categorization using multilingual associative networks based on Wikipedia

    Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

    12 Downloads (Pure)

    Abstract

    Associative networks are a connectionist language model with the ability to categorize large sets of documents. In this research we combine monolingual associative networks based on Wikipedia to create a larger, multilingual associative network, using the cross-lingual connections between Wikipedia articles. We prove that such multilingual associative networks perform better than monolingual associative networks in tasks related to document categorization by comparing the results of both types of associative network on a multilingual dataset.
    Original languageUndefined
    Title of host publicationWWW 2015 Companion. Proceedings of the 24th International Conference on World Wide Web
    Place of PublicationNew York
    PublisherAssociation for Computing Machinery (ACM)
    Pages841-846
    Number of pages6
    ISBN (Print)978-1-4503-3473-0
    DOIs
    Publication statusPublished - May 2015
    Event24th International World Wide Web Conference, WWW 2015: WWW2015 - Florence, Florence, Italy
    Duration: 18 May 201522 May 2015
    Conference number: 24
    http://www2015.wwwconference.org/
    http://www.www2015.it/

    Publication series

    Name
    PublisherACM

    Conference

    Conference24th International World Wide Web Conference, WWW 2015
    Abbreviated titleWWW
    CountryItaly
    CityFlorence
    Period18/05/1522/05/15
    Internet address

    Keywords

    • EWI-25995
    • METIS-312590
    • IR-96796

    Cite this

    Bloom, N., Theune, M., & de Jong, F. M. G. (2015). Document categorization using multilingual associative networks based on Wikipedia. In WWW 2015 Companion. Proceedings of the 24th International Conference on World Wide Web (pp. 841-846). New York: Association for Computing Machinery (ACM). https://doi.org/10.1145/2740908.2743003
    Bloom, Niels ; Theune, Mariet ; de Jong, Franciska M.G. / Document categorization using multilingual associative networks based on Wikipedia. WWW 2015 Companion. Proceedings of the 24th International Conference on World Wide Web. New York : Association for Computing Machinery (ACM), 2015. pp. 841-846
    @inproceedings{ab61e292c7cc4baeb0b2d529e725a819,
    title = "Document categorization using multilingual associative networks based on Wikipedia",
    abstract = "Associative networks are a connectionist language model with the ability to categorize large sets of documents. In this research we combine monolingual associative networks based on Wikipedia to create a larger, multilingual associative network, using the cross-lingual connections between Wikipedia articles. We prove that such multilingual associative networks perform better than monolingual associative networks in tasks related to document categorization by comparing the results of both types of associative network on a multilingual dataset.",
    keywords = "EWI-25995, METIS-312590, IR-96796",
    author = "Niels Bloom and Mariet Theune and {de Jong}, {Franciska M.G.}",
    note = "10.1145/2740908.2743003",
    year = "2015",
    month = "5",
    doi = "10.1145/2740908.2743003",
    language = "Undefined",
    isbn = "978-1-4503-3473-0",
    publisher = "Association for Computing Machinery (ACM)",
    pages = "841--846",
    booktitle = "WWW 2015 Companion. Proceedings of the 24th International Conference on World Wide Web",
    address = "United States",

    }

    Bloom, N, Theune, M & de Jong, FMG 2015, Document categorization using multilingual associative networks based on Wikipedia. in WWW 2015 Companion. Proceedings of the 24th International Conference on World Wide Web. Association for Computing Machinery (ACM), New York, pp. 841-846, 24th International World Wide Web Conference, WWW 2015, Florence, Italy, 18/05/15. https://doi.org/10.1145/2740908.2743003

    Document categorization using multilingual associative networks based on Wikipedia. / Bloom, Niels; Theune, Mariet; de Jong, Franciska M.G.

    WWW 2015 Companion. Proceedings of the 24th International Conference on World Wide Web. New York : Association for Computing Machinery (ACM), 2015. p. 841-846.

    Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

    TY - GEN

    T1 - Document categorization using multilingual associative networks based on Wikipedia

    AU - Bloom, Niels

    AU - Theune, Mariet

    AU - de Jong, Franciska M.G.

    N1 - 10.1145/2740908.2743003

    PY - 2015/5

    Y1 - 2015/5

    N2 - Associative networks are a connectionist language model with the ability to categorize large sets of documents. In this research we combine monolingual associative networks based on Wikipedia to create a larger, multilingual associative network, using the cross-lingual connections between Wikipedia articles. We prove that such multilingual associative networks perform better than monolingual associative networks in tasks related to document categorization by comparing the results of both types of associative network on a multilingual dataset.

    AB - Associative networks are a connectionist language model with the ability to categorize large sets of documents. In this research we combine monolingual associative networks based on Wikipedia to create a larger, multilingual associative network, using the cross-lingual connections between Wikipedia articles. We prove that such multilingual associative networks perform better than monolingual associative networks in tasks related to document categorization by comparing the results of both types of associative network on a multilingual dataset.

    KW - EWI-25995

    KW - METIS-312590

    KW - IR-96796

    U2 - 10.1145/2740908.2743003

    DO - 10.1145/2740908.2743003

    M3 - Conference contribution

    SN - 978-1-4503-3473-0

    SP - 841

    EP - 846

    BT - WWW 2015 Companion. Proceedings of the 24th International Conference on World Wide Web

    PB - Association for Computing Machinery (ACM)

    CY - New York

    ER -

    Bloom N, Theune M, de Jong FMG. Document categorization using multilingual associative networks based on Wikipedia. In WWW 2015 Companion. Proceedings of the 24th International Conference on World Wide Web. New York: Association for Computing Machinery (ACM). 2015. p. 841-846 https://doi.org/10.1145/2740908.2743003