Predicting Code-switching in Multilingual Communication for Immigrant Communities

Evangelos E. Papalexakis, Dong-Phuong Nguyen, A. Seza Doğruöz

    Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

    21 Downloads (Pure)

    Abstract

    Immigrant communities host multilingual speakers who switch across languages and cultures in their daily communication practices. Although there are in-depth linguistic descriptions of code-switching across different multilingual communication settings, there is a need for automatic prediction of code-switching in large datasets. We use emoticons and multi-word expressions as novel features to predict code-switching in a large online discussion forum for the Turkish-Dutch immigrant community in the Netherlands. Our results indicate that multi-word expressions are powerful features to predict code-switching.
    Original languageUndefined
    Title of host publicationProceedings of the First Workshop on Computational Approaches to Code Switching
    Place of PublicationStroudsburg, PA, USA
    PublisherAssociation for Computational Linguistics (ACL)
    Pages42-50
    Number of pages9
    ISBN (Print)978-1-937284-96-1
    Publication statusPublished - 25 Oct 2014

    Publication series

    Name
    PublisherAssociation for Computational Linguistics

    Keywords

    • EWI-25499
    • code-switching
    • METIS-309773
    • Social Media
    • IR-94058
    • natural language processing

    Cite this