Computational Sociolinguistics: A Survey

Dong-Phuong Nguyen, A. Seza Doğruöz, Carolyn P. Rosé, Franciska M.G. de Jong

    Research output: Contribution to journalArticleAcademicpeer-review

    21 Citations (Scopus)
    147 Downloads (Pure)

    Abstract

    Language is a social phenomenon and variation is inherent to its social nature. Recently, there has been a surge of interest within the computational linguistics (CL) community in the social dimension of language. In this article we present a survey of the emerging field of “computational sociolinguistics‿ that reflects this increased interest. We aim to provide a comprehensive overview of CL research on sociolinguistic themes, featuring topics such as the relation between language and social identity, language use in social interaction, and multilingual communication. Moreover, we demonstrate the potential for synergy between the research communities involved, by showing how the large-scale data-driven methods that are widely used in CL can complement existing sociolinguistic studies, and how sociolinguistics can inform and challenge the methods and assumptions used in CL studies. We hope to convey the possible benefits of a closer collaboration between the two communities and conclude with a discussion of open challenges.
    Original languageUndefined
    Pages (from-to)537-593
    Number of pages57
    JournalComputational linguistics
    Volume42
    Issue number3
    DOIs
    Publication statusPublished - Sep 2016

    Keywords

    • Social Media
    • Sociolinguistics
    • computational social science
    • METIS-320914
    • CR-I.2.7
    • Computational Linguistics
    • IR-102941
    • EWI-27517

    Cite this

    Nguyen, Dong-Phuong ; Doğruöz, A. Seza ; Rosé, Carolyn P. ; de Jong, Franciska M.G. / Computational Sociolinguistics: A Survey. In: Computational linguistics. 2016 ; Vol. 42, No. 3. pp. 537-593.
    @article{d23ec10f52014f5e90d9b62af37d1ae8,
    title = "Computational Sociolinguistics: A Survey",
    abstract = "Language is a social phenomenon and variation is inherent to its social nature. Recently, there has been a surge of interest within the computational linguistics (CL) community in the social dimension of language. In this article we present a survey of the emerging field of “computational sociolinguistics‿ that reflects this increased interest. We aim to provide a comprehensive overview of CL research on sociolinguistic themes, featuring topics such as the relation between language and social identity, language use in social interaction, and multilingual communication. Moreover, we demonstrate the potential for synergy between the research communities involved, by showing how the large-scale data-driven methods that are widely used in CL can complement existing sociolinguistic studies, and how sociolinguistics can inform and challenge the methods and assumptions used in CL studies. We hope to convey the possible benefits of a closer collaboration between the two communities and conclude with a discussion of open challenges.",
    keywords = "Social Media, Sociolinguistics, computational social science, METIS-320914, CR-I.2.7, Computational Linguistics, IR-102941, EWI-27517",
    author = "Dong-Phuong Nguyen and Doğru{\"o}z, {A. Seza} and Ros{\'e}, {Carolyn P.} and {de Jong}, {Franciska M.G.}",
    note = "eemcs-eprint-27517",
    year = "2016",
    month = "9",
    doi = "10.1162/COLI_a_00258",
    language = "Undefined",
    volume = "42",
    pages = "537--593",
    journal = "Computational linguistics",
    issn = "0891-2017",
    publisher = "MIT Press Journals",
    number = "3",

    }

    Nguyen, D-P, Doğruöz, AS, Rosé, CP & de Jong, FMG 2016, 'Computational Sociolinguistics: A Survey', Computational linguistics, vol. 42, no. 3, pp. 537-593. https://doi.org/10.1162/COLI_a_00258

    Computational Sociolinguistics: A Survey. / Nguyen, Dong-Phuong; Doğruöz, A. Seza; Rosé, Carolyn P.; de Jong, Franciska M.G.

    In: Computational linguistics, Vol. 42, No. 3, 09.2016, p. 537-593.

    Research output: Contribution to journalArticleAcademicpeer-review

    TY - JOUR

    T1 - Computational Sociolinguistics: A Survey

    AU - Nguyen, Dong-Phuong

    AU - Doğruöz, A. Seza

    AU - Rosé, Carolyn P.

    AU - de Jong, Franciska M.G.

    N1 - eemcs-eprint-27517

    PY - 2016/9

    Y1 - 2016/9

    N2 - Language is a social phenomenon and variation is inherent to its social nature. Recently, there has been a surge of interest within the computational linguistics (CL) community in the social dimension of language. In this article we present a survey of the emerging field of “computational sociolinguistics‿ that reflects this increased interest. We aim to provide a comprehensive overview of CL research on sociolinguistic themes, featuring topics such as the relation between language and social identity, language use in social interaction, and multilingual communication. Moreover, we demonstrate the potential for synergy between the research communities involved, by showing how the large-scale data-driven methods that are widely used in CL can complement existing sociolinguistic studies, and how sociolinguistics can inform and challenge the methods and assumptions used in CL studies. We hope to convey the possible benefits of a closer collaboration between the two communities and conclude with a discussion of open challenges.

    AB - Language is a social phenomenon and variation is inherent to its social nature. Recently, there has been a surge of interest within the computational linguistics (CL) community in the social dimension of language. In this article we present a survey of the emerging field of “computational sociolinguistics‿ that reflects this increased interest. We aim to provide a comprehensive overview of CL research on sociolinguistic themes, featuring topics such as the relation between language and social identity, language use in social interaction, and multilingual communication. Moreover, we demonstrate the potential for synergy between the research communities involved, by showing how the large-scale data-driven methods that are widely used in CL can complement existing sociolinguistic studies, and how sociolinguistics can inform and challenge the methods and assumptions used in CL studies. We hope to convey the possible benefits of a closer collaboration between the two communities and conclude with a discussion of open challenges.

    KW - Social Media

    KW - Sociolinguistics

    KW - computational social science

    KW - METIS-320914

    KW - CR-I.2.7

    KW - Computational Linguistics

    KW - IR-102941

    KW - EWI-27517

    U2 - 10.1162/COLI_a_00258

    DO - 10.1162/COLI_a_00258

    M3 - Article

    VL - 42

    SP - 537

    EP - 593

    JO - Computational linguistics

    JF - Computational linguistics

    SN - 0891-2017

    IS - 3

    ER -