Computational Sociolinguistics: A Survey

Dong Nguyen, A. Seza Doğruöz, Carolyn P. Rosé, Franciska de Jong

    Research output: Contribution to journalArticleAcademicpeer-review

    140 Citations (Scopus)
    283 Downloads (Pure)


    Language is a social phenomenon and variation is inherent to its social nature. Recently, there has been a surge of interest within the computational linguistics (CL) community in the social dimension of language. In this article we present a survey of the emerging field of “computational sociolinguistics‿ that reflects this increased interest. We aim to provide a comprehensive overview of CL research on sociolinguistic themes, featuring topics such as the relation between language and social identity, language use in social interaction, and multilingual communication. Moreover, we demonstrate the potential for synergy between the research communities involved, by showing how the large-scale data-driven methods that are widely used in CL can complement existing sociolinguistic studies, and how sociolinguistics can inform and challenge the methods and assumptions used in CL studies. We hope to convey the possible benefits of a closer collaboration between the two communities and conclude with a discussion of open challenges.
    Original languageEnglish
    Pages (from-to)537-593
    Number of pages57
    JournalComputational linguistics
    Issue number3
    Publication statusPublished - Sept 2016


    • Social media
    • Sociolinguistics
    • Computational social science
    • CR-I.2.7
    • Computational linguistics


    Dive into the research topics of 'Computational Sociolinguistics: A Survey'. Together they form a unique fingerprint.

    Cite this