Computational Sociolinguistics: A Survey

Dong-Phuong Nguyen, A. Seza Doğruöz, Carolyn P. Rosé, Franciska M.G. de Jong

Research output: Contribution to journalArticleAcademicpeer-review

19 Citations (Scopus)
133 Downloads (Pure)

Abstract

Language is a social phenomenon and variation is inherent to its social nature. Recently, there has been a surge of interest within the computational linguistics (CL) community in the social dimension of language. In this article we present a survey of the emerging field of “computational sociolinguistics‿ that reflects this increased interest. We aim to provide a comprehensive overview of CL research on sociolinguistic themes, featuring topics such as the relation between language and social identity, language use in social interaction, and multilingual communication. Moreover, we demonstrate the potential for synergy between the research communities involved, by showing how the large-scale data-driven methods that are widely used in CL can complement existing sociolinguistic studies, and how sociolinguistics can inform and challenge the methods and assumptions used in CL studies. We hope to convey the possible benefits of a closer collaboration between the two communities and conclude with a discussion of open challenges.
Original languageUndefined
Pages (from-to)537-593
Number of pages57
JournalComputational linguistics
Volume42
Issue number3
DOIs
Publication statusPublished - Sep 2016

Keywords

  • Social Media
  • Sociolinguistics
  • computational social science
  • METIS-320914
  • CR-I.2.7
  • Computational Linguistics
  • IR-102941
  • EWI-27517

Cite this

Nguyen, Dong-Phuong ; Doğruöz, A. Seza ; Rosé, Carolyn P. ; de Jong, Franciska M.G. / Computational Sociolinguistics: A Survey. In: Computational linguistics. 2016 ; Vol. 42, No. 3. pp. 537-593.
@article{d23ec10f52014f5e90d9b62af37d1ae8,
title = "Computational Sociolinguistics: A Survey",
abstract = "Language is a social phenomenon and variation is inherent to its social nature. Recently, there has been a surge of interest within the computational linguistics (CL) community in the social dimension of language. In this article we present a survey of the emerging field of “computational sociolinguistics‿ that reflects this increased interest. We aim to provide a comprehensive overview of CL research on sociolinguistic themes, featuring topics such as the relation between language and social identity, language use in social interaction, and multilingual communication. Moreover, we demonstrate the potential for synergy between the research communities involved, by showing how the large-scale data-driven methods that are widely used in CL can complement existing sociolinguistic studies, and how sociolinguistics can inform and challenge the methods and assumptions used in CL studies. We hope to convey the possible benefits of a closer collaboration between the two communities and conclude with a discussion of open challenges.",
keywords = "Social Media, Sociolinguistics, computational social science, METIS-320914, CR-I.2.7, Computational Linguistics, IR-102941, EWI-27517",
author = "Dong-Phuong Nguyen and Doğru{\"o}z, {A. Seza} and Ros{\'e}, {Carolyn P.} and {de Jong}, {Franciska M.G.}",
note = "eemcs-eprint-27517",
year = "2016",
month = "9",
doi = "10.1162/COLI_a_00258",
language = "Undefined",
volume = "42",
pages = "537--593",
journal = "Computational linguistics",
issn = "0891-2017",
publisher = "MIT Press Journals",
number = "3",

}

Nguyen, D-P, Doğruöz, AS, Rosé, CP & de Jong, FMG 2016, 'Computational Sociolinguistics: A Survey' Computational linguistics, vol. 42, no. 3, pp. 537-593. https://doi.org/10.1162/COLI_a_00258

Computational Sociolinguistics: A Survey. / Nguyen, Dong-Phuong; Doğruöz, A. Seza; Rosé, Carolyn P.; de Jong, Franciska M.G.

In: Computational linguistics, Vol. 42, No. 3, 09.2016, p. 537-593.

Research output: Contribution to journalArticleAcademicpeer-review

TY - JOUR

T1 - Computational Sociolinguistics: A Survey

AU - Nguyen, Dong-Phuong

AU - Doğruöz, A. Seza

AU - Rosé, Carolyn P.

AU - de Jong, Franciska M.G.

N1 - eemcs-eprint-27517

PY - 2016/9

Y1 - 2016/9

N2 - Language is a social phenomenon and variation is inherent to its social nature. Recently, there has been a surge of interest within the computational linguistics (CL) community in the social dimension of language. In this article we present a survey of the emerging field of “computational sociolinguistics‿ that reflects this increased interest. We aim to provide a comprehensive overview of CL research on sociolinguistic themes, featuring topics such as the relation between language and social identity, language use in social interaction, and multilingual communication. Moreover, we demonstrate the potential for synergy between the research communities involved, by showing how the large-scale data-driven methods that are widely used in CL can complement existing sociolinguistic studies, and how sociolinguistics can inform and challenge the methods and assumptions used in CL studies. We hope to convey the possible benefits of a closer collaboration between the two communities and conclude with a discussion of open challenges.

AB - Language is a social phenomenon and variation is inherent to its social nature. Recently, there has been a surge of interest within the computational linguistics (CL) community in the social dimension of language. In this article we present a survey of the emerging field of “computational sociolinguistics‿ that reflects this increased interest. We aim to provide a comprehensive overview of CL research on sociolinguistic themes, featuring topics such as the relation between language and social identity, language use in social interaction, and multilingual communication. Moreover, we demonstrate the potential for synergy between the research communities involved, by showing how the large-scale data-driven methods that are widely used in CL can complement existing sociolinguistic studies, and how sociolinguistics can inform and challenge the methods and assumptions used in CL studies. We hope to convey the possible benefits of a closer collaboration between the two communities and conclude with a discussion of open challenges.

KW - Social Media

KW - Sociolinguistics

KW - computational social science

KW - METIS-320914

KW - CR-I.2.7

KW - Computational Linguistics

KW - IR-102941

KW - EWI-27517

U2 - 10.1162/COLI_a_00258

DO - 10.1162/COLI_a_00258

M3 - Article

VL - 42

SP - 537

EP - 593

JO - Computational linguistics

JF - Computational linguistics

SN - 0891-2017

IS - 3

ER -