Abstract
We combine social theory and NLP methods to classify English-speaking Twitter users’ online social identity in profile descriptions. We conduct two text classification experiments. In Experiment 1 we use a 5-category online social identity classification based on identity and self-categorization theories. While we are able to automatically classify two identity categories (Relational and Occupational), automatic classification of the other three identities (Political, Ethnic/religious and Stigmatized) is challenging. In Experiment 2 we test a merger of such identities based on theoretical arguments. We find that by combining these identities we can improve the predictive performance of the classifiers in the experiment. Our study shows how social theory can be used to guide NLP methods, and how such methods provide input to revisit traditional social theory that is strongly consolidated in offline setting
Original language | English |
---|---|
Title of host publication | Workshop on Natural Language Processing and Computational Social Science |
Subtitle of host publication | NLP+CSS, EMNLP |
Place of Publication | Austin, Texas |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 55-65 |
Number of pages | 11 |
ISBN (Print) | 978-1-945626-26-5 |
DOIs | |
Publication status | Published - Nov 2016 |
Event | EMNLP 2016, Workshop on Natural Language Processing and Computational Social Science: EMNLP 2016, Workshop on Natural Language Processing and Computational Social Science: proceedings of the workshop - Austin, Texas Duration: 5 Nov 2016 → … |
Conference
Conference | EMNLP 2016, Workshop on Natural Language Processing and Computational Social Science |
---|---|
City | Austin, Texas |
Period | 5/11/16 → … |
Keywords
- EWI-27810
- IR-102356
- METIS-319140