Immigrant communities host multilingual speakers who switch across languages and cultures in their daily communication practices. Although there are in-depth linguistic descriptions of code-switching across different multilingual communication settings, there is a need for automatic prediction of code-switching in large datasets. We use emoticons and multi-word expressions as novel features to predict code-switching in a large online discussion forum for the Turkish-Dutch immigrant community in the Netherlands. Our results indicate that multi-word expressions are powerful features to predict code-switching.
|Publisher||Association for Computational Linguistics|
|Workshop||First Workshop on Computational Approaches to Code Switching|
|Period||25/10/14 → 25/10/14|
|Other||25 October 2014|
- Social Media
- natural language processing