"How old do you think I am?": A study of language and age in Twitter

Dong-Phuong Nguyen, Rilana Gravel, Rudolf Berend Trieschnigg, Theo Meder

    Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

    228 Citations (Scopus)
    277 Downloads (Pure)

    Abstract

    In this paper we focus on the connection between age and language use, exploring age prediction of Twitter users based on their tweets. We discuss the construction of a fine-grained annotation effort to assign ages and life stages to Twitter users. Using this dataset, we explore age prediction in three different ways: classifying users into age categories, by life stages, and predicting their exact age. We find that an automatic system achieves better performance than humans on these tasks and that both humans and the automatic systems have difficulties predicting the age of older people. Moreover, we present a detailed analysis of variables that change with age. We find strong patterns of change, and that most changes occur at young ages.
    Original languageEnglish
    Title of host publicationProceedings of the Seventh International AAAI Conference on Weblogs and Social Media, ICWSM 2013
    Place of PublicationPalo Alto, CA, USA
    PublisherAAAI
    Pages439-448
    Number of pages10
    ISBN (Print)978-1-57735-610-3
    Publication statusPublished - Jul 2013
    Event7th International AAAI Conference on Weblogs and Social Media, ICWSM 2013 - Cambridge, United States
    Duration: 8 Jul 201310 Jul 2013
    Conference number: 7
    http://www.icwsm.org/2013/

    Conference

    Conference7th International AAAI Conference on Weblogs and Social Media, ICWSM 2013
    Abbreviated titleICWSM
    Country/TerritoryUnited States
    CityCambridge
    Period8/07/1310/07/13
    Internet address

    Keywords

    • EWI-23604
    • Twitter
    • METIS-297788
    • IR-87215
    • Age
    • Sociolinguistics

    Fingerprint

    Dive into the research topics of '"How old do you think I am?": A study of language and age in Twitter'. Together they form a unique fingerprint.

    Cite this