Improved cyberbullying detection using gender information

M. Dadvar, Franciska M.G. de Jong, Roeland J.F. Ordelman, Rudolf Berend Trieschnigg

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

4921 Downloads (Pure)

Abstract

As a result of the invention of social networks, friendships, relationships and social communication are all undergoing changes and new definitions seem to be applicable. One may have hundreds of ‘friends’ without even seeing their faces. Meanwhile, alongside this transition there is increasing evidence that online social applications are used by children and adolescents for bullying. State-of-the-art studies in cyberbullying detection have mainly focused on the content of the conversations while largely ignoring the characteristics of the actors involved in cyberbullying. Social studies on cyberbullying reveal that the written language used by a harasser varies with the author’s features including gender. In this study we used a support vector machine model to train a gender-specific text classifier. We demonstrated that taking gender-specific language features into account improves the discrimination capacity of a classifier to detect cyberbullying.
Original languageUndefined
Title of host publicationProceedings of the Twelfth Dutch-Belgian Information Retrieval Workshop (DIR 2012)
Place of PublicationGhent
PublisherGhent University
Pages23-25
Number of pages4
ISBN (Print)not assigned
Publication statusPublished - 23 Feb 2012
Event12th Dutch-Belgian Information Retrieval Workshop, DIR 2012 - Ghent, Belgium
Duration: 24 Feb 201224 Feb 2012
Conference number: 12

Publication series

Name
PublisherUniversity of Ghent

Workshop

Workshop12th Dutch-Belgian Information Retrieval Workshop, DIR 2012
Abbreviated titleDIR
Country/TerritoryBelgium
CityGhent
Period24/02/1224/02/12

Keywords

  • Gender distinction
  • Cyberharassment
  • IR-79872
  • METIS-285161
  • Support vector machine
  • EC Grant Agreement nr.: FP7/231507
  • Social Networks
  • Text Mining
  • EWI-21608

Cite this