Are We All in a Truman Show? Spotting Instagram Crowdturfing through Self-Training

  • Pier Paolo Tricomi
  • , Sousan Tarahomi
  • , Christian Cattai
  • , Francesco Martini
  • , Mauro Conti

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

1 Citation (Scopus)
191 Downloads (Pure)

Abstract

Influencer Marketing generated 16 billion in 2022. Usually, the more popular influencers are paid more for their collaborations. Thus, many services were created to boost profiles' popularity metrics through bots or fake accounts. However, real people recently started participating in such boosting activities using their real accounts for monetary rewards, generating ungenuine content that is extremely difficult to detect. To date, no works have attempted to detect this new phenomenon, known as crowdturfing (CT), on Instagram. In this work, we propose the first Instagram CT engagement detector. Our algorithm leverages profiles' characteristics through semi-supervised learning to spot accounts involved in CT activities. Compared to the supervised approaches used so far to identify fake accounts, semi-supervised models can exploit huge quantities of unlabeled data to increase performance. We purchased and studied 1293 CT profiles from 11 providers to build our self-training classifier, which reached 95% F1-score. We tested our model in the wild by detecting and analyzing CT engagement from 20 mega-influencers (i.e., with more than one million followers), and discovered that more than 20 % was artificial. We analyzed the CT profiles and comments, showing that it is difficult to detect these activities based solely on their generated content.

Original languageEnglish
Title of host publicationICCCN 2023 - 2023 32nd International Conference on Computer Communications and Networks
PublisherIEEE
ISBN (Electronic)9798350336184
DOIs
Publication statusPublished - 1 Sept 2023
Event32nd International Conference on Computer Communications and Networks, ICCCN 2023 - Honolulu, United States
Duration: 24 Jul 202327 Jul 2023
Conference number: 32

Publication series

NameProceedings - International Conference on Computer Communications and Networks, ICCCN
Volume2023-July
ISSN (Print)1095-2055

Conference

Conference32nd International Conference on Computer Communications and Networks, ICCCN 2023
Abbreviated titleICCCN 2023
Country/TerritoryUnited States
CityHonolulu
Period24/07/2327/07/23

Keywords

  • 2024 OA procedure
  • Collusion
  • Crowdturfing Detection
  • Fake Accounts
  • Fake Engagement
  • Fake Profiles
  • Instagram
  • Self-Training
  • Semi-Supervised Learning
  • Bot Detection

Fingerprint

Dive into the research topics of 'Are We All in a Truman Show? Spotting Instagram Crowdturfing through Self-Training'. Together they form a unique fingerprint.

Cite this