Analysis and Prediction of Multilingual Controversy on Reddit

Philipp Koncar, Simon Walk, Denis Helic

Research output: Chapter in Book/Report/Conference proceedingConference paperpeer-review

Abstract

Social media users express their opinions about arbitrary subjects, including controversial matters such as the 2020 U.S. presidential election or climate change. Controversial topics typically attract user attention, which often lead to fruitful, but sometimes also heated discussions potentially segregating the community. Understanding features that are predictive of controversy in social media can improve moderation of communities and therefore the public discourse. In this paper, we analyze and predict controversy on the multilingual social platform Reddit. In particular, we compare a large set of textual and user activity features in controversial and non-controversial comments posted in six different languages. Using these features we perform a prediction task and study their predictive strengths for controversy. Our results indicate that, regardless of the language, controversial comments are harder to read, more negative and users follow up faster and more frequently to such comments. Moreover, with our prediction experiment (ROC AUC = 0.79) we find that across all languages user activity is the most predictive of controversy on Reddit. Our results contribute to an improved understanding of controversy in social media and can serve as a foundation for tools and models to automatically detect controversial content posted on such platforms.

Original languageEnglish
Title of host publicationWebSci 2021 - Proceedings of the 13th ACM Web Science Conference
PublisherAssociation of Computing Machinery
Pages215-224
Number of pages10
ISBN (Electronic)9781450383301
DOIs
Publication statusPublished - 21 Jun 2021
Event13th ACM Web Science Conference: WebSci 2021 - Virtual, Online, United Kingdom
Duration: 21 Jun 202125 Jun 2021

Publication series

NameACM International Conference Proceeding Series

Conference

Conference13th ACM Web Science Conference
Country/TerritoryUnited Kingdom
CityVirtual, Online
Period21/06/2125/06/21

Keywords

  • analysis
  • controversy
  • prediction
  • Reddit

ASJC Scopus subject areas

  • Software
  • Human-Computer Interaction
  • Computer Vision and Pattern Recognition
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Analysis and Prediction of Multilingual Controversy on Reddit'. Together they form a unique fingerprint.

Cite this