Detecting Impoliteness and Incivility in Online Discussions

Classification Approaches for German User Comments.

Authors

  • Anke Stoll University of Dusseldorf
  • Marc Ziegele Heinrich-Heine-University Düsseldorf
  • Oliver Quiring Johannes Gutenberg University Mainz

DOI:

https://doi.org/10.5117/CCR2020.1.005.KATH

Keywords:

automated content analysis, incivility, impoliteness, text classification, machine learning, user comments, online discussions

Abstract

Impoliteness and incivility in online discussions have recently been discussed as relevant issues in the field of communication science. However, automatically detecting such concepts with computational methods is challenging. In our study, we develop supervised classification models to predict impoliteness and incivility in German user comments. Using a sample of 10,000 hand-coded user comments and a theory-grounded coding scheme, we train and test classifiers based on unigram, bigram, and trigram feature models and on Naïve Bayes and Support Vector Machine algorithms. Our classification models, based on word frequency distributions in user comments, predict both impoliteness and incivility with an accuracy of about 80 percent. The models also reveal predictive features that include obviously offensive language, uncivil rhetoric, and topic and context-related words. Our study thereby contributes both to the understanding of impolite and uncivil communication in user comment sections and to the development and applications of text classification using machine learning.

Downloads

Published

2020-02-03

How to Cite

Stoll, A., Ziegele, M., & Quiring, O. (2020). Detecting Impoliteness and Incivility in Online Discussions: Classification Approaches for German User Comments. Computational Communication Research, 2(1), 109–134. https://doi.org/10.5117/CCR2020.1.005.KATH

Issue

Section

Articles