A tool for tracking the propagation of words on Reddit


  • Tom Willaert Vrije Universiteit Brussel
  • Paul Van Eecke Vrije Universiteit Brussel
  • Jeroen Van Soest Vrije Universiteit Brussel
  • Katrien Beuls


language propagation, media, reddit, memetics, digital methods


The data-driven study of cultural information diffusion in online (social) media is currently an active area of research. The availability of data from the web thereby generates new opportunities to examine how words propagate through online media and communities, as well as how these diffusion patterns are intertwined with the materiality and culture of social media platforms. In support of such efforts, this paper introduces an online tool for tracking the consecutive occurrences of words across subreddits on Reddit between 2005 and 2017. By processing the full Pushshift.io Reddit comment archive for this period (Baumgartner et al., 2020), we are able to track the first occurrences of 76 million words, allowing us to visualize which subreddits subsequently adopt any of those words over time. We illustrate this approach by addressing the spread of terms referring to famous internet controversies, and the percolation of alt-right terminology. By making our instrument and the processed data publically available, we aim to facilitate a range of exploratory analyses in computational social science, the digital humanities, and related fields.




How to Cite

Willaert, T., Van Eecke, P., Van Soest, J., & Beuls, K. (2021). A tool for tracking the propagation of words on Reddit. Computational Communication Research, 3(1), 117–132. Retrieved from https://computationalcommunication.org/ccr/article/view/86