grafzahl

fine-tuning Transformers for text data from within R

Authors

Keywords:

machine learning, transformers, r, python, automated content analysis

Abstract

This paper introduces `grafzahl`, an R package for fine-tuning Transformers for text data from within R. The package combines the ease of use of the `quanteda` R ecosystem and the state-of-the-art `Transformers` Python library. The package is used in this paper to reproduce the analyses in communication papers or, of non-Germanic benchmark datasets. Very significant improvement in model accuacy over traditional machine learning approach such as Convoluted Neural Network is observed. `grafzahl` might have a role in the mainstreamization of Transformer-based machine learning methods for communication research and beyond.

Downloads

Published

2023-04-24

How to Cite

Chan, C.- hong. (2023). grafzahl: fine-tuning Transformers for text data from within R. Computational Communication Research, 5(1), 76–84. Retrieved from https://computationalcommunication.org/ccr/article/view/178

Issue

Section

Articles