grafzahl

fine-tuning Transformers for text data from within R

Keywords: machine learning, transformers, r, python, automated content analysis

Abstract

This paper introduces `grafzahl`, an R package for fine-tuning Transformers for text data from within R. The package combines the ease of use of the `quanteda` R ecosystem and the state-of-the-art `Transformers` Python library. The package is used in this paper to reproduce the analyses in communication papers or, of non-Germanic benchmark datasets. Very significant improvement in model accuacy over traditional machine learning approach such as Convoluted Neural Network is observed. `grafzahl` might have a role in the mainstreamization of Transformer-based machine learning methods for communication research and beyond.

Published
2023-04-24
How to Cite
Chan, C.- hong. (2023). grafzahl. Computational Communication Research, 5(1), 76-84. Retrieved from https://computationalcommunication.org/ccr/article/view/178
Section
Articles