Wu, P. Y. and Mebane, W. R. (2022) “MARMOT: A Deep Learning Framework for Constructing Multimodal Representations for Vision-and-Language Tasks”, Computational Communication Research, 4(1), pp. 275–322. Available at: https://computationalcommunication.org/ccr/article/view/102 (Accessed: 28 March 2024).