Wu, Patrick Y., and Walter R. Mebane. 2022. “MARMOT: A Deep Learning Framework for Constructing Multimodal Representations for Vision-and-Language Tasks”. Computational Communication Research 4 (1):275-322. https://computationalcommunication.org/ccr/article/view/102.