1.
Wu PY, Mebane WR. MARMOT: A Deep Learning Framework for Constructing Multimodal Representations for Vision-and-Language Tasks. CCR [Internet]. 2022 May 3 [cited 2024 Nov. 21];4(1):275-322. Available from: https://computationalcommunication.org/ccr/article/view/102