論文-《MUREL: Multimodal Relational Reasoning for Visual Question Answering Remi》重點翻譯+擴展

時間 2020-12-25

原文原文鏈接

Multimodal attentional networks are currently state-of-the-art models for Visual Question Answering (VQA) tasks involving real images. 多模態注意力網絡是目前最先進的涉及真實圖像的VQA任務模型。 In this paper, we propose MuRe

>>阅读原文<<