論文-《MUREL: Multimodal Relational Reasoning for Visual Question Answering Remi》重點翻譯+擴展

  Multimodal attentional networks are currently state-of-the-art models for Visual Question Answering (VQA) tasks involving real images. 多模態注意力網絡是目前最先進的涉及真實圖像的VQA任務模型。   In this paper, we propose MuRe
相關文章
相關標籤/搜索