《Look, Imagine and Match: Improving Textual-Visual Cross-Modal Retrieval with Generative Models》

時間 2021-01-02

原文原文鏈接

來源：CVPR2018 一、Introduction 第一篇同時利用GAN和Reinforcement Learning(RL)做跨媒體檢索的文章。這個網絡可以同時做三個跨媒體的任務：cross-media retrieval，image caption and text-to-image synthesis（對於後兩個任務，文章只給出了可視化的結果，沒有給出定量的分析）。這篇文章發表在CVP

>>阅读原文<<