(Paper Reading)Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering

時間 2020-12-30

原文原文鏈接

Introduction Within our approach, the bottom-up mechanism (based on Faster R-CNN) proposes image regions, each with an associated feature vector, while the top-down mechanism determines feature weight