(Paper Reading)Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering

Introduction Within our approach, the bottom-up mechanism (based on Faster R-CNN) proposes image regions, each with an associated feature vector, while the top-down mechanism determines feature weight
相關文章
相關標籤/搜索