2020 cvpr Hierarchical Conditional Relation Networks for Video Question Answering

摘要: problems:Video question answering (VideoQA) is challenging as it requires modeling capacity to distill dynamic visual artifacts and distant relations and to associate them with linguistic concepts
相關文章
相關標籤/搜索