論文閱讀：Learnable pooling with Context Gating for video classification

時間 2020-12-29

標籤視頻简体版

原文原文鏈接

這篇論文是2016年Google Cloud & YouTube-8M Video Understanding Challenge比賽中冠軍得主的論文。文章的兩點貢獻：融合了VLAD, bag-of-visual-words和Fisher Vector三種編碼方式，並且每個都做了一定程度的調整。其中，VLAD改爲NetRVLAD, bag-of-visual-words改爲Soft-DBoW,

>>阅读原文<<