論文筆記--ActionVLAD: Learning spatio-temporal aggregation for action classification

時間 2021-01-02

原文原文鏈接

介紹這是去年CVPR2017的一篇動作分類的文章，用tensorflow實現，有預訓練模型，代碼鏈接如下： http://rohitgirdhar.github.io/ActionVLAD 這篇文章在時空上分別獨立提取特徵，然後做pooling聚合，採用了一種VLAD的pooling方法，端到端的訓練，主要解決兩個疑惑： 1.如何聚合視頻幀之間的特徵來表示整個視頻。 2.在多流網絡中(例如two