將3D卷積分解爲spatial convolution in each channel and linear projection across channels.
(spatial convolution + linear projection.)spa
簡單歸納就是spatial conv + linear projection,可是在spatial conv的時候用了一個residual connection,感受頗有道理,例如是一個vertical edge detector,那麼horizontal information將丟失。這個和後來的MobileNet中的depthwise conv + pointwise conv很是的像。orm