Feature maps




Why not Linear
- 4 Layers: [784, 256, 256, 256, 10]

335k or 1.3MB

em...
- 486 PC + AT&T DSP32C
Batch Xspa
Gradient Cacheit
etc.io

Receptive Field

Fully connnected

Partial connected

Locally connected

Rethink Linear layer

Fully VS Lovally

Weight sharing



Why call Convolution?

2D Convolution
\[ y(t) = x(t)*h(t) = \int_{-\infty}^{\infty}x(\tau)h(t-\tau)d\tau \]class

Convolution in Computer Vision



CNN on feature maps
