ResNet

Paper: Deep Residual Learning for Image Recognition --- Q:Is learning better networks as easy as stacking more layers? A:vanishing/exploding gradients, which is largely addressed by normalization init
相關文章
相關標籤/搜索