cs231n-notes-Lecture-7:各種優化方法介紹與比較

Lecture-7 Training Neural Networks Optimization SGD Cons Very slow progress along shallow dimension, jitter along steep direction. 2. local minima or saddle point. Saddle points are much more common i
相關文章
相關標籤/搜索