[coursera/ImprovingDL/week2]Optimization algorithms(summary&question)

時間 2021-01-02

原文原文鏈接

summary 2.1 mini-batch gradient the size of batch: m BGD: too long for each iteration the size of batch:1 SGD: lose speed up(vectorization) in-between mini-batch 2.2 bias correct For the beginning of