CMU 11-785 L07 Optimizers and regularizers

Optimizers Momentum and Nestorov’s method improve convergence by normalizing the mean (first moment) of the derivatives Considering the second moments RMS Prop / Adagrad / AdaDelta / ADAM1 Simple grad
相關文章
相關標籤/搜索