TensorFlow經常使用的優化器

時間 2019-11-21

標籤 tensorflow 經常使用優化简体版

原文原文鏈接

簡介

目前TensorFlow支持11種不一樣的經典優化器（參考TensorFlow API tf.train文檔）python

下面重點介紹 tf.train.GradientDescentOptimizer、tf.train.MomentumOptimizer、tf.train.AdamOptimizer算法

這個優化器主要實現的是 梯度降低算法api

實現 動量梯度降低算法，可參考簡述動量Momentum梯度降低.net

其中，即momentum，表示要在多大程度上保留原來的更新方向，這個值在0-1之間，在訓練開始時，因爲梯度可能會很大，因此初始值通常選爲0.5；當梯度不那麼大時，改成0.9。是學習率，即當前batch的梯度多大程度上影響最終更新方向，跟普通的SGD含義相同。與 code

之和不必定爲1。

實現 Adam優化算法（ Adam 這個名字來源於 adaptive moment estimation，自適應矩估計。）

learning_rate: （學習率）張量或者浮點數
beta1: 浮點數或者常量張量，表示 The exponential decay rate for the 1st moment estimates.
beta2: 浮點數或者常量張量，表示 The exponential decay rate for the 2nd moment estimates.
epsilon: A small constant for numerical stability. This epsilon is "epsilon hat" in the Kingma and Ba paper (in the formula just before Section 2.1), not the epsilon in Algorithm 1 of the paper.
use_locking: 爲True時鎖定更新
name: 梯度降低名稱，默認爲 "Adam".

相關標籤/搜索

每日一句

每一个你不满意的现在，都有一个你没有努力的曾经。