pytorch 顯示網絡結構，顯示優化器的設置代碼，

時間 2020-12-30

原文原文鏈接

最近注意到在一些caffe模型中，偏置項的學習率通常設置爲普通層的兩倍。具體原因可以參考(https://datascience.stackexchange.com/questions/23549/why-is-the-learning-rate-for-the-bias-usually-twice-as-large-as-the-the-lr-for-t)，貌似沒有文章提到這個。 Pytorch

>>阅读原文<<