FitNets: Hints for Thin Deep Nets

時間 2020-12-24

標籤 Knowledge Distillation 简体版

原文原文鏈接

其實應該先早點寫這篇文章的這篇文章主要是將hinton的output distillation擴展到了feature distillation 該loss用來拉進student和teacher feature的距離該loss就是與hard label、soft label做cross entroy 訓練過程需要注意：先進行hints training，即選擇某一層feature對齊後，利用H

>>阅读原文<<