深度神經網絡加速和壓縮

模型加速與壓縮方法分類總結 • Low-Rank • Pruning • Quantization • Knowledge Distillation • Compact Network Design   Low-Rank Previous low-rank based methods: • SVD - Zhang et al., 「Accelerating Very Deep Convolutio
相關文章
相關標籤/搜索