【DL小結5】Transformer模型與self attention

時間 2020-02-14

標籤 DL小結5 transformer 模型 self attention 简体版

原文原文鏈接

1 提出背景針對attention model不能平行化，且忽略了輸入句中文字間和目標句中文字間的關係，google在2017年《Attention is all you need》一文提出了Transformer模型。Transformer最大的特色就是徹底拋棄了RNN、CNN架構。模型中主要的概念有2項：1. Self attention（代替RNN）：解決輸入句中文字間和目標句中文字間的

>>阅读原文<<