Paper Reading: Neural Machine Translation by Jointly Learning to Align and Translate

時間 2019-11-11

標籤 paper reading neural machine translation jointly learning align translate 简体版

原文原文鏈接

這篇文章是論文"NEURAL MACHINE TRANSLATION BY JOINTLY LEARNING TO ALIGN AND TRANSLATE"的閱讀筆記，這是2015年發表在ICLR的一篇文章。html

ABSTRACT

NMT(neural machine translation)是個不少人研究過的問題，最近也突破不少。
回到這篇論文，當時解決NMT問題的作法主要是基於encoder-decoder框架的,這框架也挺好的，在不少領域表現都不錯。可是，encoder部分把輸入信息壓縮到一個固定長度的vector中，這形成了性能的瓶頸。這篇論文提出的模型就是在翻譯的過程當中自動在輸入中尋找與輸出目標有關係的部分幫助決策。這就是這篇論文提出的方法的核心思想。網絡

看一下原文是怎麼說的👇框架

In this paper, we conjecture that the use of a fixed-length vector is a bottleneck in improving the performance of this basic encoder–decoder architecture, and propose to extend this by allowing a model to automatically (soft-)search for parts of a source sentence that are relevant to predicting a target word, without having to form these parts as a hard segment explicitly.iphone