Mastering Complex Control in MOBA Games with Deep Reinforcement Learning（論文筆記）

時間 2020-12-24

原文原文鏈接

本文由騰訊AI Lab跟天美髮表，用於王者榮耀1v1的AI訓練，達成99.81%的勝率。文章分了幾個部分進行講解。大系統整個框架分爲四個模塊：RL Learner，AI Server，Dispatch module與Memory Pool，如下圖： AI Server：此模塊用當前的agent與遊戲環境進行交互來收集數據，一個AI Server綁定一個cpu，agent會copy到cpu中，爲

>>阅读原文<<