Imitation Learning 模仿學習

We want RL Algorithm that Perform Optimization(優化) Delayed consequences(延遲結果) Exploration(探索) Generation(泛化) And do it all statistically and computationally efficiently(統計性地,計算高效性地執行以上過程) Generalizati
相關文章
相關標籤/搜索