Reinforcement Learning in Continuous State and Action Spaces: A Brief Note

時間 2021-01-02

原文原文鏈接

Thanks Hado van Hasselt for the great work. Introduction In the problems of sequential decision making in continuous domains with delayed reward signals, the main purpose for the algorithms is to lear