Deep Reinforcement Learning

時間 2019-11-06

標籤 deep reinforcement learning 简体版

原文原文鏈接

Reinforcement-Learning-Introduction-Adaptive-Computationhtml

http://incompleteideas.net/book/bookdraft2017nov5.pdfweb

http://incompleteideas.net/book/ebook/the-book.html算法

https://www.amazon.com/Reinforcement-Learning-Introduction-Adaptive-Computation/dp/0262193981shell

https://orbi.ulg.ac.be/bitstream/2268/27963/1/book-FA-RL-DP.pdfc#

http://videolectures.net/deeplearning2017_montreal/api

http://www.clipconverter.cc/async

Reinforcement Learning--David Silveride

http://www0.cs.ucl.ac.uk/staff/D.Silver/web/Teaching.htmlpost

https://www.youtube.com/watch?v=2pWv7GOvuf0學習

COMBINING POLICY GRADIENT AND Q-LEARNING

https://arxiv.org/pdf/1611.01626.pdf

https://www.quora.com/Whats-the-difference-between-reinforcement-Learning-and-Deep-learning

https://stats.stackexchange.com/questions/144154/supervised-learning-unsupervised-learning-and-reinforcement-learning-workflow

https://www.quora.com/What-is-the-difference-between-supervised-unsupervised-reinforcement-and-deep-learning

https://www.quora.com/Is-reinforcement-learning-the-combination-of-unsupervised-learning-and-supervised-learning

https://www.quora.com/What-is-the-difference-between-supervised-unsupervised-reinforcement-and-deep-learning

https://www.oreilly.com/ideas/reinforcement-learning-for-complex-goals-using-tensorflow

https://medium.com/emergent-future/simple-reinforcement-learning-with-tensorflow-part-6-partial-observability-and-deep-recurrent-q-68463e9aeefc

https://medium.com/emergent-future/simple-reinforcement-learning-with-tensorflow-part-0-q-learning-with-tables-and-neural-networks-d195264329d0

最前沿：深度學習訓練方法大革新，反向傳播訓練再也不惟一

https://zhuanlan.zhihu.com/p/22143664

最前沿：讓計算機學會學習Let Computers Learn to Learn

https://zhuanlan.zhihu.com/p/21362413?refer=intelligentunit

深度加強學習之Policy Gradient方法1

https://zhuanlan.zhihu.com/p/21725498

https://deepmind.com/blog/#decoupled-neural-interfaces-using-synthetic-gradients

ore from my Simple Reinforcement Learning with Tensorflow series:

https://keon.io/deep-q-learning/

Human-level control through deep reinforcement learning

https://storage.googleapis.com/deepmind-media/dqn/DQNNaturePaper.pdf

http://rll.berkeley.edu/deeprlcourse/

https://bcourses.berkeley.edu/courses/1453965/pages/cs294-129-designing-visualizing-and-understanding-deep-neural-networks

https://cs.stanford.edu/people/karpathy/convnetjs/demo/rldemo.html

如何用簡單例子講解 Q - learning 的具體過程？

https://www.zhihu.com/question/26408259

https://deeplearning4j.org/reinforcementlearning.html

https://deeplearning4j.org/neuralnet-overview.html

https://devblogs.nvidia.com/parallelforall/deep-learning-nutshell-reinforcement-learning/

https://medium.com/beyond-intelligence/reinforcement-learning-or-evolutionary-strategies-nature-has-a-solution-both-8bc80db539b3

https://medium.com/ai-society/my-first-experience-with-deep-reinforcement-learning-1743594f0361

https://medium.com/emergent-future/simple-reinforcement-learning-with-tensorflow-part-0-q-learning-with-tables-and-neural-networks-d195264329d0

http://neuro.cs.ut.ee/demystifying-deep-reinforcement-learning/

More from my Simple Reinforcement Learning with Tensorflow series:

Part 0 — Q-Learning Agents
Part 1 — Two-Armed Bandit
Part 1.5 — Contextual Bandits
Part 2 — Policy-Based Agents
Part 3 — Model-Based RL
Part 4 — Deep Q-Networks and Beyond
Part 5 — Visualizing an Agent’s Thoughts and Actions
Part 6 — Partial Observability and Deep Recurrent Q-Networks
Part 7 — Action-Selection Strategies for Exploration
Part 8 — Asynchronous Actor-Critic Agents (A3C)

Deep Reinforcement Learning 深度加強學習資源 (持續更新）

https://zhuanlan.zhihu.com/p/20885568

深度解讀AlphaGo

https://zhuanlan.zhihu.com/p/20893777

深度學習論文閱讀路線圖 Deep Learning Papers Reading Roadmap

https://zhuanlan.zhihu.com/p/23080129

ICLR 2017 DRL相關論文

https://zhuanlan.zhihu.com/p/23807875

https://www.intelnervana.com/demystifying-deep-reinforcement-learning/

http://www.jmlr.org/papers/volume6/murphy05a/murphy05a.pdf

https://deepmind.com/research/publications/

https://deepmind.com/blog/alphago-zero-learning-scratch/

Mastering the Game of Go without Human Knowledge

https://www.nature.com/articles/doi:10.1038/nature24270

https://en.wikipedia.org/wiki/State%E2%80%93action%E2%80%93reward%E2%80%93state%E2%80%93action

DQN 從入門到放棄1 DQN與加強學習

https://zhuanlan.zhihu.com/p/21262246?refer=intelligentunit

DQN 從入門到放棄4 動態規劃與Q-Learning

https://zhuanlan.zhihu.com/p/21378532?refer=intelligentunit

DQN從入門到放棄5 深度解讀DQN算法

https://zhuanlan.zhihu.com/p/21421729

強化學習系列之九:Deep Q Network (DQN)

http://www.algorithmdog.com/drl

相關標籤/搜索

每日一句

每一个你不满意的现在，都有一个你没有努力的曾经。