Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

2019-07-15 22:23:02

Paper: https://arxiv.org/pdf/1801.01290.pdf or Updated Version: https://arxiv.org/pdf/1812.05905.pdf

Project: https://sites.google.com/view/soft-actor-critic or https://sites.google.com/view/sac-and-applications/

TensorFlow: https://github.com/haarnoja/sac

PyTorch: https://github.com/vitchyr/rlkit

Demo video: https://www.youtube.com/channel/UCxXt8Br3-wyluz9Q08-fsaA

Good Related Blog: https://zhuanlan.zhihu.com/p/70360272

==== Video Related Tutorials (A2C, A3C):

A brief review of Actor-Critic Algorithms: 　　https://www.youtube.com/watch?v=aODdNpihRwM

CS885 Lecture 7b: Actor Critic: 　　　　　　 https://www.youtube.com/watch?v=5Ke-d1Itk3k

DRL Lecture 6: Actor-Critic: 　　　　　　　 https://www.youtube.com/watch?v=j82QLgfhFiY&t=27s

Build an A2C agent that learns to play Sonic with Tensorflow (tutorial): 　　https://www.youtube.com/watch?v=GCfUdkCL7FQ

Reinforcement Learning 6: Policy Gradients and Actor Critics (Deep Mind): 　　 https://www.youtube.com/watch?v=bRfUxQs6xIM&t=27s

Actor Critic (A3C) Tutorial: 　　　　　　　　https://www.youtube.com/watch?v=O5BlozCJBSE

Actor Critic Algorithms: 　　　　　　　　　 https://www.youtube.com/watch?v=w_3mmm0P0j8&t=2s

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor的相关教程结束。

《Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.doc》

下载本文的Word格式文档，以方便收藏与打印。