Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor 2019-07-15 22:23:02 Paper: https://arxiv.org/pdf/1801.01290.pdf or Updated Version:&...