DSAC-T: Distributional Soft Actor-Critic with Three Refinements Jingliang Duan, Wenxuan Wang, Liming Xiao, Jiaxin Gao, and Shengbo Eben Li∗ 清华大学 Ieee Computational Intelligence Magazin 计算机科学2区SCI I Introduction: 强化学习近年来在复杂决策和控制任务中取得了巨大成功,结合神经网络等高容量函数...
1)引进Expected value substituting提高了学习过程的稳定性和效率; 2)引进Variance-based critic gradient adjusting减少了对人工调参的需要,使算法更具通用性; 3)引进Twin value distribution learning进一步抑制过估计问题。 我们提出了一款SOTA的model-free算法,有效地抑制了RL过估计并实现性能的大幅度提升。我们的工作可...
DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic 258 stars 26 forks Branches Tags Activity Star Notifications Jingliang-Duan/DSAC-v2 main 1 Branch0 Tags Code Folders and files Latest commit drsssssss Merge pull request #11 from Kirikirito/mainNov 7, 2024 81cc1a4· Nov 7, ...
This paper proposes a reinforcement-learning-based decision-making method under a framework of offline training and online correction, called the Shielded Distributional Soft Actor-critic (Shielded DSAC). The Shielded DSAC adopts the policy evaluation with safety considerations in offline training, and ...
Distributional Soft Actor Critic for Risk Sensitive Learning Most of reinforcement learning (RL) algorithms aim at maximizing the expectation of accumulated discounted returns. Since the accumulated discounted return is a random variable, its distribution includes more information than its expecta... X Ma...
In this paper, we introduce the minimax formulation and distributional framework to improve the generalization ability of RL algorithms and develop the Minimax Distributional Soft Actor-Critic (Minimax DSAC) algorithm. Minimax formulation aims to seek optimal policy considering the most serious ...
近年来,强化学习在围棋、游戏等领域的应用取得巨大成功。然而,现有算法在学习过程中值函数的近似误差会造成严重的过估计问题,导致策略性能极大地降低。智能驾驶课题组(iDLab)提出一种可减少过估计的Distributional Soft Actor-Critic(DSAC)算法,通过学习连续状态-动作回报分布(state-action return distribution)来动态调节Q...
Distributional Soft Actor-Critic (DSAC) Distributional Soft Actor-Critic with Three Refinements (DSAC-T) Requires Windows 7 or greater or Linux. Python 3.8. The installation path must be in English. Installation #Please make sure not to include Chinese characters in the installation path, as it...
Distributional stochastic Planner‐Actor‐Critic for deformable image registrationdoi:10.1002/ima.23109Du, JianingChang, QingLian, LiInternational Journal of Imaging Systems & Technology
DSAC; Distributional Soft Actor-Critic. Contribute to Jingliang-Duan/DSAC-v1 development by creating an account on GitHub.