reinforcement-learning humanoids cassie ppo bipedal-robots jvrc-1 Updated May 3, 2024 Python pocketxjl / humanoid-control Star 242 Code Issues Pull requests A humanoid bipedal walking control repo using NMPC and WBC, and simulation with mujoco. Contact: xuejl2001@mail.ustc.edu.cn humanoid...
Code Issues Pull requests BipedalWalker environment from gym, solved with Asynchronous Advantage Actor Critic algorithm using Tensorflow. learning algorithm tensorflow policy actor openai gym policy-gradient gradient a3c reinforcement walker bipedal actor-critic critic a2c Updated Dec 13, 2019 Python low...
The combination of PPO and MLP has already been proved to be feasible in learning robot locomotion policies by multiple research works [11,12,19,20]. Advanced network structures such as LSTM or Transformer can also fit into the proposed framework as policy representation, but more tuning and ...