docker pull stablebaselines/rl-baselines3-zoo Run script in the docker image: ./scripts/run_docker_cpu.sh python train.py --algo ppo --env CartPole-v1 Tests To run tests, first install pytest, then: make pytest Same for type checking with pytype: ...
PROST: Monte Carlo Tree Search (MCTS) Deep Reinforcement Learning (DQN, PPO, etc.): Popular Reinforcement Learning (RL) algorithms from Stable Baselines and RLlib Symbolic Dynamic Programming: Exact Symbolic regression-based planning and policy evaluationInstallation...
I tried to use the flatten() method to return flattened observations (return obs.flatten(), info) I still get: model.learn(total_timesteps=10000) File "/home/ubuntu/anaconda2/lib/python3.6/site-packages/stable_baselines/ppo1/pposgd_simple.py", line 230, in learn seg = seg_gen.__next...