项目作者: snowyukischnee

项目描述 :
Solving OpenAI Pendulum-v0 using PPO
高级语言: Python
项目地址: git://github.com/snowyukischnee/Pendulum_PPO.git
创建时间: 2019-01-05T11:28:38Z
项目社区:https://github.com/snowyukischnee/Pendulum_PPO

开源协议:MIT License

下载


Pendulum_PPO

Solving OpenAI Pendulum-v0 using Proximal Policy Optimization Algorithms

Quickstarted

Run this command to use the pretrained model to play the game

  1. >python pendulum.py play

Or run this command to train the model

  1. >python pendulum.py anything-(not-play)

Notes

The model in pendulum.py was able to solved Pendulum-v0 after about 110 episodes

Total rewards in 140 steps of traing:
Total rewards in 140 steps of traing


You’re free to edit the model hyperparameters and some constansts to make it better

Contributions

Special thanks to Morvan Zhou for the explanation of the PPO