项目作者： snowyukischnee

项目描述：
Solving OpenAI Pendulum-v0 using PPO

高级语言： Python

项目主页：

项目地址: git://github.com/snowyukischnee/Pendulum_PPO.git

创建时间： 2019-01-05T11:28:38Z
项目社区：https://github.com/snowyukischnee/Pendulum_PPO
开源协议：MIT License
下载

Pendulum_PPO

Quickstarted

Run this command to use the pretrained model to play the game

>python pendulum.py play

Or run this command to train the model

>python pendulum.py anything-(not-play)

The model in pendulum.py was able to solved Pendulum-v0 after about 110 episodes

Total rewards in 140 steps of traing:

You’re free to edit the model hyperparameters and some constansts to make it better

Special thanks to Morvan Zhou for the explanation of the PPO