项目作者: dtiarks

项目描述 :
Collection of policy gradient based RL agents
高级语言: Python
项目地址: git://github.com/dtiarks/pg_agents.git
创建时间: 2017-07-06T20:14:10Z
项目社区:https://github.com/dtiarks/pg_agents

开源协议:MIT License

下载


PG Agents: Policy Gradient Algorithms with Tensorflow

The idea behind pg_agents is to provide an easy to understand python package containing the state the art policy gradient algorithms.

Implemented algorithms

  • VPG: Vanilla Policy Gradient Also known as REINFORCE

  • TNPG: Truncated Natural Policy Gradient Reformulation of the batch RL problem in terms of a contrained optimization problem

  • TRPO: Trust Region Policy Optimization Extension of TNPG to ensure robustness

  • GAE: Generalized Advantage Estimator Method to estimate the advantage function from experience. Helps to reduce the variance of the gradient estimator.

  • PPO: Proximal Policy Optimization Simple but efficient extension of VPG.

Examples

  • Robotschool Ant

Ant