项目作者: acyclics

项目描述 :
Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments
高级语言: Python
项目地址: git://github.com/acyclics/MPO.git
创建时间: 2020-09-03T15:38:34Z
项目社区:https://github.com/acyclics/MPO

开源协议:

下载