项目作者: ishank-juneja

项目描述 :
The project compares the sample efficiency of reward-search and reward-shaping in learning an optimal policy
高级语言: Python
项目地址: git://github.com/ishank-juneja/reward-search-shaping.git
创建时间: 2019-11-10T06:18:38Z
项目社区:https://github.com/ishank-juneja/reward-search-shaping

开源协议:

下载