项目作者: WinDerek

项目描述 :
Reinforcement learning agents in Python (dynamic programming, temporal-difference, deep Q-learning, stochastic/deterministic policy gradients)
高级语言: Jupyter Notebook
项目地址: git://github.com/WinDerek/reinforce-py.git
创建时间: 2020-05-30T15:49:19Z
项目社区:https://github.com/WinDerek/reinforce-py

开源协议:Apache License 2.0

下载