项目作者: alexBDG

项目描述 :
Stanford - CS234 Reinforcement Learning - Assignment 2
高级语言: Python
项目地址: git://github.com/alexBDG/RL2.git
创建时间: 2020-05-11T10:51:07Z
项目社区:https://github.com/alexBDG/RL2

开源协议:

下载


RL2

Stanford - CS234 Reinforcement Learning - Assignment 2

Introduction (from Assignment #2 paper)

In Pong, one player scores if the ball passes by the other player. An episode is over when one of the players
reaches 21 points. Thus, the total return of an episode is between -21 (lost every point) and +21 (won every
point). Our agent plays against a decent hard-coded AI player.

Results on Test Environment (described in Assignment #2 paper)

Linear Approximation

  1. python q2_linear.py

q2

DeepMind’s DQN

  1. python q3_nature.py

q3

Results on Atari Pong Environment

Linear

  1. python q4_train_atari_linear.py

q4

DeepMind’s DQN

  1. python q5_train_atari_nature.py

q5

Python libraries

Numpy, TensorFlow, Gym, collections, pyglet, random, time, sys, loggings, matplotlib, os