项目作者: ghanzouri

项目描述 :
Posterior Goal Sampling for Hierarchical Reinforcement Learning
高级语言: Python
项目地址: git://github.com/ghanzouri/CS330-Posterior-Goal-Sampling-for-Hierarchical-Reinforcement-Learning.git


CS330 Posterior Goal Sampling for Hierarchical Reinforcement Learning

RL PyTorch

Original Github repository : Deep Reinforcement Learning Algorithms with PyTorch.

This work was done as a class project for CS 330 : Deep Multi-Task and Meta Learning. It is a replication of Hierarchical-DQN (h-DQN), (Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation (Kulkarni et al. 2016) ) on the Long Corridor Game environment.

Modifications

The work was done on the h-DQN algorithm and provided a modified version that includes a Hierarchical-Ensemble DQN RL for efficient high-level policy learning. The code is present in agents/hierarchical_agents/bh_agents.py.

We also included a new function pick_actions2() in agents/DQN_agents/DQN.py that allows goal sampling among all Q heads functions.

Execution

  1. python results/Long_Corridor.py

Usage

The repository’s high-level structure is:

  1. ├── agents
  2. ├── DQN_agents
  3. ├── actor_critic_agents
  4. ├── hierarchical_agents
  5. └── policy_gradient_agents
  6. ├── environments
  7. ├── exploration_strategies
  8. ├── results
  9. └── data_and_graphs
  10. ├── tests
  11. ├── utilities
  12. └── data structures