项目作者: aijunbai

项目描述 :
Markovian State and Action Abstractions for MDPs via Hierarchical MCTS within a POMDP Formulation
高级语言: C++
项目地址: git://github.com/aijunbai/hplanning.git
创建时间: 2016-01-30T06:10:04Z
项目社区:https://github.com/aijunbai/hplanning

开源协议:Other

下载


hplanning

Build Status

This is the code release of paper:

Dependencies

  • qt4-qmake
  • libboost-dev
  • libboost-program-options-dev

Usages

  • make to build hplanning
  • ./run.sh to run a problem instance with default settings
  • ./debug.sh to build and run using debug version
  • ./release.sh to build and run using release version

Options

Allowed options of hplanning:

  1. --help produce help message
  2. --test run unit tests
  3. --problem arg problem to run
  4. --map arg map to use for (continus) rooms domain
  5. --outputfile arg (=output.txt) summary output file
  6. --size arg size of problem (problem specific)
  7. --number arg number of elements in problem (problem
  8. specific)
  9. --timeout arg timeout (seconds)
  10. --mindoubles arg minimum power of two simulations
  11. --maxdoubles arg maximum power of two simulations
  12. --runs arg number of runs
  13. --accuracy arg accuracy level used to determine horizon
  14. --horizon arg horizon to use when not discounting
  15. --num steps arg number of steps to run when using average
  16. reward
  17. --verbose arg verbosity level
  18. --usetransforms arg Use transforms
  19. --useparticlefilter arg Use particle fileter
  20. --transformdoubles arg Relative power of two for transforms compared
  21. to simulations
  22. --transformattempts arg Number of attempts for each transform
  23. --treeknowledge arg Knowledge level in tree (0=Pure, 1=Legal,
  24. 2=Smart)
  25. --rolloutknowledge arg Knowledge level in rollouts (0=Pure, 1=Legal,
  26. 2=Smart)
  27. --smarttreecount arg Prior count for preferred actions during smart
  28. tree search
  29. --smarttreevalue arg Prior value for preferred actions during smart
  30. tree search
  31. --reusetree arg Reuse tree generated during previous search
  32. --seeding arg Use pid as random seed
  33. --thompsonsampling arg use Thompson Sampling instead of UCB1
  34. --timeoutperaction arg timeout per action (seconds)
  35. --polling arg use polling rollout for hplanning
  36. --stack arg use call stack for hplanning
  37. --localreward arg use local reward
  38. --hplanning arg use hplanning when possible
  39. --actionabstraction arg use hplanning w/ action abstraction when
  40. possible
  41. --memoryless arg find a memoryless policy in hplanning