Skip to content

EHadoux/HS3MDP

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

62 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

HS3MDP

POMCP enhancement for Hidden Semi-Markovian Mode MDP.

HS3MDP is based on POMCP 1.0 used in the NIPS 2010 paper Online Monte-Carlo Planning in Large POMDPs by David Silver and Joel Veness.

The original code can be found in this repository under the tag POMCP-1.0.

Traffic, Elevator and Sailboat problems are modified versions of those proposed by Samuel Ping-Man Choi in his thesis Reinforcement Learning in Non-stationary Environments.

The presentation on this work, in the 8th joint NII-LIP6 workshop, can be found here.

Requirements

  • git
  • automake, autoconf, etc.
  • C++ Boost

Installation

  1. git clone this repository
  2. run autoreconf -i
  3. run ./configure (possibly with --enable-assert)
  4. run make

You will find the executable pomcp in the src directory.

Usage

Simply run pomcp --help to see all possible parameters.

About

POMCP enhancement for Hidden Semi-Markov Mode MDP.

Resources

License

Stars

Watchers

Forks

Packages

No packages published