mcts

Monte-Carlo Tree Search (MCTS) basic implementation.

The basic idea is to implement a tic tac toe player using MCTS and match it up against my previously written reinforcement learning agent (Q-learning). I'm only doing that because I know that the q-learner can learn to play optimally, so I can use it as a baseline player to evaluate and validate the MCTS player.

For now the idea is to have a very simple MCTS policy, later I'll extend it to have more sophisticated exploration/exploitation policies, and look into other features.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Debug		Debug
Release		Release
src		src
README.md		README.md
Tupfile		Tupfile
Tuprules.tup		Tuprules.tup

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Debug

Debug

Release

Release

src

src

README.md

README.md

Tupfile

Tupfile

Tuprules.tup

Tuprules.tup

Repository files navigation

mcts

About

Releases

Packages

Languages

RyannnXU/mcts

Folders and files

Latest commit

History

Repository files navigation

mcts

About

Resources

Stars

Watchers

Forks

Languages