A simple package to allow users to run Monte Carlo Tree Search on any perfect information domain

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

MCTS

This package provides a simple way of using Monte Carlo Tree Search in any perfect information domain.

It was originally authored by pbsinclair42. This fork however complies with the Python Naming Convention, provides base classes for implementing states and actions, and includes more comprehensive examples.

Installation

With pip: pip install monte-carlo-tree-search

Without pip: Download the zip/tar.gz file of the latest release, extract it, and run python setup.py install

Quick Usage

In order to run MCTS, you must implement your own State class that extends mcts.base.base.BaseState which can fully describe the state of the world. It must implement four methods:

get_current_player(): Returns 1 if it is the maximizer player's turn to choose an action, or -1 for the minimiser player
get_possible_actions(): Returns an iterable of all actions which can be taken from this state
take_action(action): Returns the state which results from taking action action
is_terminal(): Returns True if this state is a terminal state
get_reward(): Returns the reward for this state. Only needed for terminal states.

You must also choose a hashable representation for an action as used in get_possible_actions and take_action. Typically, this would be a class with a custom __hash__ method, but it could also simply be a tuple, a string, etc. A BaseAction class is provided for this purpose.

Once these have been implemented, running MCTS is as simple as initializing your starting state, then running:

from mcts.base.base import BaseState
from mcts.searcher.mcts import MCTS


class MyState(BaseState):
    def get_possible_actions(self) -> [any]:
        pass

    def take_action(self, action: any) -> 'BaseState':
        pass

    def is_terminal(self) -> bool:
        pass

    def get_reward(self) -> float:
        pass

    def get_current_player(self) -> int:
        pass


initial_state = MyState()

searcher = MCTS(time_limit=1000)
bestAction = searcher.search(initial_state=initial_state)

Here the unit of time_limit=1000 is milliseconds. You can also use for example iteration_limit=100 to specify the number of rollouts. Exactly one of time_limit and iteration_limit should be specified.

best_action = searcher.search(initial_state=initial_state)
print(best_action)  # the best action to take found within the time limit

To also receive the best reward as a return value set need_details to True in searcher.search(...).

best_action, reward = searcher.search(initial_state=initial_state, need_details=True)
print(best_action)  # the best action to take found within the time limit
print(reward)  # the expected reward for the best action

Examples

You can find some examples using the MCTS here:

naughtsandcrosses.py is a minimal runnable example by pbsinclair42
connectmnk.py is an example running a full game between two MCTS agents by LucasBorboleta

Collaborating

Feel free to raise a new issue for any new feature or bug you've spotted. Pull requests are also welcomed if you're interested in directly improving the project.

Coding Guidelines

Commit message should follow the Conventional Commits specification. This makes contributions easily comprehensible and enables us to automatically generate release notes.

Recommended tooling for developers:

JetBrains Plugin Conventional Commit by Edoardo Luppi
Visual Studio Plugin Conventional Commits by vivaxy

Example commit message

fix: prevent racing of requests

Introduce a request id and a reference to latest request. Dismiss
incoming responses other than from latest request.

Remove timeouts which were used to mitigate the racing issue but are
obsolete now.

Reviewed-by: Z
Refs: #123

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

2.0.5

Feb 29, 2024

2.0.4

Feb 23, 2023

2.0.3

Jun 29, 2022

2.0.2

May 16, 2022

2.0.1

May 13, 2022

2.0.0

May 12, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

monte_carlo_tree_search-2.0.5-py3-none-any.whl (10.7 kB view hashes)

Uploaded Feb 29, 2024 Python 3

Hashes for monte_carlo_tree_search-2.0.5-py3-none-any.whl

Hashes for monte_carlo_tree_search-2.0.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c3f3d7823c1ad1637d2a1fac51a1f7058f34757afd1651413e50a6b7c9c6abd2`
MD5	`4467ccc0bc6e4520577a45f6fa66a942`
BLAKE2b-256	`0d50e2f12682e0044e60dcd809997cabbeba4634ac14a9aa67e7c7725bee65b2`