#

mcts

Here are 178 public repositories matching this topic...

junxiaosong / AlphaZero_Gomoku

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

board-game reinforcement-learning tensorflow pytorch mcts gomoku rl monte-carlo-tree-search self-learning gobang alphago alphago-zero alphazero

Updated Apr 24, 2024
Python

werner-duvaud / muzero-general

MuZero

machine-learning reinforcement-learning deep-learning neural-network deep-reinforcement-learning python3 pytorch gym mcts rl tensorboard residual-network monte-carlo-tree-search self-learning alphago model-based-rl alphazero muzero muzero-general

Updated Sep 3, 2024
Python

opendilab / LightZero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Updated Oct 8, 2024
Python

s-casci / tinyzero

Easily train AlphaZero-like agents on any environment you want!

reinforcement-learning mcts alphazero

Updated Jan 11, 2024
Python

hrpan / tetris_mcts

MCTS project for Tetris

game reinforcement-learning deep-learning tetris mcts tetris-bots

Updated Oct 3, 2023
Python

dylandjian / SuperGo

A student implementation of Alpha Go Zero

machine-learning reinforcement-learning python3 pytorch mcts alphago alphago-zero

Updated Aug 1, 2018
Python

DataCanvasIO / Hypernets

A General Automated Machine Learning framework to simplify the development of End-to-end AutoML toolkits in specific domains.

reinforcement-learning keras mcts hyperparameter-optimization evolutionary-algorithms nas monte-carlo-tree-search hyperparameter-tuning automl neural-architecture-search nasnet enas autodl

Updated Jul 19, 2024
Python

initial-h / AlphaZero_Gomoku_MPI

An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku

algorithm tensorflow parallel deep-reinforcement-learning mcts gomoku tree-search tensorlayer alphago mpi4py dirichlet-distribution alphazero alphazero-gomoku

Updated Jan 20, 2020
Python

thuxugang / doudizhu

AI斗地主

reinforcement-learning ai card-game dqn mcts doudizhu

Updated Jun 13, 2018
Python

akolishchak / doom-net-pytorch

Reinforcement learning models in ViZDoom environment

agent learning reinforcement-learning pytorch doom behavior-tree mcts vizdoom reinforcement ppo doomnet-track1

Updated Mar 9, 2022
Python

blanyal / alpha-zero

AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm" by DeepMind.

game machine-learning reinforcement-learning deep-learning tensorflow tic-tac-toe connect-four reversi mcts othello tictactoe resnet deepmind connect4 alphago-zero alpha-zero alphazero self-play

Updated Apr 14, 2018
Python

Urinx / ReinforcementLearning

Reinforcing Your Learning of Reinforcement Learning

reinforcement-learning tic-tac-toe space-invaders q-learning doom dqn mcts policy-gradient cartpole gomoku ddpg atari-2600 alphago frozenlake ppo advantage-actor-critic alphago-zero

Updated Jul 14, 2019
Python

lowrollr / turbozero

fast + parallel AlphaZero in JAX

reinforcement-learning mcts gpu-acceleration vectorization monte-carlo-tree-search alphazero jax

Updated Mar 26, 2024
Python

YoujiaZhang / AlphaGo-Zero-Gobang

Meta-Zeta是一个基于强化学习的五子棋(Gobang)模型，主要用以了解AlphaGo Zero的运行原理的Demo，即神经网络是如何指导MCTS做出决策的，以及如何自我对弈学习。源码+教程

gui ai deep-learning tensorflow mcts residual-networks gobang alphago alphazero gomuku

Updated Dec 7, 2022
Python

masouduut94 / MCTS-agent-python

Monte Carlo Tree Search (MCTS) is a method for finding optimal decisions in a given domain by taking random samples in the decision space and building a search tree accordingly. It has already had a profound impact on Artificial Intelligence (AI) approaches for domains that can be represented as trees of sequential decisions, particularly games …

reinforcement-learning mcts markov-decision-processes monte-carlo-tree-search sequential-decisions decision-space game-of-hex

Updated Mar 10, 2024
Python

CGLemon / pyDLGO

基於深度學習的 GTP 圍棋（围棋）引擎，KGS 指引文件以及演算法教學。

deep-learning baduk weiqi goban mcts game-of-go alphago

Updated Aug 14, 2024
Python

kobanium / TamaGo

Computer go engine using Monte-Carlo Tree Search written in Python3.

go reinforcement-learning deep-learning baduk weiqi mcts go-text-protocol monte-carlo-tree-search alphago alphago-zero alphagozero gumbel-alphazero

Updated May 2, 2024
Python

hr0nix / omega

A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.

reinforcement-learning nethack mcts flax model-based-rl jax model-based-reinforcement-learning muzero minihack rlax

Updated Sep 19, 2022
Python

xuetf / AlphaZero_Gobang

Deep Learning big homework of UCAS

deep-learning pytorch mcts gomoku residual-networks gobang alphazero five-in-a-row

Updated Jan 8, 2019
Python

hayoung-kim / mcts-tic-tac-toe

Monte Carlo Tree Search for tic tac toe

tic-tac-toe mcts

Updated Jul 24, 2018
Python

Improve this page

Add a description, image, and links to the mcts topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mcts topic, visit your repo's landing page and select "manage topics."