mcts

Star

Here are 450 public repositories matching this topic...

hijkzzz / Awesome-LLM-Strawberry

Star

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

reinforcement-learning mathematics coding mcts strawberry llm chain-of-thought openai-o1

Updated Dec 17, 2025

suragnair / alpha-zero-general

Star

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

reinforcement-learning deep-learning neural-network tensorflow keras pytorch mcts othello gomoku monte-carlo-tree-search gobang alphago tf alphago-zero alpha-zero alphazero self-play

Updated Jan 1, 2025
Jupyter Notebook

junxiaosong / AlphaZero_Gomoku

Star

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

board-game reinforcement-learning tensorflow pytorch mcts gomoku rl monte-carlo-tree-search self-learning gobang alphago alphago-zero alphazero

Updated Apr 24, 2024
Python

werner-duvaud / muzero-general

Star

MuZero

machine-learning reinforcement-learning deep-learning neural-network deep-reinforcement-learning python3 pytorch gym mcts rl tensorboard residual-network monte-carlo-tree-search self-learning alphago model-based-rl alphazero muzero muzero-general

Updated Sep 3, 2024
Python

opendilab / LightZero

Star

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Updated Dec 30, 2025
Python

zzli2022 / Awesome-System2-Reasoning-LLM

Star

Latest Advances on System-2 Reasoning

benchmark mcts rl reasoning r1 prm o3 o1 slow-fast system-2 self-improve macro-action

Updated Jun 8, 2025
Python

yaotingwangofficial / Awesome-MCoT

Star

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

survey mcts reasoning cot multimodal system-2 chain-of-thought instruction-tuning large-vision-language-model multimodal-large-language-models multimodal-chain-of-thought openai-o1 slow-thinking deepseek-r1 mllm-reasoning

Updated Nov 14, 2025

chauvinSimon / My_Bibliography_for_Research_on_Autonomous_Driving

Star

Personal notes about scientific and research works on "Decision-Making for Autonomous Driving"

Updated Dec 15, 2020

s-casci / tinyzero

Star

Easily train AlphaZero-like agents on any environment you want!

reinforcement-learning mcts alphazero

Updated Jan 11, 2024
Python

hrpan / tetris_mcts

Star

MCTS project for Tetris

game reinforcement-learning deep-learning tetris mcts tetris-bots

Updated Oct 9, 2024
Python

dylandjian / SuperGo

Star

A student implementation of Alpha Go Zero

machine-learning reinforcement-learning python3 pytorch mcts alphago alphago-zero

Updated Aug 1, 2018
Python

QueensGambit / CrazyAra

Star

A Deep Learning UCI-Chess Variant Engine written in C++ & Python 🦜

python open-source machine-learning chess-engine deep-learning mxnet artificial-intelligence mcts gluon lichess convolutional-neural-network alphago python-chess alphazero crazyhouse mcgs

Updated Dec 19, 2025
Jupyter Notebook

DataCanvasIO / Hypernets

Star

A General Automated Machine Learning framework to simplify the development of End-to-end AutoML toolkits in specific domains.

reinforcement-learning keras mcts hyperparameter-optimization evolutionary-algorithms nas monte-carlo-tree-search hyperparameter-tuning automl neural-architecture-search nasnet enas autodl

Updated Aug 5, 2025
Python

vgarciasc / mcts-viz

Star

Visualization of MCTS algorithm applied to Tic-tac-toe.

visualization mcts tictactoe p5js

Updated Aug 25, 2021
JavaScript

sungyubkim / Deep_RL_with_pytorch

Star

A pytorch tutorial for DRL(Deep Reinforcement Learning)

deep-reinforcement-learning pytorch dqn mcts uct c51 iqn hedge ppo a2c gail counterfactual-regret-minimization qr-dqn random-network-distillation soft-actor-critic self-imitation-learning

Updated Apr 24, 2023
Jupyter Notebook

initial-h / AlphaZero_Gomoku_MPI

Star

An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku

algorithm tensorflow parallel deep-reinforcement-learning mcts gomoku tree-search tensorlayer alphago mpi4py dirichlet-distribution alphazero alphazero-gomoku

Updated Feb 28, 2025
Python

SE-Agent is a self-evolution framework for LLM Code agents. It enables trajectory-level evolution to exchange information across reasoning paths via Revision, Recombination, and Refinement, expanding the search space and escaping local optima. On SWE-bench Verified, it achieves SOTA performance

mcts code-fix swe-agent test-time-scaling claude-code code-agent swe-bench self-evolve

Updated Sep 23, 2025
Python

thuxugang / doudizhu

Star

AI斗地主

reinforcement-learning ai card-game dqn mcts doudizhu

Updated Jun 13, 2018
Python

kaesve / muzero

Star

A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.

reinforcement-learning deep-learning tensorflow deep-reinforcement-learning tf2 mcts alphazero tensorflow2 muzero

Updated Mar 28, 2021
Jupyter Notebook

zjeffer / chess-deep-rl

Star

Research project: create a chess engine using Deep Reinforcement Learning

machine-learning chess-engine chess reinforcement-learning ai deep-learning neural-network deep-reinforcement-learning artificial-intelligence mcts neural-networks alphazero

Updated Jun 29, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the mcts topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mcts topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mcts

Here are 450 public repositories matching this topic...

hijkzzz / Awesome-LLM-Strawberry

suragnair / alpha-zero-general

junxiaosong / AlphaZero_Gomoku

werner-duvaud / muzero-general

opendilab / LightZero

zzli2022 / Awesome-System2-Reasoning-LLM

yaotingwangofficial / Awesome-MCoT

chauvinSimon / My_Bibliography_for_Research_on_Autonomous_Driving

s-casci / tinyzero

hrpan / tetris_mcts

dylandjian / SuperGo

QueensGambit / CrazyAra

DataCanvasIO / Hypernets

vgarciasc / mcts-viz

sungyubkim / Deep_RL_with_pytorch

initial-h / AlphaZero_Gomoku_MPI

JARVIS-Xs / SE-Agent

thuxugang / doudizhu

kaesve / muzero

zjeffer / chess-deep-rl

Improve this page

Add this topic to your repo