A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
-
Updated
Dec 24, 2024
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
Personal notes about scientific and research works on "Decision-Making for Autonomous Driving"
Easily train AlphaZero-like agents on any environment you want!
MCTS project for Tetris
A student implementation of Alpha Go Zero
A General Automated Machine Learning framework to simplify the development of End-to-end AutoML toolkits in specific domains.
A Deep Learning UCI-Chess Variant Engine written in C++ & Python 🦜
Visualization of MCTS algorithm applied to Tic-tac-toe.
A pytorch tutorial for DRL(Deep Reinforcement Learning)
An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku
A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.
Reinforcement learning models in ViZDoom environment
Allie: A UCI compliant chess engine
Research project: create a chess engine using Deep Reinforcement Learning
AlphaZero based engine for the game of Go (圍棋/围棋).
Add a description, image, and links to the mcts topic page so that developers can more easily learn about it.
To associate your repository with the mcts topic, visit your repo's landing page and select "manage topics."