Topic: "mcts"
hijkzzz/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
Size: 1.63 MB - Last synced at: about 18 hours ago - Pushed at: about 20 hours ago - Stars: 6,743 - Forks: 375

suragnair/alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Language: Jupyter Notebook - Size: 414 MB - Last synced at: 16 days ago - Pushed at: 5 months ago - Stars: 4,140 - Forks: 1,089

junxiaosong/AlphaZero_Gomoku
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
Language: Python - Size: 7.88 MB - Last synced at: 15 days ago - Pushed at: about 1 year ago - Stars: 3,484 - Forks: 993

werner-duvaud/muzero-general
MuZero
Language: Python - Size: 7.09 MB - Last synced at: 15 days ago - Pushed at: 9 months ago - Stars: 2,644 - Forks: 647

opendilab/LightZero
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
Language: Python - Size: 115 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 1,382 - Forks: 156

zzli2022/Awesome-System2-Reasoning-LLM
Latest Advances on System-2 Reasoning
Language: Python - Size: 2.63 MB - Last synced at: 23 days ago - Pushed at: about 1 month ago - Stars: 993 - Forks: 45

yaotingwangofficial/Awesome-MCoT
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
Size: 4.63 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 576 - Forks: 15

chauvinSimon/My_Bibliography_for_Research_on_Autonomous_Driving
Personal notes about scientific and research works on "Decision-Making for Autonomous Driving"
Size: 784 MB - Last synced at: 11 months ago - Pushed at: over 4 years ago - Stars: 434 - Forks: 94

s-casci/tinyzero
Easily train AlphaZero-like agents on any environment you want!
Language: Python - Size: 41.4 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 379 - Forks: 14

hrpan/tetris_mcts 📦
MCTS project for Tetris
Language: Python - Size: 9.73 MB - Last synced at: 6 months ago - Pushed at: 8 months ago - Stars: 342 - Forks: 34

dylandjian/SuperGo
A student implementation of Alpha Go Zero
Language: Python - Size: 113 KB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 275 - Forks: 63

DataCanvasIO/Hypernets
A General Automated Machine Learning framework to simplify the development of End-to-end AutoML toolkits in specific domains.
Language: Python - Size: 17.8 MB - Last synced at: 13 days ago - Pushed at: about 2 months ago - Stars: 267 - Forks: 41

QueensGambit/CrazyAra
A Deep Learning UCI-Chess Variant Engine written in C++ & Python :parrot:
Language: Jupyter Notebook - Size: 61.5 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 263 - Forks: 44

vgarciasc/mcts-viz
Visualization of MCTS algorithm applied to Tic-tac-toe.
Language: JavaScript - Size: 66.4 KB - Last synced at: about 2 months ago - Pushed at: almost 4 years ago - Stars: 233 - Forks: 12

initial-h/AlphaZero_Gomoku_MPI
An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku
Language: Python - Size: 28 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 197 - Forks: 45

sungyubkim/Deep_RL_with_pytorch
A pytorch tutorial for DRL(Deep Reinforcement Learning)
Language: Jupyter Notebook - Size: 521 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 188 - Forks: 42

thuxugang/doudizhu
AI斗地主
Language: Python - Size: 9.99 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 172 - Forks: 68

kaesve/muzero
A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.
Language: Jupyter Notebook - Size: 115 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 138 - Forks: 24

zjeffer/chess-deep-rl
Research project: create a chess engine using Deep Reinforcement Learning
Language: Jupyter Notebook - Size: 10.3 MB - Last synced at: 2 months ago - Pushed at: 11 months ago - Stars: 135 - Forks: 12

akolishchak/doom-net-pytorch
Reinforcement learning models in ViZDoom environment
Language: Python - Size: 262 MB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 130 - Forks: 20

PuYuuu/vehicle-interaction-decision-making
The decision-making of multiple vehicles at intersection bases on level-k game and MCTS
Language: C++ - Size: 5 MB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 114 - Forks: 43

manyoso/allie
Allie: A UCI compliant chess engine
Language: C++ - Size: 700 KB - Last synced at: 2 months ago - Pushed at: about 4 years ago - Stars: 105 - Forks: 21

CGLemon/Sayuri
AlphaZero based engine for the game of Go (圍棋/围棋).
Language: C++ - Size: 14.8 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 95 - Forks: 10

lowrollr/turbozero
fast + parallel AlphaZero in JAX
Language: Python - Size: 28.8 MB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 94 - Forks: 9

Urinx/ReinforcementLearning
Reinforcing Your Learning of Reinforcement Learning
Language: Python - Size: 118 MB - Last synced at: about 2 months ago - Pushed at: almost 6 years ago - Stars: 94 - Forks: 22

blanyal/alpha-zero
AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm" by DeepMind.
Language: Python - Size: 124 KB - Last synced at: about 2 months ago - Pushed at: about 7 years ago - Stars: 88 - Forks: 28

rlglab/minizero
MiniZero: An AlphaZero and MuZero Training Framework
Language: C++ - Size: 2.32 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 79 - Forks: 21

kobanium/Ray
Computer go engine using Monte-Carlo Tree Search (MCTS)
Language: C++ - Size: 93.8 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 71 - Forks: 81

Wangmerlyn/MCTS-GSM8k-Demo
This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems
Language: Python - Size: 17.6 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 66 - Forks: 8

masouduut94/MCTS-agent-python
Monte Carlo Tree Search (MCTS) is a method for finding optimal decisions in a given domain by taking random samples in the decision space and building a search tree accordingly. It has already had a profound impact on Artificial Intelligence (AI) approaches for domains that can be represented as trees of sequential decisions, particularly games and planning problems. In this project I used a board game called "HEX" as a platform to test different simulation strategies in MCTS field.
Language: Python - Size: 695 KB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 65 - Forks: 9

CGLemon/pyDLGO
基於深度學習的 GTP 圍棋(围棋)引擎,KGS 指引文件以及演算法教學。
Language: Python - Size: 12.2 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 63 - Forks: 11

kobanium/TamaGo
Computer go engine using Monte-Carlo Tree Search written in Python3.
Language: Python - Size: 2.61 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 61 - Forks: 11

gorisanson/quoridor-ai
Quoridor AI based on Monte Carlo tree search
Language: JavaScript - Size: 273 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 58 - Forks: 8

yangboz/godpaper
:monkey_face: An AI chess-board-game framework(by many programming languages) implementations.
Language: HTML - Size: 65.1 MB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 48 - Forks: 18

coreylowman/synthesis
A rust implementation of AlphaZero algorithm
Language: Rust - Size: 976 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 47 - Forks: 6

YoujiaZhang/AlphaGo-Zero-Gobang
Meta-Zeta是一个基于强化学习的五子棋(Gobang)模型,主要用以了解AlphaGo Zero的运行原理的Demo,即神经网络是如何指导MCTS做出决策的,以及如何自我对弈学习。源码+教程
Language: Python - Size: 13.2 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 40 - Forks: 6

masterai-top/The-strongest-AI-in-Texas-Hold-em-1-to-1
德州AI,MasterAI is an AI poker dedicated to suport n-play (single- or multi-agent) Texas Hold'em imperfect-informatin games.。MasterAI v2.0是从MasterAI v1.0衍生出来的迭代算法,它在非完全信息游戏中利用了通用的强化学习+搜索,并在一对一无限押注的德州扑克中实现了超人的表现。AI源码出售。Tg:@xuzongbin001;E-mail:[email protected]
Language: C++ - Size: 3.72 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 36 - Forks: 9

xuetf/AlphaZero_Gobang
Deep Learning big homework of UCAS
Language: Python - Size: 50.8 MB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 34 - Forks: 16

hr0nix/omega
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.
Language: Python - Size: 577 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 32 - Forks: 4

ai-boson/mcts
MCTS algorithm tutorial and it's explanation with code. Application of MCTS to create A.I for simple game.
Language: Ruby - Size: 378 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 30 - Forks: 0

hayoung-kim/mcts-tic-tac-toe
Monte Carlo Tree Search for tic tac toe
Language: Python - Size: 29.3 KB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 29 - Forks: 12

tuero/muzero-cpp
A C++ pytorch implementation of MuZero
Language: C++ - Size: 65.5 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 28 - Forks: 5

DenseLance/mcts-simple
mcts-simple is a Python3 library that allows reinforcement learning problems to be solved easily with its implementations of Monte Carlo Tree Search.
Language: Python - Size: 11.2 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 27 - Forks: 3

TianHongZXY/CoRe
[ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models
Language: Python - Size: 6.01 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 27 - Forks: 4

zhangshun97/AI_Gomocup
Here are some Python implementations of Gomoku AIs, including MCTS, Minimax and Genetic Alg.
Language: Python - Size: 6.93 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 27 - Forks: 10

OMerkel/UCThello
UCThello - a board game demonstrator (Othello variant) with computer AI using Monte Carlo Tree Search (MCTS) with UCB (Upper Confidence Bounds) applied to trees (UCT in short)
Language: JavaScript - Size: 5.61 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 25 - Forks: 4

WorkingRobot/Craftimizer
The best FFXIV crafting solver via hardware accelerated Genetic MCTS.
Language: C# - Size: 1.37 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 25 - Forks: 17

snowfrogdev/macao
A general purpose game playing A.I. framework based on the Monte Carlo tree search algorithm.
Language: TypeScript - Size: 2.53 MB - Last synced at: 28 days ago - Pushed at: over 2 years ago - Stars: 24 - Forks: 4

timoklein/alphazero-gym 📦
AlphaZero for continuous control tasks
Language: Python - Size: 483 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 23 - Forks: 4

jdj2261/pytamp
Python robot's tamp library
Language: Python - Size: 47.9 MB - Last synced at: 24 days ago - Pushed at: 5 months ago - Stars: 21 - Forks: 1

bellerb/chappie.ai
Generalized AI to perform a multitude of tasks written in python3
Language: Jupyter Notebook - Size: 406 MB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 21 - Forks: 6

castacks/mcts-stl-planning
Online Signal Temporal Logic (STL) Monte-Carlo Tree Search for Guided Imitation Learning
Language: Python - Size: 111 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 21 - Forks: 0

bellerb/chess
Program for playing chess in the console against AI or human opponents
Language: Python - Size: 72.6 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 20 - Forks: 8

deeptexas-ai/The-strongest-AI-in-Texas-Hold-em-unlimited-Texas-Hold-em-1-vs.-1
德州扑克最强人工智能AI,1对1的德州AI,可以战胜人类顶尖职业牌手,先出售全套AI源代码和AI训练模型;Telegram联系: @xuzongbin001 或E-mail:[email protected]
Language: C++ - Size: 4.31 MB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 19 - Forks: 4

masterai-top/The-strongest-AI-in-Texas-Hold-em-MasterAI-1.0-1vs1-Extreme
德州AI,德州1对1的AI;MasterAI decisively defeated 14 top human Texas hold'em poker professsionals in September 2020.。MasterAI 是Master团队在非完美信息博弈中实现的的一种扑克AI,在德州扑克一对一的有限押注已经取得成果,MasterAI于2020年9月战胜了中国的14位顶级扑克职业选手
Language: C++ - Size: 35.4 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 19 - Forks: 6

Cognitive-AI-Systems/mats-lp
[AAAI-2024] MATS-LP addresses the challenging problem of decentralized lifelong multi-agent pathfinding. The proposed approach utilizes a combination of Monte Carlo Tree Search and reinforcement learning for resolving conflicts.
Language: C++ - Size: 716 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 19 - Forks: 0

adepierre/Caffe_AlphaZero
Implementation of Deepmind's AlphaZero algorithm with Caffe and C++
Language: C++ - Size: 1.02 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 18 - Forks: 9

jlokitha/connect-4-game
A classic Connect Four game featuring two-player mode and an AI opponent powered by Monte Carlo Tree Search (MCTS), offering an exciting and strategic gameplay experience.
Language: Java - Size: 138 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 17 - Forks: 0

JuliaPOMDP/FactoredValueMCTS.jl
Scalable MCTS for team scenarios
Language: Julia - Size: 273 KB - Last synced at: 7 days ago - Pushed at: 12 months ago - Stars: 16 - Forks: 3

faameunier/MCTSnet
A PyTorch implementation of DeepMind's MCTSnet
Language: Python - Size: 728 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 16 - Forks: 2

ladofa/janggi
야매장기 - 알파고를 참고한 장기 AI
Language: Python - Size: 708 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 16 - Forks: 6

patrik-ha/explainable-minichess
Chess environment for smaller chess variants, AlphaZero-like MCTS-learning, and Concept Detection
Language: PureBasic - Size: 41.2 MB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 15 - Forks: 5

rlglab/optionzero
[ICLR 2025 Oral] OptionZero: A method for autonomously discovering and utilizing options in the MuZero algorithm
Language: C++ - Size: 2.67 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 14 - Forks: 0

jianzhnie/RLZero
A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.
Language: Python - Size: 384 KB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 14 - Forks: 0

cmubig/sorts
Code base for Social Robot Tree Search (SoRTS).
Language: Python - Size: 39.9 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 14 - Forks: 3

melkael/vne
Virtual Network Embedding algorithms, including code for the paper "Monkey Business: Reinforcement learning meets neighborhood search for Virtual Network Embedding"
Language: Julia - Size: 3.62 MB - Last synced at: 6 months ago - Pushed at: over 3 years ago - Stars: 14 - Forks: 0

aijunbai/thompson-sampling
Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs
Language: C++ - Size: 948 KB - Last synced at: over 2 years ago - Pushed at: almost 9 years ago - Stars: 14 - Forks: 0

martinobdl/MCTS
Implementation of SPW and DPW for Monte Carlo Tree Search in Continuous action/state space
Language: Python - Size: 37.1 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 13 - Forks: 3

michaelbzms/MonteCarloTreeSearch
A fast C++ impementation of Monte Carlo Tree Search with abstract classes that a user of this library can extend in order to use it. To demonstrate it I apply it to the game of Quoridor.
Language: C++ - Size: 114 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 13 - Forks: 3

kekmodel/gym-tictactoe-zero
Tic Tac Toe with Alpha Zero method - My first work
Language: Python - Size: 50.3 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 13 - Forks: 5

matgrioni/euchre-bot
An AI for the card game euchre using Monte-Carlo Tree Search.
Language: Go - Size: 645 KB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 13 - Forks: 3

Aleum/AlphaGo
9x9 AlphaGo
Language: Python - Size: 11.4 MB - Last synced at: about 1 year ago - Pushed at: almost 9 years ago - Stars: 13 - Forks: 5

rystrauss/dopamax
Reinforcement learning in pure JAX.
Language: Python - Size: 262 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 12 - Forks: 1

aldidoanta/TicTacToeMCTS
A Unity WebGL project for a TicTacToe game, using Monte Carlo Tree Search (MCTS) for its AI decision making.
Language: C# - Size: 15.3 MB - Last synced at: 10 days ago - Pushed at: about 2 years ago - Stars: 12 - Forks: 0

Danielhp95/Regym
Language: Python - Size: 4.37 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 12 - Forks: 7

flixpar/AlphaTSP
AlphaGo inspired TSP Heuristic Solver
Language: Jupyter Notebook - Size: 1010 KB - Last synced at: 7 months ago - Pushed at: over 5 years ago - Stars: 12 - Forks: 5

QueensGambit/CrazyAra-Engine
CrazyAra - A Deep Learning UCI-Chess Variant Engine written in C++ :bird:
Language: C++ - Size: 1.4 MB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 12 - Forks: 0

Sagebati/OxyMcts
Mcts library written in rust, for rust.
Language: Rust - Size: 72.3 KB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 3

Mikeywalsh/MCTS-Visualisation
Visualisation of MCTS in Unity with C# for different games, being created for my third year university project at the University of York
Language: C# - Size: 67.5 MB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 11 - Forks: 1

aijunbai/hplanning
Markovian State and Action Abstractions for MDPs via Hierarchical MCTS within a POMDP Formulation
Language: C++ - Size: 27.1 MB - Last synced at: over 2 years ago - Pushed at: almost 9 years ago - Stars: 11 - Forks: 3

LucasColas/Poker-AI
Several agents that can play poker (using probability, monte carlo, etc.) and clustering to get the types of poker players.
Language: Python - Size: 3.06 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 10 - Forks: 2

deeptexas-ai/MasterAI-1.0-1vs1-Limit
出售德州扑克AI核心算法和训练大模型,有意者Telegram :@xuzongbin001
Language: C++ - Size: 37.3 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 10 - Forks: 2

AI4Science-WestlakeU/t_scend
This repo is the code for T-SCEND, a novel framework that significantly improves diffusion model’s reasoning capabilities with better energy-based training and scaling up test-time computation.
Language: Python - Size: 1.47 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 10 - Forks: 0

PaytonWebber/mcts-rs
A Rust implementation of the Monte Carlo Tree Search (MCTS) algorithm, utilizing an arena allocator for efficient memory management.
Language: Rust - Size: 38.1 KB - Last synced at: 23 days ago - Pushed at: 4 months ago - Stars: 10 - Forks: 2

lowrollr/turbozero_torch
fast + parallel AlphaZero in PyTorch
Language: Jupyter Notebook - Size: 28 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 10 - Forks: 4

epcm/QtNoGo
基于Qt的不围棋(nogo)单机对战平台,包含基于MCTS的AI对战Bot
Language: C++ - Size: 2.55 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 10 - Forks: 3

zhihanyang2022/alpha-zero
Minimal AlphaZero in PyTorch, trained on Connect4 on a 6x6 board.
Language: Python - Size: 6.29 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 10 - Forks: 2

JJisbug/UAV-MCTS
This code is for the paper titled Path Planning for the Dynamic UAV-aided Wireless Systems Using Monte Carlo Tree Search, which is under review for Transactions on Vehicular Technology (Correspondence).
Language: Python - Size: 50.8 KB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 10 - Forks: 2

koulanurag/dream-and-search 📦
Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"
Language: Python - Size: 1.19 MB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 10 - Forks: 1

MerceaOtniel/HybridAlpha
HybridAlpha - a mix between AlphaGo Zero and AlphaZero for multiple games
Language: Python - Size: 192 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 10 - Forks: 4

airaria/AlphaZero_Gomoku_WuZiQi
My implementation of AlphaZero for gomoku (Wu Zi Qi, 五子棋); Poorman's AlphaZero
Language: Python - Size: 10 MB - Last synced at: about 1 month ago - Pushed at: about 7 years ago - Stars: 10 - Forks: 0

OMerkel/Oware
Oware and Ouril - traditional African Mancala games with computer AI using Monte Carlo Tree Search (MCTS) with UCB (Upper Confidence Bounds) applied to trees (UCT in short)
Language: HTML - Size: 17.1 MB - Last synced at: 3 months ago - Pushed at: about 7 years ago - Stars: 10 - Forks: 2

haodong2000/alphacc_zero
AlphaCC Zero: A Decision-Making Algorithm in Reinforcement Learning for Chinese Chess (SRTP)
Language: Python - Size: 653 KB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 9 - Forks: 2

cnarutox/Gobang
Gobang Based on Monte Carlo Tree Search
Language: Python - Size: 1.95 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 9 - Forks: 2

CGLemon/ElephantArt
A chinese chess(Xiangqi, 象棋) engine based on convolution neural network and Monte Carlo tree search which support UCCI protocol.
Language: C++ - Size: 1.4 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 9 - Forks: 4

peldszus/alpha-zero-general-lib
An implementation of the AlphaZero algorithm for adversarial games to be used with the machine learning framework of your choice
Language: Python - Size: 154 KB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 9 - Forks: 1

0xNineteen/hyper-alpha-zero
hyper optimized alpha zero implementation to play gomoku (distributed training with ray, mcts with cython)
Language: Python - Size: 864 KB - Last synced at: 1 day ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 0

PiperLiu/five-in-a-row-AI
♟️ Deploy a AI five-in-a-row game. Including front-end, back-end & deep RL code. 基于 vue3 与 flask 部署的强化学习五子棋 AlphaGo 实践。
Language: Python - Size: 12.4 MB - Last synced at: about 2 months ago - Pushed at: about 3 years ago - Stars: 8 - Forks: 2

oriyanh/Bridge-AI
In this project we try to create a sophisticated computer agent to play the Contact Bridge card game. Our goal is to develop an agent that is tough to play against, with fast reaction time so it is able to play in real time against humans. We approached this as a search problem, and implemented search-tree heuristics based on Minimax and Monte Carlo Tree Search. Implemented as a final project for the "Introduction to Aritifical Intelligence" course of the Hebrew University of Jerusalem.
Language: Python - Size: 4.69 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 8 - Forks: 3

wr786/Amazons-Chess
带GUI的Amazons,不止是bot!——北京大学2019计算概论A大作业
Language: Python - Size: 161 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 8 - Forks: 0
