mcts | Topic | Ecosyste.ms: Repos

Topic: "mcts"

hijkzzz/Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

Size: 1.63 MB - Last synced at: about 18 hours ago - Pushed at: about 20 hours ago - Stars: 6,743 - Forks: 375

suragnair/alpha-zero-general

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

Language: Jupyter Notebook - Size: 414 MB - Last synced at: 16 days ago - Pushed at: 5 months ago - Stars: 4,140 - Forks: 1,089

junxiaosong/AlphaZero_Gomoku

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

Language: Python - Size: 7.88 MB - Last synced at: 15 days ago - Pushed at: about 1 year ago - Stars: 3,484 - Forks: 993

werner-duvaud/muzero-general

MuZero

Language: Python - Size: 7.09 MB - Last synced at: 15 days ago - Pushed at: 9 months ago - Stars: 2,644 - Forks: 647

opendilab/LightZero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Language: Python - Size: 115 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 1,382 - Forks: 156

zzli2022/Awesome-System2-Reasoning-LLM

Latest Advances on System-2 Reasoning

Language: Python - Size: 2.63 MB - Last synced at: 23 days ago - Pushed at: about 1 month ago - Stars: 993 - Forks: 45

yaotingwangofficial/Awesome-MCoT

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

Size: 4.63 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 576 - Forks: 15

chauvinSimon/My_Bibliography_for_Research_on_Autonomous_Driving

Personal notes about scientific and research works on "Decision-Making for Autonomous Driving"

Size: 784 MB - Last synced at: 11 months ago - Pushed at: over 4 years ago - Stars: 434 - Forks: 94

s-casci/tinyzero

Easily train AlphaZero-like agents on any environment you want!

Language: Python - Size: 41.4 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 379 - Forks: 14

hrpan/tetris_mcts 📦

MCTS project for Tetris

Language: Python - Size: 9.73 MB - Last synced at: 6 months ago - Pushed at: 8 months ago - Stars: 342 - Forks: 34

dylandjian/SuperGo

A student implementation of Alpha Go Zero

Language: Python - Size: 113 KB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 275 - Forks: 63

DataCanvasIO/Hypernets

A General Automated Machine Learning framework to simplify the development of End-to-end AutoML toolkits in specific domains.

Language: Python - Size: 17.8 MB - Last synced at: 13 days ago - Pushed at: about 2 months ago - Stars: 267 - Forks: 41

QueensGambit/CrazyAra

A Deep Learning UCI-Chess Variant Engine written in C++ & Python :parrot:

Language: Jupyter Notebook - Size: 61.5 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 263 - Forks: 44

vgarciasc/mcts-viz

Visualization of MCTS algorithm applied to Tic-tac-toe.

Language: JavaScript - Size: 66.4 KB - Last synced at: about 2 months ago - Pushed at: almost 4 years ago - Stars: 233 - Forks: 12

initial-h/AlphaZero_Gomoku_MPI

An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku

Language: Python - Size: 28 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 197 - Forks: 45

sungyubkim/Deep_RL_with_pytorch

A pytorch tutorial for DRL(Deep Reinforcement Learning)

Language: Jupyter Notebook - Size: 521 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 188 - Forks: 42

thuxugang/doudizhu

AI斗地主

Language: Python - Size: 9.99 MB - Last synced at: over 1 year ago - Pushed at: almost 7 years ago - Stars: 172 - Forks: 68

kaesve/muzero

A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.

Language: Jupyter Notebook - Size: 115 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 138 - Forks: 24

zjeffer/chess-deep-rl

Research project: create a chess engine using Deep Reinforcement Learning

Language: Jupyter Notebook - Size: 10.3 MB - Last synced at: 2 months ago - Pushed at: 11 months ago - Stars: 135 - Forks: 12

akolishchak/doom-net-pytorch

Reinforcement learning models in ViZDoom environment

Language: Python - Size: 262 MB - Last synced at: about 1 year ago - Pushed at: about 3 years ago - Stars: 130 - Forks: 20

PuYuuu/vehicle-interaction-decision-making

The decision-making of multiple vehicles at intersection bases on level-k game and MCTS

Language: C++ - Size: 5 MB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 114 - Forks: 43

manyoso/allie

Allie: A UCI compliant chess engine

Language: C++ - Size: 700 KB - Last synced at: 2 months ago - Pushed at: about 4 years ago - Stars: 105 - Forks: 21

CGLemon/Sayuri

AlphaZero based engine for the game of Go (圍棋/围棋).

Language: C++ - Size: 14.8 MB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 95 - Forks: 10

lowrollr/turbozero

fast + parallel AlphaZero in JAX

Language: Python - Size: 28.8 MB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 94 - Forks: 9

Urinx/ReinforcementLearning

Reinforcing Your Learning of Reinforcement Learning

Language: Python - Size: 118 MB - Last synced at: about 2 months ago - Pushed at: almost 6 years ago - Stars: 94 - Forks: 22

AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm" by DeepMind.

Language: Python - Size: 124 KB - Last synced at: about 2 months ago - Pushed at: about 7 years ago - Stars: 88 - Forks: 28

rlglab/minizero

MiniZero: An AlphaZero and MuZero Training Framework

Language: C++ - Size: 2.32 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 79 - Forks: 21

kobanium/Ray

Computer go engine using Monte-Carlo Tree Search (MCTS)

Language: C++ - Size: 93.8 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 71 - Forks: 81

Wangmerlyn/MCTS-GSM8k-Demo

This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems

Language: Python - Size: 17.6 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 66 - Forks: 8

masouduut94/MCTS-agent-python

Monte Carlo Tree Search (MCTS) is a method for finding optimal decisions in a given domain by taking random samples in the decision space and building a search tree accordingly. It has already had a profound impact on Artificial Intelligence (AI) approaches for domains that can be represented as trees of sequential decisions, particularly games and planning problems. In this project I used a board game called "HEX" as a platform to test different simulation strategies in MCTS field.

Language: Python - Size: 695 KB - Last synced at: 5 months ago - Pushed at: about 1 year ago - Stars: 65 - Forks: 9

CGLemon/pyDLGO

基於深度學習的 GTP 圍棋（围棋）引擎，KGS 指引文件以及演算法教學。

Language: Python - Size: 12.2 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 63 - Forks: 11

kobanium/TamaGo

Computer go engine using Monte-Carlo Tree Search written in Python3.

Language: Python - Size: 2.61 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 61 - Forks: 11

gorisanson/quoridor-ai

Quoridor AI based on Monte Carlo tree search

Language: JavaScript - Size: 273 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 58 - Forks: 8

yangboz/godpaper

:monkey_face: An AI chess-board-game framework(by many programming languages) implementations.

Language: HTML - Size: 65.1 MB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 48 - Forks: 18

coreylowman/synthesis

A rust implementation of AlphaZero algorithm

Language: Rust - Size: 976 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 47 - Forks: 6

YoujiaZhang/AlphaGo-Zero-Gobang

Meta-Zeta是一个基于强化学习的五子棋(Gobang)模型，主要用以了解AlphaGo Zero的运行原理的Demo，即神经网络是如何指导MCTS做出决策的，以及如何自我对弈学习。源码+教程

Language: Python - Size: 13.2 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 40 - Forks: 6

masterai-top/The-strongest-AI-in-Texas-Hold-em-1-to-1

德州AI，MasterAI is an AI poker dedicated to suport n-play (single- or multi-agent) Texas Hold'em imperfect-informatin games.。MasterAI v2.0是从MasterAI v1.0衍生出来的迭代算法，它在非完全信息游戏中利用了通用的强化学习+搜索，并在一对一无限押注的德州扑克中实现了超人的表现。AI源码出售。Tg：@xuzongbin001;E-mail：[email protected]

Language: C++ - Size: 3.72 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 36 - Forks: 9

xuetf/AlphaZero_Gobang

Deep Learning big homework of UCAS

Language: Python - Size: 50.8 MB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 34 - Forks: 16

hr0nix/omega

A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.

Language: Python - Size: 577 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 32 - Forks: 4

ai-boson/mcts

MCTS algorithm tutorial and it's explanation with code. Application of MCTS to create A.I for simple game.

Language: Ruby - Size: 378 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 30 - Forks: 0

hayoung-kim/mcts-tic-tac-toe

Monte Carlo Tree Search for tic tac toe

Language: Python - Size: 29.3 KB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 29 - Forks: 12

tuero/muzero-cpp

A C++ pytorch implementation of MuZero

Language: C++ - Size: 65.5 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 28 - Forks: 5

DenseLance/mcts-simple

mcts-simple is a Python3 library that allows reinforcement learning problems to be solved easily with its implementations of Monte Carlo Tree Search.

Language: Python - Size: 11.2 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 27 - Forks: 3

TianHongZXY/CoRe

[ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models

Language: Python - Size: 6.01 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 27 - Forks: 4

zhangshun97/AI_Gomocup

Here are some Python implementations of Gomoku AIs, including MCTS, Minimax and Genetic Alg.

Language: Python - Size: 6.93 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 27 - Forks: 10

OMerkel/UCThello

UCThello - a board game demonstrator (Othello variant) with computer AI using Monte Carlo Tree Search (MCTS) with UCB (Upper Confidence Bounds) applied to trees (UCT in short)

Language: JavaScript - Size: 5.61 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 25 - Forks: 4

WorkingRobot/Craftimizer

The best FFXIV crafting solver via hardware accelerated Genetic MCTS.

Language: C# - Size: 1.37 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 25 - Forks: 17

snowfrogdev/macao

A general purpose game playing A.I. framework based on the Monte Carlo tree search algorithm.

Language: TypeScript - Size: 2.53 MB - Last synced at: 28 days ago - Pushed at: over 2 years ago - Stars: 24 - Forks: 4

timoklein/alphazero-gym 📦

AlphaZero for continuous control tasks

Language: Python - Size: 483 KB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 23 - Forks: 4

jdj2261/pytamp

Python robot's tamp library

Language: Python - Size: 47.9 MB - Last synced at: 24 days ago - Pushed at: 5 months ago - Stars: 21 - Forks: 1

bellerb/chappie.ai

Generalized AI to perform a multitude of tasks written in python3

Language: Jupyter Notebook - Size: 406 MB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 21 - Forks: 6

castacks/mcts-stl-planning

Online Signal Temporal Logic (STL) Monte-Carlo Tree Search for Guided Imitation Learning

Language: Python - Size: 111 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 21 - Forks: 0

bellerb/chess

Program for playing chess in the console against AI or human opponents

Language: Python - Size: 72.6 MB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 20 - Forks: 8

deeptexas-ai/The-strongest-AI-in-Texas-Hold-em-unlimited-Texas-Hold-em-1-vs.-1

德州扑克最强人工智能AI，1对1的德州AI，可以战胜人类顶尖职业牌手，先出售全套AI源代码和AI训练模型；Telegram联系： @xuzongbin001 或E-mail：[email protected]

Language: C++ - Size: 4.31 MB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 19 - Forks: 4

masterai-top/The-strongest-AI-in-Texas-Hold-em-MasterAI-1.0-1vs1-Extreme

德州AI，德州1对1的AI；MasterAI decisively defeated 14 top human Texas hold'em poker professsionals in September 2020.。MasterAI 是Master团队在非完美信息博弈中实现的的一种扑克AI，在德州扑克一对一的有限押注已经取得成果，MasterAI于2020年9月战胜了中国的14位顶级扑克职业选手

Language: C++ - Size: 35.4 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 19 - Forks: 6

Cognitive-AI-Systems/mats-lp

[AAAI-2024] MATS-LP addresses the challenging problem of decentralized lifelong multi-agent pathfinding. The proposed approach utilizes a combination of Monte Carlo Tree Search and reinforcement learning for resolving conflicts.

Language: C++ - Size: 716 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 19 - Forks: 0

adepierre/Caffe_AlphaZero

Implementation of Deepmind's AlphaZero algorithm with Caffe and C++

Language: C++ - Size: 1.02 MB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 18 - Forks: 9

jlokitha/connect-4-game

A classic Connect Four game featuring two-player mode and an AI opponent powered by Monte Carlo Tree Search (MCTS), offering an exciting and strategic gameplay experience.

Language: Java - Size: 138 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 17 - Forks: 0

JuliaPOMDP/FactoredValueMCTS.jl

Scalable MCTS for team scenarios

Language: Julia - Size: 273 KB - Last synced at: 7 days ago - Pushed at: 12 months ago - Stars: 16 - Forks: 3

faameunier/MCTSnet

A PyTorch implementation of DeepMind's MCTSnet

Language: Python - Size: 728 KB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 16 - Forks: 2

ladofa/janggi

야매장기 - 알파고를 참고한 장기 AI

Language: Python - Size: 708 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 16 - Forks: 6

patrik-ha/explainable-minichess

Chess environment for smaller chess variants, AlphaZero-like MCTS-learning, and Concept Detection

Language: PureBasic - Size: 41.2 MB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 15 - Forks: 5

rlglab/optionzero

[ICLR 2025 Oral] OptionZero: A method for autonomously discovering and utilizing options in the MuZero algorithm

Language: C++ - Size: 2.67 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 14 - Forks: 0

jianzhnie/RLZero

A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.

Language: Python - Size: 384 KB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 14 - Forks: 0

cmubig/sorts

Code base for Social Robot Tree Search (SoRTS).

Language: Python - Size: 39.9 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 14 - Forks: 3

melkael/vne

Virtual Network Embedding algorithms, including code for the paper "Monkey Business: Reinforcement learning meets neighborhood search for Virtual Network Embedding"

Language: Julia - Size: 3.62 MB - Last synced at: 6 months ago - Pushed at: over 3 years ago - Stars: 14 - Forks: 0

aijunbai/thompson-sampling

Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs

Language: C++ - Size: 948 KB - Last synced at: over 2 years ago - Pushed at: almost 9 years ago - Stars: 14 - Forks: 0

martinobdl/MCTS

Implementation of SPW and DPW for Monte Carlo Tree Search in Continuous action/state space

Language: Python - Size: 37.1 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 13 - Forks: 3

michaelbzms/MonteCarloTreeSearch

A fast C++ impementation of Monte Carlo Tree Search with abstract classes that a user of this library can extend in order to use it. To demonstrate it I apply it to the game of Quoridor.

Language: C++ - Size: 114 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 13 - Forks: 3

kekmodel/gym-tictactoe-zero

Tic Tac Toe with Alpha Zero method - My first work

Language: Python - Size: 50.3 MB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 13 - Forks: 5

matgrioni/euchre-bot

An AI for the card game euchre using Monte-Carlo Tree Search.

Language: Go - Size: 645 KB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 13 - Forks: 3

Aleum/AlphaGo

9x9 AlphaGo

Language: Python - Size: 11.4 MB - Last synced at: about 1 year ago - Pushed at: almost 9 years ago - Stars: 13 - Forks: 5

rystrauss/dopamax

Reinforcement learning in pure JAX.

Language: Python - Size: 262 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 12 - Forks: 1

aldidoanta/TicTacToeMCTS

A Unity WebGL project for a TicTacToe game, using Monte Carlo Tree Search (MCTS) for its AI decision making.

Language: C# - Size: 15.3 MB - Last synced at: 10 days ago - Pushed at: about 2 years ago - Stars: 12 - Forks: 0

Danielhp95/Regym

Language: Python - Size: 4.37 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 12 - Forks: 7

flixpar/AlphaTSP

AlphaGo inspired TSP Heuristic Solver

Language: Jupyter Notebook - Size: 1010 KB - Last synced at: 7 months ago - Pushed at: over 5 years ago - Stars: 12 - Forks: 5

QueensGambit/CrazyAra-Engine

CrazyAra - A Deep Learning UCI-Chess Variant Engine written in C++ :bird:

Language: C++ - Size: 1.4 MB - Last synced at: about 1 year ago - Pushed at: over 5 years ago - Stars: 12 - Forks: 0

Sagebati/OxyMcts

Mcts library written in rust, for rust.

Language: Rust - Size: 72.3 KB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 3

Mikeywalsh/MCTS-Visualisation

Visualisation of MCTS in Unity with C# for different games, being created for my third year university project at the University of York

Language: C# - Size: 67.5 MB - Last synced at: over 2 years ago - Pushed at: almost 7 years ago - Stars: 11 - Forks: 1

aijunbai/hplanning

Markovian State and Action Abstractions for MDPs via Hierarchical MCTS within a POMDP Formulation

Language: C++ - Size: 27.1 MB - Last synced at: over 2 years ago - Pushed at: almost 9 years ago - Stars: 11 - Forks: 3

LucasColas/Poker-AI

Several agents that can play poker (using probability, monte carlo, etc.) and clustering to get the types of poker players.

Language: Python - Size: 3.06 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 10 - Forks: 2

deeptexas-ai/MasterAI-1.0-1vs1-Limit

出售德州扑克AI核心算法和训练大模型，有意者Telegram :@xuzongbin001

Language: C++ - Size: 37.3 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 10 - Forks: 2

AI4Science-WestlakeU/t_scend

This repo is the code for T-SCEND, a novel framework that significantly improves diffusion model’s reasoning capabilities with better energy-based training and scaling up test-time computation.

Language: Python - Size: 1.47 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 10 - Forks: 0

PaytonWebber/mcts-rs

A Rust implementation of the Monte Carlo Tree Search (MCTS) algorithm, utilizing an arena allocator for efficient memory management.

Language: Rust - Size: 38.1 KB - Last synced at: 23 days ago - Pushed at: 4 months ago - Stars: 10 - Forks: 2

lowrollr/turbozero_torch

fast + parallel AlphaZero in PyTorch

Language: Jupyter Notebook - Size: 28 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 10 - Forks: 4

epcm/QtNoGo

基于Qt的不围棋(nogo)单机对战平台，包含基于MCTS的AI对战Bot

Language: C++ - Size: 2.55 MB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 10 - Forks: 3

zhihanyang2022/alpha-zero

Minimal AlphaZero in PyTorch, trained on Connect4 on a 6x6 board.

Language: Python - Size: 6.29 MB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 10 - Forks: 2

JJisbug/UAV-MCTS

This code is for the paper titled Path Planning for the Dynamic UAV-aided Wireless Systems Using Monte Carlo Tree Search, which is under review for Transactions on Vehicular Technology (Correspondence).

Language: Python - Size: 50.8 KB - Last synced at: over 1 year ago - Pushed at: almost 4 years ago - Stars: 10 - Forks: 2

koulanurag/dream-and-search 📦

Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"

Language: Python - Size: 1.19 MB - Last synced at: about 1 year ago - Pushed at: almost 4 years ago - Stars: 10 - Forks: 1

MerceaOtniel/HybridAlpha

HybridAlpha - a mix between AlphaGo Zero and AlphaZero for multiple games

Language: Python - Size: 192 MB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 10 - Forks: 4

airaria/AlphaZero_Gomoku_WuZiQi

My implementation of AlphaZero for gomoku (Wu Zi Qi, 五子棋); Poorman's AlphaZero

Language: Python - Size: 10 MB - Last synced at: about 1 month ago - Pushed at: about 7 years ago - Stars: 10 - Forks: 0

OMerkel/Oware

Oware and Ouril - traditional African Mancala games with computer AI using Monte Carlo Tree Search (MCTS) with UCB (Upper Confidence Bounds) applied to trees (UCT in short)

Language: HTML - Size: 17.1 MB - Last synced at: 3 months ago - Pushed at: about 7 years ago - Stars: 10 - Forks: 2

haodong2000/alphacc_zero

AlphaCC Zero: A Decision-Making Algorithm in Reinforcement Learning for Chinese Chess (SRTP)

Language: Python - Size: 653 KB - Last synced at: 5 months ago - Pushed at: almost 2 years ago - Stars: 9 - Forks: 2

cnarutox/Gobang

Gobang Based on Monte Carlo Tree Search

Language: Python - Size: 1.95 MB - Last synced at: over 2 years ago - Pushed at: almost 3 years ago - Stars: 9 - Forks: 2

CGLemon/ElephantArt

A chinese chess(Xiangqi, 象棋) engine based on convolution neural network and Monte Carlo tree search which support UCCI protocol.

Language: C++ - Size: 1.4 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 9 - Forks: 4

peldszus/alpha-zero-general-lib

An implementation of the AlphaZero algorithm for adversarial games to be used with the machine learning framework of your choice

Language: Python - Size: 154 KB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 9 - Forks: 1

0xNineteen/hyper-alpha-zero

hyper optimized alpha zero implementation to play gomoku (distributed training with ray, mcts with cython)

Language: Python - Size: 864 KB - Last synced at: 1 day ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 0

PiperLiu/five-in-a-row-AI

♟️ Deploy a AI five-in-a-row game. Including front-end, back-end & deep RL code. 基于 vue3 与 flask 部署的强化学习五子棋 AlphaGo 实践。

Language: Python - Size: 12.4 MB - Last synced at: about 2 months ago - Pushed at: about 3 years ago - Stars: 8 - Forks: 2

oriyanh/Bridge-AI

In this project we try to create a sophisticated computer agent to play the Contact Bridge card game. Our goal is to develop an agent that is tough to play against, with fast reaction time so it is able to play in real time against humans. We approached this as a search problem, and implemented search-tree heuristics based on Minimax and Monte Carlo Tree Search. Implemented as a final project for the "Introduction to Aritifical Intelligence" course of the Hebrew University of Jerusalem.

Language: Python - Size: 4.69 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 8 - Forks: 3

wr786/Amazons-Chess

带GUI的Amazons，不止是bot！——北京大学2019计算概论A大作业

Language: Python - Size: 161 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 8 - Forks: 0