Ecosyste.ms: Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: alphago-zero

Repositories

maxpumperla/deep_learning_and_the_game_of_go

Code and other material for the book "Deep Learning and the Game of Go"

Language: Python - Size: 313 MB - Last synced: 12 days ago - Pushed: over 1 year ago - Stars: 947 - Forks: 385

cestpasphoto/alpha-zero-general

A very fast implementation of AlphaZero, applied to games like Splendor, Santorini, The Little Prince, … Browser version available

Language: Python - Size: 643 MB - Last synced: 24 days ago - Pushed: 25 days ago - Stars: 25 - Forks: 8

pytorch/ELF 📦

ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation

Language: C++ - Size: 6.13 MB - Last synced: 27 days ago - Pushed: almost 5 years ago - Stars: 3,356 - Forks: 567

suragnair/alpha-zero-general

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

Language: Jupyter Notebook - Size: 414 MB - Last synced: 27 days ago - Pushed: 27 days ago - Stars: 3,681 - Forks: 1,002

SethKitchen/AlphaHearthstoneZero Fork of HearthSim/SabberStone

Just another Hearthstone Simulator in C# .Net Core, with some A.I. approaches!

Language: C# - Size: 37.2 MB - Last synced: about 1 month ago - Pushed: over 4 years ago - Stars: 5 - Forks: 0

gooooloo/reversi-alpha-zero Fork of mokemokechicken/reversi-alpha-zero

Reversi reinforcement learning by AlphaGo Zero methods.

Language: Python - Size: 721 KB - Last synced: about 1 month ago - Pushed: about 6 years ago - Stars: 4 - Forks: 0

Zeta36/chess-alpha-zero

Chess reinforcement learning by AlphaGo Zero methods.

Language: Jupyter Notebook - Size: 120 MB - Last synced: about 1 month ago - Pushed: about 1 year ago - Stars: 2,092 - Forks: 480

kobanium/TamaGo

Computer go engine using Monte-Carlo Tree Search written in Python3.

Language: Python - Size: 1.86 MB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 49 - Forks: 10

yffbit/gomoku

Gomoku AI based on AlphaGo Zero algorithm

Language: C++ - Size: 18.6 KB - Last synced: about 2 months ago - Pushed: almost 4 years ago - Stars: 3 - Forks: 3

yangboz/DeepReinforcementLearning Fork of AppliedDataSciencePartners/DeepReinforcementLearning

A replica of the AlphaZero methodology for deep reinforcement learning in Python

Language: Jupyter Notebook - Size: 2.61 MB - Last synced: about 2 months ago - Pushed: over 6 years ago - Stars: 0 - Forks: 0

yhyu13/AlphaGOZero-python-tensorflow

Congratulation to DeepMind! This is a reengineering implementation (on behalf of many other git repo in /support/) of DeepMind's Oct19th publication: [Mastering the Game of Go without Human Knowledge]. The supervised learning approach is more practical for individuals. (This repository has single purpose of education only)

Language: Python - Size: 185 MB - Last synced: about 1 month ago - Pushed: over 1 year ago - Stars: 341 - Forks: 113

junxiaosong/AlphaZero_Gomoku

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

Language: Python - Size: 7.88 MB - Last synced: 2 months ago - Pushed: 2 months ago - Stars: 3,158 - Forks: 946

mokemokechicken/reversi-alpha-zero

Reversi reinforcement learning by AlphaGo Zero methods.

Language: Python - Size: 1.22 MB - Last synced: 2 months ago - Pushed: over 1 year ago - Stars: 669 - Forks: 168

thyeem/marizero

Experimental Crunchy AI player based on the AlphaGo-Zero algorithm.

Language: Python - Size: 11.9 MB - Last synced: 3 months ago - Pushed: almost 5 years ago - Stars: 0 - Forks: 0

michaelnny/alpha_zero

A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games

Language: Python - Size: 325 MB - Last synced: 3 months ago - Pushed: 3 months ago - Stars: 23 - Forks: 7

Narsil/alphagozero

Unofficial attempt to rebuild AlphaGo Zero

Language: Python - Size: 109 KB - Last synced: about 1 month ago - Pushed: about 1 month ago - Stars: 57 - Forks: 16

techeng322/alphagozero

Unofficial attempt to rebuild AlphaGo Zero

Language: Python - Size: 115 KB - Last synced: 5 months ago - Pushed: 7 months ago - Stars: 1 - Forks: 0

Urinx/ReinforcementLearning

Reinforcing Your Learning of Reinforcement Learning

Language: Python - Size: 118 MB - Last synced: about 1 month ago - Pushed: almost 5 years ago - Stars: 84 - Forks: 21

witchu/alphazero

Board Game Reinforcement Learning using AlphaZero method. including Makhos (Thai Checkers), Reversi, Connect Four, Tic-tac-toe game rules

Language: Python - Size: 8.24 MB - Last synced: 6 months ago - Pushed: about 6 years ago - Stars: 25 - Forks: 9

novoselov-ab/ai-zero

Implementation of an AlphaGo Zero paper in one C++ header file without any dependencies

Language: C++ - Size: 11.5 MB - Last synced: 6 months ago - Pushed: about 6 years ago - Stars: 5 - Forks: 4

MerceaOtniel/HybridAlpha

HybridAlpha - a mix between AlphaGo Zero and AlphaZero for multiple games

Language: Python - Size: 192 MB - Last synced: 6 months ago - Pushed: about 4 years ago - Stars: 10 - Forks: 4

bupticybee/icyChessZero

中国象棋alpha zero程序

Language: Jupyter Notebook - Size: 23.5 MB - Last synced: 7 months ago - Pushed: over 5 years ago - Stars: 348 - Forks: 73

CuriosAI/sai Fork of leela-zero/leela-zero

SAI: a fork of Leela Zero with variable komi.

Language: C++ - Size: 4.65 MB - Last synced: 7 months ago - Pushed: 8 months ago - Stars: 102 - Forks: 12

Zeta36/connect4-alpha-zero

Connect4 reinforcement learning by AlphaGo Zero methods.

Language: Python - Size: 10.6 MB - Last synced: 7 months ago - Pushed: about 3 years ago - Stars: 108 - Forks: 38

dylandjian/SuperGo

A student implementation of Alpha Go Zero

Language: Python - Size: 113 KB - Last synced: 8 months ago - Pushed: almost 6 years ago - Stars: 275 - Forks: 63

evolutionsoftswiss/alpha-zero-learning

Java based alpha zero reinforcement learning. The generic base module allows implementation of any adversary board game. Example implementation for Tic Tac Toe.

Language: Java - Size: 54.3 MB - Last synced: 9 months ago - Pushed: 9 months ago - Stars: 4 - Forks: 2

adamtupper/alphablooms

A Blooms implementation for the AlphaZero General library.

Language: Jupyter Notebook - Size: 555 KB - Last synced: 9 months ago - Pushed: about 3 years ago - Stars: 0 - Forks: 0

2Bear/othello-zero

An implementation of the AlphaGo Zero and the AlphaZero algorithm for othello playing.

Language: Python - Size: 3.88 MB - Last synced: 7 months ago - Pushed: almost 3 years ago - Stars: 18 - Forks: 3

SergioIommi/DQN-2048

Deep Reinforcement Learning to Play 2048 (with Keras)

Language: Python - Size: 20.6 MB - Last synced: 9 months ago - Pushed: almost 6 years ago - Stars: 17 - Forks: 3

Yangyangii/AlphaZero-connect6 Fork of reinforcement-learning-kr/alpha_omok

DeepMind AlphaZero for Connect 6

Language: Python - Size: 81.1 MB - Last synced: 10 months ago - Pushed: over 2 years ago - Stars: 3 - Forks: 2

haodong2000/alphacc_zero

AlphaCC Zero: A Decision-Making Algorithm in Reinforcement Learning for Chinese Chess (SRTP)

Language: Python - Size: 653 KB - Last synced: about 2 months ago - Pushed: 11 months ago - Stars: 9 - Forks: 2

Simuschlatz/AlphaBing

♟️ A combination of Reinforcement Learning and Alpha-Beta Search in Chinese chess

Language: Python - Size: 160 MB - Last synced: 10 months ago - Pushed: 10 months ago - Stars: 12 - Forks: 1

chihangs/slap

Switchable Lightweight Anti-symmetric Processing (SLAP) with CNN - Application in Gomoku Reinforcement Learning

Language: Python - Size: 32 MB - Last synced: 11 months ago - Pushed: 11 months ago - Stars: 0 - Forks: 0

lrenc/AlphaMiao

:smirk_cat: AlphaZero Gomoku

Language: JavaScript - Size: 3.12 MB - Last synced: about 1 year ago - Pushed: about 1 year ago - Stars: 1 - Forks: 0

PeeteKeesel/sokoban-ai

:evergreen_tree: Teaching an AI to solve Sokoban using AlphaGo-Zero style RL & Single-Player MCTS.

Language: Python - Size: 3.6 MB - Last synced: 12 months ago - Pushed: over 2 years ago - Stars: 0 - Forks: 0

adepierre/Caffe_AlphaZero

Implementation of Deepmind's AlphaZero algorithm with Caffe and C++

Language: C++ - Size: 1.02 MB - Last synced: about 1 year ago - Pushed: about 6 years ago - Stars: 18 - Forks: 9

yzhq97/AlphaGomokuZero

An illustration program which visualizes the MCTS mechanism inside AlphaZero in order to provide a better understanding of how an AI makes decisions. 一个通过可视化AlphaZero中的蒙特卡洛树搜索来解释AI决策方式的程序。

Language: Python - Size: 64.9 MB - Last synced: about 1 year ago - Pushed: almost 6 years ago - Stars: 16 - Forks: 6

blanyal/alpha-zero

AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm" by DeepMind.

Language: Python - Size: 124 KB - Last synced: over 1 year ago - Pushed: about 6 years ago - Stars: 78 - Forks: 24

davinwang/C2TutorialsGo

This is a tutorial written for Caffe2 which mocks google AlphaGo Fan and AlphaGo Zero.

Language: Jupyter Notebook - Size: 1.88 MB - Last synced: 2 months ago - Pushed: over 5 years ago - Stars: 8 - Forks: 3

water-vapor/AlphaZero

A replication of Alpha(Go) Zero.

Language: Python - Size: 748 KB - Last synced: about 1 year ago - Pushed: almost 5 years ago - Stars: 6 - Forks: 2

plkmo/AlphaZero_Connect4

PyTorch implementation of AlphaZero Connect from scratch (with results)

Language: Python - Size: 39 MB - Last synced: over 1 year ago - Pushed: over 4 years ago - Stars: 74 - Forks: 36

anudeep22003/alpha-go-zero

Personal implementation of deep learning algorithms in the RL space to learn.

Language: Python - Size: 122 KB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 0 - Forks: 0

pie972/Adversarial-Search

Research Paper about adversarial search

Language: Python - Size: 1.7 MB - Last synced: over 1 year ago - Pushed: about 2 years ago - Stars: 3 - Forks: 1

ChenDarYen/gobang_drl

A gobang AI model base on AlphaGo Zero's algorithm. Combine DRL with MCTS. The model is designed and trained in Pytorch and be used in C++(QT).

Language: C++ - Size: 56.5 MB - Last synced: over 1 year ago - Pushed: about 4 years ago - Stars: 2 - Forks: 0

BIGBALLON/Toward-AGZ

Materials for AlphaGo

Size: 2.93 KB - Last synced: about 1 year ago - Pushed: about 2 years ago - Stars: 3 - Forks: 0

kekmodel/gym-tictactoe-zero

Tic Tac Toe with Alpha Zero method - My first work

Language: Python - Size: 50.3 MB - Last synced: about 1 year ago - Pushed: almost 6 years ago - Stars: 13 - Forks: 5

xuzijian629/combopt-zero

A reinforcement learning based solver for combinatorial problems

Language: C++ - Size: 1.55 MB - Last synced: about 1 year ago - Pushed: almost 2 years ago - Stars: 36 - Forks: 9

kekmodel/mcts-omok

Omok using MCTS (UCT, PUCT)

Language: Python - Size: 251 KB - Last synced: about 1 year ago - Pushed: about 6 years ago - Stars: 1 - Forks: 1

petosa/simple-alpha-zero

Clean, tested, & modular AlphaZero implementation with multiplayer support.

Language: Python - Size: 133 MB - Last synced: over 1 year ago - Pushed: about 5 years ago - Stars: 7 - Forks: 2

fffasttime/Gmk0

Learning 15x15 gomoku from zero!

Language: C++ - Size: 1.54 MB - Last synced: over 1 year ago - Pushed: about 6 years ago - Stars: 12 - Forks: 6

neoyung/connect-4

A reinforcement learning agent trained without prior human knowledge

Language: Jupyter Notebook - Size: 1.25 MB - Last synced: over 1 year ago - Pushed: about 4 years ago - Stars: 3 - Forks: 3

RubenBranco/SokobanAlphaGo

AlphaGo Zero Reinforcement Learning Sokoban Solver

Language: Python - Size: 23.4 KB - Last synced: over 1 year ago - Pushed: almost 6 years ago - Stars: 8 - Forks: 3

flixpar/AlphaTSP

AlphaGo inspired TSP Heuristic Solver

Language: Jupyter Notebook - Size: 1010 KB - Last synced: over 1 year ago - Pushed: over 4 years ago - Stars: 11 - Forks: 4

ankursharma-iitd/AlphaZero-for-Go

Implementation of Alpha Go Zero - Reinforcement Learning Project, COL870 @iit-delhi

Language: Python - Size: 513 KB - Last synced: about 1 year ago - Pushed: over 1 year ago - Stars: 1 - Forks: 1

kirarpit/connect4

Solving board games like Connect4 using Deep Reinforcement Learning

Language: Python - Size: 497 KB - Last synced: over 1 year ago - Pushed: over 1 year ago - Stars: 33 - Forks: 3

Jhyeok-lee/alphago

Apply AlphagoZero Algorithms to Gomoku

Language: Python - Size: 1.04 MB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 4 - Forks: 1

tkhkaeio/AlphaZero

I researched and explained AlphaGo/AlphaGo Zero papers, which had beaten the world the game of Go champion in 2016, 2017. Especially, I applied Alpha Zero algorithm to Othello to catch the whole idea.

Language: Python - Size: 8.55 MB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 5 - Forks: 0

navreeetkaur/AlphaGoZero

Implementation of Alpha Go Zero - Reinforcement Learning Project, COL870 @iit-delhi

Language: Python - Size: 341 MB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 3 - Forks: 0

kinwo/deeprl-tennis-competition

Learning to play tennis from scratch with AlphaGo Zero style self-play using DDPG

Language: HTML - Size: 1.55 MB - Last synced: about 1 year ago - Pushed: over 5 years ago - Stars: 4 - Forks: 0

TTitcombe/AlphaDraughts

A PyTorch implementation of AlphaGo Zero applied to Draughts

Language: Python - Size: 49.8 KB - Last synced: about 1 year ago - Pushed: over 4 years ago - Stars: 3 - Forks: 2

airaria/AlphaZero_Gomoku_WuZiQi

My implementation of AlphaZero for gomoku (Wu Zi Qi, 五子棋); Poorman's AlphaZero

Language: Python - Size: 10 MB - Last synced: over 1 year ago - Pushed: about 6 years ago - Stars: 9 - Forks: 0

sharpobject/spender

It's splendor

Language: Lua - Size: 71.3 KB - Last synced: about 1 year ago - Pushed: about 5 years ago - Stars: 4 - Forks: 0

zhixiangli/alphagomoku-zero

An implementation by AlphaZero algorithm for Gomoku (Gobang / Five Chess)

Language: Python - Size: 23.4 KB - Last synced: over 1 year ago - Pushed: about 6 years ago - Stars: 5 - Forks: 3

mariochampion/DeepReinforcementLearning Fork of AppliedDataSciencePartners/DeepReinforcementLearning

A replica of the AlphaZero methodology for deep reinforcement learning in Python

Language: Jupyter Notebook - Size: 2.77 MB - Last synced: about 1 year ago - Pushed: about 6 years ago - Stars: 1 - Forks: 0

Related Keywords

alphago-zero 64 reinforcement-learning 38 alphago 29 alphazero 25 mcts 20 deep-learning 16 monte-carlo-tree-search 14 tensorflow 13 pytorch 12 keras 11 gomoku 11 alphagozero 9 self-play 7 deep-reinforcement-learning 7 alpha-zero 7 machine-learning 7 python 6 artificial-intelligence 6 gobang 5 tic-tac-toe 4 reversi 4 deep-neural-networks 4 othello 4 neural-networks 4 resnet 4 connect-four 3 policy-gradient 3 chess 3 self-learning 3 convolutional-neural-networks 3 cpp 3 board-game 3 tictactoe 3 neural-network 3 reinforcement-learning-algorithms 3 go 3 connect4 3 deepmind 3 ddpg 2 dqn 2 q-learning 2 splendor 2 checkers 2 draughts 2 sokoban 2 monte-carlo 2 alpha-beta-pruning 2 chinese-chess 2 deep-q-network 2 openai-gym 2 game 2 openai 2 rl 2 game-playing-agent 2 libtorch 2 deeplearning 2 gym 1 muzero 1 deep 1 qt5 1 deep-neural-network 1 trees 1 tree-search 1 tree 1 research-paper 1 ddpg-agent 1 ddpg-algorithm 1 monterey-hackintosh 1 montecarlo-simulation 1 reinforcement-learning-agent 1 checkers-reinforcement-learning 1 montecarlo 1 monte-carlo-simulation 1 monte-carlo-sampling 1 monte-carlo-methods 1 gomuku 1 alphago-master 1 algorithm-stages 1 algorithm 1 ai 1 caffe2 1 caffe 1 gameplay 1 experience-replay 1 multiplayer 1 traveling-salesman-problem 1 uct 1 travelling-salesman-problem 1 puct 1 tsp 1 omok 1 sokoban-game 1 vertex-cover 1 tsp-heuristic 1 sokoban-solver 1 a3c-agent 1 async-dqn 1 connect4-game 1 double-dqn 1 dueling-dqn 1