Topic: "deepmind"
google-deepmind/pysc2
StarCraft II Learning Environment
Language: Python - Size: 4.36 MB - Last synced at: 18 days ago - Pushed at: 10 months ago - Stars: 8,109 - Forks: 1,160

andri27-ts/Reinforcement-Learning
Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning
Language: Jupyter Notebook - Size: 10 MB - Last synced at: 22 days ago - Pushed at: almost 5 years ago - Stars: 4,309 - Forks: 628

chris-chris/pysc2-examples
StarCraft II - pysc2 Deep Reinforcement Learning Examples
Language: Python - Size: 4.96 MB - Last synced at: 23 days ago - Pushed at: about 4 years ago - Stars: 755 - Forks: 354

Toni-SM/skrl
Modular reinforcement learning library (on PyTorch and JAX) with support for NVIDIA Isaac Gym, Omniverse Isaac Gym and Isaac Lab
Language: Python - Size: 7.52 MB - Last synced at: 5 days ago - Pushed at: 15 days ago - Stars: 744 - Forks: 81

inoryy/reaver
Reaver: Modular Deep Reinforcement Learning Framework. Focused on StarCraft II. Supports Gym, Atari, and MuJoCo.
Language: Python - Size: 291 KB - Last synced at: 23 days ago - Pushed at: over 4 years ago - Stars: 557 - Forks: 89

anantzoid/Conditional-PixelCNN-decoder
Tensorflow implementation of Gated Conditional Pixel Convolutional Neural Network
Language: Python - Size: 2.5 MB - Last synced at: about 1 month ago - Pushed at: almost 7 years ago - Stars: 482 - Forks: 83

Robinwho/Deep-Learning
深度学习/人工智能/机器学习资料汇总(Deep Learning/Artificial Intelligent/Machine Learning) 持续更新……
Size: 1.21 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 390 - Forks: 132

SkyWorkAIGC/SkyCode-AI-CodeX-GPT3
SkyCode是一个多语言开源编程大模型,采用GPT3模型结构,支持Java, JavaScript, C, C++, Python, Go, shell等多种主流编程语言,并能理解中文注释。模型可以对代码进行补全,拥有强大解题能力,使您从编程中解放出来,专心于解决更重要的问题。| SkyCode is an open source programming model, which adopts the GPT3 model structure. It supports Java, JavaScript, C, C++, Python, Go, shell and other languages, and can understand Chinese comments.
Size: 34.2 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 388 - Forks: 21

google-deepmind/spriteworld
Spriteworld: a flexible, configurable python-based reinforcement learning environment
Language: Python - Size: 446 KB - Last synced at: 12 months ago - Pushed at: almost 5 years ago - Stars: 367 - Forks: 54

vballoli/nfnets-pytorch
NFNets and Adaptive Gradient Clipping for SGD implemented in PyTorch. Find explanation at tourdeml.github.io/blog/
Language: Python - Size: 5.63 MB - Last synced at: 27 days ago - Pushed at: over 1 year ago - Stars: 345 - Forks: 29

yhyu13/AlphaGOZero-python-tensorflow
Congratulation to DeepMind! This is a reengineering implementation (on behalf of many other git repo in /support/) of DeepMind's Oct19th publication: [Mastering the Game of Go without Human Knowledge]. The supervised learning approach is more practical for individuals. (This repository has single purpose of education only)
Language: Python - Size: 185 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 341 - Forks: 113

minqi/learning-to-communicate-pytorch
Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch
Language: Python - Size: 85 KB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 336 - Forks: 73

wohlert/generative-query-network-pytorch
Generative Query Network (GQN) in PyTorch as described in "Neural Scene Representation and Rendering"
Language: Jupyter Notebook - Size: 43.1 MB - Last synced at: 5 months ago - Pushed at: almost 6 years ago - Stars: 322 - Forks: 63

google-deepmind/multi_object_datasets
Multi-object image datasets with ground-truth segmentation masks and generative factors.
Language: Python - Size: 2.92 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 248 - Forks: 24

L0SG/relational-rnn-pytorch
An implementation of DeepMind's Relational Recurrent Neural Networks (NeurIPS 2018) in PyTorch.
Language: Python - Size: 4.49 MB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 245 - Forks: 35

tayalmanan28/MuJoCo-Tutorial
Tutorial on how to get started with MuJoCo Simulation Platform. MuJoCo stands for Multi-Joint dynamics with Contact. It was acquired and made freely available by DeepMind in October 2021, and open sourced in May 2022. Feel free to contribute. Show your support by ✨this repository.
Language: Jupyter Notebook - Size: 31.4 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 217 - Forks: 32

denisyarats/dmc2gym
OpenAI Gym wrapper for the DeepMind Control Suite
Language: Python - Size: 27.3 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 190 - Forks: 58

MrGemy95/visual-interaction-networks-pytorch
This's an implementation of deepmind Visual Interaction Networks paper using pytorch
Language: Python - Size: 6.01 MB - Last synced at: 22 days ago - Pushed at: over 7 years ago - Stars: 166 - Forks: 24

MattKleinsmith/pbt
Population Based Training (in PyTorch with sqlite3). Status: Unsupported
Language: Python - Size: 120 KB - Last synced at: about 2 months ago - Pushed at: over 7 years ago - Stars: 166 - Forks: 25

benjs/nfnets_pytorch
Pre-trained NFNets with 99% of the accuracy of the official paper "High-Performance Large-Scale Image Recognition Without Normalization".
Language: Python - Size: 3.54 MB - Last synced at: 5 months ago - Pushed at: about 4 years ago - Stars: 160 - Forks: 13

Zhenye-Na/advanced-deep-learning-and-reinforcement-learning-deepmind
🎮 Advanced Deep Learning and Reinforcement Learning at UCL & DeepMind | YouTube videos 👉
Language: Jupyter Notebook - Size: 54.4 MB - Last synced at: 4 days ago - Pushed at: over 5 years ago - Stars: 154 - Forks: 45

iShohei220/torch-gqn
PyTorch Implementation of Generative Query Network
Language: Python - Size: 53.7 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 135 - Forks: 30

jsikyoon/visual-interaction-networks_tensorflow
Tensorflow Implementation of Visual Interaction Networks
Language: Python - Size: 616 KB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 132 - Forks: 30

SoyGema/Startcraft_pysc2_minigames
Startcraft II Machine Learning research with DeepMind pysc2 python library .mini-games and agents.
Language: Python - Size: 25 MB - Last synced at: 7 days ago - Pushed at: about 6 years ago - Stars: 132 - Forks: 14

bharathgs/NALU
Basic pytorch implementation of NAC/NALU from Neural Arithmetic Logic Units paper by trask et.al
Language: Python - Size: 168 KB - Last synced at: 12 days ago - Pushed at: over 6 years ago - Stars: 115 - Forks: 21

manyoso/allie
Allie: A UCI compliant chess engine
Language: C++ - Size: 700 KB - Last synced at: about 1 month ago - Pushed at: about 4 years ago - Stars: 105 - Forks: 21

jsikyoon/pathnet
Tensorflow Implementation of PathNet: Evolution Channels Gradient Descent in Super Neural Networks
Language: Python - Size: 1.65 MB - Last synced at: over 1 year ago - Pushed at: almost 8 years ago - Stars: 101 - Forks: 24

gbaptista/gemini-ai
A Ruby Gem for interacting with Gemini through Vertex AI, Generative Language API, or AI Studio, Google's generative AI services.
Language: Ruby - Size: 182 KB - Last synced at: 7 months ago - Pushed at: 10 months ago - Stars: 97 - Forks: 21

Zeta36/muzero
A simple implementation of MuZero algorithm for connect4 game
Language: Jupyter Notebook - Size: 59.6 KB - Last synced at: 7 days ago - Pushed at: over 4 years ago - Stars: 97 - Forks: 20

shivamsaboo17/Overcoming-Catastrophic-forgetting-in-Neural-Networks
Elastic weight consolidation technique for incremental learning.
Language: Jupyter Notebook - Size: 17.6 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 92 - Forks: 21

navneet-nmk/Pytorch-RL-CPP
A Repository with C++ implementations of Reinforcement Learning Algorithms (Pytorch)
Language: C++ - Size: 15.5 MB - Last synced at: 6 months ago - Pushed at: almost 6 years ago - Stars: 91 - Forks: 18

blanyal/alpha-zero
AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm" by DeepMind.
Language: Python - Size: 124 KB - Last synced at: 26 days ago - Pushed at: about 7 years ago - Stars: 88 - Forks: 28

rohitrango/objects-that-sound
Unofficial Implementation of Google Deepmind's paper `Objects that Sound`
Language: Python - Size: 57 MB - Last synced at: 12 days ago - Pushed at: almost 7 years ago - Stars: 83 - Forks: 16

azminewasi/online-ml-university
A curated list of FREE courses available online from top universities of the world on CS-DS-ML!
Size: 172 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 74 - Forks: 24

KokoMind/Recurrent-Environment-Simulators
Deepmind Recurrent Environment Simulators paper implementation in tensorflow
Language: Python - Size: 172 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 72 - Forks: 8

henry-prior/jax-rl
JAX implementations of core Deep RL algorithms
Language: Python - Size: 354 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 62 - Forks: 9

mtrazzi/two-step-task
Implementation of the two-step-task as described in "Prefrontal cortex as a meta-reinforcement learning system" and "Learning to Reinforcement Learn".
Language: Jupyter Notebook - Size: 8.71 MB - Last synced at: 24 days ago - Pushed at: about 6 years ago - Stars: 58 - Forks: 13

chris-chris/haiku-scalable-example
Scalable distributed reinforcement learning agents on kubernetes
Language: Python - Size: 303 KB - Last synced at: 16 days ago - Pushed at: almost 2 years ago - Stars: 57 - Forks: 4

mjpyeon/wavenet-classifier
Keras Implementation of Deepmind's WaveNet for Supervised Learning Tasks
Language: Python - Size: 12.7 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 57 - Forks: 11

miyosuda/scan
SCAN: Learning Abstract Hierarchical Compositional Visual Concepts
Language: Python - Size: 40.8 MB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 54 - Forks: 7

jihoonerd/Deep-Reinforcement-Learning-with-Double-Q-learning
📖 Paper: Deep Reinforcement Learning with Double Q-learning 🕹️
Language: Python - Size: 18.7 MB - Last synced at: 23 days ago - Pushed at: 12 months ago - Stars: 52 - Forks: 16

FLAIROx/jafar
JAX reimplementation of the DeepMind paper "Genie: Generative Interactive Environments"
Language: Python - Size: 102 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 50 - Forks: 3

Sohojoe/MujocoUnity
Reproducing MuJoCo benchmarks in a modern, commercial game /physics engine (Unity + PhysX).
Language: C# - Size: 44.5 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 50 - Forks: 6

siddk/relation-network
Tensorflow Implementation of Relation Networks for the bAbI QA Task, detailed in "A Simple Neural Network Module for Relational Reasoning," [https://arxiv.org/abs/1706.01427] by Santoro et. al.
Language: Python - Size: 3.42 MB - Last synced at: 29 days ago - Pushed at: almost 8 years ago - Stars: 49 - Forks: 15

jihoonerd/Human-level-control-through-deep-reinforcement-learning
📖 Paper: Human-level control through deep reinforcement learning 🕹️
Language: Python - Size: 21.4 MB - Last synced at: 23 days ago - Pushed at: 12 months ago - Stars: 47 - Forks: 8

woctezuma/deep-learning-resources
Books, courses, videos and blogs, mostly about Deep Learning.
Size: 251 KB - Last synced at: 22 days ago - Pushed at: 8 months ago - Stars: 42 - Forks: 5

shivamsaboo17/Neural-Scene-Representation-and-Rendering
Generative Query Network for rendering 3D scenes from 2D images
Language: Python - Size: 7.32 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 40 - Forks: 9

masterai-top/The-strongest-AI-in-Texas-Hold-em-1-to-1
德州AI,MasterAI is an AI poker dedicated to suport n-play (single- or multi-agent) Texas Hold'em imperfect-informatin games.。MasterAI v2.0是从MasterAI v1.0衍生出来的迭代算法,它在非完全信息游戏中利用了通用的强化学习+搜索,并在一对一无限押注的德州扑克中实现了超人的表现。AI源码出售。Tg:@xuzongbin001;E-mail:[email protected]
Language: C++ - Size: 3.72 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 36 - Forks: 9

Sohojoe/ActiveRagdollAssaultCourse
Research into Assault Course for training Active Ragdolls (using MujocoUnity+ml_agents)
Language: C# - Size: 123 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 36 - Forks: 5

Sohojoe/ActiveRagdollControllers
Research into controllers for 2d and 3d Active Ragdolls (using MujocoUnity+ml_agents)
Language: C# - Size: 115 MB - Last synced at: 11 days ago - Pushed at: over 6 years ago - Stars: 34 - Forks: 3

mvrahden/reinforce-js
[INACTIVE] A collection of various machine learning solver. The library is an object-oriented approach (baked with Typescript) and tries to deliver simplified interfaces that make using the algorithms pretty simple.
Language: TypeScript - Size: 169 KB - Last synced at: 4 days ago - Pushed at: almost 7 years ago - Stars: 31 - Forks: 7

hoangthang1607/nfnets-Tensorflow-2
Pre-trained NFNets with 99% of the accuracy of the official paper "High-Performance Large-Scale Image Recognition Without Normalization".
Language: Jupyter Notebook - Size: 160 KB - Last synced at: about 1 month ago - Pushed at: about 4 years ago - Stars: 30 - Forks: 12

R-Stefano/Grid-Cells
Implementation of Vector Based Navigation using Grid-like cells using Tensorflow and Numpy
Language: Python - Size: 184 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 28 - Forks: 7

benjs/DCTransformer-PyTorch 📦
Unofficial PyTorch implementation of the paper "Generating images with sparse representations"
Language: Python - Size: 15.6 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 28 - Forks: 2

grananqvist/NALU-tf
Tensorflow implementation of Neural Arithmetic Logic Unit, Trask et al.
Language: Python - Size: 3.91 KB - Last synced at: 9 days ago - Pushed at: almost 7 years ago - Stars: 28 - Forks: 3

greydanus/dnc
Differentiable Neural Computer in TensorFlow
Language: Jupyter Notebook - Size: 31.8 MB - Last synced at: about 2 years ago - Pushed at: about 8 years ago - Stars: 28 - Forks: 11

BKHMSI/Meta-RL-TwoStep-Task
PyTorch implementation of Episodic Meta Reinforcement Learning on variants of the "Two-Step" task. Reproduces the results found in three papers. Check the ReadMe for more details!
Language: Python - Size: 1.03 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 23 - Forks: 2

Nilabhra/NALU
Neural Arithmetic Logic Units
Language: Jupyter Notebook - Size: 68.4 KB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 22 - Forks: 6

chiamp/muzero-cartpole
Applying DeepMind's MuZero algorithm to the cart pole environment in gym
Language: Python - Size: 70.2 MB - Last synced at: 24 days ago - Pushed at: almost 2 years ago - Stars: 21 - Forks: 1

deeptexas-ai/The-strongest-AI-in-Texas-Hold-em-unlimited-Texas-Hold-em-1-vs.-1
德州扑克最强人工智能AI,1对1的德州AI,可以战胜人类顶尖职业牌手,先出售全套AI源代码和AI训练模型;Telegram联系: @xuzongbin001 或E-mail:[email protected]
Language: C++ - Size: 4.31 MB - Last synced at: 29 days ago - Pushed at: 3 months ago - Stars: 19 - Forks: 4

tiagoCuervo/CommonsGame
An OpenAI gym multi-agent environment implementing the Commons Game proposed in "A multi-agent reinforcement learning model of common-pool resource appropriation"
Language: Python - Size: 371 KB - Last synced at: 8 months ago - Pushed at: almost 5 years ago - Stars: 18 - Forks: 4

crunchiness/lernd
Lernd is ∂ILP (dILP) framework implementation based on Deepmind's paper Learning Explanatory Rules from Noisy Data.
Language: Python - Size: 162 KB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 17 - Forks: 4

SergioIommi/DQN-2048
Deep Reinforcement Learning to Play 2048 (with Keras)
Language: Python - Size: 20.6 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 17 - Forks: 3

gautam1858/Neural-Arithmetic-Logic-Units
MXNet Implementation of DeepMind's Neural Arithmetic Logic Units (NALU)
Language: Python - Size: 12.7 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 17 - Forks: 2

isaaccorley/detcon-pytorch
PyTorch implementation of DeepMind's DetCon from "Efficient Visual Pretraining with Contrastive Detection" Henaff et al. (ICCV 2021)
Language: Python - Size: 419 KB - Last synced at: 23 days ago - Pushed at: over 3 years ago - Stars: 14 - Forks: 4

PoCInnovation/Open-Zero
Open-zero is a research project aiming to realize the various projects of the company DeepMind
Language: Python - Size: 575 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 14 - Forks: 0

astier/model-free-episodic-control
Model-Free-Episodic-Control implementation.
Language: Python - Size: 46.6 MB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 14 - Forks: 4

dtreai/Griffin-Jax
Jax implementation of "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"
Language: Python - Size: 53.7 KB - Last synced at: 3 months ago - Pushed at: 12 months ago - Stars: 13 - Forks: 0

tigerneil/reinforcementlearning.today
Made for a reading group at the Center for Safe AGI.
Size: 94.7 KB - Last synced at: 17 days ago - Pushed at: over 2 years ago - Stars: 12 - Forks: 2

amirhossein-hkh/pong-dqn
RL Agent for Atari Game Pong
Language: Jupyter Notebook - Size: 12 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 12 - Forks: 7

sayakpaul/NALU
Neural Arithmetic Logic Units by Trask et al.
Language: Jupyter Notebook - Size: 14.6 KB - Last synced at: about 1 month ago - Pushed at: about 6 years ago - Stars: 12 - Forks: 0

seungjaeryanlee/playing-hard-exploration-games-by-watching-youtube
[WIP] Playing Hard Exploration Games by Watching YouTube (Aytar et al., 2018)
Language: Jupyter Notebook - Size: 497 KB - Last synced at: 3 months ago - Pushed at: over 6 years ago - Stars: 12 - Forks: 3

chiamp/fast-reinforcement-learning
Implementing DeepMind's Fast Reinforcement Learning paper, and adding additional features to generalize the algorithms
Language: Python - Size: 4.37 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 10 - Forks: 0

thundergolfer/reasoning-about-entailment-tensorflow
:school: Tensorflow implementation of "Reasoning About Entailment with Neural Attention"
Language: Python - Size: 49.8 KB - Last synced at: 3 days ago - Pushed at: about 2 years ago - Stars: 10 - Forks: 5

nerdimite/ntm
A PyTorch Implementation of Neural Turing Machine
Language: Python - Size: 255 KB - Last synced at: about 1 year ago - Pushed at: almost 5 years ago - Stars: 10 - Forks: 4

angusfung/pbt-gan
Applying Population Based Training on Generative Adversarial Networks.
Language: Python - Size: 3.57 MB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 10 - Forks: 2

epignatelli/human-level-control-through-deep-reinforcement-learning
A jax/stax implementation of: Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A.A., Veness, J., Bellemare, M.G., Graves, A., Riedmiller, M., Fidjeland, A.K., Ostrovski, G. and Petersen, S., 2015. Human-level control through deep reinforcement learning. nature, 518(7540), pp.529-533.
Language: Python - Size: 81.1 KB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 9 - Forks: 4

batuhan-ince/RL_pysc2
DRL algorithms for Starcraft II
Language: Python - Size: 5.1 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 9 - Forks: 2

soskek/interval-bound-propagation-chainer
Sven Gowal et al., Scalable Verified Training for Provably Robust Image Classification, ICCV 2019
Language: Jupyter Notebook - Size: 246 KB - Last synced at: 21 days ago - Pushed at: over 5 years ago - Stars: 9 - Forks: 2

0xNineteen/hyper-alpha-zero
hyper optimized alpha zero implementation to play gomoku (distributed training with ray, mcts with cython)
Language: Python - Size: 864 KB - Last synced at: 9 days ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 0

semanticweights/tarok
:spades: Slovenian Tarok card game environment for the OpenSpiel framework.
Language: C++ - Size: 278 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 8 - Forks: 2

BKHMSI/Meta-RL-Harlow
PyTorch implementation of two variants of the Harlow visual fixation task (PsychLab and 1D version). Reproduces the results found in two papers. Check the ReadMe for more details!
Language: Python - Size: 30.3 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 8 - Forks: 1

AnatoliiPotapov/MNIST-EWC
Implementation of ews weight constraint mentioned in recent Deep Mind paper: http://www.pnas.org/content/early/2017/03/13/1611835114.full.pdf
Language: Jupyter Notebook - Size: 19.2 MB - Last synced at: about 2 years ago - Pushed at: about 8 years ago - Stars: 8 - Forks: 3

rkc007/AlphaGo-Zero-Implementation-Using-Reinforcement-Learning
This is my implementation of the DeepMind's AlphaZero algorithm for the Game of Go
Language: Python - Size: 50.8 KB - Last synced at: 13 days ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 2

turiphro/deeplearning
Deep Learning track / hackdays
Language: Python - Size: 10.4 MB - Last synced at: 28 days ago - Pushed at: over 7 years ago - Stars: 7 - Forks: 3

Charmve/PuppyGo
vision language model and large language model powered embodied robot
Size: 11.7 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 0

mwufi/pytorch-gqn
Neural Scene Rendering! With Helpful Comments and a video ->
Language: Jupyter Notebook - Size: 2.58 MB - Last synced at: 25 days ago - Pushed at: almost 6 years ago - Stars: 6 - Forks: 1

prabhatnagarajan/dqn
This repository contains a python implementation of a Deep Q-Network (DQN) for Atari gameplay using tensorflow.
Language: Python - Size: 84.4 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 6 - Forks: 1

peter-can-write/david-silver-rl-notes
Notes about David Silver's course on Reinforcement Learning
Size: 36.1 KB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 6 - Forks: 3

wtlow003/speculative-sampling
Implementation of Speculative Sampling in "Accelerating Large Language Model Decoding with Speculative Sampling"
Language: Python - Size: 30.3 KB - Last synced at: 29 days ago - Pushed at: 9 months ago - Stars: 5 - Forks: 1

Akella17/Beta-VAE
To learn and reason like humans, AI must first learn to factorise interpretable representations of independent data generative factors (preferably in an unsupervised manner!!). What does all this mean? Go through this tutorial to get an overview of disentanglement in the context of unsupervised visual disentangled representation learning.
Language: Jupyter Notebook - Size: 6.87 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 5 - Forks: 0

vickypandey14/Gemini-PHP
I've developed Gemini-PHP, a PHP application designed to work smoothly with the Gemini API. This tool makes it easy for users to create content simply by providing their prompts.
Language: Hack - Size: 669 KB - Last synced at: 27 days ago - Pushed at: 7 months ago - Stars: 4 - Forks: 0

imgeorgiev/dmc2gymnasium
Gymnasium integration for the DeepMind Control (DMC) suite
Language: Python - Size: 6.84 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

RoboticsDesignLab/jitterbug
A Jitterbug dm_control Reinforcement Learning domain
Language: Jupyter Notebook - Size: 44.4 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 1

AI-Huang/WaveNet
Keras and PyTorch implementations for Google's WaveNet
Language: Python - Size: 14.6 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

jinmang2/Awesome-Papers
:snowflake: All about my interest Papers and Review :)
Language: HTML - Size: 810 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 2

R-Stefano/DQN
Implementation of Deep Q-Network (DQN) on OpenAI games: Pong and Breakout using Tensorflow and Numpy
Language: Python - Size: 95.4 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 3

adiyadav123/Gemini-Chat
Gemini Chat is an AI chatbot website created by Aditya Yadav which uses Google's Gemini. It is designed to be informative, engaging, and helpful. Gemini can answer questions on a wide range of topics, including general knowledge, current events, and entertainment. It can also provide weather updates, sports scores, and other useful information.
Language: JavaScript - Size: 321 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 1

kochlisGit/Shadow-Hand-Controller
Construction of controllers for Shadow-Hand in Mujoco environment, using Deep Learning. 2 Different methods were used to create the controllers: a) Behavioral Cloning b) Deep Reinforcement Learning
Language: Python - Size: 11.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 2

Neo-47/Atari-DQN
The code for the famous DQN paper applied on Atari's Breakout.
Language: Python - Size: 52.5 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 4
