An open API service providing repository metadata for many open source software ecosystems.

Topic: "deepmind"

google-deepmind/pysc2

StarCraft II Learning Environment

Language: Python - Size: 4.36 MB - Last synced at: 18 days ago - Pushed at: 10 months ago - Stars: 8,109 - Forks: 1,160

andri27-ts/Reinforcement-Learning

Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning

Language: Jupyter Notebook - Size: 10 MB - Last synced at: 22 days ago - Pushed at: almost 5 years ago - Stars: 4,309 - Forks: 628

chris-chris/pysc2-examples

StarCraft II - pysc2 Deep Reinforcement Learning Examples

Language: Python - Size: 4.96 MB - Last synced at: 23 days ago - Pushed at: about 4 years ago - Stars: 755 - Forks: 354

Toni-SM/skrl

Modular reinforcement learning library (on PyTorch and JAX) with support for NVIDIA Isaac Gym, Omniverse Isaac Gym and Isaac Lab

Language: Python - Size: 7.52 MB - Last synced at: 5 days ago - Pushed at: 15 days ago - Stars: 744 - Forks: 81

inoryy/reaver

Reaver: Modular Deep Reinforcement Learning Framework. Focused on StarCraft II. Supports Gym, Atari, and MuJoCo.

Language: Python - Size: 291 KB - Last synced at: 23 days ago - Pushed at: over 4 years ago - Stars: 557 - Forks: 89

anantzoid/Conditional-PixelCNN-decoder

Tensorflow implementation of Gated Conditional Pixel Convolutional Neural Network

Language: Python - Size: 2.5 MB - Last synced at: about 1 month ago - Pushed at: almost 7 years ago - Stars: 482 - Forks: 83

Robinwho/Deep-Learning

深度学习/人工智能/机器学习资料汇总(Deep Learning/Artificial Intelligent/Machine Learning) 持续更新……

Size: 1.21 MB - Last synced at: about 1 year ago - Pushed at: about 4 years ago - Stars: 390 - Forks: 132

SkyWorkAIGC/SkyCode-AI-CodeX-GPT3

SkyCode是一个多语言开源编程大模型,采用GPT3模型结构,支持Java, JavaScript, C, C++, Python, Go, shell等多种主流编程语言,并能理解中文注释。模型可以对代码进行补全,拥有强大解题能力,使您从编程中解放出来,专心于解决更重要的问题。| SkyCode is an open source programming model, which adopts the GPT3 model structure. It supports Java, JavaScript, C, C++, Python, Go, shell and other languages, and can understand Chinese comments.

Size: 34.2 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 388 - Forks: 21

google-deepmind/spriteworld

Spriteworld: a flexible, configurable python-based reinforcement learning environment

Language: Python - Size: 446 KB - Last synced at: 12 months ago - Pushed at: almost 5 years ago - Stars: 367 - Forks: 54

vballoli/nfnets-pytorch

NFNets and Adaptive Gradient Clipping for SGD implemented in PyTorch. Find explanation at tourdeml.github.io/blog/

Language: Python - Size: 5.63 MB - Last synced at: 27 days ago - Pushed at: over 1 year ago - Stars: 345 - Forks: 29

yhyu13/AlphaGOZero-python-tensorflow

Congratulation to DeepMind! This is a reengineering implementation (on behalf of many other git repo in /support/) of DeepMind's Oct19th publication: [Mastering the Game of Go without Human Knowledge]. The supervised learning approach is more practical for individuals. (This repository has single purpose of education only)

Language: Python - Size: 185 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 341 - Forks: 113

minqi/learning-to-communicate-pytorch

Learning to Communicate with Deep Multi-Agent Reinforcement Learning in PyTorch

Language: Python - Size: 85 KB - Last synced at: about 1 year ago - Pushed at: about 6 years ago - Stars: 336 - Forks: 73

wohlert/generative-query-network-pytorch

Generative Query Network (GQN) in PyTorch as described in "Neural Scene Representation and Rendering"

Language: Jupyter Notebook - Size: 43.1 MB - Last synced at: 5 months ago - Pushed at: almost 6 years ago - Stars: 322 - Forks: 63

google-deepmind/multi_object_datasets

Multi-object image datasets with ground-truth segmentation masks and generative factors.

Language: Python - Size: 2.92 MB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 248 - Forks: 24

L0SG/relational-rnn-pytorch

An implementation of DeepMind's Relational Recurrent Neural Networks (NeurIPS 2018) in PyTorch.

Language: Python - Size: 4.49 MB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 245 - Forks: 35

tayalmanan28/MuJoCo-Tutorial

Tutorial on how to get started with MuJoCo Simulation Platform. MuJoCo stands for Multi-Joint dynamics with Contact. It was acquired and made freely available by DeepMind in October 2021, and open sourced in May 2022. Feel free to contribute. Show your support by ✨this repository.

Language: Jupyter Notebook - Size: 31.4 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 217 - Forks: 32

denisyarats/dmc2gym

OpenAI Gym wrapper for the DeepMind Control Suite

Language: Python - Size: 27.3 KB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 190 - Forks: 58

MrGemy95/visual-interaction-networks-pytorch

This's an implementation of deepmind Visual Interaction Networks paper using pytorch

Language: Python - Size: 6.01 MB - Last synced at: 22 days ago - Pushed at: over 7 years ago - Stars: 166 - Forks: 24

MattKleinsmith/pbt

Population Based Training (in PyTorch with sqlite3). Status: Unsupported

Language: Python - Size: 120 KB - Last synced at: about 2 months ago - Pushed at: over 7 years ago - Stars: 166 - Forks: 25

benjs/nfnets_pytorch

Pre-trained NFNets with 99% of the accuracy of the official paper "High-Performance Large-Scale Image Recognition Without Normalization".

Language: Python - Size: 3.54 MB - Last synced at: 5 months ago - Pushed at: about 4 years ago - Stars: 160 - Forks: 13

Zhenye-Na/advanced-deep-learning-and-reinforcement-learning-deepmind

🎮 Advanced Deep Learning and Reinforcement Learning at UCL & DeepMind | YouTube videos 👉

Language: Jupyter Notebook - Size: 54.4 MB - Last synced at: 4 days ago - Pushed at: over 5 years ago - Stars: 154 - Forks: 45

iShohei220/torch-gqn

PyTorch Implementation of Generative Query Network

Language: Python - Size: 53.7 KB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 135 - Forks: 30

jsikyoon/visual-interaction-networks_tensorflow

Tensorflow Implementation of Visual Interaction Networks

Language: Python - Size: 616 KB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 132 - Forks: 30

SoyGema/Startcraft_pysc2_minigames

Startcraft II Machine Learning research with DeepMind pysc2 python library .mini-games and agents.

Language: Python - Size: 25 MB - Last synced at: 7 days ago - Pushed at: about 6 years ago - Stars: 132 - Forks: 14

bharathgs/NALU

Basic pytorch implementation of NAC/NALU from Neural Arithmetic Logic Units paper by trask et.al

Language: Python - Size: 168 KB - Last synced at: 12 days ago - Pushed at: over 6 years ago - Stars: 115 - Forks: 21

manyoso/allie

Allie: A UCI compliant chess engine

Language: C++ - Size: 700 KB - Last synced at: about 1 month ago - Pushed at: about 4 years ago - Stars: 105 - Forks: 21

jsikyoon/pathnet

Tensorflow Implementation of PathNet: Evolution Channels Gradient Descent in Super Neural Networks

Language: Python - Size: 1.65 MB - Last synced at: over 1 year ago - Pushed at: almost 8 years ago - Stars: 101 - Forks: 24

gbaptista/gemini-ai

A Ruby Gem for interacting with Gemini through Vertex AI, Generative Language API, or AI Studio, Google's generative AI services.

Language: Ruby - Size: 182 KB - Last synced at: 7 months ago - Pushed at: 10 months ago - Stars: 97 - Forks: 21

Zeta36/muzero

A simple implementation of MuZero algorithm for connect4 game

Language: Jupyter Notebook - Size: 59.6 KB - Last synced at: 7 days ago - Pushed at: over 4 years ago - Stars: 97 - Forks: 20

shivamsaboo17/Overcoming-Catastrophic-forgetting-in-Neural-Networks

Elastic weight consolidation technique for incremental learning.

Language: Jupyter Notebook - Size: 17.6 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 92 - Forks: 21

navneet-nmk/Pytorch-RL-CPP

A Repository with C++ implementations of Reinforcement Learning Algorithms (Pytorch)

Language: C++ - Size: 15.5 MB - Last synced at: 6 months ago - Pushed at: almost 6 years ago - Stars: 91 - Forks: 18

blanyal/alpha-zero

AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm" by DeepMind.

Language: Python - Size: 124 KB - Last synced at: 26 days ago - Pushed at: about 7 years ago - Stars: 88 - Forks: 28

rohitrango/objects-that-sound

Unofficial Implementation of Google Deepmind's paper `Objects that Sound`

Language: Python - Size: 57 MB - Last synced at: 12 days ago - Pushed at: almost 7 years ago - Stars: 83 - Forks: 16

azminewasi/online-ml-university

A curated list of FREE courses available online from top universities of the world on CS-DS-ML!

Size: 172 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 74 - Forks: 24

KokoMind/Recurrent-Environment-Simulators

Deepmind Recurrent Environment Simulators paper implementation in tensorflow

Language: Python - Size: 172 KB - Last synced at: about 2 years ago - Pushed at: over 7 years ago - Stars: 72 - Forks: 8

henry-prior/jax-rl

JAX implementations of core Deep RL algorithms

Language: Python - Size: 354 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 62 - Forks: 9

mtrazzi/two-step-task

Implementation of the two-step-task as described in "Prefrontal cortex as a meta-reinforcement learning system" and "Learning to Reinforcement Learn".

Language: Jupyter Notebook - Size: 8.71 MB - Last synced at: 24 days ago - Pushed at: about 6 years ago - Stars: 58 - Forks: 13

chris-chris/haiku-scalable-example

Scalable distributed reinforcement learning agents on kubernetes

Language: Python - Size: 303 KB - Last synced at: 16 days ago - Pushed at: almost 2 years ago - Stars: 57 - Forks: 4

mjpyeon/wavenet-classifier

Keras Implementation of Deepmind's WaveNet for Supervised Learning Tasks

Language: Python - Size: 12.7 KB - Last synced at: about 2 years ago - Pushed at: about 6 years ago - Stars: 57 - Forks: 11

miyosuda/scan

SCAN: Learning Abstract Hierarchical Compositional Visual Concepts

Language: Python - Size: 40.8 MB - Last synced at: about 1 year ago - Pushed at: over 7 years ago - Stars: 54 - Forks: 7

jihoonerd/Deep-Reinforcement-Learning-with-Double-Q-learning

📖 Paper: Deep Reinforcement Learning with Double Q-learning 🕹️

Language: Python - Size: 18.7 MB - Last synced at: 23 days ago - Pushed at: 12 months ago - Stars: 52 - Forks: 16

FLAIROx/jafar

JAX reimplementation of the DeepMind paper "Genie: Generative Interactive Environments"

Language: Python - Size: 102 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 50 - Forks: 3

Sohojoe/MujocoUnity

Reproducing MuJoCo benchmarks in a modern, commercial game /physics engine (Unity + PhysX).

Language: C# - Size: 44.5 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 50 - Forks: 6

siddk/relation-network

Tensorflow Implementation of Relation Networks for the bAbI QA Task, detailed in "A Simple Neural Network Module for Relational Reasoning," [https://arxiv.org/abs/1706.01427] by Santoro et. al.

Language: Python - Size: 3.42 MB - Last synced at: 29 days ago - Pushed at: almost 8 years ago - Stars: 49 - Forks: 15

jihoonerd/Human-level-control-through-deep-reinforcement-learning

📖 Paper: Human-level control through deep reinforcement learning 🕹️

Language: Python - Size: 21.4 MB - Last synced at: 23 days ago - Pushed at: 12 months ago - Stars: 47 - Forks: 8

woctezuma/deep-learning-resources

Books, courses, videos and blogs, mostly about Deep Learning.

Size: 251 KB - Last synced at: 22 days ago - Pushed at: 8 months ago - Stars: 42 - Forks: 5

shivamsaboo17/Neural-Scene-Representation-and-Rendering

Generative Query Network for rendering 3D scenes from 2D images

Language: Python - Size: 7.32 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 40 - Forks: 9

masterai-top/The-strongest-AI-in-Texas-Hold-em-1-to-1

德州AI,MasterAI is an AI poker dedicated to suport n-play (single- or multi-agent) Texas Hold'em imperfect-informatin games.。MasterAI v2.0是从MasterAI v1.0衍生出来的迭代算法,它在非完全信息游戏中利用了通用的强化学习+搜索,并在一对一无限押注的德州扑克中实现了超人的表现。AI源码出售。Tg:@xuzongbin001;E-mail:[email protected]

Language: C++ - Size: 3.72 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 36 - Forks: 9

Sohojoe/ActiveRagdollAssaultCourse

Research into Assault Course for training Active Ragdolls (using MujocoUnity+ml_agents)

Language: C# - Size: 123 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 36 - Forks: 5

Sohojoe/ActiveRagdollControllers

Research into controllers for 2d and 3d Active Ragdolls (using MujocoUnity+ml_agents)

Language: C# - Size: 115 MB - Last synced at: 11 days ago - Pushed at: over 6 years ago - Stars: 34 - Forks: 3

mvrahden/reinforce-js

[INACTIVE] A collection of various machine learning solver. The library is an object-oriented approach (baked with Typescript) and tries to deliver simplified interfaces that make using the algorithms pretty simple.

Language: TypeScript - Size: 169 KB - Last synced at: 4 days ago - Pushed at: almost 7 years ago - Stars: 31 - Forks: 7

hoangthang1607/nfnets-Tensorflow-2

Pre-trained NFNets with 99% of the accuracy of the official paper "High-Performance Large-Scale Image Recognition Without Normalization".

Language: Jupyter Notebook - Size: 160 KB - Last synced at: about 1 month ago - Pushed at: about 4 years ago - Stars: 30 - Forks: 12

R-Stefano/Grid-Cells

Implementation of Vector Based Navigation using Grid-like cells using Tensorflow and Numpy

Language: Python - Size: 184 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 28 - Forks: 7

benjs/DCTransformer-PyTorch 📦

Unofficial PyTorch implementation of the paper "Generating images with sparse representations"

Language: Python - Size: 15.6 KB - Last synced at: about 2 years ago - Pushed at: about 4 years ago - Stars: 28 - Forks: 2

grananqvist/NALU-tf

Tensorflow implementation of Neural Arithmetic Logic Unit, Trask et al.

Language: Python - Size: 3.91 KB - Last synced at: 9 days ago - Pushed at: almost 7 years ago - Stars: 28 - Forks: 3

greydanus/dnc

Differentiable Neural Computer in TensorFlow

Language: Jupyter Notebook - Size: 31.8 MB - Last synced at: about 2 years ago - Pushed at: about 8 years ago - Stars: 28 - Forks: 11

BKHMSI/Meta-RL-TwoStep-Task

PyTorch implementation of Episodic Meta Reinforcement Learning on variants of the "Two-Step" task. Reproduces the results found in three papers. Check the ReadMe for more details!

Language: Python - Size: 1.03 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 23 - Forks: 2

Nilabhra/NALU

Neural Arithmetic Logic Units

Language: Jupyter Notebook - Size: 68.4 KB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 22 - Forks: 6

chiamp/muzero-cartpole

Applying DeepMind's MuZero algorithm to the cart pole environment in gym

Language: Python - Size: 70.2 MB - Last synced at: 24 days ago - Pushed at: almost 2 years ago - Stars: 21 - Forks: 1

deeptexas-ai/The-strongest-AI-in-Texas-Hold-em-unlimited-Texas-Hold-em-1-vs.-1

德州扑克最强人工智能AI,1对1的德州AI,可以战胜人类顶尖职业牌手,先出售全套AI源代码和AI训练模型;Telegram联系: @xuzongbin001 或E-mail:[email protected]

Language: C++ - Size: 4.31 MB - Last synced at: 29 days ago - Pushed at: 3 months ago - Stars: 19 - Forks: 4

tiagoCuervo/CommonsGame

An OpenAI gym multi-agent environment implementing the Commons Game proposed in "A multi-agent reinforcement learning model of common-pool resource appropriation"

Language: Python - Size: 371 KB - Last synced at: 8 months ago - Pushed at: almost 5 years ago - Stars: 18 - Forks: 4

crunchiness/lernd

Lernd is ∂ILP (dILP) framework implementation based on Deepmind's paper Learning Explanatory Rules from Noisy Data.

Language: Python - Size: 162 KB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 17 - Forks: 4

SergioIommi/DQN-2048

Deep Reinforcement Learning to Play 2048 (with Keras)

Language: Python - Size: 20.6 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 17 - Forks: 3

gautam1858/Neural-Arithmetic-Logic-Units

MXNet Implementation of DeepMind's Neural Arithmetic Logic Units (NALU)

Language: Python - Size: 12.7 KB - Last synced at: about 1 year ago - Pushed at: over 6 years ago - Stars: 17 - Forks: 2

isaaccorley/detcon-pytorch

PyTorch implementation of DeepMind's DetCon from "Efficient Visual Pretraining with Contrastive Detection" Henaff et al. (ICCV 2021)

Language: Python - Size: 419 KB - Last synced at: 23 days ago - Pushed at: over 3 years ago - Stars: 14 - Forks: 4

PoCInnovation/Open-Zero

Open-zero is a research project aiming to realize the various projects of the company DeepMind

Language: Python - Size: 575 KB - Last synced at: about 1 year ago - Pushed at: over 3 years ago - Stars: 14 - Forks: 0

astier/model-free-episodic-control

Model-Free-Episodic-Control implementation.

Language: Python - Size: 46.6 MB - Last synced at: about 2 years ago - Pushed at: almost 6 years ago - Stars: 14 - Forks: 4

dtreai/Griffin-Jax

Jax implementation of "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"

Language: Python - Size: 53.7 KB - Last synced at: 3 months ago - Pushed at: 12 months ago - Stars: 13 - Forks: 0

tigerneil/reinforcementlearning.today

Made for a reading group at the Center for Safe AGI.

Size: 94.7 KB - Last synced at: 17 days ago - Pushed at: over 2 years ago - Stars: 12 - Forks: 2

amirhossein-hkh/pong-dqn

RL Agent for Atari Game Pong

Language: Jupyter Notebook - Size: 12 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 12 - Forks: 7

sayakpaul/NALU

Neural Arithmetic Logic Units by Trask et al.

Language: Jupyter Notebook - Size: 14.6 KB - Last synced at: about 1 month ago - Pushed at: about 6 years ago - Stars: 12 - Forks: 0

seungjaeryanlee/playing-hard-exploration-games-by-watching-youtube

[WIP] Playing Hard Exploration Games by Watching YouTube (Aytar et al., 2018)

Language: Jupyter Notebook - Size: 497 KB - Last synced at: 3 months ago - Pushed at: over 6 years ago - Stars: 12 - Forks: 3

chiamp/fast-reinforcement-learning

Implementing DeepMind's Fast Reinforcement Learning paper, and adding additional features to generalize the algorithms

Language: Python - Size: 4.37 MB - Last synced at: about 1 month ago - Pushed at: about 1 year ago - Stars: 10 - Forks: 0

thundergolfer/reasoning-about-entailment-tensorflow

:school: Tensorflow implementation of "Reasoning About Entailment with Neural Attention"

Language: Python - Size: 49.8 KB - Last synced at: 3 days ago - Pushed at: about 2 years ago - Stars: 10 - Forks: 5

nerdimite/ntm

A PyTorch Implementation of Neural Turing Machine

Language: Python - Size: 255 KB - Last synced at: about 1 year ago - Pushed at: almost 5 years ago - Stars: 10 - Forks: 4

angusfung/pbt-gan

Applying Population Based Training on Generative Adversarial Networks.

Language: Python - Size: 3.57 MB - Last synced at: about 1 month ago - Pushed at: over 6 years ago - Stars: 10 - Forks: 2

epignatelli/human-level-control-through-deep-reinforcement-learning

A jax/stax implementation of: Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A.A., Veness, J., Bellemare, M.G., Graves, A., Riedmiller, M., Fidjeland, A.K., Ostrovski, G. and Petersen, S., 2015. Human-level control through deep reinforcement learning. nature, 518(7540), pp.529-533.

Language: Python - Size: 81.1 KB - Last synced at: about 1 month ago - Pushed at: over 4 years ago - Stars: 9 - Forks: 4

batuhan-ince/RL_pysc2

DRL algorithms for Starcraft II

Language: Python - Size: 5.1 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 9 - Forks: 2

soskek/interval-bound-propagation-chainer

Sven Gowal et al., Scalable Verified Training for Provably Robust Image Classification, ICCV 2019

Language: Jupyter Notebook - Size: 246 KB - Last synced at: 21 days ago - Pushed at: over 5 years ago - Stars: 9 - Forks: 2

0xNineteen/hyper-alpha-zero

hyper optimized alpha zero implementation to play gomoku (distributed training with ray, mcts with cython)

Language: Python - Size: 864 KB - Last synced at: 9 days ago - Pushed at: over 2 years ago - Stars: 8 - Forks: 0

semanticweights/tarok

:spades: Slovenian Tarok card game environment for the OpenSpiel framework.

Language: C++ - Size: 278 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 8 - Forks: 2

BKHMSI/Meta-RL-Harlow

PyTorch implementation of two variants of the Harlow visual fixation task (PsychLab and 1D version). Reproduces the results found in two papers. Check the ReadMe for more details!

Language: Python - Size: 30.3 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 8 - Forks: 1

AnatoliiPotapov/MNIST-EWC

Implementation of ews weight constraint mentioned in recent Deep Mind paper: http://www.pnas.org/content/early/2017/03/13/1611835114.full.pdf

Language: Jupyter Notebook - Size: 19.2 MB - Last synced at: about 2 years ago - Pushed at: about 8 years ago - Stars: 8 - Forks: 3

rkc007/AlphaGo-Zero-Implementation-Using-Reinforcement-Learning

This is my implementation of the DeepMind's AlphaZero algorithm for the Game of Go

Language: Python - Size: 50.8 KB - Last synced at: 13 days ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 2

turiphro/deeplearning

Deep Learning track / hackdays

Language: Python - Size: 10.4 MB - Last synced at: 28 days ago - Pushed at: over 7 years ago - Stars: 7 - Forks: 3

Charmve/PuppyGo

vision language model and large language model powered embodied robot

Size: 11.7 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 6 - Forks: 0

mwufi/pytorch-gqn

Neural Scene Rendering! With Helpful Comments and a video ->

Language: Jupyter Notebook - Size: 2.58 MB - Last synced at: 25 days ago - Pushed at: almost 6 years ago - Stars: 6 - Forks: 1

prabhatnagarajan/dqn

This repository contains a python implementation of a Deep Q-Network (DQN) for Atari gameplay using tensorflow.

Language: Python - Size: 84.4 MB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 6 - Forks: 1

peter-can-write/david-silver-rl-notes

Notes about David Silver's course on Reinforcement Learning

Size: 36.1 KB - Last synced at: about 2 years ago - Pushed at: about 7 years ago - Stars: 6 - Forks: 3

wtlow003/speculative-sampling

Implementation of Speculative Sampling in "Accelerating Large Language Model Decoding with Speculative Sampling"

Language: Python - Size: 30.3 KB - Last synced at: 29 days ago - Pushed at: 9 months ago - Stars: 5 - Forks: 1

Akella17/Beta-VAE

To learn and reason like humans, AI must first learn to factorise interpretable representations of independent data generative factors (preferably in an unsupervised manner!!). What does all this mean? Go through this tutorial to get an overview of disentanglement in the context of unsupervised visual disentangled representation learning.

Language: Jupyter Notebook - Size: 6.87 MB - Last synced at: almost 2 years ago - Pushed at: over 6 years ago - Stars: 5 - Forks: 0

vickypandey14/Gemini-PHP

I've developed Gemini-PHP, a PHP application designed to work smoothly with the Gemini API. This tool makes it easy for users to create content simply by providing their prompts.

Language: Hack - Size: 669 KB - Last synced at: 27 days ago - Pushed at: 7 months ago - Stars: 4 - Forks: 0

imgeorgiev/dmc2gymnasium

Gymnasium integration for the DeepMind Control (DMC) suite

Language: Python - Size: 6.84 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

RoboticsDesignLab/jitterbug

A Jitterbug dm_control Reinforcement Learning domain

Language: Jupyter Notebook - Size: 44.4 MB - Last synced at: almost 2 years ago - Pushed at: over 4 years ago - Stars: 4 - Forks: 1

AI-Huang/WaveNet

Keras and PyTorch implementations for Google's WaveNet

Language: Python - Size: 14.6 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

jinmang2/Awesome-Papers

:snowflake: All about my interest Papers and Review :)

Language: HTML - Size: 810 KB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 3 - Forks: 2

R-Stefano/DQN

Implementation of Deep Q-Network (DQN) on OpenAI games: Pong and Breakout using Tensorflow and Numpy

Language: Python - Size: 95.4 MB - Last synced at: about 2 years ago - Pushed at: almost 5 years ago - Stars: 3 - Forks: 3

adiyadav123/Gemini-Chat

Gemini Chat is an AI chatbot website created by Aditya Yadav which uses Google's Gemini. It is designed to be informative, engaging, and helpful. Gemini can answer questions on a wide range of topics, including general knowledge, current events, and entertainment. It can also provide weather updates, sports scores, and other useful information.

Language: JavaScript - Size: 321 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 1

kochlisGit/Shadow-Hand-Controller

Construction of controllers for Shadow-Hand in Mujoco environment, using Deep Learning. 2 Different methods were used to create the controllers: a) Behavioral Cloning b) Deep Reinforcement Learning

Language: Python - Size: 11.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 2

Neo-47/Atari-DQN

The code for the famous DQN paper applied on Atari's Breakout.

Language: Python - Size: 52.5 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 4