An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: value-function

cwzhou/itrSurv

multi-utility optimal individualized treatment regime estimation for survival data

Language: R - Size: 13 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Samahussien7/Bellman

Language: Jupyter Notebook - Size: 17.6 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

Multi-Shot-Approximation-of-MDPs/Self-Guided-ALPs-Discounted-Cost

Multi-Shot Approximation of Discounted Cost MDPs

Language: Python - Size: 4.99 MB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 11 - Forks: 4

mamello-justice/research-gsslgovf

Goal Selection Strategies for Learning Goal-Oriented Value Functions

Language: TeX - Size: 1020 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

rgriva/numerical-methods

This is code from the Numerical Methods course at EPGE/FGV in 2018

Language: MATLAB - Size: 10.8 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 3 - Forks: 4

rgriva/macro3

Code for the Macro III course at the M.Sc/Ph.D programs at EPGE/FGV

Language: Matlab - Size: 3.91 MB - Last synced at: over 1 year ago - Pushed at: over 6 years ago - Stars: 7 - Forks: 5

dellalibera/td-gammon

TD-Gammon implementation

Language: Python - Size: 1.06 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 25 - Forks: 8

YyzHarry/SV-RL

[ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning

Language: Python - Size: 1.47 MB - Last synced at: about 2 years ago - Pushed at: about 5 years ago - Stars: 31 - Forks: 6

MikeS96/rl_openai

RL with OpenAI Gym

Language: Jupyter Notebook - Size: 2.75 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 4 - Forks: 4

ard333/flappy-bird

Flappy Bird for artificial intelligence/machine learning (Agent available: Q-Learning, SARSA, and combined with Backpropagation)

Language: Java - Size: 62.5 KB - Last synced at: about 2 years ago - Pushed at: almost 7 years ago - Stars: 5 - Forks: 1