Topic: "stochastic-gradient-ascent"
RFLeijenaar/RL-KArmed-Bandit
K-armed bandit problem approached with a variety of action-selection learning algorithms.
Language: C - Size: 638 KB - Last synced at: 4 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0
