An open API service providing repository metadata for many open source software ecosystems.

GitHub / CSKrishna / Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting

We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/CSKrishna%2FOptimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting

Stars: 10
Forks: 4
Open issues: 0

License: None
Language: Jupyter Notebook
Size: 86.9 KB
Dependencies parsed at: Pending

Created at: about 7 years ago
Updated at: over 3 years ago
Pushed at: about 7 years ago
Last synced at: about 2 years ago

Topics: bandit, contextual, gradient, learning, multi-agent, policy, reinforcement

    Loading...