q-values-tracking | Topic | Ecosyste.ms: Repos

Topic: "q-values-tracking"

ChaitanyaC22/Deep-RL-Project---Maximize-total-profits-earned-by-cab-driver

The goal of this project is to build an RL-based algorithm that can help cab drivers maximize their profits by improving their decision-making process on the field. Taking long-term profit as the goal, a method is proposed based on reinforcement learning to optimize taxi driving strategies for profit maximization. This optimization problem is formulated as a Markov Decision Process i.e. MDP.

Language: Jupyter Notebook - Size: 1.61 MB - Last synced at: 26 days ago - Pushed at: almost 4 years ago - Stars: 13 - Forks: 3

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos

Topic: "q-values-tracking"

ChaitanyaC22/Deep-RL-Project---Maximize-total-profits-earned-by-cab-driver