Topic: "q-values-tracking"
ChaitanyaC22/Deep-RL-Project---Maximize-total-profits-earned-by-cab-driver
The goal of this project is to build an RL-based algorithm that can help cab drivers maximize their profits by improving their decision-making process on the field. Taking long-term profit as the goal, a method is proposed based on reinforcement learning to optimize taxi driving strategies for profit maximization. This optimization problem is formulated as a Markov Decision Process i.e. MDP.
Language: Jupyter Notebook - Size: 1.61 MB - Last synced at: 26 days ago - Pushed at: almost 4 years ago - Stars: 13 - Forks: 3
