Skip to content

Latest commit

 

History

History
2 lines (2 loc) · 348 Bytes

README.md

File metadata and controls

2 lines (2 loc) · 348 Bytes

Racetrack-Environment

In this jupyter notebook , you will compare the performance of three reinforcement learning algorithms - On-Policy First-Visit Monte-Carlo Control, Sarsa, and Q-Learning - in a simple racetrack environment. You will then implement a modified TD agent that improves upon the learning performance of a basic Q-Learning agent.