Skip to content

In this jupyter notebook , you will compare the performance of three reinforcement learning algorithms - On-Policy First-Visit Monte-Carlo Control, Sarsa, and Q-Learning - in a simple racetrack environment. You will then implement a modified TD agent that improves upon the learning performance of a basic Q-Learning agent.

Notifications You must be signed in to change notification settings

abhaybodhe/Racetrack-Environment

Repository files navigation

Racetrack-Environment

In this jupyter notebook , you will compare the performance of three reinforcement learning algorithms - On-Policy First-Visit Monte-Carlo Control, Sarsa, and Q-Learning - in a simple racetrack environment. You will then implement a modified TD agent that improves upon the learning performance of a basic Q-Learning agent.

About

In this jupyter notebook , you will compare the performance of three reinforcement learning algorithms - On-Policy First-Visit Monte-Carlo Control, Sarsa, and Q-Learning - in a simple racetrack environment. You will then implement a modified TD agent that improves upon the learning performance of a basic Q-Learning agent.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published