multi-armed_bandits

An experiment with the multi-armed bandits problem applying Reinforcement Learning techniques.

Description

This problem is a commonly problem used to play with it with reinforcement learning techniques. The problem consist in that you have one or many bandit machines with arms. Is analogous if you have X machines with an arm each one or one machine with X arms. In this problem, you can pull an arm and you will got a reward. Every arm give you a reward that follows any kind of distribution and you algorithm has to get the best cumulative reward.

See wikipedia multi-armed bandit for more details about the problem.

See wikipedia reinforcement learning for more details about reinforcement learning.

Here, you can execute the main file and study what this does, modify it or what do you want.

I will improve the code over time. This is the first version.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
penv		penv
pers		pers
rl_dir		rl_dir
tools		tools
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

multi-armed_bandits

Description

About

Releases

Packages

Languages

License

DavidDB33/multi-armed_bandits

Folders and files

Latest commit

History

Repository files navigation

multi-armed_bandits

Description

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages