snake_AI

Using machine learning algorithms to teach a A.I. how to play the classic game snake. Implements reinforced learning and Deep Q Networks to maximize the cumulative reward based on the current state of the environment.

snake_pygame

Made using pygame module -play_step(action) --returns reward, done(game_over bool), score

Model

Made with PyTorch

Linear_QNet (DQN) uses a feed foreward neural network with one hidden layer size 256 and a Relu activation function.

Q Value = Quality of action

Init Q Value (= init model)
Choose action (model.predict) ->returns action based on maximized cumulative reward
Perform action
Measure Reward
Update Q value (+train model)
Repeat

Agent

Main file

Training method:

-state = get_state(from game) -action = get_move(based on state of game environment): ->model.predict(best action based on reward)

reward, game_over, score = game.play_step(action) new_state = get_state(game)

add single move to short term memory once game_over = True add batch of short term memories to long term memory

model.train()

How new Q-Value is calculated

s = state

a = action

lr = learning rate

gamma = discount rate

R = reward

** Function **

NewQ(s,a) = Q(s,a) + lr[R(s,a) + gamma*maxQ'(s',a') - Q(s,a)

Uses optimizer.Adam() method an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments.

Criterion of the Mean Squared error is pair with the built in backwards() method to complete the training process.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
__pycache__		__pycache__
model		model
snake_AI		snake_AI
README.md		README.md
Snake_AI_Demo.mp4		Snake_AI_Demo.mp4
agent.py		agent.py
helper.py		helper.py
model.py		model.py
snake_pygame.py		snake_pygame.py
tempCodeRunnerFile.py		tempCodeRunnerFile.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

snake_AI

snake_pygame

Model

Agent

Main file

Training method:

model.train()

About

Releases

Packages

Languages

TravisH18/snake_AI

Folders and files

Latest commit

History

Repository files navigation

snake_AI

snake_pygame

Model

Agent

Main file

Training method:

model.train()

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages