pg-cartpole-tf2

What is this

play CartPole to 200 step with Policy Gradient using TensorFlow2. this code is refined from Aurelien Geron's <Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow>.

What is in it

policygradient_cartpole_train.py : train models ,write these models in "model-save-path" folder;

policygradient_cartpole_test.py : test models ,read models from "model-save-path" folder ,test these model with several episodes ,write a result csv in "result-save-path"folder;

pg_cartpole_0300.h5 ：a good model plays well;

pg_cartpole_test_result.csv : an example of result csv;

How to use

train

(1) give a folder to reserve the trained models # default : ./

(2) python3 policygradient_cartpole_train.py #default parameters is recommended

test

python3 policygradient_cartpole_test.py

other

(1) when testing , you can use "--render=True" to show the cartpole,but it will be slow.

(2) the model will saved every 20 iteration(default),the example max-step curve shows below:

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
README.md		README.md
curve of mean max step.png		curve of mean max step.png
pg_cartpole_0360.h5		pg_cartpole_0360.h5
pg_cartpole_test_result.csv		pg_cartpole_test_result.csv
policygradient_cartpole_test.py		policygradient_cartpole_test.py
policygradient_cartpole_train.py		policygradient_cartpole_train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pg-cartpole-tf2

What is this

What is in it

How to use

train

test

other

About

Releases

Packages

Languages

Song-xx/policygradient-cartpole-tf2

Folders and files

Latest commit

History

Repository files navigation

pg-cartpole-tf2

What is this

What is in it

How to use

train

test

other

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages