Pong Q-learning

Structure

src
├── game.py  # game logic (using pygame)
├── main.py  # parse args and run
└── world.py # main module: Q-learning + game interactions

Usage

python src/main.py \
	--canvas_size 32 24 \
	--paddle_length 7 \
	--velocity 1 \
	--epsilon 0.03 \
	--learning_rate 0.7 \
	--discount 0.99 \
	--train_episodes 50000 \
	--eval_episodes 10 \
	--eval_every 200 \
	--max_iter 1000 \
	--agent_strategy eps_greedy \
	--opponent_strategy almost_perfect --alpha 0.8 \
	--load --filename q-values/32_24_7_003_07_50k_eg_ap_08.p

To train with custom parameters, omit the --load --filename <file> args.

You can also use --plot_scores to plot train and eval scores, after training.

Using --filename <file> (without --load) will save final Q-values to the specified file, as a pickle dictionary.

Pickle file naming convention:

<w>_<h>_<paddle_len>_<eps>_<lr>_<train_episodes>_<agent_strategy>_<opponent_strategy>

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
q-values		q-values
src		src
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
requirements.txt		requirements.txt
show-off.sh		show-off.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pong Q-learning

Structure

Usage

Results

red opponent has 100% chance of moving perfectly

red opponent has 80% chance of moving perfectly

About

Languages

License

alexandru-dinu/pong-qlearning

Folders and files

Latest commit

History

Repository files navigation

Pong Q-learning

Structure

Usage

Results

red opponent has 100% chance of moving perfectly

red opponent has 80% chance of moving perfectly

About

Topics

Resources

License

Stars

Watchers

Forks

Languages