GitHub - piteren/pypoks: Poker with Deep Reinforcement Learning, multi processing, genetic algorithms

Deep Reinforcement Learning (DRL) with Neural Network (NN) based Agent in NL Texas Hold'em Poker Game Environment with Python & PyTorch

It is a pure DRL with implemented algorithms such as PG, AC and PPO.
pypoks does not use any search algorithm while training or playing.
No prior knowledge of poker game rules is required by the RL algorithm.
The algorithm can be easily adjusted for any bet sizes, table sizes, starting stacks, etc.

Research Scope of the Project (ML/RL):

Testbed for different RL concepts like PG, A3C, PPO, and their modifications
Efficient NN Agent (PyTorch-based) architecture details
Asynchronous self-play: multi-GPU, many subprocesses, hundreds of tables at once
Efficient environment events (data) representation (multiplayer, many bets)
Efficient process (and subprocesses) monitoring
Genetic Algorithms (GA) for policies (with PyTorch)
High (poker) variance & backpropagation
High (poker) variance & policy evaluation

How to Read the Docs

In some sub-folders, there are separate READMEs (.md). Please follow them for more detailed concepts of the code from the sub-folders.

Setup

The project may be set up with Python=<3.11. Install requirements from requirements.txt
For instructions on how to install tkinter for Python 3.11, please go to the gui/tkinter folder.

Training

To run training scripts, you will need about 50 CPU cores, 120GB RAM, and a 2x GPU system.
You may just play run_human_game.py with trained agents downloaded from here To play a human game with agents, you will also need tkinter for GUI. Please install it.

To train a poker agent (DMK) from scratch, run:

$ python run/run_train_loop.py

This script will train a set of agents with RL self-play. The script is preconfigured with many options that will fit a system with 2x GPUs (11GB). Trained agents available for download with the link above took about 5 days to train.

While training, you may check the progress with TensorBoard (run run_TB.sh)

In case of OSError: [Errno 24] Too many open files, you may need to increase the open-files limit: $ ulimit -n 65535

Human Game - Playing with Trained Agents

To play a game with trained agents:

$ python run/run_human_game.py

Allowed moves are defined in the /game_configs yaml file.

While playing, a debug of the game is logged to the terminal - you can always check the cards played by each agent.

Name		Name	Last commit message	Last commit date
Latest commit History 292 Commits
code_concepts		code_concepts
game_configs		game_configs
gui		gui
images		images
podecide		podecide
pologic		pologic
run		run
tests		tests
.gitignore		.gitignore
README.md		README.md
envy.py		envy.py
requirements.txt		requirements.txt
run_TB.sh		run_TB.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Reinforcement Learning (DRL) with Neural Network (NN) based Agent in NL Texas Hold'em Poker Game Environment with Python & PyTorch

Research Scope of the Project (ML/RL):

How to Read the Docs

Setup

Training

Human Game - Playing with Trained Agents

About

Releases

Packages

Languages

piteren/pypoks

Folders and files

Latest commit

History

Repository files navigation

Deep Reinforcement Learning (DRL) with Neural Network (NN) based Agent in NL Texas Hold'em Poker Game Environment with Python & PyTorch

Research Scope of the Project (ML/RL):

How to Read the Docs

Setup

Training

Human Game - Playing with Trained Agents

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages