Attention Model for Vehicle Routing Problems

Tensorflow 2.0 implementation of Attention, Learn to Solve Routing Problems! article.

Dmitry Eremeev, Alexey Pustynnikov

This work was done as part of a final project for DeepPavlov course: Advanced Topics in Deep Reinforcement learning.

Code of the full project (dynamic version) is located at https://github.com/d-eremeev/ADM-VRP

Enviroment:

Current enviroment implementation is located in Enviroment.py file - AgentVRP class.

The class contains information about current state and actions that were done by agent.

Main methods:

step(action): transit to a new state according to the action.
get_costs(dataset, pi): returns costs for each graph in batch according to the paths in action-state space.
get_mask(): returns a mask with available actions (allowed nodes).
all_finished(): checks if all games in batch are finished (all graphes are solved).

Let's connect current terms with RL language (small dictionary):

State: $X$ - graph instance (coordinates, demands, etc.) together with information in which node agent is located.
Action: $\pi_t$ - decision in which node agent should go.
Reward: The (negative) tour length.

Model Training:

AM is trained by policy gradient using REINFORCE algorithm with baseline.

Baseline

Baseline is a copy of model with fixed weights from one of the preceding epochs.
Use warm-up for early epochs: mix exponential moving average of model cost over past epochs with baseline model.
Update baseline at the end of epoch if the difference in costs for candidate model and baseline is statistically-significant (t-test).
Baseline uses separate dataset for this validation. This dataset is updated after each baseline renewal.

Files Description:

Enviroment.py - enviroment for VRP RL Agent
layers.py - MHA layers for encoder
attention_graph_encoder.py - Graph Attention Encoder
attention_graph_decoder.py - Graph Attention Decoder
attention_model.py - Attention Model
reinforce_baseline.py - class for REINFORCE baseline
train.py - defines training loop, that we use in train_with_checkpoint.ipynb
train_with_checkpoint.ipynb - from this file one can start training or continue training from chechpoint
generate_data.py - various auxiliary functions for data creation, saving and visualisation
results folder: folder name is ADM_VRP_{graph_size}_{batch_size}. There are training logs, learning curves and saved models in each folder

Training procedure:

Open train_with_checkpoint.ipynb and choose training parameters.
All outputs would be saved in current directory.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Attention Model for Vehicle Routing Problems

Tensorflow 2.0 implementation of Attention, Learn to Solve Routing Problems! article.

Dmitry Eremeev, Alexey Pustynnikov

Enviroment:

Model Training:

Files Description:

Training procedure:

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
Enviroment.py		Enviroment.py
README.md		README.md
attention_graph_decoder.py		attention_graph_decoder.py
attention_graph_encoder.py		attention_graph_encoder.py
attention_model.py		attention_model.py
generate_data.py		generate_data.py
layers.py		layers.py
reinforce_baseline.py		reinforce_baseline.py
train.py		train.py
train_with_checkpoint.ipynb		train_with_checkpoint.ipynb

alexeypustynnikov/AM-VRP

Folders and files

Latest commit

History

Repository files navigation

Attention Model for Vehicle Routing Problems

Tensorflow 2.0 implementation of Attention, Learn to Solve Routing Problems! article.

Dmitry Eremeev, Alexey Pustynnikov

Enviroment:

Model Training:

Files Description:

Training procedure:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages