- Tensorflow implementation of "Show, attend and Tell" http:https://arxiv.org/abs/1502.03044
- Borrowed most of the idea from the author's source code https://github.com/kelvinxu/arctic-captions
- make_flickr_dataset.py: Extracts conv5_3 layer activations of VGG Network for flickr30k images, and save them in 'data/feats.npy'
- model_tensorflow.py: Main codes
- Download flickr30k Dataset.
- Extract VGG conv5_3 features using make_flickr_dataset.py
- Train: run train() in model_tensorflow.py
- Test: run test() in model_tensorflow.py