seq2seq

A model to combine two sub-captions for image captioning on VizWiz dataset

split_captions.ipynb is used to create the dataset used for seq-2-seq model.

eval.ipynb is used to evaluate the trained model on the validation/test dataset and create captions from smaller subcaptions.

opts.py gathers the main parameters of the network to be imported by other python codes.

prepre_labels.py is used to preprocess the captions and create a word embedding /language model. This file generates the language.pkl which can be imported by other modules of the project.

read_tsv.py is used to read the tsv file containing the bottom-up features and store them in npz files.

train.py is the main file used to train the network.

utils.py contains some functions used in the project.

For final evaluation of generated captions, we use the API presented by VizWiz at https://github.com/Yinan-Zhao/vizwiz-caption

Results folder contains the captions generated by 2 of our models: V9 and V12, for without and with visual features.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
results		results
README.md		README.md
Split_captions.ipynb		Split_captions.ipynb
eval.ipynb		eval.ipynb
language.pkl		language.pkl
models.py		models.py
opts.py		opts.py
prepro_labels.py		prepro_labels.py
read_tsv.py		read_tsv.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

seq2seq

About

Releases

Packages

Languages

spiridonoff/seq2seq

Folders and files

Latest commit

History

Repository files navigation

seq2seq

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages