Visionizer

Platforms for fashion e-commerce are rising in popularity. However, the majority of the manual work still goes into scanning, rendering, and captioning fashion products. The task of creating a textual description of a fashion item from a photograph depicting it is covered in this research on fashion image captioning. For this work, we create the FashionCap dataset, which consists of 40,000 captions for over 290,000 photos and it is based on the original counterpart FashionGen. We conduct a thorough analysis using several neural architectures on three fashion captioning datasets.

Datasets

The following datasets were used, and are available here: https://doi.org/10.5281/zenodo.7196078

FashionCap
ReducedFACAD
ReducedInFashAI

Dependencies

tensorflow
numpy
pandas
nltk
bert_score
matplotlib
tqdm
numba

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
images		images
README.md		README.md
Visionizer.ipynb		Visionizer.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Visionizer

Datasets

Dependencies

Best BLEU score instance

Worst BLEU score instance

Attention Weights during generation

About

Releases

Packages

Contributors 2

Languages

NoLogicPlease/Visionizer

Folders and files

Latest commit

History

Repository files navigation

Visionizer

Datasets

Dependencies

Best BLEU score instance

Worst BLEU score instance

Attention Weights during generation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages