Img2Cap: Image captioning using LM T5 or VitGPT

This application generates a caption of an image based on detected objects in the image. If the number of objects is not less than two, the application generates the caption applying the T5 model, if less, the application uses the VitGPT model.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Img2Cap_Image_captioning_using_LM_T5_or_VitGPT.ipynb		Img2Cap_Image_captioning_using_LM_T5_or_VitGPT.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Img2Cap: Image captioning using LM T5 or VitGPT

About

Releases

Packages

Languages

TonmoyTalukder/Img2Cap

Folders and files

Latest commit

History

Repository files navigation

Img2Cap: Image captioning using LM T5 or VitGPT

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages