Highlights
- Pro
Block or Report
Block or report cc288
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Machine learning datasets used in tutorials on MachineLearningMastery.com
Simple image captioning model
This is official Pytorch implementation of "Rethinking the necessity of image fusion in high-level vision tasks: A practical infrared and visible image fusion network based on progressive semantic …
Image Captioning using CNN and Transformer.
GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)
PyTorch implementation of Image captioning with Bottom-up, Top-down Attention
Meshed-Memory Transformer for Image Captioning. CVPR 2020
A PyTorch reimplementation of bottom-up-attention models
Transformer-based image captioning extension for pytorch/fairseq
GIT: A Generative Image-to-text Transformer for Vision and Language
A list of awesome remote sensing image captioning resources
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
Awesome radiology report generation and image captioning papers.
[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
A Chinese medical ChatGPT based on LLaMa, training from large-scale pretrain corpus and multi-turn dialogue dataset.
Hyperparameter analysis for Image Captioning using LSTMs and Transformers
Transformer & CNN Image Captioning model in PyTorch.
Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.
This is implementation of finetuning BLIP model for Visual Question Answering
Medical Image captioning on chest X-rays
LaVIT: Empower the Large Language Model to Understand and Generate Visual Content
Automate Fashion Image Captioning using BLIP-2. Automatic generating descriptions of clothes on shopping websites, which can help customers without fashion knowledge to better understand the featur…
Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation. CVPR 2023
Using LLMs and pre-trained caption models for super-human performance on image captioning.
Pytorch implementation of image captioning using transformer-based model.
Official LEVIR-CC dataset and Pytorch implementation for Remote Sensing Image Change Captioning With Dual-Branch Transformers: A New Method and a Large Scale Dataset