Skip to content
View cc288's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report cc288

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Machine learning datasets used in tutorials on MachineLearningMastery.com

1,065 1,477 Updated Aug 15, 2023

Simple image captioning model

Jupyter Notebook 1,253 210 Updated Jun 9, 2024

This is official Pytorch implementation of "Rethinking the necessity of image fusion in high-level vision tasks: A practical infrared and visible image fusion network based on progressive semantic …

Python 102 2 Updated Apr 27, 2024

Image Captioning using CNN and Transformer.

Python 47 9 Updated Nov 9, 2021

GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)

Python 174 27 Updated May 9, 2023
Python 7 Updated May 5, 2024
Python 5 Updated May 5, 2024

PyTorch implementation of Image captioning with Bottom-up, Top-down Attention

Python 160 37 Updated Jan 6, 2019

Meshed-Memory Transformer for Image Captioning. CVPR 2020

Python 507 136 Updated Dec 21, 2022

A PyTorch reimplementation of bottom-up-attention models

Jupyter Notebook 289 75 Updated Apr 7, 2022

Transformer-based image captioning extension for pytorch/fairseq

Python 313 55 Updated Dec 18, 2020

GIT: A Generative Image-to-text Transformer for Vision and Language

Python 533 65 Updated Dec 2, 2023

A list of awesome remote sensing image captioning resources

Python 76 1 Updated Jun 23, 2024

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Python 4,022 415 Updated Jun 14, 2024

[ICCV 2017] Torch code for Grad-CAM

Lua 1,447 221 Updated Sep 17, 2022

Awesome radiology report generation and image captioning papers.

30 5 Updated Jun 6, 2024

[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Jupyter Notebook 566 30 Updated Mar 4, 2024

A Chinese medical ChatGPT based on LLaMa, training from large-scale pretrain corpus and multi-turn dialogue dataset.

Python 264 24 Updated Dec 12, 2023

Hyperparameter analysis for Image Captioning using LSTMs and Transformers

Jupyter Notebook 26 2 Updated Oct 3, 2023

Transformer & CNN Image Captioning model in PyTorch.

Python 39 6 Updated Mar 7, 2023

Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.

Python 989 280 Updated Oct 5, 2023

This is implementation of finetuning BLIP model for Visual Question Answering

Python 38 5 Updated Dec 22, 2023

Medical Image captioning on chest X-rays

Jupyter Notebook 35 19 Updated Mar 21, 2023

本项目旨在分享大模型相关技术原理以及实战经验。

HTML 7,390 721 Updated Jun 25, 2024

LaVIT: Empower the Large Language Model to Understand and Generate Visual Content

Jupyter Notebook 421 21 Updated May 24, 2024

Automate Fashion Image Captioning using BLIP-2. Automatic generating descriptions of clothes on shopping websites, which can help customers without fashion knowledge to better understand the featur…

Jupyter Notebook 40 6 Updated Aug 7, 2023

Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation. CVPR 2023

Python 49 4 Updated Jun 15, 2023

Using LLMs and pre-trained caption models for super-human performance on image captioning.

Python 39 4 Updated Oct 13, 2023

Pytorch implementation of image captioning using transformer-based model.

Jupyter Notebook 49 10 Updated Apr 13, 2023

Official LEVIR-CC dataset and Pytorch implementation for Remote Sensing Image Change Captioning With Dual-Branch Transformers: A New Method and a Large Scale Dataset

Python 96 6 Updated May 11, 2024
Next