⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~

Vue 6,025 423 Updated Oct 7, 2024

ChaoningZhang / MobileSAM

This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!

Jupyter Notebook 4,729 487 Updated Jan 29, 2024

yformer / EfficientSAM

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Jupyter Notebook 2,105 151 Updated Jun 6, 2024

ytongbai / LVM

Python 1,747 54 Updated Jun 28, 2024

pytorch-labs / segment-anything-fast

A batched offline inference oriented version of segment-anything

Python 1,190 70 Updated Sep 13, 2024

ymgw55 / segment-anything-edge-detection

Unofficial edge detection implementation using the Automatic Mask Generation (AMG) of the Segment Anything Model (SAM).

C++ 52 5 Updated Apr 16, 2024

XinyuZhou2000 / Spoken-Dialogue

Jupyter Notebook 18 1 Updated Dec 7, 2023

QwenLM / Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 4,891 373 Updated Aug 7, 2024

ChenDelong1999 / RemoteCLIP

🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing" (IEEE TGRS)

Jupyter Notebook 282 18 Updated Jun 27, 2024

UX-Decoder / Semantic-SAM

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

Python 2,282 109 Updated Jul 19, 2024

ChenDelong1999 / polite-flamingo

🦩 Visual Instruction Tuning with Polite Flamingo - training multi-modal LLMs to be both clever and polite! (AAAI-24 Oral)

Python 63 3 Updated Dec 9, 2023

om-ai-lab / awesome-RSVLM

Collection of Remote Sensing Vision-Language Models

122 4 Updated May 13, 2024

X-PLUG / mPLUG-Owl

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Python 2,261 171 Updated Sep 23, 2024

kohjingyu / fromage

🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".

Jupyter Notebook 475 35 Updated Oct 30, 2023

tatsu-lab / stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,393 4,031 Updated Jul 17, 2024

tloen / alpaca-lora

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,576 2,214 Updated Jul 29, 2024

lupantech / ScienceQA

Data and code for NeurIPS 2022 Paper "Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering".

Python 592 64 Updated Sep 19, 2024

om-ai-lab / VL-CheckList

Evaluating Vision & Language Pretraining Models with Objects, Attributes and Relations. [EMNLP 2022]

Python 126 4 Updated Sep 29, 2024

serre-lab / CVR

A Benchmark for Efficient and Compositional Visual Reasoning

Python 17 6 Updated Aug 2, 2023

mlfoundations / open_flamingo

An open-source framework for training large multimodal models.

Python 3,686 280 Updated Aug 31, 2024

1e12Leon / ProbDet

Python 21 Updated May 6, 2023

othneildrew / Best-README-Template

An awesome README template to jumpstart your projects!

14,064 22,876 Updated Aug 12, 2024

eric-ai-lab / awesome-vision-language-navigation

A curated list for vision-and-language navigation. ACL 2022 paper "Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions"

363 19 Updated May 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Delong Chen (陈德龙) ChenDelong1999

Achievements

Achievements

Block or report ChenDelong1999

Stars

1e12Leon / UEMM-Air

HLTCHKUST / UniVaR

ZhanYang-nwpu / Awesome-Remote-Sensing-Multimodal-Large-Language-Model

qqlu / Entity

arampacha / CLIP-rsicd

mit-han-lab / efficientvit

allenai / unified-io-2

ccfddl / ccf-deadlines