Skip to content
View ChenDelong1999's full-sized avatar
🚀
🚀

Block or report ChenDelong1999

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
10 Updated Sep 5, 2024

Official reposity for paper "High-Dimension Human Value Representation in Large Language Models"

Python 19 1 Updated Jul 9, 2024

Multimodal Large Language Models for Remote Sensing (RS-MLLMs): A Survey

131 4 Updated Sep 16, 2024

EntitySeg Toolbox: Towards Open-World and High-Quality Image Segmentation

Jupyter Notebook 693 58 Updated Nov 30, 2023
Jupyter Notebook 202 31 Updated Feb 19, 2022

EfficientViT is a new family of vision models for efficient high-resolution vision.

Python 1,799 164 Updated Aug 9, 2024
Python 561 27 Updated Feb 15, 2024

⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~

Vue 6,025 423 Updated Oct 7, 2024

This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!

Jupyter Notebook 4,729 487 Updated Jan 29, 2024

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Jupyter Notebook 2,105 151 Updated Jun 6, 2024
Python 1,747 54 Updated Jun 28, 2024

A batched offline inference oriented version of segment-anything

Python 1,190 70 Updated Sep 13, 2024

Unofficial edge detection implementation using the Automatic Mask Generation (AMG) of the Segment Anything Model (SAM).

C++ 52 5 Updated Apr 16, 2024
Jupyter Notebook 18 1 Updated Dec 7, 2023

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 4,891 373 Updated Aug 7, 2024

🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing" (IEEE TGRS)

Jupyter Notebook 282 18 Updated Jun 27, 2024

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

Python 2,282 109 Updated Jul 19, 2024

🦩 Visual Instruction Tuning with Polite Flamingo - training multi-modal LLMs to be both clever and polite! (AAAI-24 Oral)

Python 63 3 Updated Dec 9, 2023

Collection of Remote Sensing Vision-Language Models

122 4 Updated May 13, 2024

mPLUG-Owl: The Powerful Multi-modal Large Language Model Family

Python 2,261 171 Updated Sep 23, 2024

🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".

Jupyter Notebook 475 35 Updated Oct 30, 2023

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,393 4,031 Updated Jul 17, 2024

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,576 2,214 Updated Jul 29, 2024

Data and code for NeurIPS 2022 Paper "Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering".

Python 592 64 Updated Sep 19, 2024

Evaluating Vision & Language Pretraining Models with Objects, Attributes and Relations. [EMNLP 2022]

Python 126 4 Updated Sep 29, 2024

A Benchmark for Efficient and Compositional Visual Reasoning

Python 17 6 Updated Aug 2, 2023

An open-source framework for training large multimodal models.

Python 3,686 280 Updated Aug 31, 2024
Python 21 Updated May 6, 2023

An awesome README template to jumpstart your projects!

14,064 22,876 Updated Aug 12, 2024

A curated list for vision-and-language navigation. ACL 2022 paper "Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions"

363 19 Updated May 2, 2024
Next