Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption.

Python 558 99 Updated May 7, 2023

L4Clippers / Patent-Image-Retrieval-Transformer-DML

Jupyter Notebook 8 2 Updated Dec 18, 2023

ABaldrati / CLIP4CirDemo

[CVPR 2022 - Demo Track] - Effective conditioned and composed image retrieval combining CLIP-based features

SCSS 70 8 Updated Mar 28, 2023

bbbdbbb / MiniGPT-4-captions

Generating captions on image datasets using MiniGPT-v2

Python 4 Updated Dec 23, 2023

Vision-CAIR / MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,103 2,897 Updated Apr 22, 2024

bentoml / OpenLLM

Run any open-source LLMs, such as Llama 2, Mistral, as OpenAI compatible API endpoint in the cloud.

Python 9,168 581 Updated Jun 17, 2024

Facico / Chinese-Vicuna

Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案，结构参考alpaca

C 4,140 427 Updated Mar 7, 2024

ellenzhuwang / implicitOOD

An end-to-end vision and language model incorporating explicit knowledge graphs and OOD-detection.

Python 3 Updated May 3, 2024

chu-tianxiang / QuIP-for-all

QuIP quantization

Python 34 3 Updated Mar 17, 2024

xieyuquanxx / awesome-Large-MultiModal-Hallucination

😎 up-to-date & curated list of awesome LMM hallucinations papers, methods & resources.

129 11 Updated Mar 23, 2024

kuleshov-group / llmtools

Finetuning Large Language Models on One Consumer GPU in Under 4 Bits

Python 682 73 Updated May 25, 2024

IST-DASLab / gptq

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 1,771 143 Updated Mar 27, 2024

chaoyi-wu / Finetune_LLAMA

简单易懂的LLaMA微调指南。

Python 306 33 Updated Jul 5, 2023

suu990901 / LLaMA-MiLe-Loss

Code for a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models

Python 43 3 Updated Jun 17, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

10,167 673 Updated Jun 22, 2024

xinyu1205 / recognize-anything

Open-source and strong foundation image recognition models.

Jupyter Notebook 2,542 240 Updated Jun 12, 2024

dvlab-research / MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,059 275 Updated May 4, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 17,635 1,893 Updated May 28, 2024

CASIA-IVA-Lab / FastSAM

Fast Segment Anything

Python 7,053 658 Updated Feb 29, 2024

nightrome / cocostuff

The official homepage of the COCO-Stuff dataset.

Shell 817 145 Updated Sep 9, 2022

yeates / MaGIC

[ICLR 24] MaGIC: Multi-modality Guided Image Completion

Python 40 3 Updated Apr 24, 2024

bleedline / aimoneyhunter

ai副业赚钱大集合，教你如何利用ai做一些副业项目，赚取更多额外收益。The Ultimate Guide to Making Money with AI Side Hustles: Learn how to leverage AI for some cool side gigs and rake in some extra cash. Check out the English versi…

10,918 1,001 Updated Jun 19, 2024

songrise / CLIP-Count

[ACM MM23] CLIP-Count: Towards Text-Guided Zero-Shot Object Counting

Python 73 6 Updated Mar 20, 2024

IDEA-Research / OpenSeeD

[ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"

Python 606 37 Updated Jan 22, 2024

ylqi / Count-Anything

This method uses Segment Anything and CLIP to ground and count any object that matches a custom text prompt, without requiring any point or box annotation.

Python 119 14 Updated Apr 22, 2023

anuragxel / salt

Segment Anything Labelling Tool

Python 996 127 Updated Feb 19, 2024

berkeley-hipie / HIPIE

[NeurIPS2023] Code release for "Hierarchical Open-vocabulary Universal Image Segmentation"

Jupyter Notebook 246 18 Updated Mar 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ellen Wang ellenzhuwang

Block or report ellenzhuwang

Stars

hhshomee / designpatent_dataset

beichenzbc / Long-CLIP

mlfoundations / open_clip

njustkmg / OMML