Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption.

Python 560 98 Updated May 7, 2023

L4Clippers / Patent-Image-Retrieval-Transformer-DML

Jupyter Notebook 8 2 Updated Dec 18, 2023

ABaldrati / CLIP4CirDemo

[CVPR 2022 - Demo Track] - Effective conditioned and composed image retrieval combining CLIP-based features

SCSS 71 8 Updated Mar 28, 2023

bbbdbbb / MiniGPT-4-captions

Generating captions on image datasets using MiniGPT-v2

Python 5 Updated Dec 23, 2023

Vision-CAIR / MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,196 2,899 Updated Apr 22, 2024

bentoml / OpenLLM

Run any open-source LLMs, such as Llama 3.1, Gemma, as OpenAI compatible API endpoint in the cloud.

Python 9,455 603 Updated Jul 29, 2024

Facico / Chinese-Vicuna

Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案，结构参考alpaca

C 4,139 428 Updated Mar 7, 2024

ellenzhuwang / implicitOOD

An end-to-end vision and language model incorporating explicit knowledge graphs and OOD-detection.

Python 3 Updated May 3, 2024

chu-tianxiang / QuIP-for-all

QuIP quantization

Python 35 3 Updated Mar 17, 2024

xieyuquanxx / awesome-Large-MultiModal-Hallucination

😎 up-to-date & curated list of awesome LMM hallucinations papers, methods & resources.

134 11 Updated Mar 23, 2024

kuleshov-group / llmtools

Finetuning Large Language Models on One Consumer GPU in Under 4 Bits

Python 689 74 Updated May 25, 2024

IST-DASLab / gptq

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 1,807 145 Updated Mar 27, 2024

chaoyi-wu / Finetune_LLAMA

简单易懂的LLaMA微调指南。

Python 325 33 Updated Jul 5, 2023

suu990901 / LLaMA-MiLe-Loss

Code for a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models

Python 46 3 Updated Jun 17, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

10,915 723 Updated Jul 25, 2024

xinyu1205 / recognize-anything

Open-source and strong foundation image recognition models.

Jupyter Notebook 2,634 249 Updated Jul 18, 2024

dvlab-research / MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,110 277 Updated May 4, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 18,420 2,020 Updated Jul 14, 2024

CASIA-IVA-Lab / FastSAM

Fast Segment Anything

Python 7,200 677 Updated Jul 26, 2024

nightrome / cocostuff

The official homepage of the COCO-Stuff dataset.

Shell 829 145 Updated Sep 9, 2022

yeates / MaGIC

[ICLR 24] MaGIC: Multi-modality Guided Image Completion

Python 44 3 Updated Apr 24, 2024

bleedline / aimoneyhunter

ai副业赚钱大集合，教你如何利用ai做一些副业项目，赚取更多额外收益。The Ultimate Guide to Making Money with AI Side Hustles: Learn how to leverage AI for some cool side gigs and rake in some extra cash. Check out the English versi…

12,467 1,115 Updated Jul 12, 2024

songrise / CLIP-Count

[ACM MM23] CLIP-Count: Towards Text-Guided Zero-Shot Object Counting

Python 75 6 Updated Mar 20, 2024

IDEA-Research / OpenSeeD

[ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"

Python 620 37 Updated Jan 22, 2024

ylqi / Count-Anything

This method uses Segment Anything and CLIP to ground and count any object that matches a custom text prompt, without requiring any point or box annotation.

Python 121 15 Updated Apr 22, 2023

anuragxel / salt

Segment Anything Labelling Tool

Python 1,002 127 Updated Feb 19, 2024

berkeley-hipie / HIPIE

[NeurIPS2023] Code release for "Hierarchical Open-vocabulary Universal Image Segmentation"

Jupyter Notebook 258 19 Updated Mar 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ellen Wang ellenzhuwang

Block or report ellenzhuwang

Stars

hhshomee / designpatent_dataset

beichenzbc / Long-CLIP

mlfoundations / open_clip

njustkmg / OMML