MiyazonoKaori137

MiyazonoKaori137

11 followers · 139 following

Highlights

Stars

Multimodel

14 repositories

facebookresearch / ImageBind

ImageBind One Embedding Space to Bind Them All

Python 8,359 769 Updated Jul 31, 2024

microsoft / CLAP

Learning audio concepts from natural language supervision

Python 486 38 Updated Sep 18, 2024

mlfoundations / open_clip

An open source implementation of CLIP.

Python 10,305 981 Updated Nov 12, 2024

LAION-AI / CLAP

Contrastive Language-Audio Pretraining

Python 1,415 137 Updated Jul 9, 2024

kyegomez / PALI3

Implementation of PALI3 from the paper PALI-3 VISION LANGUAGE MODELS: SMALLER, FASTER, STRONGER"

Python 143 2 Updated Nov 11, 2024

KaiyangZhou / CoOp

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Python 1,772 200 Updated May 20, 2024

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 25,922 3,320 Updated Jul 23, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 20,238 2,236 Updated Aug 12, 2024

Vision-CAIR / MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,422 2,920 Updated Sep 2, 2024

salesforce / LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 9,928 972 Updated Oct 11, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

12,656 808 Updated Nov 10, 2024

0nutation / SpeechGPT

SpeechGPT Series: Speech Large Language Models

Python 1,291 86 Updated Jul 22, 2024

InternLM / InternLM-XComposer

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Python 2,521 154 Updated Oct 10, 2024

QwenLM / Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,045 385 Updated Aug 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly