Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
1
Libraries
Datasets
Languages
Licenses
Other
Reset Tasks
Multimodal
Image-Text-to-Text
Visual Question Answering
Document Question Answering
Computer Vision
Depth Estimation
Image Classification
Object Detection
Image Segmentation
Text-to-Image
Image-to-Text
Image-to-Image
Image-to-Video
Unconditional Image Generation
Video Classification
Text-to-Video
Zero-Shot Image Classification
Mask Generation
Zero-Shot Object Detection
Text-to-3D
Image-to-3D
Image Feature Extraction
Natural Language Processing
Text Classification
Token Classification
Table Question Answering
Question Answering
Zero-Shot Classification
Translation
Summarization
Feature Extraction
Text Generation
Text2Text Generation
Fill-Mask
Sentence Similarity
Audio
Text-to-Speech
Text-to-Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Time Series Forecasting
Reinforcement Learning
Reinforcement Learning
Robotics
Other
Graph Machine Learning
Apply filters
Models
330
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
cjpais/llava-v1.6-34B-gguf
Image-Text-to-Text
•
Updated
Mar 7
•
2.55k
•
37
cjpais/llava-1.6-mistral-7b-gguf
Image-Text-to-Text
•
Updated
Mar 6
•
19.5k
•
67
Trelis/llava-v1.6-mistral-7b-PATCHED
Image-Text-to-Text
•
Updated
Mar 6
•
37
•
8
SurfaceData/llava-v1.6-mistral-7b-sglang
Image-Text-to-Text
•
Updated
Mar 7
•
2.08k
•
6
microsoft/udop-large
Image-Text-to-Text
•
Updated
Mar 11
•
8.77k
•
100
microsoft/udop-large-512-300k
Image-Text-to-Text
•
Updated
Mar 11
•
919
•
30
deepseek-ai/deepseek-vl-1.3b-chat
Image-Text-to-Text
•
Updated
Mar 15
•
5.03k
•
35
llava-hf/llava-v1.6-vicuna-13b-hf
Image-Text-to-Text
•
Updated
24 days ago
•
201k
•
9
Xenova/moondream2
Image-Text-to-Text
•
Updated
27 days ago
•
39
•
15
HuggingFaceM4/idefics2-8b-base
Image-Text-to-Text
•
Updated
14 days ago
•
11.2k
•
24
xtuner/llava-llama-3-8b
Image-Text-to-Text
•
Updated
Apr 26
•
394
•
27
xtuner/llava-llama-3-8b-v1_1
Image-Text-to-Text
•
Updated
Apr 28
•
1.84k
•
110
xtuner/llava-phi-3-mini
Image-Text-to-Text
•
Updated
Apr 25
•
478
•
20
xtuner/llava-llama-3-8b-transformers
Image-Text-to-Text
•
Updated
Apr 26
•
346
•
4
jiajunlong/TinyLLaVA-OpenELM-450M-SigLIP-0.89B
Image-Text-to-Text
•
Updated
14 days ago
•
1.76k
•
3
Salesforce/xgen-mm-phi3-mini-instruct-r-v1
Image-Text-to-Text
•
Updated
16 days ago
•
10k
•
143
Baron-GG/LLAUS
Image-Text-to-Text
•
Updated
about 7 hours ago
•
1
google/paligemma-3b-ft-rsvqa-hr-224
Image-Text-to-Text
•
Updated
about 1 month ago
•
140
•
2
google/paligemma-3b-ft-vqav2-448
Image-Text-to-Text
•
Updated
about 1 month ago
•
2.95k
•
7
google/paligemma-3b-ft-vizwizvqa-448
Image-Text-to-Text
•
Updated
about 1 month ago
•
53
•
1
google/paligemma-3b-mix-448
Image-Text-to-Text
•
Updated
27 days ago
•
13.3k
•
47
google/paligemma-3b-pt-448
Image-Text-to-Text
•
Updated
27 days ago
•
3.21k
•
16
google/paligemma-3b-ft-cococap-448
Image-Text-to-Text
•
Updated
about 1 month ago
•
2.8k
•
2
gokaygokay/paligemma-rich-captions
Image-Text-to-Text
•
Updated
28 days ago
•
1.16k
•
5
Lin-Chen/open-llava-next-llama3-8b
Image-Text-to-Text
•
Updated
17 days ago
•
2.61k
•
19
yifanzhang114/SliME-vicuna-7B
Image-Text-to-Text
•
Updated
about 9 hours ago
•
38
•
1
yifanzhang114/SliME-Llama3-8B
Image-Text-to-Text
•
Updated
about 9 hours ago
•
1
BUAADreamer/PaliGemma-3B-Chat-v0.2
Image-Text-to-Text
•
Updated
8 days ago
•
55
•
4
llava-hf/LLaVA-NeXT-Video-34B-hf
Image-Text-to-Text
•
Updated
6 days ago
•
1
•
1
rulins/blip2-t5-llava
Image-Text-to-Text
•
Updated
Apr 21
•
1
Previous
1
2
3
4
...
11
Next