llava

Here are 85 public repositories matching this topic...

open-compass / VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 50+ HF models, 20+ benchmarks

computer-vision evaluation pytorch gemini openai vqa vit gpt multi-modal clip claude openai-api gpt4 large-language-models llm chatgpt llava qwen gpt-4v

Updated Jul 18, 2024
Python

modelscope / swift

Star

ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3, Llava-Video, Internvl2, MiniCPM-V, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)

agent deploy llama lora gemma peft multimodal sft dpo pre-training awq llm modelscope llava ollama qwen2 unsloth llama3 glm4 internvl

Updated Jul 18, 2024
Python

wangclnlp / Vision-LLM-Alignment

Star

This repo contains the codes for supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF) designed for vision LLMs.

vision alignment multi-model reward ppo sft dpo llm rlhf mllm llava

Updated Jul 18, 2024
Python

modelscope / data-juicer

Star

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据！

Updated Jul 18, 2024
Python

jcassady / llava-benchmark

Star

LLaVA Bechmark evaluates image & audio processing capabilities of AI models with Ollama.

python ai pytest llava ollama

Updated Jul 18, 2024
Python

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.

image-to-text clip text-to-image dit multimodal sora text-to-video aigc stable-diffusion controlnet llava blip2 minigpt4 sd-xl ppdiffusers eva-clip stablevideodiffusion qwen-vl

Updated Jul 18, 2024
Python

jakobdylanc / discord-llm-chatbot

Sponsor

Star

llmcord.py • Talk to LLMs with your friends!

bot ai discord chatbot openai llama gpt streamed clyde gpt-4 llm chatgpt llava llamacpp oobabooga ollama litellm llmcord llama3 gpt-4o

Updated Jul 17, 2024
Python

InternLM / xtuner

Star

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

agent chatbot conversational-ai peft baichuan msagent large-language-models llm supervised-finetuning llava llm-training chatglm2 internlm llama2 qwen chatglm3 mixtral llama3 phi3

Updated Jul 17, 2024
Python

TinyLLaVA / TinyLLaVA_Factory

Star

A Framework of Small-scale Large Multimodal Models

nlp transformers llama vision-language llava large-multimodal-models tinyllama

Updated Jul 17, 2024
Python

WisconsinAIVision / ViP-LLaVA

Star

[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

chatbot llama multi-modal clip vision-language gpt-4 foundation-models visual-prompting llava llama2 cvpr2024 gpt-4-vision

Updated Jul 17, 2024
Python

the-smart-home-maker / hass_ollama_image_analysis

Star

Image analysis with Ollama (AI models) from within Home Assistant

ai image-processing home-assistant hacs-integration llava ollama

Updated Jul 17, 2024
Python

Victorwz / MLM_Filter

Star

Official implementation of our paper "Finetuned Multimodal Language Models are High-Quality Image-Text Data Filters".

data-filtering data-quality-assessment large-language-models llava multimodal-large-language-models image-text-data

Updated Jul 16, 2024
Python

Uminosachi / open-llm-webui

Star

This repository contains a web application designed to execute relatively compact, locally-operated Large Language Models (LLMs).

nlp chatbot transformers llama language-model gradio huggingface llm llava ggml llama2 llama3 llava-llama3

Updated Jul 16, 2024
Python

LostXine / LLaRA

Star

LLaRA: Large Language and Robotics Assistant

robotics behavioral-cloning vlm self-supervised-learning instruction-tuning llava

Updated Jul 16, 2024
Python

apocas / restai

Sponsor

Star

RestAI is an AIaaS (AI as a Service) open-source platform. Built on top of LlamaIndex, Ollama and HF Pipelines. Supports any public LLM supported by LlamaIndex and any local LLM suported by Ollama. Precise embeddings usage and tuning.

python transformers embeddings openai llama rag fastapi llm stable-diffusion langchain openaiapi llava llamaindex ollama

Updated Jul 15, 2024
Python

NotYuSheng / Multimodal-Large-Language-Model

Sponsor

Star

Localized Multimodal Large Language Model (MLLM) integrated with Streamlit and Ollama for text and image processing tasks.

multimodal large-language-models llm llava multimodal-large-language-models ollama visual-large-language-models

Updated Jul 15, 2024
Python

haotian-liu / LLaVA

Star

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

chatbot llama multimodal multi-modality gpt-4 foundation-models visual-language-learning chatgpt instruction-tuning vision-language-model llava llama2 llama-2

Updated Jul 14, 2024
Python

eliranwong / freegenius

Star

FreeGenius AI, an advanced AI assistant that can talk and take multi-step actions. Supports numerous open-source LLMs via Llama.cpp or Ollama or Groq Cloud API, with optional integration with AutoGen agents, OpenAI API, Google Gemini Pro and unlimited plugins.

google ai gemini vision openai mistral autogen groq stable-diffusion chatgpt llava llamacpp ollama llama3

Updated Jul 14, 2024
Python

mbzuai-oryx / VideoGPT-plus

Star

Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding

chatbot clip image-encoder video-encoder multimodal dual-encoder vision-language vicuna gpt4 vision-language-pretraining llava video-conversation video-chatbot llama3 gpt4o phi-3-mini

Updated Jul 14, 2024
Python

FuxiaoLiu / MMC

Star

[NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning

chart benchmark resource stock dataset arxiv gpt otter multimodal instruction-tuning llava minigpt4 mplug-owl

Updated Jul 11, 2024
Python

Improve this page

Add a description, image, and links to the llava topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llava topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llava

Here are 85 public repositories matching this topic...

open-compass / VLMEvalKit

modelscope / swift

wangclnlp / Vision-LLM-Alignment

modelscope / data-juicer

jcassady / llava-benchmark

PaddlePaddle / PaddleMIX

jakobdylanc / discord-llm-chatbot

InternLM / xtuner

TinyLLaVA / TinyLLaVA_Factory

WisconsinAIVision / ViP-LLaVA

the-smart-home-maker / hass_ollama_image_analysis

Victorwz / MLM_Filter

Uminosachi / open-llm-webui

LostXine / LLaRA

apocas / restai

NotYuSheng / Multimodal-Large-Language-Model

haotian-liu / LLaVA

eliranwong / freegenius

mbzuai-oryx / VideoGPT-plus

FuxiaoLiu / MMC

Improve this page

Add this topic to your repo