llava

Leveraging state-of-the-art (SOTA) language models (LLMs) and orchestration frameworks like LangChain, Vox optimizes multimodal interactions, enhancing communication tasks with personalized assistance and enriched user experiences.

python gmail-api langchain llava llama2 ollama

Updated Apr 21, 2024
Python

MahmoudIbrahims / Assistant_Generate_text_from_images

Star

Generate text from images and translation text English to Arabic

nlp computer-vision llava

Updated Jun 14, 2024
Python

Hehua-Fan / Multimodal-AI-Chat-Application

Star

Build your own ChatGPT locally, with the ability of interacting with images, pdfs, audios

whisper vector-database streamlit-webapp large-language-models langchain llava

Updated Mar 19, 2024
Python

miguelscarv / pheye

Star

Pheye - a family of efficient small vision-language models

nlp machine-learning natural-language-processing computer-vision deep-learning image-captioning visual-question-answering visual-question-generation llava moondream moe-llava

Updated May 29, 2024
Python

jcassady / llava-benchmark

Star

LLaVA Bechmark evaluates image & audio processing capabilities of AI models with Ollama.

python ai pytest llava ollama

Updated Jul 18, 2024
Python

bjh-developer / Insightface-LLaVa_Integration

Star

Computer Vision Research for Multimedia Understanding at DSO National Laboratories Internship 2023 under the DSTA JC Scholarship

computer-vision python3 insightface llava

Updated Dec 22, 2023
Python

mapluisch / LLaVA-WebSocket-Server

Star

Python-based WebSocket for CLI LLaVA inference.

python websocket websockets inference llama llm llava llm-inference llama2

Updated Nov 30, 2023
Python

kylemclaren / ollava

Star

Upload images to Slack with automatic alt text generation using Llava on Ollama

slack accessibility llava ollama

Updated Jun 23, 2024
Python

yueying-teng / generate-language-image-instruction-following-data

Star

mistral multimodal-learning llm langchain llava vllm llama-cpp-python instruction-following-data

Updated Jun 5, 2024
Python

Lunik / image-tagger

Star

Image-Tagger is a simple command line tool to tag images using exiftool. It uses AI to detect the content of the image and add tags to the image metadata.

metadata image ai exif artificial-intelligence vision llava ollama phi3

Updated Jun 13, 2024
Python

WJakubowsk / hateful_memes_detection

Star

Joint work as part of a bachelor's thesis on utilizing a combination of NLP and CV methods in implementing multimodal approaches to combat hate speech in memes.

natural-language-processing computer-vision memes hate-speech llava

Updated Jan 28, 2024
Python

nsourlos / bird_detector_ancient_manuscripts

Star

object-detection pdf-extractor image-extractor bird-detection ancient-books llm llava groundingdino grounding-dino

Updated Feb 8, 2024
Python

nsourlos / OCR_with_LLMs

Star

ocr text-extraction object-detection pytesseract llava

Updated Feb 8, 2024
Python

enkaranfiles / remote-sensing-dataset-construction

Star

Vision Language Dataset Construction Library for Remote Sensing Domain

spectral multimodality vision-language llava

Updated May 30, 2024
Python

Ravi-Teja-konda / TunedLlavaDelights

Star

Explore the rich flavors of Indian desserts with TunedLlavaDelights. Utilizing the in Llava fine-tuning, our project unveils detailed nutritional profiles, taste notes, and optimal consumption times for beloved sweets. Dive into a fusion of AI innovation and culinary tradition

dessert nutrition nutrition-information finetuning multimodal multi-modality gpt4 tranformers dalle2 stable-diffusion chatgpt vision-language-model llava vision-language-learning llama2 gpt4v

Updated Mar 17, 2024
Python

Improve this page

Add a description, image, and links to the llava topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llava topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llava

Here are 85 public repositories matching this topic...

instill-ai / model-llava-13b-dvc

NotYuSheng / Multimodal-Large-Language-Model

the-smart-home-maker / hass_ollama_image_analysis

ashourml / pocket-gen

instill-ai / model-zephyr-7b-dvc

alirashidAR / Prism-GenAI

MahmoudIbrahims / Assistant_Generate_text_from_images

Hehua-Fan / Multimodal-AI-Chat-Application

miguelscarv / pheye

jcassady / llava-benchmark

bjh-developer / Insightface-LLaVa_Integration

mapluisch / LLaVA-WebSocket-Server

kylemclaren / ollava

yueying-teng / generate-language-image-instruction-following-data

Lunik / image-tagger

WJakubowsk / hateful_memes_detection

nsourlos / bird_detector_ancient_manuscripts

nsourlos / OCR_with_LLMs

enkaranfiles / remote-sensing-dataset-construction

Ravi-Teja-konda / TunedLlavaDelights

Improve this page

Add this topic to your repo