Website for data science undergraduate capstone project "Improving Performance of Vision Encoding Large Language Models with Contextual Prompts" @UCSD HDSI 2024
-
Updated
Mar 17, 2024 - HTML
Website for data science undergraduate capstone project "Improving Performance of Vision Encoding Large Language Models with Contextual Prompts" @UCSD HDSI 2024
A simple fun app to create your linkedin profile roast
⚗️ Llava 13b model repository trained by liuhaotian managed by DVC
List of AI tools that can interact with user interfaces
a unique blend of features from your favorite social media platforms like Facebook, Twitter, Reddit, and Instagram, all in one convenient place
Leveraging state-of-the-art (SOTA) language models (LLMs) and orchestration frameworks like LangChain, Vox optimizes multimodal interactions, enhancing communication tasks with personalized assistance and enriched user experiences.
Generate text from images and translation text English to Arabic
Computer Vision Research for Multimedia Understanding at DSO National Laboratories Internship 2023 under the DSTA JC Scholarship
Python-based WebSocket for CLI LLaVA inference.
Build your own ChatGPT locally, with the ability of interacting with images, pdfs, audios
Pheye - a family of efficient small vision-language models
Ask LLaMa about image in your clipboard
⚗️ Zephyr 7b model repository trained by HuggingFaceH4 managed by DVC
text to text / image to text Architecture prompt generator for stable diffusion / any image generation plateform based on Ollama LLM
Visualizing the attention of vision-language models
Hands on some MultiModal Models
Add a description, image, and links to the llava topic page so that developers can more easily learn about it.
To associate your repository with the llava topic, visit your repo's landing page and select "manage topics."