Skip to content
View darkpromise98's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report darkpromise98

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Efficient Multimodal Large Language Models: A Survey

225 8 Updated Aug 16, 2024

Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!

Python 366 35 Updated Aug 28, 2024

Official pytorch implementation of "RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in LVLMs"

Python 8 Updated Jul 21, 2024

This is the first released survey paper on hallucinations of large vision-language models (LVLMs). To keep track of this field and continuously update our survey, we maintain this repository of rel…

29 3 Updated Jul 26, 2024

This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual Debias Decoding strategy.

Python 66 2 Updated Mar 28, 2024

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 625 35 Updated Aug 5, 2024

[CVPR'2024 Highlight] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments".

Python 200 10 Updated Jun 13, 2024

SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 621 36 Updated Aug 22, 2024
HTML 63 6 Updated May 10, 2024

RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness

Python 197 6 Updated Sep 2, 2024

[ACL 2024 πŸ”₯] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted fo…

Python 1,142 97 Updated Aug 27, 2024

Explainability for Vision Transformers

Python 805 93 Updated Mar 12, 2022

A RLHF Infrastructure for Vision-Language Models

Python 85 5 Updated Jun 12, 2024

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,178 713 Updated Aug 5, 2024

When do we not need larger vision models?

Python 306 9 Updated Aug 19, 2024

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Jupyter Notebook 2,497 124 Updated Aug 4, 2024

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,158 277 Updated May 4, 2024

Dense Connector for MLLMs

Python 96 3 Updated Aug 19, 2024

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 1,901 116 Updated Sep 3, 2024

CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts

Python 127 10 Updated Jun 8, 2024

πŸ“– A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).

340 10 Updated Aug 20, 2024

πŸ™Œ OpenHands: Code Less, Make More

Python 30,944 3,566 Updated Sep 3, 2024

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 1,984 187 Updated Apr 24, 2024

Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models

71 2 Updated Jun 23, 2024

Mixture-of-Experts for Large Vision-Language Models

Python 1,894 121 Updated May 15, 2024

πŸ”₯πŸ”₯ LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Python 786 55 Updated Jul 10, 2024

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Python 1,305 81 Updated Sep 3, 2024

A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing

Python 275 17 Updated Jul 19, 2024

A family of lightweight multimodal models.

Python 868 66 Updated Aug 2, 2024

Code for the paper "Evaluating Large Language Models Trained on Code"

Python 2,286 326 Updated Feb 5, 2024
Next