Block or Report
Block or report zwhus
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Evaluation code and datasets for the ACL 2024 paper, VISTA: Visualized Text Embedding for Universal Multi-Modal Retrieval. The original code and model can be accessed at FlagEmbedding.
Retrieval and Retrieval-augmented LLMs
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Official PyTorch implementation of CODA-LM(https://arxiv.org/abs/2404.10595)
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Boosting Driving Scene Understanding with Advanced Vision-Language Models
Evaluation code for various unsupervised automated metrics for Natural Language Generation.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Awesome Incremental Learning
[AAAI2024]Hybrid-SORT: Weak Cues Matter for Online Multi-Object Tracking
[ECCV2022] PETR: Position Embedding Transformation for Multi-View 3D Object Detection & [ICCV2023] PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
LAVIS - A One-stop Library for Language-Vision Intelligence
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
GIT: A Generative Image-to-text Transformer for Vision and Language
A central hub for gathering and showcasing amazing projects that extend OpenMMLab with SAM and other exciting features.
The pytorch re-implement of the official efficientdet with SOTA performance in real time and pretrained weights.
OpenMMLab Detection Toolbox and Benchmark