Block or Report
Block or report isfinne
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Example applications, microservices, and code samples for the Internet Computer
Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
[COLM-2024] List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs
DynRefer: Delving into Region-level Multi-modality Tasks via Dynamic Resolution
Survey on Data-centric Large Language Models
【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"
Evaluation code for Ref-L4, a new REC benchmark in the LMM era
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
An Open-source Toolkit for LLM Development
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
BlueLM(蓝心大模型): Open large language models developed by vivo AI Lab
DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention
A Framework of Small-scale Large Multimodal Models
The official implementation of Hierarchical Semantic Decoding with Counting Assitance for Generalized Referring Expression Segmentation
LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning
[ECCV2024] This is an official implementation for "PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model"
We write your reusable computer vision tools. 💜
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
✨✨Latest Advances on Multimodal Large Language Models
A benchmark dataset for GRES and GREC [CVPR2023 Highlight]
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
evm 系列 以太坊 bsc matic avax okx 等 区块链 通用 快速 打铭文工具