Highlights
- Pro
Block or Report
Block or report MaureenZOU
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Empowering Multimodal LLMs with Set-of-Mark Prompting and Improved Visual Reasoning Ability.
Open-Sora: Democratizing Efficient Video Production for All
Code release for "Cut and Learn for Unsupervised Object Detection and Instance Segmentation" and "VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation"
GenSim: Generating Robotic Simulation Tasks via Large Language Models
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型
[CVPR 2024] Official implementation of the paper "Visual In-context Learning"
API for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills
AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
✨✨Latest Advances on Multimodal Large Language Models
Emu Series: Generative Multimodal Models from BAAI
[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language
Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
[CVPR2023 Highlight] GRES: Generalized Referring Expression Segmentation
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
Generate 3D objects conditioned on text or images
Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.