Skip to content
View MaureenZOU's full-sized avatar
🐿️
愉快搬砖 : )
🐿️
愉快搬砖 : )

Highlights

  • Pro
Block or Report

Block or report MaureenZOU

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 955 53 Updated Jun 28, 2024

Empowering Multimodal LLMs with Set-of-Mark Prompting and Improved Visual Reasoning Ability.

Python 91 1 Updated May 9, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 19,843 1,884 Updated Jun 27, 2024

Code release for "Cut and Learn for Unsupervised Object Detection and Instance Segmentation" and "VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation"

Python 889 89 Updated Mar 2, 2024

GenSim: Generating Robotic Simulation Tasks via Large Language Models

Python 263 18 Updated Mar 23, 2024
Python 533 25 Updated Feb 15, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型

Python 3,770 290 Updated Jun 20, 2024
Python 290 11 Updated Jan 22, 2024

[CVPR 2024] Official implementation of the paper "Visual In-context Learning"

Python 304 10 Updated Apr 8, 2024

API for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Python 1,969 120 Updated Jun 25, 2024

LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills

Python 652 50 Updated Feb 1, 2024

LLaVA-Interactive-Demo

Python 328 25 Updated Jun 10, 2024

AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI

JavaScript 912 84 Updated Jan 31, 2024

Set-of-Mark Prompting for LMMs

Python 1,008 80 Updated Jun 5, 2024
Python 8,180 475 Updated Jan 27, 2024

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 6,341 356 Updated Mar 20, 2024

✨✨Latest Advances on Multimodal Large Language Models

10,300 691 Updated Jun 28, 2024

Emu Series: Generative Multimodal Models from BAAI

Python 1,556 79 Updated Mar 8, 2024

[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language

Python 1,273 119 Updated Oct 5, 2023

Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

Python 2,036 102 Updated Mar 21, 2024

[CVPR2023 Highlight] GRES: Generalized Referring Expression Segmentation

Python 654 16 Updated Sep 5, 2023

Fast Segment Anything

Python 7,072 663 Updated Jun 25, 2024

detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.

Python 1,890 198 Updated Jun 27, 2024

Generate 3D objects conditioned on text or images

Python 11,427 908 Updated Jun 22, 2024

Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds

Python 1,467 98 Updated Aug 16, 2023

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Python 4,171 351 Updated Apr 9, 2024
JavaScript 3 Updated Apr 10, 2023

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 14,029 1,295 Updated May 23, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 45,277 5,347 Updated Jun 24, 2024
Next