Skip to content
View georgegu1997's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report georgegu1997

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Segment Everything All at Once

Python 82 3 Updated Jul 9, 2024
MATLAB 676 188 Updated Jul 9, 2024

Code release for "Cut and Learn for Unsupervised Object Detection and Instance Segmentation" and "VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation"

Python 895 90 Updated Mar 2, 2024

Code for "Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers"

Python 69 4 Updated May 16, 2024

[CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Language 3D Assistant.

Python 201 6 Updated Jul 1, 2024

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Python 2,022 99 Updated Jul 2, 2024

Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want

Python 46 2 Updated Apr 3, 2024

Empowering Multimodal LLMs with Set-of-Mark Prompting and Improved Visual Reasoning Ability.

Python 91 2 Updated May 9, 2024

OpenEQA Embodied Question Answering in the Era of Foundation Models

Jupyter Notebook 184 13 Updated May 31, 2024

API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Python 602 20 Updated Jun 13, 2024

Grounded Segment Anything: From Objects to Parts

Jupyter Notebook 374 17 Updated May 19, 2023

OpenSUN3D Workshop Challenge - CVPR '24

Python 16 Updated May 31, 2024
Python 22 1 Updated May 18, 2024

Grounded Language-Image Pre-training

Python 2,071 186 Updated Jan 24, 2024

SceneFun3D ToolKit

50 Updated Apr 21, 2024

This repo accompanies the research paper, ARKitScenes - A Diverse Real-World Dataset for 3D Indoor Scene Understanding Using Mobile RGB-D Data and contains the data, scripts to visualize and proces…

Python 611 55 Updated Dec 18, 2023
JavaScript 2 Updated Jul 8, 2024

The official Meta Llama 3 GitHub site

Python 23,017 2,437 Updated Jul 3, 2024

A new markup-based typesetting system that is powerful and easy to learn.

Rust 29,911 819 Updated Jul 9, 2024

[CVPR 2024] Probing the 3D Awareness of Visual Foundation Models

Python 224 7 Updated Apr 22, 2024

[CVPR'24] Group Anything with Radiance Fields

Python 349 27 Updated Jun 25, 2024

Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024

Jupyter Notebook 1,291 69 Updated Jun 28, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 20,391 1,933 Updated Jul 9, 2024

Webpage

JavaScript 3 Updated Jul 8, 2024

Gauzilla: a 3D Gaussian Splatting renderer written in Rust for WebAssembly with lock-free multithreading

Rust 256 21 Updated May 31, 2024

A script for cloning a non-relocatable virtualenv. originated here: https://gist.github.com/860822

Python 209 56 Updated Dec 19, 2023

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 2,546 242 Updated Jul 5, 2024

The official PyTorch implementation of Google's Gemma models

Python 5,157 490 Updated Jul 9, 2024

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 3,940 383 Updated Jul 8, 2024
Next