-
University of Toronto
- Toronto, Canada
- https://georgegu1997.github.io/
Highlights
- Pro
Block or Report
Block or report georgegu1997
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Code release for "Cut and Learn for Unsupervised Object Detection and Instance Segmentation" and "VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation"
Code for "Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers"
[CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Language 3D Assistant.
[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
Empowering Multimodal LLMs with Set-of-Mark Prompting and Improved Visual Reasoning Ability.
OpenEQA Embodied Question Answering in the Era of Foundation Models
API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
Grounded Segment Anything: From Objects to Parts
This repo accompanies the research paper, ARKitScenes - A Diverse Real-World Dataset for 3D Indoor Scene Understanding Using Mobile RGB-D Data and contains the data, scripts to visualize and proces…
A new markup-based typesetting system that is powerful and easy to learn.
[CVPR 2024] Probing the 3D Awareness of Visual Foundation Models
[CVPR'24] Group Anything with Radiance Fields
Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024
Open-Sora: Democratizing Efficient Video Production for All
Gauzilla: a 3D Gaussian Splatting renderer written in Rust for WebAssembly with lock-free multithreading
A script for cloning a non-relocatable virtualenv. originated here: https://gist.github.com/860822
PyTorch code and models for V-JEPA self-supervised learning from video.
The official PyTorch implementation of Google's Gemma models
[CVPR 2024] Real-Time Open-Vocabulary Object Detection