Skip to content
View yaojin17's full-sized avatar

Block or report yaojin17

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

[CVPR 2024] Probing the 3D Awareness of Visual Foundation Models

Python 237 11 Updated Jul 9, 2024
Python 3 Updated Aug 23, 2024

MINT-1T: A one trillion token multimodal interleaved dataset.

720 18 Updated Jul 31, 2024

😎 Awesome lists of papers and codes about open-vocabulary perception, including both 3D and 2D

20 1 Updated Jun 25, 2024

[ECCV 2024] Code for VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models

Python 378 26 Updated Aug 13, 2024

[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.

Python 1,548 99 Updated Aug 20, 2024

Generalizable 3D Scene Reconstruction via Divide and Conquer from a Single View

Python 95 2 Updated Jul 15, 2024

A large-scale NOCS dataset.

Jupyter Notebook 46 4 Updated Jul 12, 2024

The codebase for our paper: Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model

Python 37 2 Updated Aug 8, 2024

Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision

Python 43 4 Updated Jul 10, 2024

Official Repository of Multi-Object Hallucination in Vision-Language Models

Python 19 1 Updated Aug 2, 2024

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Python 2,158 114 Updated Aug 20, 2024

daily update NeRF releated paper on arxiv

Python 174 7 Updated Sep 4, 2024
Python 20 Updated Mar 17, 2024

Code for "Real3D: Scaling Up Large Reconstruction Models with Real-World Images"

Python 134 Updated Jun 13, 2024

Chat with NeRF enables users to interact with a NeRF model by typing in natural language.

Python 285 19 Updated Apr 17, 2024

Official Implementation of 3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs

27 Updated Jun 13, 2024
Python 494 23 Updated Jun 19, 2024

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 14,653 1,354 Updated Aug 31, 2024

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

Python 1,317 109 Updated Jul 17, 2024

Dream2DGS

Python 107 4 Updated Jun 13, 2024
Python 153 10 Updated Aug 16, 2024

Code for Toon3D https://toon3d.studio/

Python 189 9 Updated Jun 13, 2024
Python 444 25 Updated Nov 29, 2023

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

1,062 80 Updated Aug 20, 2024

A unified framework for 3D content generation.

Python 6,101 469 Updated Aug 9, 2024

Lightplane implements a highly memory-efficient differentiable radiance field renderer, and a module for unprojecting features from images to 3D grids.

Python 248 7 Updated Aug 6, 2024

Code for the paper "pix2gestalt: Amodal Segmentation by Synthesizing Wholes" (CVPR 2024)

Python 131 10 Updated May 3, 2024

Code for "FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent" by Cameron Smith*, David Charatan*, Ayush Tewari, and Vincent Sitzmann

Python 859 83 Updated Jun 13, 2024
Next