Stars
We introduce LT3SD, a novel latent 3D scene diffusion approach enabling high-fidelity generation of infinite 3D environments in a patch-by-patch and coarse-to-fine fashion.
Learn to create a desktop app with Python and Qt
Python package for importing and loading external assets into AI2THOR
GRUtopia: Dream General Robots in a City at Scale
The code of "[TPAMI] SceneHGN: Hierarchical Graph Networks for 3D Indoor Scene Generation with Fine-Grained Geometry"
[CVPR 2024✨Highlight] Official repository for HOLD, the first method that jointly reconstructs articulated hands and objects from monocular videos without assuming a pre-scanned object template and…
A modular graph-based Retrieval-Augmented Generation (RAG) system
[SIGGRAPH'24] 2D Gaussian Splatting for Geometrically Accurate Radiance Fields
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
The Complete Street Rule for ArcGIS CityEngine is a scenario oriented design tool intended to enable users to quickly create procedurally generated multimodal streets.
[ACL2023 Area Chair Award] Official repo for the paper "Tell2Design: A Dataset for Language-Guided Floor Plan Generation".
Official code for VisProg (CVPR 2023 Best Paper!)
Utility functions when working with Ai2-THOR. Try to do one thing once.
CVPR 2024: Language Guided Generation of 3D Embodied AI Environments.
Open-Sora: Democratizing Efficient Video Production for All
A modular high-level library to train embodied AI agents across a variety of tasks and environments.
OmniGibson: a platform for accelerating Embodied AI research built upon NVIDIA's Omniverse engine. Join our Discord for support: https://discord.gg/bccR5vGFEx
Official implementation of ECCV24 paper "SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding"
PyViz3D is a web-based visualizer for 3D objects and point clouds.
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Infinite Photorealistic Worlds using Procedural Generation
[CVPR 2024] The official repo for "GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians"
[CVPR 2024] Official Implementation of "Seamless Human Motion Composition with Blended Positional Encodings".