Skip to content
View evelinehong's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report evelinehong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Auto1111 extension implementing text2video diffusion models (like ModelScope or VideoCrafter) using only Auto1111 webui dependencies

Python 1,276 108 Updated Jul 14, 2024

Dynamic Thresholding (CFG Scale Fix) for Stable Diffusion (eSwarmUI, ComfyUI, and Auto WebUI)

Python 1,082 102 Updated Aug 3, 2024

[ECCV 2024] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Python 2,223 174 Updated Jul 19, 2024

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Python 4,387 322 Updated Jul 10, 2024
Python 77 4 Updated Mar 31, 2024

Stable Video Diffusion Training Code and Extensions.

Python 501 45 Updated Jul 25, 2024

MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

Python 770 44 Updated Feb 3, 2024

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Python 2,818 249 Updated Jun 25, 2024

Finetune ModelScope's Text To Video model using Diffusers 🧨

Python 648 105 Updated Dec 14, 2023

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

Python 1,139 105 Updated Apr 10, 2024

Official implementation for CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding

Python 39 4 Updated Nov 7, 2023

Official code release for ConceptGraphs

Python 333 55 Updated Jul 26, 2024

Code for 3D-LLM: Injecting the 3D World into Large Language Models

Python 869 53 Updated Jun 6, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 19,302 2,458 Updated Jul 15, 2024

[CVPR 2023] Code for "3D Concept Learning and Reasoning from Multi-View Images"

Python 71 3 Updated Jan 20, 2024

Code for CVPR 2021 oral paper "Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene Contexts"

Python 219 29 Updated Jun 30, 2022

[ICCV'23 Workshop] SAM3D: Segment Anything in 3D Scenes

Python 916 65 Updated Apr 21, 2024

An open-source framework for training large multimodal models.

Python 3,605 273 Updated May 25, 2024

Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)

Python 2,616 188 Updated Dec 5, 2023

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,223 4,024 Updated Jul 17, 2024

Codes for Switch-NeRF (ICLR 2023)

Python 191 7 Updated Nov 24, 2023

3D generation on ImageNet [ICLR 2023]

Python 207 9 Updated May 23, 2023

gradslam is an open source differentiable dense SLAM library for PyTorch

Python 1,301 155 Updated Sep 2, 2023

Code Release of "3D Concept Grounding on Neural Fields (NeurIPS2022)"

Python 15 Updated Feb 13, 2023

Implementation of the deepmind Flamingo vision-language model, based on Hugging Face language models and ready for training

Python 159 16 Updated Apr 27, 2023

Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch

Python 1,180 60 Updated Oct 18, 2022

🔥Urban-scale point cloud dataset (CVPR 2021 & IJCV 2022)

C++ 480 57 Updated Jul 22, 2022

Audio Visual Floorplan Reconstruction

Python 12 1 Updated Sep 28, 2021

Direct voxel grid optimization for fast radiance field reconstruction.

Python 1,034 110 Updated May 15, 2023
Next