-
NJU
- Nanjing
Block or Report
Block or report sijeh
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Open-Sora: Democratizing Efficient Video Production for All
Stable Video Diffusion Training Code and Extensions.
[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
CV-VAE: A Compatible Video VAE for Latent Generative Video Models
Text-to-video generation. The repo for ICLR2023 paper "CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers"
[ECCV 2024] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
An efficient video loader for deep learning with smart shuffling that's super easy to digest
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Official code for the paper "LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes".
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
High-Resolution Image Synthesis with Latent Diffusion Models
My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"
Implementation of MagViT2 Tokenizer in Pytorch
repository for 360 panorama image generation based on Stable Diffusion
[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
🎥 Python and OpenCV-based scene cut/transition detection program & library.
Official Code for Stable Cascade
Generative Models by Stability AI
Iterable datapipelines for pytorch training.
[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
[ICLR 2024]EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling(https://arxiv.org/abs/2310.04691)
Official Pytorch Implementation for “Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation” (CVPR 2023)