Skip to content
View sijeh's full-sized avatar
Block or Report

Block or report sijeh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Open-Sora: Democratizing Efficient Video Production for All

Python 20,783 1,966 Updated Jul 16, 2024

Stable Video Diffusion Training Code and Extensions.

Python 476 45 Updated Jul 15, 2024

[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition

Python 871 52 Updated Jan 2, 2024

Multimodal Models in Real World

Jupyter Notebook 339 17 Updated Jul 12, 2024

CV-VAE: A Compatible Video VAE for Latent Generative Video Models

Jupyter Notebook 181 5 Updated Jun 7, 2024

Text-to-video generation. The repo for ICLR2023 paper "CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers"

Python 3,552 381 Updated Jun 14, 2023

[ECCV 2024] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Python 2,160 169 Updated Jul 19, 2024

An efficient video loader for deep learning with smart shuffling that's super easy to digest

C++ 1,737 149 Updated Jul 17, 2024

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 3,997 388 Updated Jul 17, 2024

Official code for the paper "LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes".

Python 1,280 96 Updated Jun 14, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 10,932 975 Updated Jul 19, 2024

[ICCV 2023] Online Clustered Codebook

Python 126 5 Updated Dec 1, 2023

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 11,151 1,454 Updated Feb 29, 2024

My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"

Python 149 8 Updated Jun 17, 2024

Implementation of MagViT2 Tokenizer in Pytorch

Python 495 29 Updated Jun 26, 2024

repository for 360 panorama image generation based on Stable Diffusion

Python 173 24 Updated May 20, 2024

[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers

Python 449 15 Updated Jun 26, 2024

🎥 Python and OpenCV-based scene cut/transition detection program & library.

Python 3,008 376 Updated Jul 8, 2024

Official Code for Stable Cascade

Jupyter Notebook 6,448 521 Updated Mar 12, 2024

The uncompromising Python code formatter

Python 37,872 2,395 Updated Jul 15, 2024

Modern, extensible Python project management

Python 5,729 280 Updated Jul 17, 2024

Generative Models by Stability AI

Python 23,382 2,590 Updated Jul 9, 2024

Iterable datapipelines for pytorch training.

Python 71 17 Updated Nov 1, 2023

[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation

Jupyter Notebook 2,910 197 Updated Mar 9, 2024

Mamba SSM architecture

Python 11,795 972 Updated Jul 19, 2024

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)

Jupyter Notebook 1,604 90 Updated Jun 6, 2024

[ICLR 2024]EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling(https://arxiv.org/abs/2310.04691)

Python 109 13 Updated Mar 7, 2024

DeepSeek LLM: Let there be answers

Makefile 1,337 88 Updated Feb 4, 2024

Official Pytorch Implementation for “Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation” (CVPR 2023)

Python 870 52 Updated Jun 19, 2023
Next