Skip to content
View songweige's full-sized avatar

Highlights

  • Pro

Block or report songweige

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?

Python 75 2 Updated Nov 8, 2024

A Video Tokenizer Evaluation Dataset

Python 43 2 Updated Nov 6, 2024

A suite of image and video neural tokenizers

Python 764 17 Updated Nov 13, 2024

Patch convolution to avoid large GPU memory usage of Conv2D

Python 79 5 Updated May 26, 2024

ElasticTok: Adaptive Tokenization for Image and Video

Python 32 Updated Nov 4, 2024

FlashTex: Fast Relightable Mesh Texturing with LightControlNet

Python 83 2 Updated Sep 26, 2024

Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge) (CoRL 2024)

Jupyter Notebook 316 43 Updated Oct 30, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 37,350 5,941 Updated Aug 19, 2024

Official inference repo for FLUX.1 models

Python 15,899 1,156 Updated Nov 14, 2024

Ongoing research training transformer models at scale

Python 10,563 2,360 Updated Nov 15, 2024

Official Implementation of Rethinking Score Distillation as a Bridge Between Image Distributions

Python 60 3 Updated Jul 7, 2024

Evaluating text-to-image/video/3D models with VQAScore

Python 227 20 Updated Sep 9, 2024

[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

Python 259 7 Updated Jul 9, 2024

Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition (ICLR 2024)

Python 27 Updated May 14, 2024

A framework for 4D reconstruction from monocular videos.

Python 269 17 Updated Sep 23, 2024

Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"

Python 747 119 Updated Oct 10, 2024

One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more

Python 1,633 187 Updated Sep 8, 2024

[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

Python 589 23 Updated Nov 4, 2024

Finetune ModelScope's Text To Video model using Diffusers 🧨

Python 666 107 Updated Dec 14, 2023

Machine Learning Engineering Open Book

Python 11,641 711 Updated Nov 12, 2024

Web-based 3D visualization + Python

Python 835 49 Updated Nov 14, 2024

Consistency Distilled Diff VAE

Python 2,136 75 Updated Nov 7, 2023

The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"

Python 1,555 212 Updated Apr 9, 2024

Official PyTorch implementation of Video Probabilistic Diffusion Models in Projected Latent Space (CVPR 2023).

Python 303 15 Updated May 14, 2024

get things from one computer to another, safely

Python 20,385 643 Updated Nov 13, 2024

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Python 6,623 1,214 Updated Aug 13, 2024

Text2Cinemagraph: Text-Guided Synthesis of Eulerian Cinemagraphs [SIGGRAPH ASIA 2023]

Python 370 44 Updated Oct 7, 2023

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Python 1,374 136 Updated Dec 8, 2023

Stable Diffusion web UI

Python 142,890 26,934 Updated Nov 6, 2024
Python 475 50 Updated Mar 4, 2024
Next