Skip to content
View inFaaa's full-sized avatar
🍭
A done thesis is better than a perfect thesis.
🍭
A done thesis is better than a perfect thesis.

Highlights

  • Pro
Block or Report

Block or report inFaaa

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

Scaling Diffusion Transformers with Mixture of Experts

Python 93 3 Updated Jul 21, 2024

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Python 1,027 75 Updated Jul 7, 2024

CUBE is a benchmark to evaluate the Cultural Competence of T2I models

3 Updated Jul 18, 2024

Recursive Visual Programming

Python 8 Updated Jul 14, 2024

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Python 4,224 219 Updated Jun 14, 2024

Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various r…

Python 114 4 Updated Jul 20, 2024

[ECCV 2024] Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning

Python 18 Updated Jul 19, 2024

This repository is related to 'Intriguing Properties of Hyperbolic Embeddings in Vision-Language Models', published at TMLR (2024), https://openreview.net/pdf?id=P5D2gfi4Gg

Python 6 Updated Jul 5, 2024

Understand Human Behavior to Align True Needs

Python 2,877 242 Updated Jul 20, 2024
HTML 10 Updated Jul 10, 2024

Vico: Compositional Video Generation as Flow Equalization

Python 39 Updated Jul 9, 2024

Reward Guided Latent Consistency Distillation

Python 12 Updated May 28, 2024

Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision

Python 37 3 Updated Jul 10, 2024
Python 43 1 Updated Jul 12, 2024

[WIP] Layer Diffusion for WebUI (via Forge)

Python 3,676 324 Updated Jun 12, 2024
Python 113 1 Updated Jul 15, 2024

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 762 40 Updated Jul 14, 2024

Kolors Team

Python 2,583 146 Updated Jul 19, 2024

《动手学大模型Dive into LLMs》系列编程实践教程

2,666 220 Updated Jul 3, 2024

[ECCV 2024] Official PyTorch implement of paper "ParCo: Part-Coordinating Text-to-Motion Synthesis": http:https://arxiv.org/abs/2403.18512

Python 35 1 Updated Jul 1, 2024

Code for the paper DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents, ICML 2024

Python 46 2 Updated Jun 12, 2024
Python 37 1 Updated Jun 27, 2024

Awesome List of Consistency Models

19 Updated Jul 1, 2024

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 329 11 Updated Jul 16, 2024

💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

62 3 Updated Jul 21, 2024

[ICML 2024] On Discrete Prompt Optimization for Diffusion Models - Google

12 Updated Jul 9, 2024
Python 39 Updated Apr 10, 2024

[ACL 2024 Findings] "TempCompass: Do Video LLMs Really Understand Videos?", Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, Lei Li, Sishuo Chen, Xu Sun, Lu Hou

Python 64 3 Updated Jun 11, 2024
Next