Skip to content
View ArcherFMY's full-sized avatar
💭
Fighting
💭
Fighting
  • Hangzhou, Zhejiang, China

Block or report ArcherFMY

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Text-to-Music Generation with Rectified Flow Transformers

Python 1,297 97 Updated Sep 6, 2024

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝

Python 392 35 Updated Jul 26, 2024

Your image is almost there!

Python 7,186 416 Updated Jul 26, 2024

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Python 3,265 282 Updated Aug 15, 2024

More relighting!

Python 4,835 326 Updated Jun 27, 2024

Create Magic Story!

Jupyter Notebook 5,761 574 Updated Jul 24, 2024

Official implementation of "ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models".

Python 723 23 Updated Apr 7, 2024

Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis

Python 1,347 136 Updated Jul 29, 2024

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Python 30,533 3,755 Updated Sep 9, 2024

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 5,845 401 Updated May 29, 2024

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Python 6,746 517 Updated Jul 17, 2024

Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on

Python 5,372 791 Updated May 13, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,231 1,002 Updated Sep 10, 2024

Latte: Latent Diffusion Transformer for Video Generation.

Python 1,628 170 Updated Sep 9, 2024

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 5,976 533 Updated May 31, 2024

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 2,672 168 Updated Aug 1, 2024

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

3,120 188 Updated Sep 4, 2024

Unofficial Implementation of Animate Anyone

Python 2,895 233 Updated Jul 9, 2024

Official implementation of DreaMoving

1,790 99 Updated Jan 9, 2024

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

14,295 958 Updated Jul 26, 2024
Python 371 82 Updated Jul 17, 2024
Python 7,636 497 Updated Apr 14, 2024

[CVPR 2023] 3D Cinemagraphy from a Single Image

Python 256 12 Updated May 5, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 46,612 5,525 Updated Sep 3, 2024

Using Low-rank adaptation to quickly fine-tune diffusion models.

Jupyter Notebook 6,947 480 Updated Mar 22, 2024

[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language

Python 1,282 132 Updated Oct 5, 2023
Jupyter Notebook 3,045 285 Updated May 14, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 9,622 941 Updated Aug 23, 2024

Simple image captioning model

Jupyter Notebook 1,283 214 Updated Jun 9, 2024
Next