Skip to content
View yu20103983's full-sized avatar

Block or report yu20103983

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

Python 236 7 Updated Sep 30, 2024

✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL

Python 1,050 83 Updated Jan 23, 2024

Segment Anything for Stable Diffusion WebUI

Python 3,395 205 Updated Apr 30, 2024

AnimateDiff for AUTOMATIC1111 Stable Diffusion WebUI

Python 3,081 255 Updated Sep 22, 2024

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Python 4,520 335 Updated Jul 10, 2024

A Survey on Text-to-Video Generation/Synthesis.

600 80 Updated Jul 24, 2024

Generative Models by Stability AI

Python 24,393 2,714 Updated Sep 4, 2024

An arbitrary face-swapping framework on images and videos with one single trained model!

Python 4,505 891 Updated Aug 6, 2024

Real-time face swap for PC streaming or video calls

Python 26,498 4,532 Updated Jul 28, 2023

Industry leading face manipulation platform

Python 18,941 2,868 Updated Oct 18, 2024

Effortless data labeling with AI support from Segment Anything and other awesome models.

Python 3,949 456 Updated Oct 15, 2024

Booru style tag autocompletion for AUTOMATIC1111's Stable Diffusion web UI

JavaScript 2,575 305 Updated Sep 4, 2024

WebUI extension for ControlNet

Python 16,968 1,954 Updated Aug 12, 2024

Nightly release of ControlNet 1.1

Python 4,702 372 Updated Aug 8, 2024

✨✨Latest Advances on Multimodal Large Language Models

12,260 782 Updated Oct 16, 2024

AI based multi-label girl image classification system, implemented by using TensorFlow.

Python 2,621 260 Updated Aug 27, 2024

A central hub for gathering and showcasing amazing projects that extend OpenMMLab with SAM and other exciting features.

Python 1,121 122 Updated Mar 29, 2024

Labeling tool with SAM(segment anything model),supports SAM, SAM2, sam-hq, MobileSAM EdgeSAM etc.交互式半自动图像标注工具

Python 1,254 133 Updated Oct 18, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 47,181 5,586 Updated Sep 18, 2024

An Autonomous LLM Agent for Complex Task Solving

Python 8,102 831 Updated Aug 12, 2024

High-Resolution Image Synthesis with Latent Diffusion Models

Python 38,829 5,009 Updated Oct 10, 2024

T2I-Adapter

Python 3,447 206 Updated Jun 21, 2024

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 11,828 2,203 Updated Jun 26, 2024

本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。

Python 1,852 318 Updated Jun 4, 2023

Extension to edit dataset captions for SD web UI by AUTOMATIC1111

Python 688 56 Updated Jun 27, 2024

Labeling extension for Automatic1111's Web UI

Python 1,328 232 Updated Jul 17, 2023

Labeling extension for Automatic1111's Web UI

Python 587 71 Updated May 14, 2024

📷 EasyPhoto | Your Smart AI Photo Generator.

Python 4,953 390 Updated Jul 10, 2024

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

Jupyter Notebook 8,964 843 Updated Oct 17, 2024