yu20103983

yu20103983

Achievements

Starred repositories

tianyi-lab / HallusionBench

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

Python 236 7 Updated Sep 30, 2024

hotshotco / Hotshot-XL

✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL

Python 1,050 83 Updated Jan 23, 2024

continue-revolution / sd-webui-segment-anything

Segment Anything for Stable Diffusion WebUI

Python 3,395 205 Updated Apr 30, 2024

continue-revolution / sd-webui-animatediff

AnimateDiff for AUTOMATIC1111 Stable Diffusion WebUI

Python 3,081 255 Updated Sep 22, 2024

AILab-CVC / VideoCrafter

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Python 4,520 335 Updated Jul 10, 2024

jianzhnie / awesome-text-to-video

A Survey on Text-to-Video Generation/Synthesis.

600 80 Updated Jul 24, 2024

Stability-AI / generative-models

Generative Models by Stability AI

Python 24,393 2,714 Updated Sep 4, 2024

neuralchen / SimSwap

An arbitrary face-swapping framework on images and videos with one single trained model!

Python 4,505 891 Updated Aug 6, 2024

iperov / DeepFaceLive

Real-time face swap for PC streaming or video calls

Python 26,498 4,532 Updated Jul 28, 2023

facefusion / facefusion

Industry leading face manipulation platform

Python 18,941 2,868 Updated Oct 18, 2024

CVHub520 / X-AnyLabeling

Effortless data labeling with AI support from Segment Anything and other awesome models.

Python 3,949 456 Updated Oct 15, 2024

Coyote-A / ultimate-upscale-for-automatic1111

Python 1,638 163 Updated Jun 30, 2024

DominikDoom / a1111-sd-webui-tagcomplete

Booru style tag autocompletion for AUTOMATIC1111's Stable Diffusion web UI

JavaScript 2,575 305 Updated Sep 4, 2024

Mikubill / sd-webui-controlnet

WebUI extension for ControlNet

Python 16,968 1,954 Updated Aug 12, 2024

lllyasviel / ControlNet-v1-1-nightly

Nightly release of ControlNet 1.1

Python 4,702 372 Updated Aug 8, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

12,260 782 Updated Oct 16, 2024

KichangKim / DeepDanbooru

AI based multi-label girl image classification system, implemented by using TensorFlow.

Python 2,621 260 Updated Aug 27, 2024

open-mmlab / playground

A central hub for gathering and showcasing amazing projects that extend OpenMMLab with SAM and other exciting features.

Python 1,121 122 Updated Mar 29, 2024

yatengLG / ISAT_with_segment_anything

Labeling tool with SAM(segment anything model),supports SAM, SAM2, sam-hq, MobileSAM EdgeSAM etc.交互式半自动图像标注工具

Python 1,254 133 Updated Oct 18, 2024

facebookresearch / segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 47,181 5,586 Updated Sep 18, 2024

OpenBMB / XAgent

An Autonomous LLM Agent for Complex Task Solving

Python 8,102 831 Updated Aug 12, 2024

Stability-AI / stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Python 38,829 5,009 Updated Oct 10, 2024

TencentARC / T2I-Adapter

T2I-Adapter

Python 3,447 206 Updated Jun 21, 2024

OpenTalker / SadTalker

[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 11,828 2,203 Updated Jun 26, 2024

Zz-ww / SadTalker-Video-Lip-Sync

本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形，设置面部区域可配置的增强方式进行合成唇形（人脸）区域画面增强，提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧，补充帧间合成唇形的动作过渡，使合成的唇形更为流畅、真实以及自然。

Python 1,852 318 Updated Jun 4, 2023