Starred repositories
[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models
✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL
Segment Anything for Stable Diffusion WebUI
AnimateDiff for AUTOMATIC1111 Stable Diffusion WebUI
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
A Survey on Text-to-Video Generation/Synthesis.
Generative Models by Stability AI
An arbitrary face-swapping framework on images and videos with one single trained model!
Real-time face swap for PC streaming or video calls
Industry leading face manipulation platform
Effortless data labeling with AI support from Segment Anything and other awesome models.
Booru style tag autocompletion for AUTOMATIC1111's Stable Diffusion web UI
WebUI extension for ControlNet
Nightly release of ControlNet 1.1
✨✨Latest Advances on Multimodal Large Language Models
AI based multi-label girl image classification system, implemented by using TensorFlow.
A central hub for gathering and showcasing amazing projects that extend OpenMMLab with SAM and other exciting features.
Labeling tool with SAM(segment anything model),supports SAM, SAM2, sam-hq, MobileSAM EdgeSAM etc.交互式半自动图像标注工具
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
An Autonomous LLM Agent for Complex Task Solving
High-Resolution Image Synthesis with Latent Diffusion Models
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。
Extension to edit dataset captions for SD web UI by AUTOMATIC1111
Labeling extension for Automatic1111's Web UI
picobyte / stable-diffusion-webui-wd14-tagger
Forked from toriato/stable-diffusion-webui-wd14-taggerLabeling extension for Automatic1111's Web UI
📷 EasyPhoto | Your Smart AI Photo Generator.
FaceChain is a deep-learning toolchain for generating your Digital-Twin.