-
SUSTech
- Shenzhen
Block or Report
Block or report Zoky-2020
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
[CVPR 2023 Highlight] Official implementation of the paper: "AltFreezing for More General Video Face Forgery Detection"
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Official Implementation of Safe Latent Diffusion for Text2Image
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…
Real-time face swap for PC streaming or video calls
Official PyTorch repo for JoJoGAN: One Shot Face Stylization
📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).
An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
Universal and Transferable Attacks on Aligned Language Models
A unified evaluation framework for large language models
Open-Sora: Democratizing Efficient Video Production for All
[CVPR 2024🔥] EditGuard: Versatile Image Watermarking for Tamper Localization and Copyright Protection
18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
[ICLR 2024 Spotlight] "Negative Label Guided OOD Detection with Pretrained Vision-Language Models"
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Official Pytorch Implementation for "Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer""
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
✨✨Latest Advances on Multimodal Large Language Models
Anti-DreamBooth: Protecting users from personalized text-to-image synthesis (ICCV 2023)