Starred repositories
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
State-of-the-Art zero-shot voice conversion & singing voice conversion with in context learning
CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simpl…
Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"
An open-source RAG-based tool for chatting with your documents.
A High-Quality Real Time Upscaler for Anime Video
A machine learning-based lossless video super resolution framework. Est. Hack the Valley II, 2018.
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Generative models for conditional audio generation
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
Colab for making Wav2Lip high quality and easy to use
Bringing Old Photo Back to Life (CVPR 2020 oral)
🆙 Upscayl - #1 Free and Open Source AI Image Upscaler for Linux, MacOS and Windows.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
real time face swap and one-click video deepfake with only a single image
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Real time interactive streaming digital human
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Easily train a good VC model with voice data <= 10 mins!
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Azure / DeepSeek), Knowledge Base (file upload / knowledge manageme…
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
2024中国翻墙软件VPN推荐以及科学上网避坑,稳定好用。对比SSR机场、蓝灯、V2ray、老王VPN、VPS搭建梯子等科学上网与翻墙软件,中国最新科学上网翻墙梯子VPN下载推荐,访问Chatgpt。