- ShangHai
- https://snakehacker.cmsr.cloud/
Stars
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Machine Learning From Scratch. Bare bones NumPy implementations of machine learning models and algorithms with a focus on accessibility. Aims to cover everything from linear regression to deep lear…
Python bindings for the Chromium Embedded Framework (CEF)
[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis
GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
AI中文提示词秘籍ChatGPT中文提示词秘籍(Prompt圣经)K-Render整理
🥣 AIGC 提示词可视化编辑器 | OPS | Open Prompt Studio
《面向开发者的 ChatGPT 提示词工程》非官方版中英双语字幕 Unofficial subtitles of "ChatGPT Prompt Engineering for Developers"
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
3D Slicer Plugin for Segment anything in medical images
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
Segment Anything in Medical Images
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
DeepSeek-VL: Towards Real-World Vision-Language Understanding
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
Open standard for machine learning interoperability
Open-sourced dialogue foundation model for Chemistry and molecule science
Real time interactive streaming digital human
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
A modular graph-based Retrieval-Augmented Generation (RAG) system
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation