-
Zhejiang University
- Shenzhen, Guangdong
- https://www.jianshu.com/u/31c221f09d8a
- in/%E9%92%9F%E8%8E%B9-%E8%8C%B9-8b4732187
- https://music.163.com/#/user/home?id=273265199
Block or Report
Block or report ZillaRU
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (5)
Sort Name ascending (A-Z)
Language
Sort by: Recently starred
Starred repositories
《Machine Learning Systems: Design and Implementation》- Chinese Version
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Making large AI models cheaper, faster and more accessible
18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
NVR with realtime local object detection for IP cameras
一款漂亮且功能强大的在线海报设计器,图片编辑器,仿稿定设计,适用于多种场景:海报生成、电商产品图、文章长图、视频/公众号封面等。A beautiful online image designer, suitable for various scenarios like generate posters, making design easier!
Official implementation of "Separate Anything You Describe"
Text2speech & tone color conversion demo running on SG2300x 结合openvoice和emotivoice的TTS+即时克隆
Suno AI's Bark model in C/C++ for fast text-to-speech
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
🍦 ChatTTS-Forge is a project developed around the TTS generation model ChatTTS, implementing an API Server and a Gradio-based WebUI.
A small package to create visualizations of PyTorch execution graphs
llama3 implementation one matrix multiplication at a time
官方推荐的 ChatTTS 资源汇总项目,整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project
Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.
Kaldi-compatible online fbank extractor without external dependencies
A generative speech model for daily dialogue.
【三年面试五年模拟】算法工程师秘籍。AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、图像处理、元宇宙、AGI、SLAM等AI行业面试笔试经验分享
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
A one-of-a-kind resume builder that keeps your privacy in mind. Completely secure, customizable, portable, open-source and free forever. Try it out today!
zifeng-radxa / FACEXLIB
Forked from xinntao/facexlibFaceXlib aims at providing ready-to-use face-related functions based on current STOA open-source methods.
A fast poisson image editing implementation that can utilize multi-core CPU or GPU to handle a high-resolution image input.
The Emotional Voices Database: Towards Controlling the Emotional Expressiveness in Voice Generation Systems
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
This repository offers various extension nodes for ComfyUI. Nodes here have different characteristics compared to those in the ComfyUI Impact Pack. The Impact Pack has become too large now...
A minimal GPU design in Verilog to learn how GPUs work from the ground up