- Shanghai, China
- https://ymzhang0319.github.io/
Highlights
- Pro
Block or Report
Block or report ymzhang0319
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language: Python
Sort by: Most stars
Robust Speech Recognition via Large-Scale Weak Supervision
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Generative Models by Stability AI
Open-Sora: Democratizing Efficient Video Production for All
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
WebUI extension for ControlNet
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
ImageBind One Embedding Space to Bind Them All
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
A UI-Focused Agent for Windows OS Interaction.
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
ModelScope: bring the notion of Model-as-a-Service to life.
Enjoy the magic of Diffusion models!
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
a state-of-the-art-level open visual language model | 多模态预训练模型
a research paper for generative cartoon interpolation
Nightly release of ControlNet 1.1