Skip to content
View ymzhang0319's full-sized avatar
🌴
On vacation
🌴
On vacation

Highlights

  • Pro
Block or Report

Block or report ymzhang0319

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
136 stars written in Python
Clear filter

Robust Speech Recognition via Large-Scale Weak Supervision

Python 65,810 7,724 Updated Aug 8, 2024

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 45,579 4,843 Updated Aug 11, 2024

提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手

Python 32,134 3,359 Updated Jul 20, 2024

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 31,495 2,345 Updated Aug 10, 2024

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 31,060 4,666 Updated Aug 8, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 30,621 3,523 Updated Aug 10, 2024

Instant voice cloning by MyShell.

Python 27,881 2,722 Updated Jul 23, 2024

one-click face swap

Python 26,017 6,394 Updated Jul 5, 2024

Generative Models by Stability AI

Python 23,728 2,632 Updated Aug 4, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 21,167 2,014 Updated Aug 9, 2024

Python ProxyPool for web spider

Python 21,120 5,114 Updated Jun 17, 2024

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 20,395 2,054 Updated Jul 18, 2024

WebUI extension for ControlNet

Python 16,665 1,927 Updated Jul 25, 2024

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫

Python 15,738 5,068 Updated Aug 9, 2024

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Python 15,541 2,420 Updated Jul 26, 2024

Mamba SSM architecture

Python 12,073 1,016 Updated Aug 7, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,102 988 Updated Aug 5, 2024

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 10,672 778 Updated Jul 18, 2024

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Python 10,290 1,054 Updated Jun 21, 2024

Official implementation of AnimateDiff.

Python 10,034 817 Updated Jul 31, 2024

ImageBind One Embedding Space to Bind Them All

Python 8,147 741 Updated Jul 31, 2024

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Python 7,487 745 Updated Feb 11, 2024

A UI-Focused Agent for Windows OS Interaction.

Python 7,448 909 Updated Jul 25, 2024

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 7,064 1,189 Updated Jul 23, 2024

ModelScope: bring the notion of Model-as-a-Service to life.

Python 6,652 687 Updated Aug 9, 2024

Enjoy the magic of Diffusion models!

Python 6,108 545 Updated Aug 2, 2024

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 5,843 513 Updated May 31, 2024

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 5,748 393 Updated May 29, 2024

a research paper for generative cartoon interpolation

Python 4,986 410 Updated Jun 1, 2024

Nightly release of ControlNet 1.1

Python 4,573 370 Updated Aug 8, 2024
Next