ymzhang0319

Follow

🌴

On vacation

Yiming ymzhang0319

🌴

On vacation

Follow

PhD. candidate jointly at USTC and Shanghai AI Laboratory.

27 followers · 42 following

Achievements

Achievements

Highlights

Pro

Block or Report

Block or report ymzhang0319

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Stars

136 stars written in Python

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 65,810 7,724 Updated Aug 8, 2024

comfyanonymous / ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 45,579 4,843 Updated Aug 11, 2024

LC044 / WeChatMsg

提取微信聊天记录，将其导出成HTML、Word、Excel文档永久保存，对聊天记录进行分析生成年度聊天报告，用聊天数据训练专属于个人的AI聊天助手

Python 32,134 3,359 Updated Jul 20, 2024

gradio-app / gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 31,495 2,345 Updated Aug 10, 2024

huggingface / pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 31,060 4,666 Updated Aug 8, 2024

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 30,621 3,523 Updated Aug 10, 2024

myshell-ai / OpenVoice

Instant voice cloning by MyShell.

Python 27,881 2,722 Updated Jul 23, 2024

s0md3v / roop

one-click face swap

Python 26,017 6,394 Updated Jul 5, 2024

Stability-AI / generative-models

Generative Models by Stability AI

Python 23,728 2,632 Updated Aug 4, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 21,167 2,014 Updated Aug 9, 2024

jhao104 / proxy_pool

Python ProxyPool for web spider

Python 21,120 5,114 Updated Jun 17, 2024

facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 20,395 2,054 Updated Jul 18, 2024

Mikubill / sd-webui-controlnet

WebUI extension for ControlNet

Python 16,665 1,927 Updated Jul 25, 2024

NanmiCoder / MediaCrawler

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频｜评论爬虫、微博帖子｜评论爬虫、百度贴吧帖子｜百度贴吧评论回复爬虫

Python 15,738 5,068 Updated Aug 9, 2024

harry0703 / MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

Python 15,541 2,420 Updated Jul 26, 2024

state-spaces / mamba

Mamba SSM architecture

Python 12,073 1,016 Updated Aug 7, 2024

PKU-YuanGroup / Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,102 988 Updated Aug 5, 2024

InstantID / InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 10,672 778 Updated Jul 18, 2024

magic-research / magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Python 10,290 1,054 Updated Jun 21, 2024

guoyww / AnimateDiff

Official implementation of AnimateDiff.

Python 10,034 817 Updated Jul 31, 2024

facebookresearch / ImageBind

ImageBind One Embedding Space to Bind Them All

Python 8,147 741 Updated Jul 31, 2024

Plachtaa / VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Python 7,487 745 Updated Feb 11, 2024

microsoft / UFO

A UI-Focused Agent for Windows OS Interaction.

Python 7,448 909 Updated Jul 25, 2024

facebookresearch / mae

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 7,064 1,189 Updated Jul 23, 2024

modelscope / modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Python 6,652 687 Updated Aug 9, 2024

modelscope / DiffSynth-Studio

Enjoy the magic of Diffusion models!

Python 6,108 545 Updated Aug 2, 2024

facebookresearch / DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 5,843 513 Updated May 31, 2024

THUDM / CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 5,748 393 Updated May 29, 2024

ToonCrafter / ToonCrafter

a research paper for generative cartoon interpolation

Python 4,986 410 Updated Jun 1, 2024

lllyasviel / ControlNet-v1-1-nightly

Nightly release of ControlNet 1.1

Python 4,573 370 Updated Aug 8, 2024