KevinWang676

Kevin Wang KevinWang676

Speech Synthesis, Video Generation, Diffusion Models, and LLMs

233 followers · 34 following

Achievements

Stars

skirdey / voicerestore

VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration

Python 56 7 Updated Oct 5, 2024

myshell-ai / DreamVoice

Python 59 6 Updated Aug 26, 2024

liutaocode / TTS-arxiv-daily

Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)

Python 235 20 Updated Oct 8, 2024

kyutai-labs / moshi

Python 6,086 454 Updated Oct 4, 2024

drc-cs / FALL24-MSAI339

Python 1 Updated Oct 3, 2024

FireRedTeam / FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

Python 233 13 Updated Sep 25, 2024

Plachtaa / seed-vc

State-of-the-Art zero-shot voice conversion & singing voice conversion with in context learning

Python 303 34 Updated Oct 8, 2024

360CVGroup / FancyVideo

This is the official reproduction of FancyVideo.

Python 580 72 Updated Sep 12, 2024

zhayujie / chatgpt-on-wechat

基于大模型搭建的聊天机器人，同时支持微信公众号、企业微信应用、飞书、钉钉等接入，可选择GPT3.5/GPT-4o/GPT-o1/ Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI，能处理文本、语音和图片，访问操作系统和互联网，支持基于自有知识库进行定制企业智能客服。

Python 30,335 7,971 Updated Sep 26, 2024

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 11,415 980 Updated Oct 5, 2024

hustvl / EVF-SAM

Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"

Python 285 13 Updated Oct 7, 2024

resemble-ai / resemble-enhance

AI powered speech denoising and enhancement

Python 1,330 135 Updated Jun 21, 2024

HKoon / ChatTTS-OpenVoice

Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.

Python 320 42 Updated Jul 10, 2024

daswer123 / rvc-python

Using RVC via console or python scripts

Python 60 18 Updated Sep 16, 2024

ZhengPeng7 / BiRefNet

[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation

Python 1,098 85 Updated Oct 7, 2024

SakanaAI / AI-Scientist

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 7,735 1,057 Updated Sep 10, 2024

upstash / wikipedia-semantic-search

Semantic Search on Wikipedia with Upstash Vector

TypeScript 415 34 Updated Aug 28, 2024

MC-E / ReVideo

Python 313 8 Updated Sep 26, 2024

Vchitect / Vlogger

[CVPR2024] Make Your Dream A Vlog

Python 411 42 Updated Mar 19, 2024

AlonzoLeeeooo / awesome-video-generation

A collection of awesome video generation studies.

TeX 296 8 Updated Oct 7, 2024

fallenshock / Slicedit

Official implementation of "Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices" (ICML 2024).

Python 44 4 Updated Aug 1, 2024

showlab / Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

3,238 193 Updated Sep 21, 2024

lightvector / KataGo

GTP engine and self-play learning in Go

C++ 3,500 564 Updated Oct 8, 2024

ComfyWorkflows / ComfyUI-Launcher

Run any ComfyUI workflow w/ ZERO setup.

TypeScript 543 61 Updated Aug 18, 2024

comfy-deploy / comfyui-deploy-gradio-demo

Gradio Demo for ComfyDeploy

Python 45 2 Updated Aug 10, 2024

ltdrdata / ComfyUI-Manager

ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, th…

JavaScript 6,307 791 Updated Oct 8, 2024

zhulu111 / ComfyUI_Bxb

SD变现宝：一键把comfyui工作流转换成小程序。

Python 1,071 127 Updated Oct 3, 2024

QwenLM / Qwen2-Audio

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,128 69 Updated Aug 13, 2024

iqiancheng / fastisslow

fast is slow. (欲速不达) 段永平投资理念

HTML 24 10 Updated Feb 4, 2021

InternLM / MindSearch

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

Python 4,800 478 Updated Sep 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kevin Wang KevinWang676

Achievements

Achievements

Block or report KevinWang676

Stars

skirdey / voicerestore

myshell-ai / DreamVoice

liutaocode / TTS-arxiv-daily

kyutai-labs / moshi

drc-cs / FALL24-MSAI339

FireRedTeam / FireRedTTS

Plachtaa / seed-vc

360CVGroup / FancyVideo

zhayujie / chatgpt-on-wechat

facebookresearch / sam2

hustvl / EVF-SAM

resemble-ai / resemble-enhance

HKoon / ChatTTS-OpenVoice

daswer123 / rvc-python

ZhengPeng7 / BiRefNet

SakanaAI / AI-Scientist

upstash / wikipedia-semantic-search

MC-E / ReVideo

Vchitect / Vlogger

AlonzoLeeeooo / awesome-video-generation

fallenshock / Slicedit

showlab / Awesome-Video-Diffusion

lightvector / KataGo

ComfyWorkflows / ComfyUI-Launcher

comfy-deploy / comfyui-deploy-gradio-demo

ltdrdata / ComfyUI-Manager

zhulu111 / ComfyUI_Bxb

QwenLM / Qwen2-Audio

iqiancheng / fastisslow

InternLM / MindSearch