Skip to content
View KevinWang676's full-sized avatar

Block or report KevinWang676

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration

Python 56 7 Updated Oct 5, 2024
Python 59 6 Updated Aug 26, 2024

Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)

Python 235 20 Updated Oct 8, 2024
Python 6,086 454 Updated Oct 4, 2024
Python 1 Updated Oct 3, 2024

An Open-Sourced LLM-empowered Foundation TTS System

Python 233 13 Updated Sep 25, 2024

State-of-the-Art zero-shot voice conversion & singing voice conversion with in context learning

Python 303 34 Updated Oct 8, 2024

This is the official reproduction of FancyVideo.

Python 580 72 Updated Sep 12, 2024

基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT-o1/ Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。

Python 30,335 7,971 Updated Sep 26, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 11,415 980 Updated Oct 5, 2024

Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"

Python 285 13 Updated Oct 7, 2024

AI powered speech denoising and enhancement

Python 1,330 135 Updated Jun 21, 2024

Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.

Python 320 42 Updated Jul 10, 2024

Using RVC via console or python scripts

Python 60 18 Updated Sep 16, 2024

[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation

Python 1,098 85 Updated Oct 7, 2024

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 7,735 1,057 Updated Sep 10, 2024

Semantic Search on Wikipedia with Upstash Vector

TypeScript 415 34 Updated Aug 28, 2024
Python 313 8 Updated Sep 26, 2024

[CVPR2024] Make Your Dream A Vlog

Python 411 42 Updated Mar 19, 2024

A collection of awesome video generation studies.

TeX 296 8 Updated Oct 7, 2024

Official implementation of "Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices" (ICML 2024).

Python 44 4 Updated Aug 1, 2024

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

3,238 193 Updated Sep 21, 2024

GTP engine and self-play learning in Go

C++ 3,500 564 Updated Oct 8, 2024

Run any ComfyUI workflow w/ ZERO setup.

TypeScript 543 61 Updated Aug 18, 2024

Gradio Demo for ComfyDeploy

Python 45 2 Updated Aug 10, 2024

ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, th…

JavaScript 6,307 791 Updated Oct 8, 2024

SD变现宝:一键把comfyui工作流转换成小程序。

Python 1,071 127 Updated Oct 3, 2024

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,128 69 Updated Aug 13, 2024

fast is slow. (欲速不达) 段永平投资理念

HTML 24 10 Updated Feb 4, 2021

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

Python 4,800 478 Updated Sep 25, 2024
Next