Real-time speech recognition using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Raspberry Pi, VisionFive2, LicheePi4A etc.

C++ 915 141 Updated Jul 11, 2024

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 29,612 3,423 Updated Jul 23, 2024

comfyanonymous / ComfyUI

The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.

Python 43,007 4,535 Updated Jul 23, 2024

HumanAIGC / AnimateAnyone

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

14,154 946 Updated Jun 17, 2024

ImadSaddik / DoCamp

Java 5 2 Updated Jun 10, 2024

krishnaik06 / Updated-Langchain

Jupyter Notebook 110 122 Updated May 16, 2024

objectbox / objectbox-java

Android Database - first and fast, lightweight on-device vector database

Java 4,356 301 Updated Jun 3, 2024

tursodatabase / libsql

libSQL is a fork of SQLite that is both Open Source, and Open Contributions.

C 8,807 236 Updated Jul 22, 2024

asg017 / sqlite-vss

A SQLite extension for efficient vector search, based on Faiss!

C++ 1,611 59 Updated May 5, 2024

asg017 / sqlite-vec

Work-in-progress vector search SQLite extension that runs anywhere.

C 1,100 18 Updated Jul 21, 2024

FoloToy / folotoy-server-self-hosting

Config files for self-hosting the FoloToy Server. Documents: https://docs.folotoy.com

Dockerfile 419 77 Updated Jun 26, 2024

allwefantasy / BYZER-RETRIEVAL

Byzer-retrieval is a distributed retrieval system which designed as a backend for LLM RAG (Retrieval Augmented Generation). The system supports both BM25 retrieval algorithm and vector retrieval al…

Java 42 7 Updated Feb 27, 2024

langgenius / dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 38,561 5,267 Updated Jul 23, 2024

anothermartz / Easy-Wav2Lip

Forked from GucciFlipFlops1917/wav2lip-hq-updated-ESRGAN

Colab for making Wav2Lip high quality and easy to use

Jupyter Notebook 527 79 Updated May 17, 2024

PaddlePaddle / PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 10,678 1,812 Updated Jul 23, 2024

Ikaros-521 / AI-Vtuber

Forked from Bluecat7417/AI-Vtuber

AI Vtuber是一个由【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】驱动的虚拟主播【Live2D/UE/xuniren】，可以在【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】直播中与观众实时互动或直接在本地进行聊…

Python 2,507 384 Updated Jul 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

坠飘尘 zhuipiaochen

Block or report zhuipiaochen

Stars

quic / qidk

quic / aimet-model-zoo

styler00dollar / VSGAN-tensorrt-docker

TencentARC / GFPGAN

XPixelGroup / HAT

k2-fsa / sherpa-ncnn

RVC-Boss / GPT-SoVITS

comfyanonymous / ComfyUI

HumanAIGC / AnimateAnyone

ImadSaddik / DoCamp

krishnaik06 / Updated-Langchain

objectbox / objectbox-java

tursodatabase / libsql

asg017 / sqlite-vss

asg017 / sqlite-vec

FoloToy / folotoy-server-self-hosting

allwefantasy / BYZER-RETRIEVAL

langgenius / dify

anothermartz / Easy-Wav2Lip

PaddlePaddle / PaddleSpeech

Ikaros-521 / AI-Vtuber

PeterH0323 / Streamer-Sales

EdVince / Stable-Diffusion-NCNN

agiresearch / AIOS

philipturner / metal-flash-attention

X-PLUG / MobileAgent

mnotgod96 / AppAgent

SJTU-IPADS / PowerInfer

bentoml / OpenLLM

nomic-ai / gpt4all