open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 2,815 256 Updated Sep 25, 2024

SJTU-DMTai / Awesome-Large-Models-for-Time-Series

Papers for LLM and foundation models for time series analytics

12 Updated Sep 30, 2024

OpenT2S / LlamaVoice

LlamaVoice is a llama-based large voice generation model, providing inference and training ability.

Python 216 11 Updated Aug 26, 2024

huggingface / parler-tts

Inference and training library for high-quality TTS models.

Python 4,346 440 Updated Sep 23, 2024

bytedance / 1d-tokenizer

This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation

Jupyter Notebook 415 16 Updated Sep 25, 2024

Avaiga / taipy

Turns Data and AI algorithms into production-ready web applications in no time.

Python 13,509 1,483 Updated Oct 12, 2024

h2oai / wave

Realtime Web Apps and Dashboards for Python and R

Python 3,991 326 Updated Oct 3, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

29,300 1,602 Updated Aug 1, 2024

mini-sora / minisora

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

Python 1,185 149 Updated Oct 8, 2024

yangdongchao / LLM-Codec

The open source code for LLM-Codec

Python 112 4 Updated Aug 18, 2024

xue-fei / uSherpaServer

uSherpaServer 给Unity提供流式语音识别的websocket服务

C# 3 Updated Jun 25, 2024

Lightning-AI / litdata

Transform datasets at scale. Optimize datasets for fast AI model training.

Python 340 39 Updated Oct 12, 2024

Zejun-Yang / AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Python 4,564 573 Updated Jul 2, 2024

yukara-ikemiya / friendly-stable-audio-tools

Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stability AI.

Python 122 10 Updated Jul 25, 2024

kenjihiranabe / The-Art-of-Linear-Algebra

Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"

PostScript 17,834 2,170 Updated Feb 4, 2024

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 31,424 3,412 Updated Oct 10, 2024

ItzCrazyKns / Perplexica

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

TypeScript 13,919 1,336 Updated Oct 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lvzhiqiang

Block or report lvzhiqiang

Lists (1)

aigc

Stars

huggingface / trl

SWivid / F5-TTS

pytorch / executorch

lucidrains / minGRU-pytorch

CaraJ7 / MMSearch

bytedance / paws_room_acoustics_simulator

xingchensong / S3Tokenizer

DrStef / MIMII-Unsupervised-classification-of-valve-sounds

FireRedTeam / FireRedTTS

google / speaker-id

kyutai-labs / moshi

feizc / FluxMusic

wjakob / nanobind

gpt-omni / mini-omni