phlahut

phl phlahut

1 follower · 9 following

www.devphlahut.com

Starred repositories

6drf21e / ChatTTS_colab

🚀 一键部署（含离线整合包）！基于 ChatTTS ，支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用，无需复杂安装。

Python 2,020 254 Updated Jul 2, 2024

libukai / Awesome-ChatTTS

官方推荐的 ChatTTS 资源汇总项目，整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project

1,144 75 Updated Jul 3, 2024

xorbitsai / inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …

Python 5,115 413 Updated Oct 19, 2024

CrazyBoyM / phi3-Chinese

Phi3 中文仓库

Python 316 19 Updated Apr 25, 2024

ictnlp / LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,423 151 Updated Sep 24, 2024

LianjiaTech / BELLE

BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）

HTML 7,872 756 Updated Oct 16, 2024

Lightning-AI / litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 10,413 1,032 Updated Oct 11, 2024

FunAudioLLM / SenseVoice

Multilingual Voice Understanding Model

Python 3,082 283 Updated Oct 18, 2024

gpt-omni / mini-omni

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 2,937 262 Updated Oct 16, 2024

collabora / WhisperFusion

WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.

Python 1,521 107 Updated Jul 31, 2024

ufal / whisper_streaming

Whisper realtime streaming for long speech-to-text transcription and translation

Python 1,950 238 Updated Oct 16, 2024

collabora / WhisperLive

A nearly-live implementation of OpenAI's Whisper.

Python 1,929 263 Updated Sep 20, 2024

SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2

Python 11,954 1,002 Updated Aug 21, 2024

snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 4,198 412 Updated Oct 12, 2024

sanowl / LSLM-Listening-while-Speaking-Language-Model

LSLM implements full duplex modeling in interactive speech language models, based on research by Ma et al. (2024). This project advances human-computer interaction through real-time spoken dialogue…

Python 37 3 Updated Oct 2, 2024

huggingface / speech-to-speech

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 3,267 345 Updated Oct 16, 2024

0x5446 / api4sensevoice

API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition, and speaker verification.

Python 182 27 Updated Oct 14, 2024

devopvoid / webrtc-java

WebRTC for desktop platforms running Java

C++ 259 60 Updated Sep 27, 2024

0nutation / SpeechGPT

SpeechGPT Series: Speech Large Language Models

Python 1,260 84 Updated Jul 22, 2024

QwenLM / Qwen2-Audio

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,158 73 Updated Aug 13, 2024

livekit / agents

Build real-time multimodal AI applications 🤖🎙️📹

Python 3,563 349 Updated Oct 19, 2024

pipecat-ai / pipecat

Open Source framework for voice and multimodal conversational AI

Python 3,257 297 Updated Oct 20, 2024

pipecat-ai / rtvi-web-demo

Example UI implementing the RTVI web client

TypeScript 472 68 Updated Oct 10, 2024

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 31,619 3,442 Updated Oct 17, 2024

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 4,458 403 Updated Oct 20, 2024

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 5,766 452 Updated Sep 19, 2024

brunodev85 / winlator

Android application for running Windows applications with Wine and Box86/Box64

C 8,566 409 Updated Sep 27, 2024

zhourax / VEGA

Python 33 2 Updated Jul 9, 2024

Kedreamix / Linly-Talker

Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction…

Python 1,909 312 Updated Sep 27, 2024

redoriental / JC-WEBTRANSPORT-JNI-LIB

The first fully developed Java webtransport(webTransport) server

Java 2 Updated Mar 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly