Skip to content
View phlahut's full-sized avatar

Block or report phlahut

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。

Python 2,020 254 Updated Jul 2, 2024

官方推荐的 ChatTTS 资源汇总项目,整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project

1,144 75 Updated Jul 3, 2024

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …

Python 5,115 413 Updated Oct 19, 2024

Phi3 中文仓库

Python 316 19 Updated Apr 25, 2024

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,423 151 Updated Sep 24, 2024

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 7,872 756 Updated Oct 16, 2024

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 10,413 1,032 Updated Oct 11, 2024

Multilingual Voice Understanding Model

Python 3,082 283 Updated Oct 18, 2024

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 2,937 262 Updated Oct 16, 2024

WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.

Python 1,521 107 Updated Jul 31, 2024

Whisper realtime streaming for long speech-to-text transcription and translation

Python 1,950 238 Updated Oct 16, 2024

A nearly-live implementation of OpenAI's Whisper.

Python 1,929 263 Updated Sep 20, 2024

Faster Whisper transcription with CTranslate2

Python 11,954 1,002 Updated Aug 21, 2024

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 4,198 412 Updated Oct 12, 2024

LSLM implements full duplex modeling in interactive speech language models, based on research by Ma et al. (2024). This project advances human-computer interaction through real-time spoken dialogue…

Python 37 3 Updated Oct 2, 2024

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 3,267 345 Updated Oct 16, 2024

API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition, and speaker verification.

Python 182 27 Updated Oct 14, 2024

WebRTC for desktop platforms running Java

C++ 259 60 Updated Sep 27, 2024

SpeechGPT Series: Speech Large Language Models

Python 1,260 84 Updated Jul 22, 2024

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,158 73 Updated Aug 13, 2024

Build real-time multimodal AI applications 🤖🎙️📹

Python 3,563 349 Updated Oct 19, 2024

Open Source framework for voice and multimodal conversational AI

Python 3,257 297 Updated Oct 20, 2024

Example UI implementing the RTVI web client

TypeScript 472 68 Updated Oct 10, 2024

A generative speech model for daily dialogue.

Python 31,619 3,442 Updated Oct 17, 2024

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 4,458 403 Updated Oct 20, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 5,766 452 Updated Sep 19, 2024

Android application for running Windows applications with Wine and Box86/Box64

C 8,566 409 Updated Sep 27, 2024
Python 33 2 Updated Jul 9, 2024

Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction…

Python 1,909 312 Updated Sep 27, 2024

The first fully developed Java webtransport(webTransport) server

Java 2 Updated Mar 21, 2024
Next