Block or Report
Block or report lukeewin
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
OpenAI Whisper ASR Webservice API
Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.
Voice activity detector (VAD) for the browser with a simple API
基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等
ChatGPT Java SDK支持流式输出、Gpt插件、联网。支持OpenAI官方所有接口。ChatGPT的Java客户端。OpenAI GPT-3.5-Turb GPT-4 Api Client for Java
Open AI ChatGPT流式输出。Open AI Stream output. ChatGPT Stream output.GPT-3.5
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
🦜🔗 Build context-aware reasoning applications
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
Netty project - an event-driven asynchronous network application framework
Socket.IO server implemented on Java. Realtime java framework
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
On-device wake word detection powered by deep learning
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
Tools for handling speech data in machine learning projects.
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a ca…
Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 serve…
Real time transcription with OpenAI Whisper.
The JAVE (Java Audio Video Encoder) library is Java wrapper on the ffmpeg project
DEPREDICATED: Use https://github.com/Purfview/whisper-standalone-win instead. A method to call openai/whisper python code from the command line without using the CLI version of whisper. [Also conta…
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Deezer source separation library including pretrained models.