Block or Report
Block or report xrq360
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLanguage
Sort by: Recently starred
Starred repositories
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
A toolbox for benchmarking trustworthiness of multimodal large language models (MultiTrust)
A Python library for adversarial machine learning focusing on benchmarking adversarial robustness.
A feature-rich command-line audio/video downloader
Android in docker solution with noVNC supported and video recording
A high-throughput and memory-efficient inference and serving engine for LLMs
🚀 基于 LLM 大语言模型的知识库问答系统。开箱即用、模型中立、灵活编排,支持快速嵌入到第三方业务系统。
Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/
A generative speech model for daily dialogue.
Foundational Models for State-of-the-Art Speech and Text Translation
🚀 KIMI AI 长文本大模型逆向API白嫖测试【特长:长文本解读整理】,支持高速流式输出、智能体对话、联网搜索、长文档解读、图像OCR、多轮对话,零配置部署,多路token支持,自动清理会话痕迹。
Robust Speech Recognition via Large-Scale Weak Supervision
🔍 AI search engine - self-host with local or cloud LLMs
drizzleDumper是一款基于内存搜索的Android脱壳工具。
🔥 Proxy is a high performance HTTP(S) proxies, SOCKS5 proxies,WEBSOCKET, TCP, UDP proxy server implemented by golang. Now, it supports chain-style proxies,nat forwarding in different lan,TCP/UDP po…
AIdea 是一款支持 GPT 以及国产大语言模型通义千问、文心一言等,支持 Stable Diffusion 文生图、图生图、 SDXL1.0、超分辨率、图片上色的全能型 APP。
langchain-ChatGLM, local knowledge based ChatGLM with langchain | 基于本地知识库的 ChatGLM 问答
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Building a quick conversation-based search demo with Lepton AI.
🤯 Lobe Chat - an open-source, modern-design LLMs/AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Bedrock / Azure / Mistral / Perplexity ), Multi-Modals (Vision…
Local & Open Source Alternative to CharacterAI
vits2 backbone with multilingual-bert
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progres…