Starred repositories
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
A natural language interface for computers
Convert PDF to markdown quickly with high accuracy
🔥🕷️ Crawl4AI: Open-source LLM Friendly Web Crawler & Scrapper
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
A framework to enable multimodal models to operate a computer.
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation,…
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
The #1 open-source voice interface for desktop, mobile, and ESP32 chips.
Superduper: build end-2-end AI applications and templates using your existing data infrastructure and tools of choice
16-bit CPU for Excel, and related files
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild
General technology for enabling AI capabilities w/ LLMs and MLLMs
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
Compositional Differentiable Programming Library
llama.cpp with BakLLaVA model describes what does it see
Beyond Language Models: Byte Models are Digital World Simulators
Demo of AI chatbot that predicts user message to generate response quickly.
A demonstration of predictive text without an LLM, using permy.link