Lists (2)
Sort Name ascending (A-Z)
Starred repositories
pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,LLaMA等模型应用在纠错场景,开箱即用。
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
OpenAIOS vGPU device plugin for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory, in order to allow applications to access larger memory space than its physical ca…
Json Formatter for the standard python logger
RAGChecker: A Fine-grained Framework For Diagnosing RAG
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
The ApolloScape Open Dataset for Autonomous Driving and its Application.
使用Ansible脚本安装K8S集群,介绍组件交互原理,方便直接,不受国内网络环境影响
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Open source annotation tool for machine learning practitioners.
Label, clean and enrich text datasets with LLMs.
Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
The all-in-one solution for RAG. Build, scale, and deploy state of the art Retrieval-Augmented Generation applications
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
Always know what to expect from your data.
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
🔍 AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your da…
LangGPT: Empowering everyone to become a prompt expert!🚀 Structured Prompt,Language of GPT, 结构化提示词,结构化Prompt
SGLang is a fast serving framework for large language models and vision language models.
Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.
Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)
💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants