Skip to content
View shiquan1988's full-sized avatar

Block or report shiquan1988

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,LLaMA等模型应用在纠错场景,开箱即用。

Python 5,513 1,089 Updated Sep 24, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

3,868 213 Updated Sep 27, 2024

OpenAIOS vGPU device plugin for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory, in order to allow applications to access larger memory space than its physical ca…

Go 494 93 Updated May 21, 2024

ccache – a fast compiler cache

C++ 2,301 489 Updated Sep 22, 2024

Json Formatter for the standard python logger

Python 1,734 231 Updated Jul 3, 2024

NCCL Tests

Cuda 834 231 Updated Jul 30, 2024

RAGChecker: A Fine-grained Framework For Diagnosing RAG

Python 402 32 Updated Sep 25, 2024

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,358 465 Updated Aug 19, 2024

The ApolloScape Open Dataset for Autonomous Driving and its Application.

Jupyter Notebook 561 137 Updated May 1, 2024

使用Ansible脚本安装K8S集群,介绍组件交互原理,方便直接,不受国内网络环境影响

Jinja 10,411 3,502 Updated Aug 3, 2024

The devkit of the nuScenes dataset.

Python 2,250 623 Updated Jul 8, 2024

Label Studio is a multi-type data labeling and annotation tool with standardized output format

JavaScript 18,375 2,306 Updated Sep 27, 2024

Free RPA tool by AI Singapore

JavaScript 5,595 580 Updated Sep 6, 2024

Open source annotation tool for machine learning practitioners.

Python 9,443 1,720 Updated Sep 18, 2024

Label, clean and enrich text datasets with LLMs.

Python 2,029 139 Updated Sep 27, 2024

Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust

Rust 11,856 662 Updated Sep 26, 2024

A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。

TypeScript 75,302 58,870 Updated Sep 27, 2024

The all-in-one solution for RAG. Build, scale, and deploy state of the art Retrieval-Augmented Generation applications

Python 3,348 247 Updated Sep 28, 2024

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,210 715 Updated Aug 5, 2024

Always know what to expect from your data.

Python 9,858 1,523 Updated Sep 27, 2024

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Scala 3,261 534 Updated Sep 19, 2024

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Python 11,656 877 Updated Sep 27, 2024

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 11,587 1,047 Updated Sep 27, 2024

💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

Python 8,778 580 Updated Sep 27, 2024

🔍 AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your da…

Python 16,914 1,846 Updated Sep 26, 2024

LangGPT: Empowering everyone to become a prompt expert!🚀 Structured Prompt,Language of GPT, 结构化提示词,结构化Prompt

Jupyter Notebook 5,564 480 Updated Sep 15, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 5,323 383 Updated Sep 27, 2024

Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.

Python 6,294 1,276 Updated Aug 31, 2024

Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)

Python 9,587 1,384 Updated Jul 31, 2023

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

Python 18,685 4,601 Updated Sep 16, 2024
Next