Skip to content
View 34127chi's full-sized avatar
  • Ningbo University
  • Hangzhou

Block or report 34127chi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!

Python 2,818 173 Updated Nov 1, 2024

ModelScope-Agent: An agent framework connecting models in ModelScope with the world

Python 2,696 309 Updated Oct 29, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 18,619 1,816 Updated Nov 1, 2024

Build AI Agents with memory, knowledge, tools and reasoning. Chat with them using a beautiful Agent UI.

Python 14,409 2,005 Updated Oct 31, 2024

DSPy: The framework for programming—not prompting—foundation models

Python 18,336 1,409 Updated Nov 1, 2024

Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.

Python 3,376 335 Updated Oct 8, 2024

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 50,292 7,204 Updated Nov 1, 2024

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,088 450 Updated Oct 10, 2024

LlamaIndex is a data framework for your LLM applications

Python 36,428 5,201 Updated Oct 31, 2024

High accuracy RAG for answering questions from scientific documents with citations

Python 6,314 596 Updated Oct 30, 2024
Python 12 1 Updated Sep 26, 2024

A list of awesome papers and resources of recommender system on large language model (LLM).

1,307 110 Updated Aug 15, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 33,370 4,107 Updated Oct 30, 2024

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

15,732 1,455 Updated Sep 19, 2024

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Python 23,661 1,966 Updated Sep 26, 2024

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 94,192 15,226 Updated Oct 31, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 35,283 4,088 Updated Nov 1, 2024

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,342 1,867 Updated Apr 30, 2024

中文法律对话语言模型

Python 1,050 116 Updated May 13, 2024

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 40,615 5,210 Updated Jun 27, 2024

TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型,实现了包括LLaMA,ChatGLM,BLOOM,GPT2,Seq2Seq,BART,T5,UDA等模型的训练和预测,开箱即用。

Python 932 108 Updated Sep 14, 2024

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

Python 21,059 3,688 Updated Jul 4, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 16,275 1,605 Updated Oct 30, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,494 4,045 Updated Jul 17, 2024

ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。

52,621 13,523 Updated Jul 30, 2024

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python 37,040 3,228 Updated Aug 17, 2024

A curated list of research papers in Sentence Reprsentation Learning and a sts leaderboard of sentence embeddings.

299 23 Updated Oct 18, 2023

MuCGEC中文纠错数据集及文本纠错SOTA模型开源;Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction"

Python 503 64 Updated Jun 9, 2023

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 28,298 3,378 Updated Oct 31, 2024

A system for quickly generating training data with weak supervision

Python 5,806 858 Updated May 2, 2024
Next