Skip to content
View tutuDoki's full-sized avatar
Block or Report

Block or report tutuDoki

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!

Python 1,738 118 Updated Jul 12, 2024

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 1,445 173 Updated Jun 2, 2024

collection of awesome lists

Python 289 57 Updated Jul 11, 2024

中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。

Python 990 115 Updated Apr 20, 2024

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 26,122 3,248 Updated Jul 12, 2024

Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models

712 44 Updated May 28, 2024

✯ 一个可直连访问的电视/广播图标库与相关工具项目 ✯ 🔕 永久免费 直连访问 完整开源 不断完善的台标 支持IPv4/IPv6双栈访问 🔕

JavaScript 19,650 2,882 Updated Jul 12, 2024

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

13,246 1,222 Updated Jun 8, 2024

Langchain-Chatchat(原Langchain-ChatGLM, Qwen 与 Llama 等)基于 Langchain 与 ChatGLM 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen a…

TypeScript 29,807 5,226 Updated Jul 12, 2024

an intro to retrieval augmented large language model

246 16 Updated Sep 9, 2023

开源社区第一个能下载、能运行的中文 LLaMA2 模型!

Python 2,221 201 Updated Oct 26, 2023

Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

Python 12,906 1,175 Updated Jul 12, 2024

中文法律对话语言模型

Python 989 111 Updated May 13, 2024

An open-source educational chat model from ICALK, East China Normal University. 开源中英教育对话大模型。(通用基座模型,GPU部署,数据清理) 致敬: LLaMA, MOSS, BELLE, Ziya, vLLM

Python 649 67 Updated Jun 2, 2024

一本系统地教你将深度学习模型的性能最大化的战术手册。

2,149 193 Updated May 27, 2023

Papers & Works for large languange models (ChatGPT, GPT-3, Codex etc.).

TeX 297 27 Updated Jul 7, 2024

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 40,000 5,155 Updated Jun 27, 2024

广州电信广东IPTV列表(IGMP/RTP组播地址)

PHP 691 152 Updated Jul 11, 2024

Collection of publicly available IPTV channels from all over the world

JavaScript 81,449 2,101 Updated Jul 12, 2024

📃 UAC 白名单小工具!

C# 487 41 Updated Mar 3, 2024

Code for ACL 2019 : Entity-Relation Extraction as Multi-Turn Question Answering

Python 73 27 Updated Jun 12, 2023

计算机自学指南

HTML 52,559 6,447 Updated Jul 3, 2024

⏰ AI conference deadline countdowns

JavaScript 5,484 935 Updated Jul 8, 2024

[ACL2020] Effective Inter-Clause Modeling for End-to-End Emotion-Cause Pair Extraction

Python 56 17 Updated Mar 16, 2022

软件工程课设_后端

2 1 Updated Jun 21, 2021

「Java学习+面试指南」一份涵盖大部分 Java 程序员所需要掌握的核心知识。准备 Java 面试,首选 JavaGuide!

Java 144,833 45,416 Updated Jul 11, 2024

Java后端知识图谱🔥 帮助Java初学者成长

18,711 3,848 Updated May 28, 2024

🥗 All-in-one professional pop-up dictionary and page translator which supports multiple search modes, page translations, new word notebook and PDF selection searching.

TypeScript 11,818 716 Updated Apr 7, 2024
Next