weiwancheng

weiwancheng

4 followers · 8 following

Stars

dataset

alpaca大模型指令微调中文数据集

12 repositories

carbonz0 / alpaca-chinese-dataset

alpaca中文指令微调数据集

391 24 Updated Mar 26, 2023

hikariming / chat-dataset-baseline

人工精调的中文对话数据集和一段chatglm的微调代码

Jupyter Notebook 1,143 95 Updated May 6, 2024

yongzhuo / nlg-yongzhuo

中文文本生成（NLG）之文本摘要（text summarization）工具包, 语料数据(corpus data), 抽取式摘要 Extractive text summary of Lead3、keyword、textrank、text teaser、word significance、LDA、LSI、NMF。（graph，feature，topic model，summarize to…

Python 404 53 Updated Jun 17, 2024

Instruction-Tuning-with-GPT-4 / GPT-4-LLM

Instruction Tuning with GPT-4

HTML 4,179 302 Updated Jun 11, 2023

LianjiaTech / BELLE

BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）

HTML 7,851 753 Updated Mar 15, 2024

krystalan / SGSum

CCKS‘2021:《SGSum：一个面向体育赛事摘要的人工标注数据集》

24 6 Updated Dec 26, 2021

brightmart / nlp_chinese_corpus

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

9,437 1,542 Updated May 23, 2024

LC1332 / CamelBell-Chinese-LoRA

CamelBell（驼铃) is be a Chinese Language Tuning project based on LoRA. CamelBell is belongs to Project Luotuo(骆驼), an open sourced Chinese-LLM project created by 冷子昂 @ 商汤科技 & 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技

Jupyter Notebook 171 18 Updated Dec 21, 2023

lxj5957 / CLTS-Dataset

A Chinese Long Text Summarization Dataset

63 6 Updated Jul 29, 2022

CarperAI / cheese

Used for adaptive human in the loop evaluation of language and embedding models.

Python 301 24 Updated Mar 1, 2023

Yale-LILY / QMSum

Dataset for NAACL 2021 paper: "QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization"

Jupyter Notebook 109 20 Updated Aug 29, 2023

alibaba-damo-academy / SpokenNLP

A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.

Python 104 11 Updated Feb 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly