Skip to content
View weiwancheng's full-sized avatar

Block or report weiwancheng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

dataset

alpaca大模型指令微调中文数据集
12 repositories

alpaca中文指令微调数据集

391 24 Updated Mar 26, 2023

人工精调的中文对话数据集和一段chatglm的微调代码

Jupyter Notebook 1,143 95 Updated May 6, 2024

中文文本生成(NLG)之文本摘要(text summarization)工具包, 语料数据(corpus data), 抽取式摘要 Extractive text summary of Lead3、keyword、textrank、text teaser、word significance、LDA、LSI、NMF。(graph,feature,topic model,summarize to…

Python 404 53 Updated Jun 17, 2024

Instruction Tuning with GPT-4

HTML 4,179 302 Updated Jun 11, 2023

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 7,851 753 Updated Mar 15, 2024

CCKS‘2021:《SGSum:一个面向体育赛事摘要的人工标注数据集》

24 6 Updated Dec 26, 2021

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

9,437 1,542 Updated May 23, 2024

CamelBell(驼铃) is be a Chinese Language Tuning project based on LoRA. CamelBell is belongs to Project Luotuo(骆驼), an open sourced Chinese-LLM project created by 冷子昂 @ 商汤科技 & 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技

Jupyter Notebook 171 18 Updated Dec 21, 2023

A Chinese Long Text Summarization Dataset

63 6 Updated Jul 29, 2022

Used for adaptive human in the loop evaluation of language and embedding models.

Python 301 24 Updated Mar 1, 2023

Dataset for NAACL 2021 paper: "QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization"

Jupyter Notebook 109 20 Updated Aug 29, 2023

A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.

Python 104 11 Updated Feb 5, 2024