Block or Report
Block or report NinedayWang
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (6)
Sort Name ascending (A-Z)
Stars
Language
Sort by: Recently starred
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
Medical NLP Competition, dataset, large models, paper
CodeGPT: A Code-Related Dialogue Dataset Generated by GPT and for GPT
Home of StarCoder: fine-tuning & inference!
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
A Multi-Turn Dialogue Corpus based on Alpaca Instructions
Fast and memory-efficient exact attention
Using NLP techniques to summarize prompts for program synthesis
A multi-programming language benchmark for evaluating the performance of large language model of code.
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
Retrieval and Retrieval-augmented LLMs
📚 Freely available programming books
hbh112233abc / pdfplumber
Forked from jsvine/pdfplumberPlumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
LangChain 的中文入门教程
Official inference library for Mistral models
[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other models
The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
Codebase for Merging Language Models (ICML 2024)
Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" (ICLR 2024)
[ACL 2024 System Demonstration] An Easy-to-use Instruction Processing Framework for LLMs.