Starred repositories
PyTorch 官方中文教程包含 60 分钟快速入门教程,强化教程,计算机视觉,自然语言处理,生成对抗网络,强化学习。欢迎 Star,Fork!
easy-bert是一个中文NLP工具,提供诸多bert变体调用和调参方法,极速上手;清晰的设计和代码注释,也很适合学习
Code for "SLIM: Explicit Slot-Intent Mapping with BERT for Joint Multi-Intent Detection and Slot Filling"
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Bi-LSTM+CRF sequence labeling model implemented in PyTorch
A framework for few-shot evaluation of language models.
Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]
CMMLU: Measuring massive multitask language understanding in Chinese
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
OpenChat: Advancing Open-source Language Models with Imperfect Data
Low-level unprivileged sandboxing tool used by Flatpak and similar projects
收录NLP竞赛策略实现、各任务baseline、相关竞赛经验贴(当前赛事、往期赛事、训练赛)、NLP会议时间、常用自媒体、GPU推荐等,持续更新中
复盘所有NLP比赛的TOP方案,只关注NLP比赛,持续更新中!
SMSBoom - Deprecate: Due to judicial reasons, the repository has been suspended!
2023年最新总结,阿里,腾讯,百度,美团,头条等技术面试题目,以及答案,专家出题人分析汇总。
2021年最新总结,从程序员到CTO,从专业走向卓越,分享大牛企业内部pdf与PPT
📖 A curated list of LegalNLP resources from all around the web.
Crime assistant including crime type prediction and crime consult service based on nlp methods and crime kg,罪名法务智能项目,内容包括856项罪名知识图谱, 基于280万罪名训练库的罪名预测,基于20W法务问答对的13类问题分类与法律资讯问答功能.
Annotated dataset of 100 works of fiction to support tasks in natural language processing and the computational humanities.
The Dakshina dataset is a collection of text in both Latin and native scripts for 12 South Asian languages. For each language, the dataset includes a large collection of native script Wikipedia tex…
A Multi-Turn Dialogue Corpus based on Alpaca Instructions
Repository for organizing datasets and papers used in Open LLM.
A quick guide (especially) for trending instruction finetuning datasets
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output