Skip to content
View flomok's full-sized avatar

Block or report flomok

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。

Python 52 4 Updated Sep 14, 2024

PyTorch 官方中文教程包含 60 分钟快速入门教程,强化教程,计算机视觉,自然语言处理,生成对抗网络,强化学习。欢迎 Star,Fork!

Python 2,489 701 Updated Feb 15, 2022

easy-bert是一个中文NLP工具,提供诸多bert变体调用和调参方法,极速上手;清晰的设计和代码注释,也很适合学习

Python 72 12 Updated Nov 8, 2022
Python 6 1 Updated Oct 27, 2023

Code for "SLIM: Explicit Slot-Intent Mapping with BERT for Joint Multi-Intent Detection and Slot Filling"

Python 17 4 Updated Nov 22, 2022

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 38,865 4,108 Updated Jul 28, 2024

Bi-LSTM+CRF sequence labeling model implemented in PyTorch

Python 68 21 Updated Nov 29, 2018

Official repository for ORPO

Python 420 38 Updated May 31, 2024

A framework for few-shot evaluation of language models.

Python 6,902 1,842 Updated Nov 7, 2024

Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]

Python 1,632 78 Updated Oct 26, 2023

CMMLU: Measuring massive multitask language understanding in Chinese

Python 694 55 Updated Nov 3, 2024

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python 37,044 3,229 Updated Aug 17, 2024

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,253 398 Updated Sep 13, 2024

Low-level unprivileged sandboxing tool used by Flatpak and similar projects

C 3,950 237 Updated Oct 30, 2024

收录NLP竞赛策略实现、各任务baseline、相关竞赛经验贴(当前赛事、往期赛事、训练赛)、NLP会议时间、常用自媒体、GPU推荐等,持续更新中

Python 2,174 250 Updated Aug 29, 2023

复盘所有NLP比赛的TOP方案,只关注NLP比赛,持续更新中!

2,667 419 Updated Oct 9, 2024

SMSBoom - Deprecate: Due to judicial reasons, the repository has been suspended!

Python 15,357 3,677 Updated Mar 20, 2024

2023年最新总结,阿里,腾讯,百度,美团,头条等技术面试题目,以及答案,专家出题人分析汇总。

Python 36,620 9,447 Updated May 20, 2024

2021年最新总结,从程序员到CTO,从专业走向卓越,分享大牛企业内部pdf与PPT

11,128 3,035 Updated May 20, 2024

📖 A curated list of LegalNLP resources from all around the web.

240 29 Updated Jun 21, 2023

Crime assistant including crime type prediction and crime consult service based on nlp methods and crime kg,罪名法务智能项目,内容包括856项罪名知识图谱, 基于280万罪名训练库的罪名预测,基于20W法务问答对的13类问题分类与法律资讯问答功能.

Python 1,404 383 Updated Dec 5, 2023

Annotated dataset of 100 works of fiction to support tasks in natural language processing and the computational humanities.

Python 342 48 Updated Dec 8, 2022

The Dakshina dataset is a collection of text in both Latin and native scripts for 12 South Asian languages. For each language, the dataset includes a large collection of native script Wikipedia tex…

188 17 Updated May 27, 2020

A Multi-Turn Dialogue Corpus based on Alpaca Instructions

Python 164 16 Updated Jun 1, 2023

Repository for organizing datasets and papers used in Open LLM.

89 6 Updated Jul 6, 2023

A quick guide (especially) for trending instruction finetuning datasets

2,599 168 Updated Nov 28, 2023

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 5,833 525 Updated Oct 24, 2024

MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning

Python 86 5 Updated Aug 15, 2023

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Python 2,511 154 Updated Oct 10, 2024
Next