Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 5,687 517 Updated Sep 19, 2024

microsoft / DeepSpeedExamples

Example models using DeepSpeed

Python 6,014 1,021 Updated Sep 17, 2024

baichuan-inc / Baichuan-13B

A 13B large language model developed by Baichuan Intelligent Technology

Python 2,982 237 Updated Sep 6, 2023

THUDM / ChatGLM2-6B

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Python 15,703 1,852 Updated Jun 27, 2024

wxjiao / ParroT

The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1-mt) and human written translation and evaluation data.

Python 166 24 Updated Oct 12, 2023

wxjiao / Is-ChatGPT-A-Good-Translator

A preliminary evaluation of ChatGPT/GPT-4 for machine translation.

Python 241 16 Updated Nov 3, 2023

NLP2CT / UniTE

Forked from wanyu2018umac/UniTE

Python 11 Updated Jan 30, 2023

shengcaishizhan / kkndme_tianya

天涯 kkndme 神贴聊房价

18,709 3,811 Updated Aug 27, 2023

CLUEbenchmark / CLUEDatasetSearch

搜索所有中文NLP数据集，附常用英文NLP数据集

Python 4,105 609 Updated Nov 21, 2022

alanshi / charset_mnbvc

本项目旨在对大量文本文件进行快速编码检测和转换以辅助mnbvc语料集项目的数据清洗工作

Python 52 11 Updated Sep 29, 2024

esbatmop / MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,411 233 Updated Sep 14, 2024

yuanzhoulvpi2017 / zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 2,873 358 Updated Sep 26, 2024

THUDM / ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 40,452 5,193 Updated Jun 27, 2024

microsoft / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,910 4,056 Updated Sep 27, 2024

ZhenYangIACAS / WeTS

A benchmark for the task of translation suggestion

Mask 59 25 Updated Jun 23, 2022

THUNLP-MT / Template-NMT

Python 22 2 Updated Nov 15, 2022

songmzhang / CBMI

Python 14 1 Updated Aug 6, 2022

NVIDIA / nccl

Optimized primitives for collective multi-GPU communication

C++ 3,151 793 Updated Sep 17, 2024

alphadl / RLFW-NAT.mono

Python 2 Updated Aug 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HJH_nlp alukanlp

Block or report alukanlp

Stars

OpenLMLab / LEval

01-ai / Yi

bojone / papers.cool

imoneoi / openchat

wgwang / awesome-LLMs-In-China

THUDM / ChatGLM3

yangjianxin1 / LLMPruner

wulabing / V2Ray_ws-tls_bash_onekey

liaokongVFX / LangChain-Chinese-Getting-Started-Guide

clown-coding / vpn

hiyouga / LLaMA-Factory

yangjianxin1 / Firefly