Skip to content
View alukanlp's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report alukanlp

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark

Python 347 14 Updated Jul 9, 2024

A series of large language models trained from scratch by developers @01-ai

Jupyter Notebook 7,614 471 Updated Sep 23, 2024

Cool Papers - Immersive Paper Discovery

HTML 359 5 Updated Sep 11, 2024

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,229 399 Updated Sep 13, 2024

中国大模型

5,332 437 Updated Jun 7, 2024

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Python 13,364 1,551 Updated Jul 10, 2024
Python 293 23 Updated Apr 6, 2023

LangChain 的中文入门教程

7,354 595 Updated Aug 11, 2024

快速搭建一个自己的VPN翻墙科学上网

2,985 569 Updated Dec 30, 2018

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Python 31,616 3,891 Updated Sep 29, 2024

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 5,687 517 Updated Sep 19, 2024

Example models using DeepSpeed

Python 6,014 1,021 Updated Sep 17, 2024

A 13B large language model developed by Baichuan Intelligent Technology

Python 2,982 237 Updated Sep 6, 2023

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Python 15,703 1,852 Updated Jun 27, 2024

The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1-mt) and human written translation and evaluation data.

Python 166 24 Updated Oct 12, 2023

A preliminary evaluation of ChatGPT/GPT-4 for machine translation.

Python 241 16 Updated Nov 3, 2023
Python 11 Updated Jan 30, 2023

天涯 kkndme 神贴聊房价

18,709 3,811 Updated Aug 27, 2023

搜索所有中文NLP数据集,附常用英文NLP数据集

Python 4,105 609 Updated Nov 21, 2022

本项目旨在对大量文本文件进行快速编码检测和转换以辅助mnbvc语料集项目的数据清洗工作

Python 52 11 Updated Sep 29, 2024

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,411 233 Updated Sep 14, 2024

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 2,873 358 Updated Sep 26, 2024

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 40,452 5,193 Updated Jun 27, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,910 4,056 Updated Sep 27, 2024

A benchmark for the task of translation suggestion

Mask 59 25 Updated Jun 23, 2022
Python 22 2 Updated Nov 15, 2022
Python 14 1 Updated Aug 6, 2022

Optimized primitives for collective multi-GPU communication

C++ 3,151 793 Updated Sep 17, 2024
Python 2 Updated Aug 6, 2022
Next