Skip to content
View QingqingSun-Bao's full-sized avatar
Block or Report

Block or report QingqingSun-Bao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

文本聚类(Kmeans、DBSCAN、LDA、Single-pass)

Python 322 86 Updated May 12, 2021

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 3,501 264 Updated Aug 9, 2024

[ACL 2024] Progressive LLaMA with Block Expansion.

Python 457 34 Updated May 20, 2024

Official github repo for AutoDetect, an automated weakness detection framework for LLMs.

Python 35 1 Updated Jun 25, 2024

Investigating Prior Knowledge for Challenging Chinese Machine Reading Comprehension

Python 165 23 Updated Apr 20, 2022

轩辕:度小满中文金融对话大模型

Python 985 86 Updated Aug 12, 2024

🩺 首个会看胸部X光片的中文多模态医学大模型 | The first Chinese Medical Multimodal Model that Chest Radiographs Summarization.

Python 866 121 Updated Sep 15, 2023

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 14,376 949 Updated Aug 17, 2024

Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]

Python 1,577 72 Updated Oct 26, 2023

CMMLU: Measuring massive multitask language understanding in Chinese

Python 651 44 Updated Aug 14, 2024

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!

Python 2,262 139 Updated Aug 15, 2024

Mixture-of-Experts (MoE) Language Model

Python 173 38 Updated Jul 17, 2024

[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other models

Python 253 19 Updated Aug 14, 2024

Alpaca dataset from Stanford, cleaned and curated

Python 1,474 145 Updated Apr 14, 2023

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,793 338 Updated Aug 14, 2024

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 3,579 290 Updated Aug 10, 2024

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Python 8,257 1,372 Updated Aug 17, 2024

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

3,256 122 Updated Aug 10, 2024

Ongoing research training transformer models at scale

Python 9,694 2,184 Updated Aug 16, 2024

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Python 74,700 6,286 Updated Aug 17, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 7,935 865 Updated Aug 14, 2024
Python 19 2 Updated Jun 20, 2019

Official code for ICLR 2022 paper: "PoNet: Pooling Network for Efficient Token Mixing in Long Sequences".

Python 31 6 Updated May 23, 2023

EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

Python 2,017 248 Updated Mar 18, 2024

A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models

Python 1,745 255 Updated Jun 12, 2023

This repo contains our ACL 2017 paper data and source code

Python 719 190 Updated Sep 15, 2020

A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.

Python 97 11 Updated Feb 5, 2024

This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.

Python 310 50 Updated Apr 24, 2024

Zstandard - Fast real-time compression algorithm

C 23,019 2,055 Updated Aug 13, 2024

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 602 82 Updated Aug 8, 2024
Next