Skip to content
View ShomyLiu's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report ShomyLiu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)

Python 2,497 241 Updated Nov 7, 2024

Ring attention implementation with flash attention

Python 578 45 Updated Oct 30, 2024

The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"

Python 50 2 Updated Apr 22, 2024

使用task->push实现回复大消息的示例

C++ 1 Updated Nov 3, 2024

Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!

Python 476 53 Updated Nov 5, 2024

DISC-FinLLM,中文金融大语言模型(LLM),旨在为用户提供金融场景下专业、智能、全面的金融咨询服务。DISC-FinLLM, a Chinese financial large language model (LLM) designed to provide users with professional, intelligent, and comprehensive financ…

Python 594 71 Updated Nov 1, 2023

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Python 693 46 Updated Oct 24, 2024

Robust recipes to align language models with human and AI preferences

Python 4,661 406 Updated Oct 7, 2024

[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 701 47 Updated Nov 4, 2024

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Python 349 24 Updated Nov 1, 2024

Structured Text Generation

Python 9,230 471 Updated Nov 6, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 5,920 482 Updated Nov 7, 2024

Official inference library for Mistral models

Jupyter Notebook 9,703 860 Updated Oct 16, 2024

Mamba SSM architecture

Python 13,111 1,114 Updated Nov 5, 2024

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Python 1,141 67 Updated Oct 14, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,596 975 Updated Nov 6, 2024

轩辕:度小满中文金融对话大模型

Python 1,061 95 Updated Sep 26, 2024

The Bytepiece Tokenizer Implemented in Rust.

Rust 14 Updated Nov 28, 2023

Minimalist ML framework for Rust

Rust 15,789 949 Updated Nov 5, 2024
Rust 281 34 Updated Jul 25, 2024

Open source Python library for converting PDF to DOCX.

Python 2,580 376 Updated Sep 23, 2024

Code used for sourcing and cleaning the BigScience ROOTS corpus

Jupyter Notebook 305 40 Updated Mar 20, 2023

Experiments on including metadata such as URLs, timestamps, website descriptions and HTML tags during pretraining.

Python 30 12 Updated Jun 12, 2023

Fast and memory-efficient exact attention

Python 14,093 1,314 Updated Nov 7, 2024

Large Language Model Text Generation Inference

Python 9,012 1,060 Updated Nov 7, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 29,740 4,486 Updated Nov 7, 2024

⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.

Jupyter Notebook 2,149 381 Updated Sep 29, 2023

Copy large files to multiple machines

C++ 3 Updated Mar 24, 2024

Coroutine C++ Workflow based on C++ 20

C++ 59 4 Updated Oct 28, 2024

A framework for few-shot evaluation of language models.

Python 6,900 1,841 Updated Nov 7, 2024
Next