Skip to content
View BastianChen's full-sized avatar
  • Tencent
  • Beijing, China

Block or report BastianChen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Recipes to train reward model for RLHF.

Python 734 62 Updated Sep 23, 2024

A fast inference library for running LLMs locally on modern consumer-class GPUs

Python 3,547 273 Updated Oct 1, 2024

how to optimize some algorithm in cuda.

Cuda 1,493 122 Updated Oct 9, 2024

Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 16,674 1,148 Updated Oct 6, 2024

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 1,686 204 Updated Sep 21, 2024

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,407 184 Updated Oct 10, 2024

CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!

1,444 136 Updated May 9, 2023

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 28,115 4,156 Updated Oct 10, 2024

A large-scale, fine-grained, diverse preference dataset (and models).

Python 304 16 Updated Dec 29, 2023

👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search…

Python 12,019 2,928 Updated Oct 10, 2024

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 4,811 398 Updated Oct 6, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 32,151 3,937 Updated Oct 8, 2024

《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合中国宝宝的部署教程

Jupyter Notebook 8,315 993 Updated Sep 29, 2024

Code for the paper Fine-Tuning Language Models from Human Preferences

Python 1,209 163 Updated Jul 25, 2023

总结Prompt&LLM论文,开源数据&模型,AIGC应用

2,615 265 Updated Sep 30, 2024

Kolmogorov Arnold Networks

Jupyter Notebook 14,796 1,355 Updated Sep 15, 2024
Jupyter Notebook 27 13 Updated Jan 26, 2024

A library for efficient similarity search and clustering of dense vectors.

C++ 30,840 3,595 Updated Oct 9, 2024

一个基于langchain实现RAG的简单示例

Jupyter Notebook 264 42 Updated Oct 7, 2024

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 93,277 15,005 Updated Oct 9, 2024

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

3,476 144 Updated Sep 25, 2024

中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3

Python 1,589 143 Updated Sep 23, 2024

Official code of Remote Sensing Mamba

Python 232 14 Updated Apr 25, 2024

包含程序员面试大厂面试题和面试经验

89 13 Updated Aug 20, 2024

Awesome Papers related to Mamba.

1,113 61 Updated Sep 10, 2024

Inference code for LLaMA models

Python 104 26 Updated Aug 13, 2023

Python code for handling the Clotho dataset.

Python 74 15 Updated Nov 24, 2020

Source code for the paper 'Audio Captioning Transformer'

Jupyter Notebook 48 3 Updated Jan 18, 2022

semantic segmentation pytorch 语义分割

Python 118 30 Updated May 10, 2021

本项目旨在分享大模型相关技术原理以及实战经验。

HTML 9,494 928 Updated Sep 22, 2024
Next