Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

TypeScript 30,515 5,346 Updated Aug 9, 2024

mistralai / mistral-inference

Official inference library for Mistral models

Jupyter Notebook 9,404 824 Updated Aug 8, 2024

FMInference / DejaVu

Python 254 31 Updated Apr 2, 2024

cornell-zhang / dnn-quant-ocs

DNN quantization with outlier channel splitting

Python 109 18 Updated Mar 21, 2020

huggingface / evaluate

🤗 Evaluate: A library for easily evaluating machine learning models and datasets.

Python 1,925 238 Updated Jul 31, 2024

qwopqwop200 / GPTQ-for-LLaMa

4 bits quantization of LLaMA using GPTQ

Python 2,964 457 Updated Jul 13, 2024

VainF / Torch-Pruning

[CVPR 2023] Towards Any Structural Pruning; LLMs / SAM / Diffusion / Transformers / YOLOv8 / CNNs

Python 2,505 312 Updated Aug 9, 2024

horseee / Awesome-Efficient-LLM

A curated list for Efficient Large Language Models

Python 1,029 74 Updated Aug 9, 2024

IST-DASLab / gptq

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 1,829 146 Updated Mar 27, 2024

WZMIAOMIAO / deep-learning-for-image-processing

deep learning for image processing including classification and object-detection etc.

Python 22,060 7,866 Updated Jul 25, 2024

wimh966 / outlier_suppression

The official PyTorch implementation of the NeurIPS2022 (spotlight) paper, Outlier Suppression: Pushing the Limit of Low-bit Transformer Language Models

Python 45 4 Updated Oct 5, 2022

bitsandbytes-foundation / bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Python 5,868 594 Updated Aug 7, 2024

mit-han-lab / llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,227 164 Updated Jul 16, 2024

Guangxuan-Xiao / torch-int

This repository contains integer operators on GPUs for PyTorch.

Python 163 48 Updated Sep 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Meteor168

Block or report Meteor168

Stars

Xnhyacinth / Awesome-LLM-Long-Context-Modeling

pagehelper / Mybatis-PageHelper

deepseek-ai / DeepSeek-MoE

mobiusml / hqq

dvmazur / mixtral-offloading

EleutherAI / lm-evaluation-harness

AutoGPTQ / AutoGPTQ

Lucky-Lance / Expert_Sparsity

microsoft / LoRA

chunhuizhang / bert_t5_gpt

IST-DASLab / sparsegpt

locuslab / wanda

zyxxmu / DSnoT

FMInference / FlexGen

google-research / bert

harvardnlp / annotated-transformer

chatchat-space / Langchain-Chatchat