ModelTC
Model Infra
Pinned
Repositories
Showing 10 of 35 repositories
- llmc Public
This is the official PyTorch implementation of "LLM-QBench: A Benchmark Towards the Best Practice for Post-training Quantization of Large Language Models", and also an efficient LLM compression tool with various advanced compression methods, supporting multiple inference backends.
ModelTC/llmc’s past year of commit activity - L2_Compression Public
ModelTC/L2_Compression’s past year of commit activity