Here are
6 public repositories
matching this topic...
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
Updated
Jul 18, 2024
Python
Low Precision Arithmetic Simulation in PyTorch
Updated
May 20, 2024
Python
A script to convert floating-point CNN models into generalized low-precision ShiftCNN representation
Updated
Jul 14, 2017
Python
Low Precision(quantized) Yolov5
Updated
Jan 28, 2024
Python
Code for DNN feature map compression paper
JAX Scalify: end-to-end scaled arithmetics
Updated
Jul 17, 2024
Python
Improve this page
Add a description, image, and links to the
low-precision
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
low-precision
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.