Skip to content
View jaskirat111's full-sized avatar

Block or report jaskirat111

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

Python 7,133 503 Updated Sep 18, 2024

Universal LLM Deployment Engine with ML Compilation

Python 18,785 1,533 Updated Oct 1, 2024

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

Python 14,000 1,809 Updated Jul 3, 2024

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

Python 2,177 252 Updated Sep 30, 2024

🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools

Python 2,495 447 Updated Sep 30, 2024

Dockerfiles and scripts for ONNX container images

Jupyter Notebook 133 95 Updated Aug 17, 2022

Software Engineering for AI/ML -- An Annotated Bibliography

301 31 Updated Jul 16, 2024