Build software better, together

he-y / Awesome-Pruning

A curated list of neural network pruning resources.

awesome-list pruning model-compression model-acceleration

Updated Apr 4, 2024

Efficient-ML / Awesome-Model-Quantization

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.

awesome deep-learning quantization binarization model-compression model-acceleration binary-network binarized-neural-networks lightweight-neural-network model-quantization efficient-deep-learning

Updated Nov 1, 2024

guan-yuan / awesome-AutoML-and-Lightweight-Models

Star

A list of high-quality (newest) AutoML works and lightweight models including 1.) Neural Architecture Search, 2.) Lightweight Structures, 3.) Model Compression, Quantization and Acceleration, 4.) Hyperparameter Optimization, 5.) Automated Feature Engineering.

tensorflow pytorch hyperparameter-optimization awesome-list quantization nas automl model-compression neural-architecture-search meta-learning architecture-search quantized-training model-acceleration automated-feature-engineering quantized-neural-network

Updated Jun 19, 2021

chester256 / Model-Compression-Papers

Star

Papers for deep neural network compression and acceleration

deep-neural-networks deep-learning papers model-compression model-acceleration

Updated Jun 21, 2021

musco-ai / musco-pytorch

Star

MUSCO: MUlti-Stage COmpression of neural networks

deep-neural-networks pytorch tensor-decomposition cp-decomposition tucker model-compression network-acceleration model-acceleration truncated-svd network-compression low-rank vbmf

Updated Feb 16, 2021
Jupyter Notebook

wangxb96 / Awesome-EdgeAI

Star

Resources of our survey paper "A Comprehensive Survey on AI Integration at the Edge: Techniques, Applications, and Challenges"

machine-learning deep-learning awesome-list data-preprocessing efficient-algorithm model-compression edge-computing model-deployment model-acceleration edge-ai tiny-ml model-design model-inference

Updated Oct 25, 2024

UniModal4Reasoning / AdaptiveDiffusion

Star

[NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy

efficient-inference model-acceleration adaptive-inference diffusion-models stable-diffusion training-free

Updated Nov 3, 2024
Python

xuyang-liu16 / Awesome-Generation-Acceleration

Star

📚 Collection of awesome generation acceleration resources.

image-generation text-to-image efficient-inference video-generation model-acceleration diffusion-models text-to-video efficient-deep-learning aigc

Updated Nov 8, 2024

sdc17 / CrossGET

Star

[ICML 2024] CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers.

framework transformer image-captioning visual-reasoning multimodal-learning visual-question-answering model-acceleration efficient-deep-learning vision-language-transformer image-text-retrieval text-image-retrieval token-ensemble token-matching

Updated Oct 4, 2023

cantbebetter2 / Awesome-Diffusion-Distillation

Star

A list of papers, docs, codes about diffusion distillation.This repo collects various distillation methods for the Diffusion model. Welcome to PR the works (papers, repositories) missed by the repo.

awesome deep-learning model-compression distillation model-acceleration diffusion-models lightweight-neural-network

Updated Dec 10, 2023

signalogic / SigDL

Star

Deep Learning Compression and Acceleration SDK -- deep model compression for Edge and IoT embedded systems, and deep model acceleration for clouds and private servers

cloud acceleration compression deep-learning model-compression flow-diagram nvidia-jetson-tx2 embedded-targets model-acceleration

Updated Mar 17, 2018

Lee-Gihun / MicroNet_OSI-AI

Star

(NeurIPS-2019 MicroNet Challenge - 3rd Winner) Open source code for "SIPA: A simple framework for efficient networks"

pruning model-compression micronet model-acceleration neurips-2019 micronet-challenge early-exiting adaptive-computation compact-neural-network

Updated Dec 18, 2022
Python

TaehyeonKim-pyomu / CNN_compression_rank_selection_BayesOpt

Star

Bayesian Optimization-Based Global Optimal Rank Selection for Compression of Convolutional Neural Networks, IEEE Access

cnn pytorch bayesopt convolutional-neural-networks bayesian-optimization tensorly tucker model-compression cnn-compression network-acceleration rank-selection model-acceleration neural-network-compression gpyopt low-rank network-compression-acceleration

Updated Mar 21, 2021
Python

MingSun-Tse / Caffe_IncReg

Star

[IJCNN'19, IEEE JSTSP'19] Caffe code for our paper "Structured Pruning for Efficient ConvNets via Incremental Regularization"; [BMVC'18] "Structured Probabilistic Pruning for Convolutional Neural Network Acceleration"

pruning model-compression model-acceleration

Updated Feb 14, 2020
Makefile

ksm26 / Efficiently-Serving-LLMs

Star

Learn the ins and outs of efficiently serving Large Language Models (LLMs). Dive into optimization techniques, including KV caching and Low Rank Adapters (LoRA), and gain hands-on experience with Predibase’s LoRAX framework inference server.