μNAS

μNAS (micro-NAS or mu-NAS) is a neural architecture search system that specialises in finding ultra-small models suitable for deploying on microcontrollers: think < 64 KB memory and storage requirement. μNAS achieves this by explicitly targeting three primary resource bottlenecks: model size, latency and peak memory usage.

For a full description of methodology and experimental results, please see the accompanying paper "μNAS: Constrained Neural Architecture Search for Microcontrollers".

Changelog from arXiv v1:

correctly reported the number of MACs for the DS-CNN baseline for the Speech Commands dataset.
fixed Speech Commands hyperparameters and updated found models
add smaller CIFAR-10 model in the comparison table
add search times to the comparison table
update discussion on pruning, search convergence and the use of soft constraints

Usage

Setup

μNAS uses Python 3.7+ with the environment described by Pipfile: to create an environment with all correct packages preinstalled simply run pipenv install in the cloned repository.

To run

The search is configured using Python configuration files (see configs for examples and config.py for configuration file schema), which specify the search algorithm, how candidate models are going to be trained (incl. any pruning configuration) and resource bounds. μNAS can be invoked using driver.py which immediately delegates to the configured search algorithm.

For example, to search for MNIST models with Aging Evolution and structured pruning, run the following:

pipenv run python driver.py configs/cnn_mnist_struct_pru.py --name "example_mnist"

Navigating the code

cnn/mlp: contains a search space description for convolutional neural networks / multilayer perceptrons, together with all allowed morphisms (changes) to a candidate architecture.
configs: example search configurations,
dataset: loaders for various datasets, conforming to the interface in dataset/dataset .py
dragonfly_adapters: (Bayesian optimisation only) extra code to interoperate with Dragonfly. We found that we had to rely on internal implementation of the framework for it to correctly use our customised kernel, search space and a genetic algorithm optimiser for acq. functions, thus the module contains a fair amount of monkey-patches.
resource_models: an independent library that allows representing and computing resource usage of arbitrary computation graphs.
search_algorithms: implements aging evolution and Bayesian optimisation search algorithms; each search algorithm is also responsible for scheduling model training and correctly serialising & restoring the search state. Both use ray under the hood to parallelise the search.
teachers: a collection of teacher models for distillation.
test: automated sanity tests for search space implementations.
model_trainer.py: code for training candidate models.
pruning.py: implements Dynamic Model Pruning with Feedback as a Keras callback, used during training.
generate_tflite_models.py: generates random small models for latency benchmarking on a microcontroller.
search_state_processor.py: loads and visualises μNAS search state files.
architecture.py/config.py/search_space.py/schema_types.py base classes for candidate architectures, search configuration and free variables of the search space.

Notes on deploying found models

In the interest of storage, μNAS does not save final weights of discovered models (though it can be modified to do so): μNAS uses aging evolution and does not share trained weights across candidate models, which encourages finding models that can be trained to good accuracy from scratch. You can easily instantiate a Keras model from a found architecture (see API in architecture.py).

μNAS assumes a runtime where each operator is executed one at a time and in full, such as "TensorFlow Lite Micro". You can quantise and convert Keras models to the TFLite format using helper functions in utils.py. Note that:

μNAS only calculates resource usage of a model and does not take particular framework overheads into account.
μNAS assumes that one of the input buffers to an Add operator can be reused as an output buffer if it is not used elsewhere (to minimise peak memory usage); this optimisation is not available in TF Lite Micro at the time of writing.
The operator execution order that gives the smallest peak memory usage is not recorded in the model: use tflite-tools to optimise your tflite model prior to deploying.

Name		Name	Last commit message	Last commit date
Latest commit History 799 Commits
artifacts		artifacts
cnn		cnn
configs		configs
dataset		dataset
dragonfly_adapters		dragonfly_adapters
mlp		mlp
resource_models		resource_models
search_algorithms		search_algorithms
teachers		teachers
test		test
tmp/tflite		tmp/tflite
.gitignore		.gitignore
Makefile		Makefile
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
architecture.py		architecture.py
config.py		config.py
driver.py		driver.py
generate_csv_from_pickle.py		generate_csv_from_pickle.py
generate_ntk_rn_efficientM_from_pickle.py		generate_ntk_rn_efficientM_from_pickle.py
generate_ntk_rn_from_pickle.py		generate_ntk_rn_from_pickle.py
generate_tflite_models.py		generate_tflite_models.py
metrics_file.py		metrics_file.py
metrics_file_ntk_rn.py		metrics_file_ntk_rn.py
metrics_ntk.py		metrics_ntk.py
metrics_ntk_rn.py		metrics_ntk_rn.py
metrics_ntk_rn_v2.py		metrics_ntk_rn_v2.py
metrics_ntk_v2.py		metrics_ntk_v2.py
metrics_rn.py		metrics_rn.py
model_trainer.py		model_trainer.py
ntk.py		ntk.py
pruning.py		pruning.py
schema_types.py		schema_types.py
search_space.py		search_space.py
search_state_processor.py		search_state_processor.py
slurm_arcus.sh		slurm_arcus.sh
slurm_job.sh		slurm_job.sh
test_generate_keras_arch_models_sc_nq.py		test_generate_keras_arch_models_sc_nq.py
test_generate_keras_models_cifar10.py		test_generate_keras_models_cifar10.py
test_generate_tf_save_model_sc_nq.py		test_generate_tf_save_model_sc_nq.py
test_generate_tflite_models_cifar10_nq.py		test_generate_tflite_models_cifar10_nq.py
test_generate_tflite_models_sc_nq.py		test_generate_tflite_models_sc_nq.py
test_load_pickle.py		test_load_pickle.py
test_load_pickle_example_ntk_cifar10.py		test_load_pickle_example_ntk_cifar10.py
test_load_pickle_example_ntk_mnist.py		test_load_pickle_example_ntk_mnist.py
test_load_pickle_example_ntk_sc.py		test_load_pickle_example_ntk_sc.py
test_load_pickle_pre_ntk_cifar10.py		test_load_pickle_pre_ntk_cifar10.py
test_load_pickle_pre_ntk_mnist.py		test_load_pickle_pre_ntk_mnist.py
test_load_pickle_pre_ntk_sc.py		test_load_pickle_pre_ntk_sc.py
test_load_pickle_pre_ntk_vww.py		test_load_pickle_pre_ntk_vww.py
test_load_pickle_sc.py		test_load_pickle_sc.py
test_ntk.py		test_ntk.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

μNAS

Usage

Setup

To run

Navigating the code

Notes on deploying found models

About

Releases

Packages

Languages

KYE2138/uNAS

Folders and files

Latest commit

History

Repository files navigation

μNAS

Usage

Setup

To run

Navigating the code

Notes on deploying found models

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages