Name	Name	Last commit message	Last commit date
Latest commit History 308 Commits
.github/workflows	.github/workflows
test	test
README.md	README.md
wasi-nn.abi.md	wasi-nn.abi.md
wasi-nn.wit.md	wasi-nn.wit.md

`wasi-nn`

A proposed WebAssembly System Interface API for machine learning (ML), also known as neural networks.

Current Phase

wasi-nn is currently in Phase 2.

Champions

Andrew Brown
Mingqiu Sun

Phase 4 Advancement Criteria

wasi-nn must have at least two complete independent implementations.

Introduction
Goals
Non-goals
API walk-through
- Use case 1
- Use case 2
Detailed design discussion
- [Tricky design choice 1]
- [Tricky design choice 2]
Considered alternatives
- [Alternative 1]
- [Alternative 2]
Stakeholder Interest & Feedback
References & acknowledgements

Introduction

wasi-nn is a WASI API for performing ML inference. ML models are typically trained using a large data set, resulting in one or more files that describe the model's weights. The model is then used to compute an "inference," e.g., the probabilities of classifying an image as a set of tags. This API is concerned initially with inference, not training.

Why expose ML inference as a WASI API? Though the functionality of inference can be encoded into WebAssembly, there are two primary motivations for wasi-nn:

ease of use: an entire ecosystem already exists to train and use models (e.g., Tensorflow, ONNX, OpenVINO, etc.); wasi-nn is designed to make it easy to use existing model formats as-is
performance: the nature of ML inference makes it amenable to hardware acceleration of various kinds; without this hardware acceleration, inference can suffer slowdowns of several hundred times. Hardware acceleration for ML is very diverse — SIMD (e.g., AVX512), GPUs, TPUs, FPGAs — and it is unlikely (impossible?) that all of these would be supported natively in WebAssembly

WebAssembly programs that want to use a host's ML capabilities can access these capabilities through wasi-nn's core abstractions: backends, graphs, and tensors. A user selects a backend for inference and loads a model, instantiated as a graph, to use in the backend. Then, the user passes tensor inputs to the graph, computes the inference, and retrieves the tensor outputs.

wasi-nn backends correspond to existing ML frameworks, e.g., Tensorflow, ONNX, OpenVINO, etc. wasi-nn places no requirements on hosts to support specific backends; the API is purposefully designed to allow the largest number of ML frameworks to implement it. wasi-nn graphs can be passed as opaque byte sequences to support any number of model formats. This makes the API framework- and format-agnostic, since we expect device vendors to provide the ML backend and support for their particular graph format.

Users can find language bindings for wasi-nn at the wasi-nn bindings repository; request additional language support there. More information about wasi-nn can be found at:

Blog post: Machine Learning in WebAssembly: Using wasi-nn in Wasmtime
Blog post: Implementing a WASI Proposal in Wasmtime: wasi-nn
Blog post: Neural network inferencing for PyTorch and TensorFlow with ONNX, WebAssembly System Interface, and wasi-nn
Recorded talk: Machine Learning with Wasm (wasi-nn)
Recorded talk: Lightning Talk: High Performance Neural Network Inferencing Using wasi-nn

Goals

The primary goal of wasi-nn is to allow users to perform ML inference from WebAssembly using existing models (i.e., ease of use) and with maximum performance. Though the primary focus is inference, we plan to leave open the possibility to perform ML training in the future (request training in an issue!).

Another design goal is to make the API framework- and model-agnostic; this allows for implementing the API with multiple ML frameworks and model formats. The load method will return an error message when an unsupported model encoding scheme is passed in. This approach is similar to how a browser deals with image or video encoding.

Non-goals

wasi-nn is not designed to provide support for individual ML operations (a "model builder" API). The ML field is still evolving rapidly, with new operations and network topologies emerging continuously. It would be a challenge to define an evolving set of operations to support in the API. Instead, our approach is to start with a "model loader" API, inspired by WebNN’s model loader proposal.

API walk-through

The following example describes how a user would use wasi-nn to classify an image.

TODO

Detailed design discussion

For the details of the API, see [wasi-nn.wit.md].

Should `wasi-nn` support training models?

Ideally, yes. In the near term, however, exposing (and implementing) the inference-focused API is sufficiently complex to postpone a training-capable API until later. Also, models are typically trained offline, prior to deployment, and it is unclear why training models using WASI would be an advantage over training them natively. (Conversely, the inference API does make sense: performing ML inference in a Wasm deployment is a known use case). See associated discussion here and feel free to open pull requests or issues related to this that fit within the goals above.

Should `wasi-nn` support inspecting models?

Ideally, yes. The ability to inspect models would allow users to determine, at runtime, the tensor shapes of the inputs and outputs of a model. As with ML training (above), this can be added in the future.

Considered alternatives

There are other ways to perform ML inference from a WebAssembly program:

a user could specify a custom host API for ML tasks; this is similar to the approach taken here. The advantages and disadvantages are in line with other "spec vs. custom" trade-offs: the user can precisely tailor the API to their use case, etc., but will not be able to switch easily between implementations.
a user could compile a framework and/or model to WebAssembly; this is similar to here and here. The primary disadvantage to this approach is performance: WebAssembly, even with the recent addition of 128-bit SIMD, does not have optimized primitives for performing ML inference or accessing ML-optimized hardware. The performance loss can be of several orders of magnitude.

Stakeholder Interest & Feedback

TODO before entering Phase 3.

References & acknowledgements

Many thanks for valuable feedback and advice from:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

`wasi-nn`

Current Phase

Champions

Phase 4 Advancement Criteria

Table of Contents

Introduction

Goals

Non-goals

API walk-through

Detailed design discussion

Should `wasi-nn` support training models?

Should `wasi-nn` support inspecting models?

Considered alternatives

Stakeholder Interest & Feedback

References & acknowledgements

About

Releases

Packages

Contributors 39

WebAssembly/wasi-nn

Folders and files

Latest commit

History

Repository files navigation

wasi-nn

Current Phase

Champions

Phase 4 Advancement Criteria

Table of Contents

Introduction

Goals

Non-goals

API walk-through

Detailed design discussion

Should wasi-nn support training models?

Should wasi-nn support inspecting models?

Considered alternatives

Stakeholder Interest & Feedback

References & acknowledgements

About

Resources

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Contributors 39

`wasi-nn`

Should `wasi-nn` support training models?

Should `wasi-nn` support inspecting models?

Packages