GPU constrained! No More. Microsoft released Phi3 specially designed for memory/compute constrained environments. The model support ONXX CPU runtime which offers amazing inference speed even on mobile cpu.
-
Updated
May 31, 2024 - Jupyter Notebook
GPU constrained! No More. Microsoft released Phi3 specially designed for memory/compute constrained environments. The model support ONXX CPU runtime which offers amazing inference speed even on mobile cpu.
Add a description, image, and links to the onxx-cpu topic page so that developers can more easily learn about it.
To associate your repository with the onxx-cpu topic, visit your repo's landing page and select "manage topics."