Skip to content
View GreyZzzzzzXh's full-sized avatar

Block or report GreyZzzzzzXh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LLM inference in C/C++

C++ 65,674 9,423 Updated Oct 1, 2024

Official inference library for Mistral models

Jupyter Notebook 9,576 847 Updated Sep 20, 2024

Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.

Python 1,533 163 Updated Oct 1, 2024

Generative AI extensions for onnxruntime

C++ 445 105 Updated Oct 1, 2024

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 1,669 202 Updated Sep 21, 2024

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,369 467 Updated Sep 28, 2024

Model compression for ONNX

Python 69 8 Updated Sep 23, 2024

ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.

Python 273 53 Updated Sep 30, 2024

⚡Delightful WebNN resources, curated list of awesome things around WebNN ecosystem.😎

36 3 Updated Aug 15, 2024

A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.

JavaScript 1,295 161 Updated Jun 29, 2024

Mamba SSM architecture

Python 12,708 1,066 Updated Sep 26, 2024
Python 1,016 91 Updated Jan 4, 2024

TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.

Python 854 275 Updated Oct 1, 2024

Like NumPy, in JavaScript

JavaScript 2,391 183 Updated May 31, 2024

Friendly machine learning for the web! 🤖

JavaScript 6,457 902 Updated Jul 4, 2024

A collection of pre-trained, state-of-the-art models in the ONNX format

Jupyter Notebook 7,783 1,388 Updated Apr 30, 2024

Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX

Jupyter Notebook 2,300 432 Updated Sep 3, 2024

DirectML is a high-performance, hardware-accelerated DirectX 12 library for machine learning. DirectML provides GPU acceleration for common machine learning tasks across a broad range of supported …

C++ 2,188 290 Updated Sep 27, 2024

Simple package that makes your generator work in background thread

Python 272 22 Updated Jun 9, 2022

Use AnimeGANv3 to make your own animation works, including turning photos or videos into anime.

Python 1,698 217 Updated Aug 28, 2024
MLIR 396 69 Updated Sep 30, 2024

A C# port of shadowsocks

C# 58,325 16,392 Updated Aug 20, 2024

👾 Fast and simple video download library and CLI tool written in Go

Go 27,255 2,949 Updated Sep 27, 2024

Fast and Lightweight Observability Data Collector

C++ 1,717 386 Updated Sep 30, 2024

Arduino core for the ESP32

C++ 13,408 7,371 Updated Oct 1, 2024

Arduino IDE 1.x

Java 14,131 7,004 Updated Aug 27, 2023

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 132,807 26,467 Updated Oct 1, 2024

Prebuilt binary for TensorFlowLite's standalone installer. For RaspberryPi. A very lightweight installer. I provide a FlexDelegate, MediaPipe Custom OP and XNNPACK enabled binary.

Shell 205 35 Updated Mar 29, 2024

Prebuilt binary with Tensorflow Lite enabled. For RaspberryPi / Jetson Nano. Support for custom operations in MediaPipe. XNNPACK, XNNPACK Multi-Threads, FlexDelegate.

Shell 500 113 Updated May 7, 2024

Automated CI toolchain to produce precompiled opencv-python, opencv-python-headless, opencv-contrib-python and opencv-contrib-python-headless packages.

Python 4,465 839 Updated Aug 11, 2024
Next