Skip to content
View bibo-msft's full-sized avatar

Block or report bibo-msft

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

NeuraLUT: Hiding Neural Network Density in Boolean Synthesizable Functions

Python 5 Updated Apr 1, 2024

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.

Python 365 29 Updated Oct 9, 2024

PolyLUT is the first quantized neural network training methodology that maps a neuron to a LUT while using multivariate polynomial function learning to exploit the flexibility of the FPGA soft logic.

Python 38 4 Updated Feb 9, 2024

NN Training at the Edge with PyTorch, PYNQ and an Ultra96-V2 board

HTML 4 1 Updated Jul 22, 2024

Allo: A Programming Model for Composable Accelerator Design

Python 124 17 Updated Oct 8, 2024

Run stable-diffusion-webui with Radeon RX 580 8GB on Ubuntu 22.04.2 LTS

53 11 Updated Nov 10, 2023

VHDL module for running operations from memory with the software also written in vhdl

VHDL 6 Updated Jul 20, 2024

Tiny inference-only implementation of LLaMA

Python 91 9 Updated Apr 3, 2024

Inference Llama 2 using only python and numpy

Python 3 1 Updated Dec 5, 2023

This repo contains the code for the project, DIY Smart Watch using M5StickC

C++ 13 4 Updated Sep 28, 2019

GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as…

C++ 1,101 505 Updated Aug 21, 2024

GPGPU-Sim enabled Turing WMMA API and its benchmark results. Undergraduate study at Yonsei Univ.

C++ 8 6 Updated Feb 21, 2021

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Python 9,395 692 Updated Jul 11, 2024

Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"

Python 345 24 Updated Sep 26, 2023
C# 9 1 Updated Dec 28, 2022