Skip to content
View xxxxyu's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro
Block or Report

Block or report xxxxyu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Easy usage of Rockchip's NPUs found in RK3588 and similar chips

Shell 51 4 Updated Jun 26, 2024

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

Python 1,448 139 Updated Jun 27, 2024

Ubuntu 22.04 and 24.04 for Rockchip RK35XX Devices

Shell 1,873 204 Updated Jul 11, 2024

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 9,919 10,719 Updated Jul 11, 2024

A reference application for a local AI assistant with LLM and RAG

Python 67 9 Updated Jul 3, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 783 69 Updated Jul 12, 2024

A Stream-based LLM Agent Framework for Continuous Context Sensing and Sharing

Java 20 2 Updated Jul 12, 2024

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 37,346 5,054 Updated Jul 12, 2024

Pure C++ implementation of several models for real-time chatting on your computer (CPU)

C++ 275 19 Updated Jul 11, 2024

Run generative AI models in sophgo BM1684X

Python 72 13 Updated Jul 12, 2024

Run Large Language Models on RK3588 with GPU-acceleration

67 2 Updated Aug 16, 2023

Improve your Bilibili homepage by redesigning it, adding more features, and personalizing it to match your preferences. (English | 简体中文 | 正體中文 | 廣東話)

Vue 3,623 120 Updated Jul 11, 2024

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C++ 7,658 407 Updated Jul 1, 2024
Python 249 25 Updated May 13, 2024

Sparsity-aware deep learning inference runtime for CPUs

Python 2,939 169 Updated Jul 5, 2024

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

Python 2,010 140 Updated Jul 5, 2024

纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行

C++ 3,200 322 Updated Jul 12, 2024

Reverse engineering the rk3588 npu

C 53 3 Updated May 30, 2024

Run Neural networks on NationalChip NPU processor.

Python 12 7 Updated Sep 21, 2022

RK3399 Pro NPU support for Caffe SSD detector

Python 27 16 Updated Jun 4, 2020

DDK for Rockchip NPU

C++ 57 14 Updated Dec 29, 2020

📋 A list of open LLMs available for commercial use.

10,612 661 Updated Jul 5, 2024

Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

Python 663 84 Updated May 30, 2024

Provides a set of benchmarks that can be used to measure the memory bandwidth performance of CPU's

C 75 10 Updated Apr 8, 2024

Free monospaced font with programming ligatures

Clojure 76,123 3,076 Updated May 10, 2024

Stable Diffusion in NCNN with c++, supported txt2img and img2img

C++ 952 94 Updated Jul 3, 2023

使用Android手机的CPU推理stable diffusion

Java 127 26 Updated Dec 2, 2023

Grok open release

Python 49,160 8,311 Updated May 29, 2024

Stable Diffusion AI client app for Android

Kotlin 546 56 Updated Jul 11, 2024
Next