xxxxyu

Follow

🎯

Focusing

Xiangyu Li xxxxyu

🎯

Focusing

Follow

Ph.D. student at Institute for AI Industry Research (AIR), THU.

20 followers · 23 following

Tsinghua University
Beijing, China
17:09 (UTC +08:00)
https://orcid.org/0009-0001-5341-2303

Achievements

Achievements

Highlights

Pro

Block or Report

Block or report xxxxyu

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Stars

Pelochus / ezrknpu

Easy usage of Rockchip's NPUs found in RK3588 and similar chips

Shell 51 4 Updated Jun 26, 2024

kyegomez / BitNet

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

Python 1,448 139 Updated Jun 27, 2024

Joshua-Riek / ubuntu-rockchip

Ubuntu 22.04 and 24.04 for Rockchip RK35XX Devices

Shell 1,873 204 Updated Jul 11, 2024

alshedivat / al-folio

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 9,919 10,719 Updated Jul 11, 2024

NVIDIA-AI-IOT / jetson-copilot

A reference application for a local AI assistant with LLM and RAG

Python 67 9 Updated Jul 3, 2024

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 783 69 Updated Jul 12, 2024

MobileLLM / ChainStream

A Stream-based LLM Agent Framework for Continuous Context Sensing and Sharing

Java 20 2 Updated Jul 12, 2024

langgenius / dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 37,346 5,054 Updated Jul 12, 2024

foldl / chatllm.cpp

Pure C++ implementation of several models for real-time chatting on your computer (CPU)

C++ 275 19 Updated Jul 11, 2024

sophgo / LLM-TPU

Run generative AI models in sophgo BM1684X

Python 72 13 Updated Jul 12, 2024

Chrisz236 / llm-rk3588

Run Large Language Models on RK3588 with GPU-acceleration

67 2 Updated Aug 16, 2023

BewlyBewly / BewlyBewly

Improve your Bilibili homepage by redesigning it, adding more features, and personalizing it to match your preferences. (English | 简体中文 | 正體中文 | 廣東話)

Vue 3,623 120 Updated Jul 11, 2024

SJTU-IPADS / PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C++ 7,658 407 Updated Jul 1, 2024

airockchip / rknn-llm

Python 249 25 Updated May 13, 2024

neuralmagic / deepsparse

Sparsity-aware deep learning inference runtime for CPUs

Python 2,939 169 Updated Jul 5, 2024

neuralmagic / sparseml

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

Python 2,010 140 Updated Jul 5, 2024

ztxz16 / fastllm

纯c++的全平台llm加速库，支持python调用，chatglm-6B级模型单卡可达10000+token / s，支持glm, llama, moss基座，手机端流畅运行

C++ 3,200 322 Updated Jul 12, 2024

airockchip / rknn-toolkit2

C 524 62 Updated Jun 27, 2024

mtx512 / rk3588-npu

Reverse engineering the rk3588 npu

C 53 3 Updated May 30, 2024

NationalChip / gxDNN

Run Neural networks on NationalChip NPU processor.

Python 12 7 Updated Sep 21, 2022

Pinnh / NPU_CaffeSSD

RK3399 Pro NPU support for Caffe SSD detector

Python 27 16 Updated Jun 4, 2020

airockchip / rknpu_ddk

DDK for Rockchip NPU

C++ 57 14 Updated Dec 29, 2020

eugeneyan / open-llms

📋 A list of open LLMs available for commercial use.

10,612 661 Updated Jul 5, 2024

IST-DASLab / sparsegpt

Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

Python 663 84 Updated May 30, 2024

intel / memory-bandwidth-benchmarks

Provides a set of benchmarks that can be used to measure the memory bandwidth performance of CPU's

C 75 10 Updated Apr 8, 2024

tonsky / FiraCode

Free monospaced font with programming ligatures

Clojure 76,123 3,076 Updated May 10, 2024

EdVince / Stable-Diffusion-NCNN

Stable Diffusion in NCNN with c++, supported txt2img and img2img

C++ 952 94 Updated Jul 3, 2023

ZTMIDGO / Android-Stable-diffusion-ONNX

使用Android手机的CPU推理stable diffusion

Java 127 26 Updated Dec 2, 2023

xai-org / grok-1

Grok open release

Python 49,160 8,311 Updated May 29, 2024

ShiftHackZ / Stable-Diffusion-Android

Stable Diffusion AI client app for Android

Kotlin 546 56 Updated Jul 11, 2024