Skip to content
View ckhfor's full-sized avatar
Block or Report

Block or report ckhfor

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The most intelligent Siri powered by LLMs

TypeScript 381 51 Updated Jun 3, 2024

Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

Python 250 27 Updated Jul 30, 2024

SGLang is yet another fast serving framework for large language models and vision language models.

Python 4,186 273 Updated Aug 18, 2024

A guidance language for controlling large language models.

Jupyter Notebook 18,487 1,019 Updated Aug 18, 2024

Accessible large language models via k-bit quantization for PyTorch.

Python 5,917 597 Updated Aug 14, 2024

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C++ 7,779 399 Updated Jul 15, 2024

LLM inference in C/C++

C++ 63,670 9,121 Updated Aug 18, 2024

Inference code for Llama models

Python 55,105 9,409 Updated Aug 18, 2024

row-major matmul optimization

C++ 575 76 Updated Sep 9, 2023

Universal LLM Deployment Engine with ML Compilation

Python 18,416 1,471 Updated Aug 17, 2024

Awesome AI Coding

596 47 Updated Jul 23, 2024

a lightweight LLM model inference framework

C++ 670 83 Updated Apr 7, 2024

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

14,227 1,301 Updated Jul 21, 2024

Python wrapper for isl, an integer set library

Python 73 18 Updated Aug 11, 2024

Decompiler of LLVM bitcode to C

C++ 77 9 Updated Mar 9, 2024

pocl - Portable Computing Language

C 907 250 Updated Aug 16, 2024

SAPFOR (System FOR Automated Parallelization)

Perl 5 5 Updated Apr 9, 2023

Unicorn CPU emulator framework (ARM, AArch64, M68K, Mips, Sparc, PowerPC, RiscV, S390x, TriCore, X86)

C 7,425 1,319 Updated Aug 8, 2024

An OpenCL device simulator and debugger

C++ 345 63 Updated Jul 25, 2023

Everything we learnt from hacking Arm Mali GPUs.

Shell 117 16 Updated Jan 20, 2021

Simple OpenCL Samples that Build with Khronos Headers and Libs

C++ 81 23 Updated Aug 12, 2024

LLaMa/RWKV onnx models, quantization and testcase

Python 337 30 Updated Jul 6, 2023

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,352 4,012 Updated Aug 18, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 46,254 5,481 Updated Jun 24, 2024

计图大模型推理库,具有高性能、配置要求低、中文支持好、可移植等特点

Python 2,339 180 Updated Jan 6, 2024

open source driver project for adreno GPUs

C 262 31 Updated May 19, 2022

一些阅读源码和Fuzzing 的经验,涵盖黑盒与白盒测试..

C++ 1,001 212 Updated Aug 24, 2021

Extracts static code features from opencl kernels to be used for machine learning.

C 10 5 Updated Apr 30, 2021

compiler learning resources collect.

Python 2,004 318 Updated May 27, 2024

基于 ChatGPT API 的划词翻译浏览器插件和跨平台桌面端应用 - Browser extension and cross-platform desktop application for translation based on ChatGPT API.

TypeScript 23,416 1,707 Updated Aug 17, 2024
Next