zccyman

Henson zccyman

13 followers · 35 following

wondertek
Shanghai,China

Achievements

Stars

InternLM / MindSearch

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

Python 4,791 477 Updated Sep 25, 2024

Jittor / JittorLLMs

计图大模型推理库，具有高性能、配置要求低、中文支持好、可移植等特点

Python 2,363 182 Updated Jan 6, 2024

tensorflow / mlir-hlo

MLIR 397 70 Updated Oct 7, 2024

intel / torch-ccl

oneCCL Bindings for Pytorch*

C++ 85 23 Updated Sep 10, 2024

bytedance / byteir

A model compilation solution for various hardware

MLIR 362 38 Updated Sep 30, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

29,196 1,599 Updated Aug 1, 2024

KEKE046 / mlir-tutorial

Hands-On Practical MLIR Tutorial

C++ 301 40 Updated Oct 20, 2023

THUDM / CodeGeeX4

CodeGeeX4-ALL-9B, a versatile model for all AI software development scenarios, including code completion, code interpreter, web search, function calling, repository-level Q&A and much more.

Python 1,293 98 Updated Aug 25, 2024

sBobHuang / mlir-tutorial

Forked from KEKE046/mlir-tutorial

Hands-On Practical MLIR Tutorial

C++ 11 Updated Jul 22, 2024

triton-lang / triton

Development repository for the Triton language and compiler

C++ 12,950 1,575 Updated Oct 8, 2024

sophgo / LLM-TPU

Run generative AI models in sophgo BM1684X

Python 105 17 Updated Oct 7, 2024

intel / neural-speed

An innovative library for efficient LLM inference via low-bit quantization

C++ 345 37 Updated Aug 30, 2024

onnx / neural-compressor

Model compression for ONNX

Python 69 8 Updated Sep 23, 2024

Wsine / feishu2md

一键命令下载飞书文档为 Markdown

Go 1,138 112 Updated Aug 27, 2024

efeslab / Atom

[MLSys'24] Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

Cuda 267 24 Updated Jul 2, 2024

naver-aics / lut-gemm

C++ 33 6 Updated Apr 1, 2024

openvinotoolkit / mlas

Assembly 8 9 Updated Jan 22, 2024

ggerganov / ggml

Tensor library for machine learning

C++ 10,964 1,008 Updated Oct 6, 2024

mit-han-lab / llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,394 184 Updated Jul 16, 2024

wejoncy / QLLM

A general 2-8 bits quantization toolbox with GPTQ/AWQ/HQQ, and export to onnx/onnx-runtime easily.

Python 145 14 Updated Sep 23, 2024

0voice / cpp_new_features

2021年最新整理， C++ 学习资料，含C++ 11 / 14 / 17 / 20 / 23 新特性、入门教程、推荐书籍、优质文章、学习笔记、教学视频等

C++ 4,928 1,033 Updated Jun 8, 2022

amd / ryzen-ai-documentation

Onboarding documentation source for the AMD Ryzen™ AI Software Platform. The AMD Ryzen™ AI Software Platform enables developers to take pretrained machine learning models in popular frameworks and …

45 18 Updated Oct 7, 2024

microsoft / Olive

Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.

Python 1,542 163 Updated Oct 7, 2024

sophgo / sophgo-mq

Forked from ModelTC/MQBench

Model Quantization Benchmark

Shell 9 5 Updated Sep 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Henson zccyman

Achievements

Achievements

Block or report zccyman

Stars

InternLM / MindSearch

Jittor / JittorLLMs

tensorflow / mlir-hlo

intel / torch-ccl

bytedance / byteir

karpathy / LLM101n

KEKE046 / mlir-tutorial

THUDM / CodeGeeX4

sBobHuang / mlir-tutorial

triton-lang / triton

sophgo / LLM-TPU

intel / neural-speed

onnx / neural-compressor

Wsine / feishu2md

efeslab / Atom

naver-aics / lut-gemm

openvinotoolkit / mlas

ggerganov / ggml

mit-han-lab / llm-awq

wejoncy / QLLM

0voice / cpp_new_features

amd / ryzen-ai-documentation

microsoft / Olive

sophgo / sophgo-mq

HuangOwen / Awesome-LLM-Compression

clevercool / ANT-Quantization

kendryte / nncase

ModelTC / QLLM

ModelTC / lightllm

ModelTC / llmc