Skip to content
View PKUZHOU's full-sized avatar
Block or Report

Block or report PKUZHOU

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

The Artifact of NeoMem: Hardware/Software Co-Design for CXL-Native Memory Tiering

11 Updated Aug 11, 2024

A large-scale simulation framework for LLM inference

Python 193 19 Updated Aug 1, 2024

Various HDL (Verilog) IP Cores

Verilog 672 204 Updated Jul 1, 2021

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

974 21 Updated Jul 31, 2024

Pin based tool for simulation of rack-scale disaggregated memory systems

C++ 12 Updated Aug 5, 2024

OSDI'24 Nomad implementation

19 2 Updated Jun 10, 2024

cricket is a virtualization solution for GPUs

C 131 34 Updated Jan 7, 2024

Open source FPGA-based NIC and platform for in-network compute

Verilog 1 Updated Apr 5, 2024

Implement HNSW accelerator on FPGA for Approximate Nearest Neighbor Search

C++ 3 Updated Apr 30, 2022
C++ 9 1 Updated Feb 22, 2024

Latency and Memory Analysis of Transformer Models for Training and Inference

Python 320 36 Updated May 28, 2024

An energy-efficient RISC-V floating-point compute cluster.

C 42 42 Updated Aug 14, 2024

基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT4.0/ Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。

Python 29,157 7,732 Updated Aug 14, 2024

example code for using DC QP for providing RDMA READ and WRITE operations to remote GPU memory

C 99 21 Updated Jul 30, 2024

RISC-V SystemC-TLM simulator

C 264 70 Updated Jul 31, 2024

SERV - The SErial RISC-V CPU

Verilog 1,348 179 Updated Aug 2, 2024

PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization

C++ 17 2 Updated Feb 21, 2024

GLake: optimizing GPU memory management and IO transmission.

Python 333 31 Updated Aug 3, 2024

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C++ 7,774 400 Updated Jul 15, 2024

RoCE v2 hardware and software implementation

118 27 Updated Nov 2, 2023

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Python 1,664 87 Updated Jan 21, 2024

PsPIN: A RISC-V in-network accelerator for flexible high-performance low-power packet processing

SystemVerilog 91 16 Updated Feb 22, 2023

Scalable Network Stack for FPGAs (TCP/IP, RoCEv2)

C++ 724 260 Updated Apr 23, 2024

A DPDK repo with Corundum driver

C 24 13 Updated Aug 1, 2022

leaked prompts of GPTs

28,044 3,760 Updated Jul 9, 2024

corundum work on vu13p

Verilog 17 9 Updated Nov 10, 2023

Open source FPGA-based NIC and platform for in-network compute

Verilog 1,599 402 Updated Jul 5, 2024

Open Programmable Acceleration Engine

C++ 251 84 Updated Aug 8, 2024

DeepSeek Coder: Let the Code Write Itself

Python 6,327 451 Updated May 21, 2024
Next