Skip to content
View jiangaojie's full-sized avatar

Block or report jiangaojie

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Inference code for Llama models

Python 55,334 9,430 Updated Aug 18, 2024

gem5 相关中文笔记

13 4 Updated Dec 2, 2021
C++ 83 21 Updated Feb 12, 2024

FlexASR: A Reconfigurable Hardware Accelerator for Attention-based Seq-to-Seq Networks

C++ 42 6 Updated Apr 12, 2022

End-to-end SoC simulation: integrating the gem5 system simulator with the Aladdin accelerator simulator.

C++ 210 59 Updated Oct 6, 2022

The official repository for the gem5 computer-system architecture simulator.

C++ 1,588 1,165 Updated Sep 2, 2024

Productive, portable, and performant GPU programming in Python.

C++ 25,338 2,266 Updated Aug 22, 2024

Source code examples from the Parallel Forall Blog

HTML 1,222 633 Updated Jul 23, 2024

Development repository for the Triton language and compiler

C++ 12,436 1,506 Updated Sep 3, 2024

An open source GPU based off of the AMD Southern Islands ISA.

Verilog 1,026 235 Updated Sep 25, 2017

Material for cuda-mode lectures

Jupyter Notebook 2,294 231 Updated Aug 31, 2024

UNIX-like reverse engineering framework and command-line toolset

C 20,282 2,965 Updated Sep 2, 2024

My Design Philosophy Summary (Most of them are in Chinese)

Python 451 97 Updated Aug 23, 2024

GPGPU-Sim provides a detailed simulation model of a contemporary GPU running CUDA and/or OpenCL workloads and now includes an integrated (and validated) energy model, GPUWattch.

C++ 30 64 Updated Aug 21, 2024

This is the top-level repository for the Accel-Sim framework.

Python 284 110 Updated Aug 31, 2024

Assembler for NVIDIA Maxwell architecture

Sass 940 160 Updated Jan 3, 2023

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 20,059 4,133 Updated Sep 2, 2024

Matrix Multiply-Accumulate with CUDA and WMMA( Tensor Core)

Cuda 107 17 Updated Aug 18, 2020

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 10,247 1,473 Updated Aug 18, 2024

CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.

C# 106 16 Updated Jan 17, 2023

BookSim 2.0

C++ 253 158 Updated Jun 24, 2024

GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as…

C++ 1,071 501 Updated Aug 21, 2024

跟我一起写Makefile重制版

Python 3,365 571 Updated Jun 17, 2024

A flexible Python 2/3 Kconfig implementation and library

Python 447 159 Updated Sep 22, 2023

Guide for ICSPA MOOC

68 21 Updated Oct 11, 2023

A minimal, modularized, and machine-independent hardware abstraction layer

C 425 79 Updated Aug 30, 2024

NJU EMUlator, a full system x86/mips32/riscv32/riscv64 emulator for teaching

C 846 181 Updated Aug 10, 2024
Next