Skip to content
View XcodeRole's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report XcodeRole

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • Ring attention implementation with flash attention

    Python Updated Sep 16, 2024
  • PyTorch bindings for CUTLASS grouped GEMM.

    Cuda Apache License 2.0 Updated Jul 18, 2024
  • pytorch Public

    Forked from pytorch/pytorch

    Tensors and Dynamic neural networks in Python with strong GPU acceleration

    Python Other Updated Jun 14, 2024
  • chakra Public

    Forked from mlcommons/chakra

    Repository for MLCommons Chakra schema and tools

    Python Apache License 2.0 Updated Jun 14, 2024
  • This repository provides support for the paper “Efficient dual-level parallelism solutions for OpenFOAM-based discrete unified gas kinetic scheme”

    Updated Apr 2, 2024
  • VsP-PsT Public

    VsP&PsT algorithm

    C MIT License Updated Apr 2, 2024
  • PsP-VsT Public

    PsP&VsT algorithm

    C MIT License Updated Apr 2, 2024
  • OpenFOAM-6 Public

    This repository serves as the implementation for the paper "An efficient dual-level parallelism solution for OpenFOAM-based discrete unified gas kinetic scheme."

    C++ Other Updated Jan 2, 2024
  • leetcoding Public

    C++ Updated Sep 26, 2023
  • fork from https://github.com/Dev43/warp-wasm-templates.git

    JavaScript Updated Jun 11, 2023
  • go-evm Public

    clone from https://github.com/duanbing/go-evm.git

    Go 2 Updated Jun 1, 2023
  • Samples for CUDA Developers which demonstrates features in CUDA Toolkit

    C Other Updated Apr 27, 2023
  • hw08 Public

    Forked from parallel101/hw08

    高性能并行编程与优化 - 第08讲 CUDA

    C++ Updated Apr 27, 2023
  • ECE408 Public

    Forked from kevin85421/ECE408

    forked from https://github.com/aschuh703/ECE408.git learning cuda in-depth

    Cuda Updated Apr 11, 2023
  • 高性能并行编程与优化 - 课件

    C++ Other Updated Mar 20, 2023
  • hw06 Public

    Forked from parallel101/hw06

    高性能并行编程与优化 - 第06讲的回家作业

    C++ Updated Mar 16, 2023
  • hw01 Public

    Forked from parallel101/hw01

    高性能并行编程与优化 - 第01讲回家作业

    C Updated Mar 3, 2023
  • hw02 Public

    Forked from parallel101/hw02

    高性能并行编程与优化 - 第02讲的回家作业

    C++ Updated Feb 26, 2023
  • hw07 Public

    Forked from parallel101/hw07

    高性能并行编程与优化 07 访存优化

    C++ Updated Feb 23, 2023
  • npu_lecture Public

    npu lecture

    C++ Updated Feb 13, 2023
  • hw05 Public

    Forked from parallel101/hw05

    高性能并行编程与优化 05 C++11多线程

    C++ Updated Jan 30, 2023
  • hw03 Public

    Forked from parallel101/hw03

    高性能并行编程与优化 - 回家作业03

    C++ Updated Jan 26, 2023
  • HTML Updated Oct 19, 2022
  • Fork form git:https://g.csail.mit.edu/6.824-golabs-2021

    Go Updated Oct 19, 2022
  • my study demo

    C++ 1 1 Updated Sep 30, 2022
  • Some private work

    C++ Updated Aug 16, 2022
  • Optimized version of dugksFoam with hybrid parallelization strategy and conserved algorithm.

    C Other Updated Jun 20, 2022
  • Simple Implementation of Lattice Boltzmann Method in C++

    C++ Updated Jun 12, 2022
  • 健康学习到150岁 - 人体系统调优不完全指南

    Updated May 30, 2022
  • My own graduation project which is to parallelize and optimize NUFFT algorithms。

    C++ Updated May 11, 2022