-
ring-flash-attention Public
Forked from zhuzilin/ring-flash-attentionRing attention implementation with flash attention
Python UpdatedSep 16, 2024 -
grouped_gemm Public
Forked from fanshiqing/grouped_gemmPyTorch bindings for CUTLASS grouped GEMM.
Cuda Apache License 2.0 UpdatedJul 18, 2024 -
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Python Other UpdatedJun 14, 2024 -
chakra Public
Forked from mlcommons/chakraRepository for MLCommons Chakra schema and tools
Python Apache License 2.0 UpdatedJun 14, 2024 -
dual-level-dugks Public
This repository provides support for the paper “Efficient dual-level parallelism solutions for OpenFOAM-based discrete unified gas kinetic scheme”
UpdatedApr 2, 2024 -
-
-
OpenFOAM-6 Public
This repository serves as the implementation for the paper "An efficient dual-level parallelism solution for OpenFOAM-based discrete unified gas kinetic scheme."
C++ Other UpdatedJan 2, 2024 -
-
warp-wasm-templates Public
fork from https://github.com/Dev43/warp-wasm-templates.git
JavaScript UpdatedJun 11, 2023 -
-
cuda-samples Public
Forked from NVIDIA/cuda-samplesSamples for CUDA Developers which demonstrates features in CUDA Toolkit
C Other UpdatedApr 27, 2023 -
-
ECE408 Public
Forked from kevin85421/ECE408forked from https://github.com/aschuh703/ECE408.git learning cuda in-depth
Cuda UpdatedApr 11, 2023 -
parallel_course Public
Forked from parallel101/course高性能并行编程与优化 - 课件
C++ Other UpdatedMar 20, 2023 -
-
-
-
-
-
-
-
-
6.824-golabs-2021 Public
Fork form git:https://g.csail.mit.edu/6.824-golabs-2021
Go UpdatedOct 19, 2022 -
-
-
paralleled_cdugksFoam Public
Forked from zzhang777/paralleled_cdugksFoamOptimized version of dugksFoam with hybrid parallelization strategy and conserved algorithm.
C Other UpdatedJun 20, 2022 -
Lattice-Boltzmann-Method Public
Simple Implementation of Lattice Boltzmann Method in C++
C++ UpdatedJun 12, 2022 -
HumanSystemOptimization Public
Forked from jxygzzy/HumanSystemOptimization健康学习到150岁 - 人体系统调优不完全指南
UpdatedMay 30, 2022 -
Graduation-Project-NUFFT Public
My own graduation project which is to parallelize and optimize NUFFT algorithms。
C++ UpdatedMay 11, 2022