Block or Report
Block or report RhoninYang
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Optimized primitives for collective multi-GPU communication
SHARK - High Performance Machine Learning Distribution
User space software for Intel(R) Resource Director Technology
This is an online course where you can learn and master the skill of low-level performance analysis and tuning.
OFRAK: unpack, modify, and repack binaries.
Universal LLM Deployment Engine with ML Compilation
System performance characterization tool based on linux perf
High-performance regular expression matching library
CoreFreq : CPU monitoring and tuning software designed for 64-bit processors.
A curated list of awesome parallel computing resources
Intel staging area for llvm.org contribution. Home for Intel LLVM-based projects.
Apache Log4cxx is a C++ port of Apache Log4j
oneAPI Collective Communications Library (oneCCL)
a truly censorship-resistant alternative to Twitter that has a chance of working
Contains the source code examples described in the "Intel® 64 and IA-32 Architectures Optimization Reference Manual"