Skip to content
View yapolyak's full-sized avatar
Block or Report

Block or report yapolyak

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A light-weight MPI profiler.

C 78 29 Updated Jul 24, 2024

Implementation of SYCL and C++ standard parallelism for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications …

C++ 1,310 162 Updated Aug 14, 2024

The book "Performance Analysis and Tuning on Modern CPU"

TeX 2,054 148 Updated Aug 12, 2024

An advanced benchmarking tool

Python 138 15 Updated Jul 7, 2022

Public repository for vol 2 of The Art of HPC: parallel programming

TeX 65 24 Updated Apr 16, 2024

Library for specialized dense and sparse matrix operations, and deep learning primitives.

C 837 181 Updated Aug 17, 2024

Binder, tool for automatic generation of Python bindings

C++ 315 66 Updated Jul 29, 2024

Frame profiler

C++ 8,991 624 Updated Aug 10, 2024

CUDA on ??? GPUs

Rust 8,790 570 Updated Aug 16, 2024

Expressive Vector Engine - SIMD in C++ Goes Brrrr

C++ 919 57 Updated Aug 17, 2024

The missing CMake project initializer

CMake 1,941 74 Updated Aug 14, 2024

The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs

C++ 1,231 185 Updated Apr 14, 2024

Fast numerical array expression evaluator for Python, NumPy, Pandas, PyTables and more

Python 2,195 205 Updated Jul 20, 2024

The fastest feature-rich C++11/14/17/20/23 single-header testing framework

C++ 5,773 630 Updated Aug 16, 2024

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 25,594 2,807 Updated Aug 17, 2024

Distributed ranges is a generalization of C++ ranges for distributed data structures.

C++ 45 16 Updated Aug 16, 2024

A fast simulator for Clifford circuits

C++ 4 Updated Jan 15, 2024

Blitz++ Multi-Dimensional Array Library for C++

C++ 405 83 Updated Jul 13, 2024

The Legion Parallel Programming System

C++ 668 146 Updated Jun 27, 2024
C 14 6 Updated Aug 10, 2024

A collection of distributed algorithms for the full-state simulation of digital quantum statevectors and density matrices

C++ 14 Updated Aug 16, 2024

Tensor Contraction C++ Library

C++ 50 14 Updated Aug 22, 2019

Basic Tensor Algebra Subroutines

C++ 45 19 Updated Aug 17, 2024

Tensor Contraction Code Generator

Python 36 1 Updated Aug 14, 2017

This is a set of simple programs that can be used to explore the features of a parallel platform.

C 403 107 Updated Apr 15, 2024

Cyclops Tensor Framework: parallel arithmetic on multidimensional arrays

C++ 195 53 Updated Aug 8, 2024

Easy Boost integration in CMake projects

CMake 397 148 Updated Jun 28, 2022

An example combining scikit-build and pybind11

Python 102 29 Updated Aug 12, 2024

CMake for C++ Best Practices

CMake 1,041 114 Updated Aug 6, 2024

Source code for the TKET quantum compiler, Python bindings and utilities

C++ 244 48 Updated Aug 16, 2024
Next