Performance-portable, length-agnostic SIMD with runtime dispatch
-
Updated
Nov 1, 2024 - C++
Performance-portable, length-agnostic SIMD with runtime dispatch
Expressive Vector Engine - SIMD in C++ Goes Brrrr
PyTurboJPEG is a highly optimized Python wrapper of libjpeg-turbo (TurboJPEG API) which supports x86 and ARM architecture.
Pelemay is a native compiler for Elixir, which generates SIMD instructions. It has a plan to generate for GPU code.
SIMD-based linear algebra and statistics for data science with dart
DSL for SIMD Sorting on AVX2 & AVX512
Two-dimensional flow solver with GUI using vortex particle and boundary element methods
"Byteslice: Pushing the envelop of main memory data processing with a new storage layout" (SIGMOD'15)
Corium is a modern scripting language which combines simple, safe and efficient programming.
n-body-simulation performance test suite
A portable modern C++ primitive performance library for 3D Vision & Photo-Mechanics.
GPU-accelerated 3D vortex methods solver with easy GUI
SIMD-accelerated Vector math lib
A High Performance C# wrapper that allows you to get the benefits of SIMD Intrinsics on List<T>.
SIMD discrete Fourier transform tests and discussion
Vectroized String Helper Functions
EinsteinDB is a Hybrid memory system consisting of DRAM and Non-Volatile Memory configured to persist data fast.
System benchmarks over JVM with JMH - SIMD (superscalar processing), Branch prediction, False sharing.
This repository lists 4 problems solved using C. Each problem has its own serial and parallel implementations. For the latter, the OpenMP API was utilized.
(experiments with) pragma-based SIMD C++ types
Add a description, image, and links to the simd-parallelism topic page so that developers can more easily learn about it.
To associate your repository with the simd-parallelism topic, visit your repo's landing page and select "manage topics."