Skip to content
View JeremyMorlier's full-sized avatar

Highlights

  • Pro

Block or report JeremyMorlier

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 12 1 Updated Nov 2, 2024

Multi-core HW accelerator mapping optimization framework for layer-fused ML workloads.

Python 40 19 Updated Nov 8, 2024

Brevitas: neural network quantization in PyTorch

Python 1,201 197 Updated Nov 18, 2024

An application-focused API for memory management on NUMA & GPU architectures

C++ 325 51 Updated Nov 14, 2024
Python 7 4 Updated Jun 16, 2023

Driving Snax with MLIR

Python 13 3 Updated Nov 17, 2024

Tenstorrent MLIR compiler

C++ 75 11 Updated Nov 19, 2024

A schematic editor for VLSI/Asic/Analog custom designs, netlist backends for VHDL, Spice and Verilog. The tool is focused on hierarchy and parametric designs, to maximize circuit reuse.

C 334 21 Updated Nov 16, 2024

The next generation of OpenLane, rewritten from scratch with a modular architecture

Python 208 38 Updated Nov 18, 2024

A PULP SoC for education, easy to understand and extend with a full flow for a physical design.

SystemVerilog 22 3 Updated Nov 18, 2024

Collaborative documentation for and from Jean Zay users. Official Jean Zay documentation: https://www.idris.fr/eng/jean-zay/

109 34 Updated Jul 26, 2024

Implementation of SYCL and C++ standard parallelism for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications …

C++ 1,390 168 Updated Nov 16, 2024

A lightweight library for portable low-level GPU computation using WebGPU.

C++ 3,754 177 Updated Nov 18, 2024

Frame profiler

C++ 10,205 684 Updated Nov 17, 2024

C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))

C++ 2,211 258 Updated Nov 13, 2024

A heterogeneous accelerator-centric compute cluster

SystemVerilog 10 9 Updated Nov 14, 2024

A retargetable MLIR-based machine learning compiler and runtime toolkit.

C++ 2,846 615 Updated Nov 18, 2024

HW Architecture-Mapping Design Space Exploration Framework for Deep Learning Accelerators

C++ 113 42 Updated Nov 18, 2024

Xtext project to parse CoreDSL files

Xtend 16 3 Updated Feb 19, 2024

PyTorch emulation library for Microscaling (MX)-compatible data formats

Python 163 21 Updated Sep 23, 2024

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 15,172 1,403 Updated Sep 5, 2024

TPP experimentation on MLIR for linear algebra

MLIR 110 31 Updated Oct 21, 2024

IREE's PyTorch Frontend, based on Torch Dynamo.

Python 55 25 Updated Nov 18, 2024

LLM inference in C/C++

C++ 68,016 9,753 Updated Nov 18, 2024

LLM training in simple, raw C/CUDA

Cuda 24,441 2,764 Updated Oct 2, 2024

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

Go 98,336 7,828 Updated Nov 19, 2024

Summarize existing representative LLMs text datasets.

1,006 107 Updated Sep 4, 2024
Verilog 1,243 267 Updated Nov 14, 2024

DaCe - Data Centric Parallel Programming

Python 497 129 Updated Nov 18, 2024

Adding quality checks and confounds computation steps to fmriprep for stroke data.

Python 4 Updated Nov 18, 2024
Next