Skip to content
View omutlu's full-sized avatar

Organizations

@CMU-SAFARI

Block or report omutlu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Source code & scripts for distributed machine learning training workloads on a real-world Processing-In-Memory system (i.e., UPMEM). Described in our PACT'24 paper by Rhyner et al. at https://arxiv…

C 2 Updated Oct 5, 2024
C++ 4 Updated Sep 16, 2024

A binary instrumentation tool to analyze load instructions in any off-the-shelf x86(-64) program. Described by Bera et al. in https://arxiv.org/pdf/2406.18786

C++ 17 3 Updated Jun 30, 2024

Source code & scripts for experimental characterization and demonstration of 1) simultaneous many-row activation, 2) up to nine-input majority operations and 3) copying one row's content to up 31 r…

VHDL 8 1 Updated May 17, 2024

Circuit-level model for the Capacity-Latency Reconfigurable DRAM (CLR-DRAM) architecture. This repository contains the SPICE models of the CLR-DRAM architecture and the baseline architecture used i…

AGS Script 12 7 Updated Sep 24, 2020

Source code for the architectural simulator used for modeling the PUD system proposed in our HPCA 2024 paper `MIMDRAM: An End-to-End Processing-Using-DRAM System for High-Throughput, Energy-Efficie…

C++ 12 4 Updated Apr 5, 2024

Source code & scripts for experimental characterization and demonstration of performing NOT and up to 16-input AND, NAND, OR, and NOR operations in real DDR4 DRAM chips. Described in our HPCA'24 pa…

VHDL 10 1 Updated Apr 9, 2024

RUBICON is a novel framework to automatically develop deep-learning-based genomic basecallers for any given architecture, as described in our Genome Biology'24 paper https://genomebiology.biomedc

Jupyter Notebook 4 Updated Feb 22, 2024

MetaTrinity is a novel metagenomic analysis tool employing efficient containment search techniques and heuristics for read mapping to achieve significant speedup while maintaining high accuracy. Th…

HTML 4 1 Updated Feb 16, 2024

CoMeT is a new low-cost RowHammer mitigation that uses Count-Min Sketch-based aggressor row tracking, as described in our HPCA'24 paper https://arxiv.org/pdf/2402.18769.pdf

C++ 7 Updated Apr 5, 2024

Virtuoso is a new simulator that focuses on modelling various memory management and virtual memory aspects.

C++ 25 6 Updated Nov 17, 2023

SequenceLab is a benchmark suite for evaluating computational methods for comparing genomic sequences, such as pre-alignment filters and pairwise sequence alignment algorithms. SequenceLab is descr…

C++ 2 Updated Dec 23, 2023

SimplePIM is the first high-level programming framework for real-world processing-in-memory (PIM) architectures. Described in the PACT 2023 paper by Chen et al. (https://arxiv.org/pdf/2310.01893.pdf).

C 20 5 Updated Oct 23, 2023

New RowHammer mitigation mechanism that is area-, performance-, and energy-efficient especially at very low (e.g., 125) RowHammer thresholds, as described in the USENIX Security'24 paper https://ar…

C++ 13 1 Updated May 2, 2024

RawAlign is a real-time raw nanopore read mapper based on the Seed-Filter-Align paradigm as described by Lindegger et al. (https://arxiv.org/abs/2310.05037)

C++ 5 1 Updated Nov 2, 2023

Utopia is a new hybrid address mapping scheme that accelerates address translation while supporting all conventional VM features as described by Kanellopoulos et al. (https://arxiv.org/abs/2211.12205)

4 Updated Nov 1, 2023

GateSeeder is the first near-memory CPU-FPGA co-design for alleviating both the compute-bound and memory-bound bottlenecks in short and long-read mapping. GateSeeder outperforms Minimap2 by up to 4…

C 8 Updated Oct 3, 2023

Ramulator 2.0 is a modern, modular, extensible, and fast cycle-accurate DRAM simulator. It provides support for agile implementation and evaluation of new memory system designs (e.g., new DRAM stan…

C++ 236 58 Updated Jul 10, 2024

Victima is a new software-transparent technique that greatly extends the address translation reach of modern processors by leveraging the underutilized resources of the cache hierarchy, as desribed…

C 26 12 Updated Oct 13, 2023
C++ 14 2 Updated Dec 15, 2022

An extensible and fully programable cycle-accurate Spatial Architecture simulator for Machine Learning Workloads. Described in detail in the IEEE TC 2023 paper by Orosa et al. at https://arxiv.org/…

Python 6 Updated Jul 31, 2023

Experimental study and analysis on the effect of using a wide range of different supply voltage values on the reliability, latency, and retention characteristics of DDR3L DRAM SO-DIMMs

AGS Script 7 8 Updated Jun 12, 2017

ThyNVM: Transparent hybrid NonVolatile Memory. A gem5-based persistent memory simulator that implements a DRAM+NVM hybrid memory architecture. Introduced in the MICRO 2015 paper: https://users.ece.…

C++ 5 2 Updated Dec 28, 2015

A DDR3 Controller that uses the Xilinx MIG-7 PHY to interface with DDR3 devices.

Verilog 7 3 Updated Aug 22, 2021

A cycle-accurate simulator that models a hybrid memory subsystem consisting of multiple memory technologies. Described in the CLUSTER 2017 paper by Li et al. (https://people.inf.ethz.ch/omutlu/pub/…

C# 5 4 Updated May 9, 2019
C++ 4 1 Updated Feb 6, 2017

LTRF's register-interval creation algorithm divides the control flow graph (CFG) of a GPU application into some register-intervals which have two main characteristics: 1) register-intervals have on…

C++ 4 1 Updated May 24, 2019

Latency characterization data collected from 30 real DRAM SO-DIMMs. You can find the background and analysis on the data in our SIGMETRICS'16 paper "Understanding Latency Variation in Modern DRAM C…

2 1 Updated Jun 10, 2016

This simulator models Simultaneous Multi Layer Access (SMLA) and 3D-stacked DRAM memory, based on the TACO 2016 paper https://users.ece.cmu.edu/~omutlu/pub/smla_high-bandwidth-3d-stacked-memory_tac…

C++ 5 Updated Mar 24, 2016

This is a patch on GPGPU-sim for MeDiC. MeDiC is a mechanism that reduces the negative performance impact of memory divergence and cache queuing in GPUs. It is introduced in the PACT 2015 paper by …

1 1 Updated Nov 20, 2015
Next