Skip to content
View m-atalla's full-sized avatar

Block or report m-atalla

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for TKDE paper "Self-supervised learning on graphs: Contrastive, generative, or predictive"

1,345 166 Updated Aug 15, 2024

Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793

Python 284 10 Updated Sep 4, 2024

A hyperparameter optimization framework

Python 10,540 1,002 Updated Sep 12, 2024

Tuned OpenCL BLAS

C++ 1,046 205 Updated Jun 13, 2024

A guide to help developers get up and running quickly with the OpenCL programming framework

CMake 515 58 Updated Aug 7, 2024

CUDA Templates for Linear Algebra Subroutines

C++ 5,362 904 Updated Sep 16, 2024

Python interface for MLIR - the Multi-Level Intermediate Representation

Python 210 36 Updated May 28, 2024

A microbenchmark support library

C++ 8,895 1,612 Updated Sep 13, 2024

UBGen can generate programs with undefined behaviors (e.g., buffer-overflow, use-after-free, etc.)

C 55 5 Updated Apr 7, 2024

The NAS Parallel Benchmarks for evaluating C++ parallel programming frameworks on shared-memory architectures

C++ 45 21 Updated Sep 2, 2024

A Compiler Writing Journey

C 10,421 1,010 Updated Jul 30, 2024

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

Python 9,082 507 Updated Sep 7, 2024

Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

TypeScript 13,347 1,264 Updated Sep 5, 2024

A curated list of automated machine learning papers, articles, tutorials, slides and projects

3,997 696 Updated Jun 11, 2024

Burn is a new comprehensive dynamic Deep Learning Framework built using Rust with extreme flexibility, compute efficiency and portability as its primary goals.

Rust 8,433 414 Updated Sep 16, 2024

A book about compiling Racket and Python to x86-64 assembly

TeX 1,280 140 Updated Aug 27, 2024

A minimal GPU design in Verilog to learn how GPUs work from the ground up

SystemVerilog 6,928 514 Updated Aug 18, 2024

Language definitions and styles for listings in LaTeX.

TeX 64 6 Updated Oct 27, 2020

The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.

Python 5,950 658 Updated Sep 11, 2024

Fourier ACelerator Compiler Framework. Efforts have been taken to blind code for submission.

C 6 Updated Mar 14, 2022

PyTorch Extension Library of Optimized Scatter Operations

Python 1,540 179 Updated Aug 15, 2024

Csmith, a random generator of C programs

C++ 1,005 144 Updated Jan 26, 2024

The cling C++ interpreter

C++ 3,460 267 Updated Sep 12, 2024

NPBench - A Benchmarking Suite for High-Performance NumPy

Python 73 25 Updated Jun 13, 2024

The financial transactions database designed for mission critical safety and performance.

Zig 9,671 481 Updated Sep 17, 2024

lightweight, standalone C++ inference engine for Google's Gemma models.

C++ 5,911 500 Updated Sep 16, 2024

PyTorch Extension Library of Optimized Autograd Sparse Matrix Operations

Python 992 146 Updated Aug 15, 2024
Python 9 Updated Sep 2, 2023

Implementation of IR2Vec, published in ACM TACO

LLVM 79 37 Updated Sep 9, 2024

Strategies for Pre-training Graph Neural Networks

Python 956 161 Updated Jul 29, 2023
Next