Skip to content
View jishminor's full-sized avatar
  • Arm
  • Austin, TX

Highlights

  • Pro

Organizations

@smarter-project

Block or report jishminor

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Record "perf" performance metrics for individual functions/regions of an ELF binary.

Go 69 5 Updated Jan 17, 2024

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 7,457 757 Updated Oct 10, 2024

Development repository for the Triton language and compiler

C++ 12,978 1,584 Updated Oct 10, 2024

Distribute and run LLMs with a single file.

C++ 19,579 988 Updated Oct 9, 2024

Tensor library for machine learning

C++ 10,980 1,010 Updated Oct 9, 2024
Python 14 1 Updated Apr 21, 2024

Machine-readable data describing Arm architecture and implementations. Includes JSON descriptions of implemented PMU events.

40 10 Updated Apr 4, 2024

CUDA on non-NVIDIA GPUs

Rust 9,269 618 Updated Oct 10, 2024

Userspace eBPF runtime for Observability, Network & General Extensions Framework

C++ 792 74 Updated Oct 1, 2024

Makes ARM NEON documentation accessible (with examples)

382 66 Updated Apr 13, 2024

DAMOV is a benchmark suite and a methodical framework targeting the study of data movement bottlenecks in modern applications. It is intended to study new architectures, such as near-data processin…

C++ 75 17 Updated Jul 27, 2023

A Benchmark Tool for VectorDB

Python 524 134 Updated Oct 8, 2024

Inference code for Llama models

Python 55,944 9,518 Updated Aug 18, 2024

LLM inference in C/C++

C++ 66,106 9,495 Updated Oct 10, 2024

Minimalist ML framework for Rust

Rust 15,478 916 Updated Oct 10, 2024

STREAM benchmark

C 331 133 Updated Apr 12, 2024

Simple benchmark for memory throughput and latency

C 350 96 Updated Jul 4, 2023
C 106 48 Updated Oct 10, 2024

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 35,637 4,191 Updated Aug 19, 2024

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Jupyter Notebook 13,341 3,195 Updated Aug 12, 2024

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,582 2,214 Updated Jul 29, 2024

A collection of libraries to optimise AI model performances

Python 8,378 642 Updated Jul 22, 2024

proxychains - a tool that forces any TCP connection made by any given application to follow through proxy like TOR or any other SOCKS4, SOCKS5 or HTTP(S) proxy. Supported auth-types: "user/pass" fo…

C 6,579 618 Updated Jun 8, 2024

Compiler for Neural Network hardware accelerators

C++ 3,216 689 Updated May 11, 2024

Reference implementations of MLPerf™ inference benchmarks

Python 1,206 527 Updated Oct 9, 2024

Python package built to ease deep learning on graph, on top of existing DL frameworks.

Python 13,436 3,005 Updated Oct 10, 2024

A playbook for systematically maximizing the performance of deep learning models.

26,709 2,221 Updated Jun 18, 2024

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 82,837 22,328 Updated Oct 10, 2024

❤️A clean, elegant but advanced blog theme for Hugo 一个简洁、优雅且高效的 Hugo 主题

JavaScript 3,385 1,075 Updated Jul 7, 2024
C 121 9 Updated Jul 16, 2024
Next