Highlights
- Pro
Stars
Record "perf" performance metrics for individual functions/regions of an ELF binary.
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
Development repository for the Triton language and compiler
Distribute and run LLMs with a single file.
Machine-readable data describing Arm architecture and implementations. Includes JSON descriptions of implemented PMU events.
Userspace eBPF runtime for Observability, Network & General Extensions Framework
Makes ARM NEON documentation accessible (with examples)
DAMOV is a benchmark suite and a methodical framework targeting the study of data movement bottlenecks in modern applications. It is intended to study new architectures, such as near-data processin…
🔊 Text-Prompted Generative Audio Model
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Instruct-tune LLaMA on consumer hardware
A collection of libraries to optimise AI model performances
proxychains - a tool that forces any TCP connection made by any given application to follow through proxy like TOR or any other SOCKS4, SOCKS5 or HTTP(S) proxy. Supported auth-types: "user/pass" fo…
Reference implementations of MLPerf™ inference benchmarks
Python package built to ease deep learning on graph, on top of existing DL frameworks.
A playbook for systematically maximizing the performance of deep learning models.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
❤️A clean, elegant but advanced blog theme for Hugo 一个简洁、优雅且高效的 Hugo 主题