Erland366

Edd Erland366

25 followers · 81 following

Achievements

BetaSend feedback

Achievements

BetaSend feedback

Highlights

Organizations

Block or Report

Block or report Erland366

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Benny-Nottonson / Mojo-Marathons

Mojo 11 6 Updated Jun 24, 2024

WerWolv / ImHex

🔍 A Hex Editor for Reverse Engineers, Programmers and people who value their retinas when working at 3 AM.

C++ 38,209 1,672 Updated Jul 1, 2024

intel / auto-round

SOTA Weight-only Quantization Algorithm for LLMs. This is official implementation of "Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs"

Python 121 17 Updated Jul 1, 2024

arpitingle / gpu-alpha

459 11 Updated Jul 1, 2024

stanford-crfm / haliax

Named Tensors for Legible Deep Learning in JAX

Python 137 9 Updated Jun 27, 2024

aiortc / aioquic

QUIC and HTTP/3 implementation in Python

Python 1,598 229 Updated Jul 1, 2024

google / gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Python 5,127 487 Updated Jul 1, 2024

3b1b / manim

Animation engine for explanatory math videos

Python 60,183 5,684 Updated Jun 24, 2024

anthonix / llm.c

Forked from karpathy/llm.c

LLM training in simple, raw C/HIP for AMD GPUs

Cuda 17 1 Updated Jun 26, 2024

matatonic / openedai-vision

An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.

Python 86 7 Updated Jul 1, 2024

ofou / graham-essays

📚 Download the full collection of Paul Graham essays in EPUB, PDF & Markdown for easy reading.

Python 705 44 Updated Jun 19, 2024

huawei-lin / RapidIn

The implementation for paper "Token-wise Influential Training Data Retrieval for Large Language Models" (Accepted on ACL 2024).

Python 5 Updated Jun 11, 2024

google-deepmind / torax

TORAX: Tokamak transport simulation in JAX

Python 292 23 Updated Jul 1, 2024

casper-hansen / AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 1,400 161 Updated Jun 30, 2024

AnswerDotAI / bitlora

Experimental q[X]ora kernel development code

Jupyter Notebook 3 Updated Jun 13, 2024

NVIDIA / NeMo-Aligner

Scalable toolkit for efficient model alignment

Python 413 44 Updated Jul 1, 2024

xhluca / bm25s

Fast lexical search library implementing BM25 in Python using Scipy (on average 2x faster than Elasticsearch in single-threaded setting)

Python 535 15 Updated Jun 28, 2024

ridgerchu / matmulfreellm

Implementation for MatMul-free LM.

Python 2,538 142 Updated Jun 27, 2024

cmeraki / vllm-simulation

Code for simulating vLLM

Python 2 Updated Apr 29, 2024

NVIDIA / TensorRT-Model-Optimizer

TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization and sparsity. It compresses deep learning models for downstream deployment frame…

Python 282 15 Updated Jun 25, 2024

srush / anynp

Proof-of-concept of global switching between numpy/jax/pytorch in a library.

Python 15 Updated Jun 18, 2024

LeiWang1999 / vllm-bitblas

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 2 Updated Jul 1, 2024

Edd Erland366

Highlights

Organizations

Block or report Erland366

Starred repositories

Tensorflow