Skip to content
View Erland366's full-sized avatar

Highlights

  • Pro

Organizations

@capstone-bangkit-c22-ky01
Block or Report

Block or report Erland366

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

🔍 A Hex Editor for Reverse Engineers, Programmers and people who value their retinas when working at 3 AM.

C++ 38,209 1,672 Updated Jul 1, 2024

SOTA Weight-only Quantization Algorithm for LLMs. This is official implementation of "Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs"

Python 121 17 Updated Jul 1, 2024

Named Tensors for Legible Deep Learning in JAX

Python 137 9 Updated Jun 27, 2024

QUIC and HTTP/3 implementation in Python

Python 1,598 229 Updated Jul 1, 2024

The official PyTorch implementation of Google's Gemma models

Python 5,127 487 Updated Jul 1, 2024

Animation engine for explanatory math videos

Python 60,183 5,684 Updated Jun 24, 2024

LLM training in simple, raw C/HIP for AMD GPUs

Cuda 17 1 Updated Jun 26, 2024

An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.

Python 86 7 Updated Jul 1, 2024

📚 Download the full collection of Paul Graham essays in EPUB, PDF & Markdown for easy reading.

Python 705 44 Updated Jun 19, 2024

The implementation for paper "Token-wise Influential Training Data Retrieval for Large Language Models" (Accepted on ACL 2024).

Python 5 Updated Jun 11, 2024

TORAX: Tokamak transport simulation in JAX

Python 292 23 Updated Jul 1, 2024

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 1,400 161 Updated Jun 30, 2024

Experimental q[X]ora kernel development code

Jupyter Notebook 3 Updated Jun 13, 2024

Scalable toolkit for efficient model alignment

Python 413 44 Updated Jul 1, 2024

Fast lexical search library implementing BM25 in Python using Scipy (on average 2x faster than Elasticsearch in single-threaded setting)

Python 535 15 Updated Jun 28, 2024

Implementation for MatMul-free LM.

Python 2,538 142 Updated Jun 27, 2024

Code for simulating vLLM

Python 2 Updated Apr 29, 2024

TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization and sparsity. It compresses deep learning models for downstream deployment frame…

Python 282 15 Updated Jun 25, 2024

Proof-of-concept of global switching between numpy/jax/pytorch in a library.

Python 15 Updated Jun 18, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 2 Updated Jul 1, 2024

Latent Large Language Models

C++ 5 Updated Jul 1, 2024
Jupyter Notebook 35 1 Updated Jan 18, 2024

MambaOut: Do We Really Need Mamba for Vision?

Python 1,862 28 Updated Jun 6, 2024

Display and control your Android device

C 105,146 10,213 Updated Jul 1, 2024

GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcomm

C 7,669 280 Updated Jun 28, 2024

Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"

Python 638 34 Updated Jun 27, 2024

Collective communications library with various primitives for multi-machine training.

C++ 1,159 293 Updated Jun 26, 2024

ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization

Python 37 5 Updated Jun 14, 2024
Next