Skip to content
View honglu2875's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report honglu2875

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A lightweight suffix-sorting library

C 361 81 Updated Mar 25, 2020

A machine learning compiler for GPUs, CPUs, and ML accelerators

C++ 2,471 377 Updated Jul 29, 2024

đź“ś Extract meaningful content from the chaos of a web page

JavaScript 5,360 438 Updated Jul 10, 2024

JAX-Toolbox

Jupyter Notebook 211 36 Updated Jul 29, 2024

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

Python 171 22 Updated Jul 29, 2024

High-Performance Symbolic Regression in Python and Julia

Python 2,124 200 Updated Jul 29, 2024

lightweight, standalone C++ inference engine for Google's Gemma models.

C++ 5,828 495 Updated Jul 29, 2024

Game Boy emulator written in Python

Python 4,542 470 Updated Jul 21, 2024

A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.

Python 1,100 144 Updated Jul 29, 2024

Transform Python source code into its most compact representation

Python 546 39 Updated Jan 14, 2024

Auto configurations for Language Server for vim-lsp

Vim Script 1,271 229 Updated Jul 25, 2024

Textual Inputs is a collection of input widgets for the Textual TUI framework 🔡

Python 93 11 Updated Mar 18, 2023

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,393 491 Updated Jul 13, 2024

Weighted MinHash implementation on CUDA (multi-gpu).

C++ 112 24 Updated Nov 29, 2023

LLM verified with Monte Carlo Tree Search

Jupyter Notebook 217 26 Updated Jul 2, 2024

Uniform Manifold Approximation and Projection

Python 7,253 789 Updated Jul 25, 2024

Playing Pokemon Red with Reinforcement Learning

Jupyter Notebook 6,775 611 Updated Jul 9, 2024

A Python framework for high performance GPU simulation and graphics

Python 3,935 214 Updated Jul 29, 2024

CUDA Templates for Linear Algebra Subroutines

C++ 5,005 854 Updated Jul 29, 2024

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

Python 1,687 271 Updated Jul 29, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 23,842 3,423 Updated Jul 29, 2024

A game theoretic approach to explain the output of any machine learning model.

Jupyter Notebook 22,226 3,227 Updated Jul 26, 2024

PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.

Python 695 47 Updated Jun 28, 2024

Harness the power of ChatGPT inside the GDB or LLDB debugger!

Python 902 31 Updated Oct 9, 2023

Little article showing how to load pytorch's models with linear memory consumption

34 1 Updated Aug 29, 2022

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 10,354 2,091 Updated Jul 18, 2024

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 11,326 768 Updated Jul 10, 2024

Port of OpenAI's Whisper model in C/C++

C++ 33,398 3,377 Updated Jul 27, 2024
Python 488 39 Updated Feb 5, 2024

Neural baselines for finding and fixing single token bugs in Python

Python 8 2 Updated Nov 23, 2023
Next