Skip to content
View jturner116's full-sized avatar

Highlights

  • Pro

Block or report jturner116

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Quantized Attention that achieves speedups of 2.1x and 2.7x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.

Python 185 5 Updated Oct 11, 2024

Entropy Based Sampling and Parallel CoT Decoding

TypeScript 2,673 275 Updated Oct 16, 2024

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 6,857 623 Updated Oct 18, 2024

A golang-based data loader which can be used from Python. Focused on a VectorDB stack at the moment, fetching and processing data per sample at GB/s speeds.

Go 72 Updated Oct 12, 2024

extensible collectives library in triton

Python 56 2 Updated Sep 23, 2024

Official software repository of S. Bruch, F. M. Nardini, C. Rulli, and R. Venturini, "Efficient Inverted Indexes for Approximate Retrieval over Learned Sparse Representations". Long Paper @ ACM SIG…

Rust 39 1 Updated Oct 18, 2024

Late Interaction Models Training & Retrieval

Python 156 7 Updated Oct 18, 2024

Materials for the Ultimate Hybrid Search Workshop

Jupyter Notebook 13 Updated Jul 19, 2024

A Rust HTTP server for Python applications

Rust 2,744 81 Updated Oct 15, 2024

Intro to leetcodes. Basic techniques, quicksort and hash structures implementation, space and time complexities.

Python 95 1 Updated Jul 26, 2024

MLX: An array framework for Apple silicon

C++ 16,812 968 Updated Oct 18, 2024

Einsum-like high-level array sharding API for JAX

Python 32 2 Updated Jul 16, 2024

Sparse autoencoders

Python 319 44 Updated Oct 10, 2024

Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy

Python 823 31 Updated Oct 18, 2024

SPLADE: sparse neural search (SIGIR21, SIGIR22)

Python 758 83 Updated May 3, 2024

Rapidly build AI apps in Python

Python 5,402 259 Updated Oct 19, 2024

NLP with Rust for Python 🦀🐍

Rust 59 1 Updated Jun 2, 2024

Neural Search

Python 341 16 Updated Jun 6, 2024

A self-paced course to learn Rust, one exercise at a time.

Rust 6 Updated May 16, 2024

depyf is a tool to help you understand and adapt to PyTorch compiler torch.compile.

Python 474 11 Updated Sep 16, 2024

A modern model graph visualizer and debugger

JavaScript 1,021 77 Updated Oct 17, 2024

3D Gaussian Splatting in JAX

Cuda 53 3 Updated May 30, 2024

FastKAN: Very Fast Implementation of Kolmogorov-Arnold Networks (KAN)

Jupyter Notebook 353 46 Updated Jun 20, 2024
Python 62 3 Updated Oct 18, 2024

An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).

Python 3,988 353 Updated Aug 1, 2024

Tile primitives for speedy kernels

Cuda 1,540 60 Updated Oct 19, 2024

A multi-level tensor algebra superoptimizer

C++ 537 28 Updated Oct 19, 2024

Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"

Python 1,065 100 Updated Mar 10, 2024
Python 216 19 Updated Jul 11, 2024

seqax = sequence modeling + JAX

Python 131 9 Updated Jul 17, 2024
Next