Skip to content
View haileyschoelkopf's full-sized avatar
Block or Report

Block or report haileyschoelkopf

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 3 Updated Jun 2, 2024
Jupyter Notebook 125 6 Updated Jun 2, 2023
6 1 Updated Jun 28, 2024

FlagGems is an operator library for large language models implemented in Triton Language.

Python 174 10 Updated Jul 19, 2024
Jupyter Notebook 27 2 Updated Jul 2, 2024

An ML Systems Onboarding list

142 5 Updated Jul 19, 2024
Python 8 Updated Jul 18, 2024

seqax = sequence modeling + JAX

Python 125 9 Updated Jul 17, 2024

Multimodal language model benchmark, featuring challenging examples

Python 139 6 Updated May 14, 2024

A native PyTorch Library for large model training

Python 1,358 117 Updated Jul 19, 2024

PyTorch half precision gemm lib w/ fused optional bias + optional relu/gelu

Cuda 19 Updated Apr 16, 2024

PyTorch bindings for CUTLASS grouped GEMM.

Cuda 32 29 Updated Jul 16, 2024

Fast modular code to create and train cutting edge LLMs

Python 61 9 Updated May 16, 2024

A simple and efficient Mamba implementation in pure PyTorch and MLX.

Python 802 66 Updated Jul 12, 2024

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Python 12,010 1,203 Updated Jul 19, 2024

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

Python 422 18 Updated Jul 17, 2024

Simple and efficient pytorch-native transformer training and inference (batched)

Python 45 2 Updated Apr 2, 2024

Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature

Python 78 5 Updated Jul 17, 2024

Accelerated First Order Parallel Associative Scan

Python 115 7 Updated Jun 21, 2024

A toolkit for scaling law research ⚖

Python 39 3 Updated Mar 16, 2024

scalable and robust tree-based speculative decoding algorithm

Python 282 29 Updated Jun 7, 2024

A Native-PyTorch Library for LLM Fine-tuning

Python 3,628 297 Updated Jul 19, 2024

Language models scale reliably with over-training and on downstream tasks

Jupyter Notebook 87 4 Updated Apr 2, 2024

A PyTorch Native LLM Training Framework

Python 503 19 Updated May 31, 2024

Experiment of using Tangent to autodiff triton

Python 66 1 Updated Jan 22, 2024

Triton-based implementation of Sparse Mixture of Experts.

Python 151 10 Updated Jul 18, 2024

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 761 41 Updated Jul 16, 2024

Ring attention implementation with flash attention

Python 448 30 Updated May 20, 2024

Mamba SSM architecture

Python 11,779 969 Updated Jul 18, 2024
Next