Skip to content
View Edward-Sun's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Edward-Sun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Efficient Triton Kernels for LLM Training

Python 2,615 115 Updated Sep 2, 2024

Helpful tools and examples for working with flex-attention

Python 302 12 Updated Aug 17, 2024

The official implementation of Self-Play Preference Optimization (SPPO)

Python 438 59 Updated Aug 4, 2024
Python 448 54 Updated Jul 22, 2024

Schedule-Free Optimization in PyTorch

Python 1,776 62 Updated Aug 13, 2024

[ICML 2024] LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific Discovery

Python 41 4 Updated May 31, 2024

🙌 OpenHands: Code Less, Make More

Python 30,910 3,562 Updated Sep 2, 2024

Code of NeurIPS paper: arxiv.org/abs/2302.08224

Python 147 34 Updated Jul 16, 2024

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision

Python 71 8 Updated Jun 24, 2024

Simple and efficient pytorch-native transformer training and inference (batched)

Python 51 3 Updated Apr 2, 2024

Granite Code Models: A Family of Open Foundation Models for Code Intelligence

1,044 72 Updated Sep 2, 2024
Python 1,476 129 Updated Aug 6, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs. Now, with kittens!

Makefile 45 2 Updated Aug 4, 2024

Tile primitives for speedy kernels

Cuda 1,465 55 Updated Aug 31, 2024
Python 73 5 Updated Aug 17, 2024

A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems

Python 107 5 Updated Aug 17, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 24,454 3,197 Updated Jul 23, 2024

Linear algebra foundation for the Rust programming language

Rust 1,785 57 Updated Aug 24, 2024

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 7,970 1,437 Updated Sep 1, 2024

CoreNet: A library for training deep neural networks

Python 6,906 536 Updated May 28, 2024

Mesh TensorFlow: Model Parallelism Made Easier

Python 1,578 254 Updated Nov 17, 2023

Arena-Hard-Auto: An automatic LLM benchmark.

Jupyter Notebook 395 48 Updated Sep 1, 2024

A JAX research toolkit for building, editing, and visualizing neural networks.

Python 1,621 50 Updated Aug 12, 2024

Turn jitted jax functions back into python source code

Python 20 Updated Jul 11, 2024

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

Python 446 19 Updated Aug 30, 2024

The official Meta Llama 3 GitHub site

Python 25,856 2,884 Updated Aug 12, 2024

LLM training in simple, raw C/CUDA

Cuda 23,042 2,566 Updated Aug 26, 2024

Open weights language model from Google DeepMind, based on Griffin.

Python 587 23 Updated Jul 9, 2024
Next