Edward-Sun

🎯

Focusing

Zhiqing Sun Edward-Sun

🎯

Focusing

Ph.D. student at LTI, CMU

333 followers · 144 following

Carnegie Mellon University
zhiqingsun.com
@EdwardSun0909

Achievements

x2 x2

Achievements

x2 x2

Stars

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 2,615 115 Updated Sep 2, 2024

pytorch-labs / attention-gym

Helpful tools and examples for working with flex-attention

Python 302 12 Updated Aug 17, 2024

uclaml / SPPO

The official implementation of Self-Play Preference Optimization (SPPO)

Python 438 59 Updated Aug 4, 2024

trotsky1997 / MathBlackBox

Python 448 54 Updated Jul 22, 2024

facebookresearch / schedule_free

Schedule-Free Optimization in PyTorch

Python 1,776 62 Updated Aug 13, 2024

PingchuanMa / SGA

[ICML 2024] LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific Discovery

Python 41 4 Updated May 31, 2024

google-deepmind / code_contests

C++ 2,056 201 Updated Oct 3, 2023

All-Hands-AI / OpenHands

🙌 OpenHands: Code Less, Make More

Python 30,910 3,562 Updated Sep 2, 2024

Edward-Sun / DIFUSCO

Code of NeurIPS paper: arxiv.org/abs/2302.08224

Python 147 34 Updated Jul 16, 2024

Edward-Sun / easy-to-hard

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision

Python 71 8 Updated Jun 24, 2024

Edward-Sun / gpt-accelera

Simple and efficient pytorch-native transformer training and inference (batched)

Python 51 3 Updated Apr 2, 2024

ibm-granite / granite-code-models

Granite Code Models: A Family of Open Foundation Models for Code Intelligence

1,044 72 Updated Sep 2, 2024

openai / simple-evals

Python 1,476 129 Updated Aug 6, 2024

HazyResearch / nanoGPT-TK

Forked from karpathy/nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs. Now, with kittens!

Makefile 45 2 Updated Aug 4, 2024

HazyResearch / ThunderKittens

Tile primitives for speedy kernels

Cuda 1,465 55 Updated Aug 31, 2024

protagolabs / odyssey-math

Python 73 5 Updated Aug 17, 2024

HPMLL / BurstGPT

A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems

Python 107 5 Updated Aug 17, 2024

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 24,454 3,197 Updated Jul 23, 2024

sarah-ek / faer-rs

Linear algebra foundation for the Rust programming language

Rust 1,785 57 Updated Aug 24, 2024

triton-inference-server / server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 7,970 1,437 Updated Sep 1, 2024

apple / corenet

CoreNet: A library for training deep neural networks

Python 6,906 536 Updated May 28, 2024

tensorflow / mesh

Mesh TensorFlow: Model Parallelism Made Easier

Python 1,578 254 Updated Nov 17, 2023

lm-sys / arena-hard-auto

Arena-Hard-Auto: An automatic LLM benchmark.

Jupyter Notebook 395 48 Updated Sep 1, 2024

google-deepmind / penzai

A JAX research toolkit for building, editing, and visualizing neural networks.

Python 1,621 50 Updated Aug 12, 2024

dlwh / jax_sourceror

Turn jitted jax functions back into python source code

Python 20 Updated Jul 11, 2024

BobMcDear / attorch

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

Python 446 19 Updated Aug 30, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 25,856 2,884 Updated Aug 12, 2024

kttian / llm_factuality_tuning

Python 17 3 Updated May 2, 2024

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 23,042 2,566 Updated Aug 26, 2024

google-deepmind / recurrentgemma

Open weights language model from Google DeepMind, based on Griffin.

Python 587 23 Updated Jul 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zhiqing Sun Edward-Sun

Achievements

Achievements

Block or report Edward-Sun

Stars

linkedin / Liger-Kernel

pytorch-labs / attention-gym

uclaml / SPPO

trotsky1997 / MathBlackBox

facebookresearch / schedule_free

PingchuanMa / SGA

google-deepmind / code_contests

All-Hands-AI / OpenHands

Edward-Sun / DIFUSCO

Edward-Sun / easy-to-hard

Edward-Sun / gpt-accelera

ibm-granite / granite-code-models

openai / simple-evals

HazyResearch / nanoGPT-TK

HazyResearch / ThunderKittens

protagolabs / odyssey-math

HPMLL / BurstGPT

openai / CLIP

sarah-ek / faer-rs

triton-inference-server / server

apple / corenet

tensorflow / mesh

lm-sys / arena-hard-auto

google-deepmind / penzai

dlwh / jax_sourceror

BobMcDear / attorch

meta-llama / llama3

kttian / llm_factuality_tuning

karpathy / llm.c

google-deepmind / recurrentgemma