pjyi2147

Patrick Yi pjyi2147

23 followers · 27 following

22:09 (UTC -05:00)
patrickyi.xyz

Achievements

Stars

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 3,400 192 Updated Nov 9, 2024

InternLM / turbomind

C++ 25 1 Updated Nov 7, 2024

ggerganov / llama.cpp

LLM inference in C/C++

C++ 67,575 9,705 Updated Nov 11, 2024

denoland / deno

A modern runtime for JavaScript and TypeScript.

Rust 97,459 5,373 Updated Nov 10, 2024

muety / wakapi

📊 A minimalist, self-hosted WakaTime-compatible backend for coding statistics

Go 2,728 169 Updated Nov 5, 2024

n8n-io / n8n

Free and source-available fair-code licensed workflow automation tool. Easily automate tasks across different services.

TypeScript 48,543 7,614 Updated Nov 11, 2024

huggingface / trl

Train transformer language models with reinforcement learning.

Python 10,001 1,265 Updated Nov 11, 2024

hcengineering / platform

Huly — All-in-One Project Management Platform (alternative to Linear, Jira, Slack, Notion, Motion)

TypeScript 17,144 1,031 Updated Nov 11, 2024

microsoft / OmniParser

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 4,476 332 Updated Nov 5, 2024

thu-ml / SageAttention

Quantized Attention that achieves speedups of 2.1x and 2.7x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.

Python 370 16 Updated Nov 11, 2024

NVIDIA / cutlass

CUDA Templates for Linear Algebra Subroutines

C++ 5,637 961 Updated Nov 8, 2024

microsoft / BitNet

Official inference framework for 1-bit LLMs

C++ 10,961 739 Updated Nov 8, 2024

numba / numba

NumPy aware dynamic Python compiler using LLVM

Python 9,959 1,127 Updated Nov 7, 2024

ollama / ollama

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

Go 97,182 7,735 Updated Nov 11, 2024

openai / swarm

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 15,806 1,542 Updated Oct 15, 2024

mainmatter / 100-exercises-to-learn-rust

A self-paced course to learn Rust, one exercise at a time.

Rust 6,150 1,050 Updated Nov 8, 2024

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 12,073 2,516 Updated Nov 11, 2024

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 1,407 129 Updated Nov 11, 2024

sgl-project / sgl-learning-materials

Materials for learning SGLang

78 5 Updated Nov 10, 2024

mlc-ai / mlc-llm

Universal LLM Deployment Engine with ML Compilation

Python 19,153 1,573 Updated Nov 7, 2024

FMInference / FlexLLMGen

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,188 549 Updated Oct 28, 2024

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,617 979 Updated Nov 6, 2024

naver / bergen

Benchmarking library for RAG

Jupyter Notebook 112 10 Updated Nov 8, 2024

gitbutlerapp / gitbutler

The GitButler version control client, backed by Git, powered by Tauri/Rust/Svelte

Rust 13,228 528 Updated Nov 10, 2024

horseee / Awesome-Efficient-LLM

A curated list for Efficient Large Language Models

Python 1,247 93 Updated Oct 30, 2024

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 5,958 491 Updated Nov 11, 2024

fineanmol / Hacktoberfest2024

Make your first Pull Request on Hacktoberfest 2024. Don't forget to spread love and if you like give us a ⭐️

JavaScript 2,530 8,022 Updated Oct 24, 2024

j2kun / mlir-tutorial

MLIR For Beginners tutorial

C++ 815 69 Updated Sep 30, 2024

vllm-project / llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 663 54 Updated Nov 9, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 29,909 4,517 Updated Nov 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Patrick Yi pjyi2147

Achievements

Achievements

Block or report pjyi2147

Stars

linkedin / Liger-Kernel

InternLM / turbomind

ggerganov / llama.cpp

denoland / deno

muety / wakapi

n8n-io / n8n

huggingface / trl

hcengineering / platform

microsoft / OmniParser

thu-ml / SageAttention

NVIDIA / cutlass

microsoft / BitNet

numba / numba

ollama / ollama

openai / swarm

mainmatter / 100-exercises-to-learn-rust

NVIDIA / NeMo

flashinfer-ai / flashinfer

sgl-project / sgl-learning-materials

mlc-ai / mlc-llm

FMInference / FlexLLMGen

NVIDIA / TensorRT-LLM

naver / bergen

gitbutlerapp / gitbutler

horseee / Awesome-Efficient-LLM

sgl-project / sglang

fineanmol / Hacktoberfest2024

j2kun / mlir-tutorial

vllm-project / llm-compressor

vllm-project / vllm