Michaelvll

Zhanghao Wu Michaelvll

Ph.D. student @ UC Berkeley Sky Computing Lab; Previously, RA @ MIT HAN Lab; Undergrad @ SJTU ACM Honors Class

768 followers · 175 following

Sky Computing Lab, UC Berkeley
Berkeley, CA

Achievements

x4 x3 x2

Achievements

x4 x3 x2

Organizations

Block or Report

Block or report Michaelvll

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

dstackai / dstack

dstack is an easy-to-use and flexible container orchestrator for running AI workloads in any cloud or data center.

Python 1,203 87 Updated Jul 5, 2024

lm-sys / RouteLLM

A framework for serving and evaluating LLM routers.

Python 610 43 Updated Jul 4, 2024

romilbhardwaj / romilphdthesis

My PhD thesis on resource efficient machine learning

TeX 2 Updated Dec 29, 2023

codeflash-ai / r2e

Forked from r2e-project/r2e

Python 1 Updated Jun 17, 2024

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 64,101 7,455 Updated Jul 2, 2024

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 21,408 2,326 Updated Jul 4, 2024

llm-on-gke / skypilot-gke

Python 2 Updated Jun 18, 2024

r2e-project / r2e

Python 66 9 Updated Jun 29, 2024

skypilot-org / spot-traces

Releasing the spot availability traces used in "Can't Be Late" paper.

14 Updated Mar 31, 2024

mit-han-lab / patch_conv

Patch convolution to avoid large GPU memory usage of Conv2D

Python 70 4 Updated May 26, 2024

mit-han-lab / distrifuser

[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

Python 490 12 Updated Jun 28, 2024

QwenLM / Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 5,924 334 Updated Jul 4, 2024

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 4,193 390 Updated Jul 5, 2024

roboflow / inference

A fast, easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.

Python 1,124 84 Updated Jul 5, 2024

predibase / lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Python 1,869 126 Updated Jul 3, 2024

runpod / runpod-python

🐍 | Python library for RunPod API and serverless worker SDK.

Python 164 55 Updated Jun 10, 2024

ivy-llc / ivy

The Unified ML Representation

Python 14,024 5,820 Updated Jul 5, 2024

TabbyML / tabby

Self-hosted AI coding assistant

Rust 18,304 771 Updated Jul 5, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 11,883 1,054 Updated Jul 5, 2024

mit-han-lab / llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,083 150 Updated Jun 12, 2024

zhoujt1994 / scHiCluster

Python 52 16 Updated Apr 12, 2024

DachengLi1 / LongChat

Official repository for LongChat and LongEval

Python 499 29 Updated May 24, 2024

run-house / runhouse

Like PyTorch for building ML systems. Iterable, debuggable, multi-cloud, 100% reproducible across research and production.

Python 942 37 Updated Jul 5, 2024

gorilla-llm / gorilla-cli

LLMs for your CLI

Python 1,215 73 Updated May 29, 2024

CoLearn-Dev / colink-playbook-dev

Rust 2 Updated Jun 28, 2023

CoLearn-Dev / colink-protocol-inventory

1 1 Updated Mar 17, 2024

CoLearn-Dev / colink-server-dev

Rust 5 3 Updated Jun 27, 2023

CoLearn-Dev / colink-sdk-rust-dev

Rust 2 3 Updated Jun 27, 2023

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 22,253 3,140 Updated Jul 5, 2024

skypilot-org / skypilot

SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.

Python 6,197 426 Updated Jul 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zhanghao Wu Michaelvll

Achievements

Achievements

Organizations

Block or report Michaelvll

Stars

dstackai / dstack

lm-sys / RouteLLM

romilbhardwaj / romilphdthesis

codeflash-ai / r2e

openai / whisper

karpathy / llm.c

llm-on-gke / skypilot-gke

r2e-project / r2e

skypilot-org / spot-traces

mit-han-lab / patch_conv

mit-han-lab / distrifuser

QwenLM / Qwen2

allenai / OLMo

roboflow / inference

predibase / lorax

runpod / runpod-python

ivy-llc / ivy

TabbyML / tabby

Dao-AILab / flash-attention

mit-han-lab / llm-awq

zhoujt1994 / scHiCluster

DachengLi1 / LongChat

run-house / runhouse

gorilla-llm / gorilla-cli

CoLearn-Dev / colink-playbook-dev

CoLearn-Dev / colink-protocol-inventory

CoLearn-Dev / colink-server-dev

CoLearn-Dev / colink-sdk-rust-dev

vllm-project / vllm

skypilot-org / skypilot