-
Sky Computing Lab, UC Berkeley
- Berkeley, CA
Stars
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
High-performance In-browser LLM Inference Engine
LlamaIndex is a data framework for your LLM applications
Efficient vision foundation models for high-resolution generation and perception.
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
The data processing pipeline for the Koala chatbot language model
Code and documentation to train Stanford's Alpaca models, and generate the data.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Kubernetes Tutorial for the PS2 group meetings at UC Berkeley
A PyTorch-based framework for Quantum Classical Simulation, Quantum Machine Learning, Quantum Neural Networks, Parameterized Quantum Circuits with support for easy deployments on real quantum compu…
Examples and instructions about use LLMs (especially ChatGPT) for PhD
🦜🔗 Build context-aware reasoning applications
Running large language models on a single GPU for throughput-oriented scenarios.
A user-space file system for interacting with Google Cloud Storage
Open Source Platform for developing, scaling and deploying serious ML, AI, and data science systems
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
[NeurIPS 2022, T-PAMI 2023] Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models
A service for launching containerized processes on cloud infrastructure.
[NeurIPS 2022] A Fast Post-Training Pruning Framework for Transformers
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
Tutorial to get started with SkyPilot!
SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC ac…
Productive, portable, and performant GPU programming in Python.
The missing star history graph of GitHub repos - https://star-history.com