Skip to content
View yongwww's full-sized avatar
🐢
working
🐢
working
  • OctoAI
  • Seattle, WA
  • 13:49 (UTC -07:00)

Highlights

  • Pro

Organizations

@apache @octoml

Block or report yongwww

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Empowering everyone to build reliable and efficient software.

Rust 96,307 12,455 Updated Aug 26, 2024

The official Python library for the OpenAI API

Python 21,775 2,985 Updated Aug 20, 2024

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,157 141 Updated Jun 25, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 25,463 3,684 Updated Aug 26, 2024

TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization and sparsity. It compresses deep learning models for downstream deployment frame…

Python 400 23 Updated Aug 5, 2024

Generative Models by Stability AI

Python 23,890 2,652 Updated Aug 21, 2024

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 11,364 1,479 Updated Feb 29, 2024

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python 11,557 3,424 Updated Aug 26, 2024

Serving multiple LoRA finetuned LLM as one

Python 932 41 Updated May 8, 2024

Universal LLM Deployment Engine with ML Compilation

Python 18,489 1,487 Updated Aug 26, 2024

High-performance In-browser LLM Inference Engine

TypeScript 12,135 766 Updated Aug 23, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 46,362 5,493 Updated Jun 24, 2024

Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.

Jupyter Notebook 3,536 224 Updated Mar 12, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 131,151 26,071 Updated Aug 26, 2024

Development repository for the Triton language and compiler

C++ 12,349 1,495 Updated Aug 26, 2024

A home for the final text of all TVM RFCs.

98 79 Updated May 31, 2024
Python 193 58 Updated Mar 28, 2023

Temp repo for prototyping relax(relay next), the effort will be upstreamed. We use the wiki pages on this repo to host design docs.

Python 5 Updated Mar 6, 2023
C++ 141 20 Updated Sep 13, 2023

Object detection, 3D detection, and pose estimation using center point detection:

Python 7,232 1,924 Updated Mar 2, 2023

A retargetable MLIR-based machine learning compiler and runtime toolkit.

C++ 2,544 567 Updated Aug 26, 2024

A performant and modular runtime for TensorFlow

C++ 752 122 Updated Aug 16, 2024

Code for the paper "Evolution Strategies as a Scalable Alternative to Reinforcement Learning"

Python 1,550 276 Updated Oct 31, 2019

Serve, optimize and scale PyTorch models in production

Java 4,125 833 Updated Aug 24, 2024

YoloV3 Implemented in Tensorflow 2.0

Jupyter Notebook 2,510 908 Updated Jul 30, 2024

[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware

C++ 1,415 284 Updated Jul 25, 2024

a language for fast, portable data-parallel computation

C++ 5,821 1,065 Updated Aug 23, 2024

A Python framework for creating, editing, and invoking Noisy Intermediate Scale Quantum (NISQ) circuits.

Python 4,222 1,007 Updated Aug 23, 2024

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 20,020 4,125 Updated Aug 21, 2024
Next