Skip to content
View Yoh-Z's full-sized avatar
Block or Report

Block or report Yoh-Z

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Understand Human Behavior to Align True Needs

Python 2,778 222 Updated Jul 16, 2024

The official Meta Llama 3 GitHub site

Python 23,321 2,500 Updated Jul 17, 2024

HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance

Python 1,151 96 Updated Jul 18, 2024

SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.

Python 2,862 186 Updated Jul 18, 2024

Universal LLM Deployment Engine with ML Compilation

Python 17,814 1,422 Updated Jul 18, 2024

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 3,393 303 Updated Jul 18, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 814 74 Updated Jul 18, 2024

Making large AI models cheaper, faster and more accessible

Python 38,371 4,310 Updated Jul 18, 2024

YOLOv10: Real-Time End-to-End Object Detection

Python 8,359 723 Updated Jul 18, 2024

Efficient inference of large language models.

C++ 137 7 Updated Jul 15, 2024

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 11,170 753 Updated Jul 10, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 7,510 819 Updated Jul 18, 2024

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Python 1,480 70 Updated Jul 6, 2024

The Prometheus monitoring system and time series database.

Go 53,890 8,926 Updated Jul 18, 2024

ComfyUI's ControlNet Auxiliary Preprocessors

Python 1,706 163 Updated Jul 8, 2024

MambaOut: Do We Really Need Mamba for Vision?

Python 1,889 29 Updated Jun 6, 2024

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 4,486 599 Updated May 20, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 23,130 3,283 Updated Jul 18, 2024

This repository contains the experimental PyTorch native float8 training UX

Python 194 18 Updated Jul 18, 2024

Parallel computing with task scheduling

Python 12,259 1,690 Updated Jul 18, 2024

Transparent Image Layer Diffusion using Latent Transparency

1,898 21 Updated Jun 16, 2024

Large Language Model Text Generation Inference

Python 8,411 957 Updated Jul 18, 2024

Detect CPU features with single-file

C 268 37 Updated Jul 18, 2024

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

Python 701 32 Updated Jun 27, 2024

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 4,620 303 Updated Jun 28, 2024

experimental

Python 213 13 Updated Nov 10, 2023

A model compilation solution for various hardware

MLIR 332 38 Updated Jul 18, 2024

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,049 568 Updated Jul 14, 2024

The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.

Python 42,517 4,494 Updated Jul 17, 2024
Next