Skip to content
View simonJJJ's full-sized avatar
🌍
🌍

Organizations

@hustvl @OFA-Sys @QwenLM

Block or report simonJJJ

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
1184 results for source starred repositories
Clear filter

Efficient Triton Kernels for LLM Training

Python 2,668 117 Updated Sep 3, 2024

CUDA accelerated rasterization of gaussian splatting

Python 1,632 206 Updated Aug 30, 2024

Official inference repo for FLUX.1 models

Python 12,656 876 Updated Aug 29, 2024
Python 96 16 Updated Aug 27, 2024

Run PyTorch LLMs locally on servers, desktop and mobile

Python 3,072 189 Updated Sep 3, 2024

OpenBot leverages smartphones as brains for low-cost robots. We have designed a small electric vehicle that costs about $50 and serves as a robot body. Our software stack for Android smartphones su…

Swift 2,814 526 Updated Jul 15, 2024

Machine Learning Engineering Open Book

Python 10,614 641 Updated Sep 2, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 10,297 797 Updated Aug 21, 2024

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 682 31 Updated Aug 20, 2024

Dynamic Memory Management for Serving LLMs without PagedAttention

C 178 10 Updated Aug 3, 2024

Fast Multimodal LLM on Mobile Devices

C++ 371 44 Updated Sep 3, 2024

A small C compiler

C 9,460 858 Updated Oct 30, 2023

Extremely fast non-cryptographic hash algorithm

C 8,920 770 Updated Sep 2, 2024

flash attention tutorial written in python, triton, cuda, cutlass

Cuda 147 11 Updated Jun 18, 2024

An extremely fast Python package and project manager, written in Rust.

Rust 18,919 551 Updated Sep 3, 2024
Python 92 4 Updated Jul 8, 2024

A fast communication-overlapping library for tensor parallelism on GPUs.

C++ 170 12 Updated Jul 25, 2024

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,736 107 Updated Jul 29, 2024

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 625 35 Updated Aug 5, 2024

[ICCV2023] DETR Doesn’t Need Multi-Scale or Locality Design

Python 192 3 Updated Nov 14, 2023

Beautifully designed components that you can copy and paste into your apps. Accessible. Customizable. Open Source.

TypeScript 67,796 3,992 Updated Sep 3, 2024

DSPy: The framework for programming—not prompting—foundation models

Python 16,311 1,259 Updated Sep 1, 2024

a Hassle-Free Python Experience

Rust 13,353 457 Updated Sep 3, 2024

Universal LLM Deployment Engine with ML Compilation

Python 18,547 1,495 Updated Sep 2, 2024

《Software Engineering at Google》的中英文对译版本

HTML 4,069 510 Updated Sep 2, 2024

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 1,426 148 Updated Aug 17, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,172 46 Updated Aug 15, 2024

A generative speech model for daily dialogue.

Python 30,273 3,279 Updated Sep 3, 2024

A native PyTorch Library for large model training

Python 1,503 137 Updated Sep 2, 2024
Next