UranusSeven

🎯

Focusing

Uranus UranusSeven

🎯

Focusing

41 followers · 8 following

https://www.zhihu.com/people/840445

Achievements

x2 x3 x3

BetaSend feedback

Achievements

x2 x3 x3

BetaSend feedback

Block or Report

Block or report UranusSeven

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Lists (9)

Sort

🚀 My stack

Stable Diffusion⭐️

2 repositories

Tools🔨

7 repositories

Training

1 repository

Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

tlc-pack / libflash_attn

Standalone Flash Attention v2 kernel without libtorch dependency

C++ 76 12 Updated May 21, 2024

bytedance / flux

A fast communication-overlapping library for tensor parallelism on GPUs.

C++ 50 3 Updated Jun 14, 2024

DefTruth / Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

1,808 128 Updated Jun 24, 2024

Paitesanshi / LLM-Agent-Survey

2,363 137 Updated May 5, 2024

multimodal-art-projection / MAP-NEO

Python 726 68 Updated Jun 21, 2024

KnowingNothing / MatmulTutorial

A Easy-to-understand TensorOp Matmul Tutorial

C++ 209 20 Updated Jun 15, 2024

jy-yuan / KIVI

KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache

Python 157 14 Updated Jun 16, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 22,445 2,331 Updated Jun 25, 2024

lucidrains / ring-attention-pytorch

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Python 396 22 Updated Apr 20, 2024

microsoft / msccl

Microsoft Collective Communication Library

C++ 266 26 Updated Sep 20, 2023

kohya-ss / sd-scripts

Python 4,416 749 Updated Jun 25, 2024

bmaltais / kohya_ss

Python 8,719 1,128 Updated Jun 24, 2024

outlines-dev / outlines

Structured Text Generation

Python 6,829 348 Updated Jun 25, 2024

trinodb / trino

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Java 9,812 2,842 Updated Jun 25, 2024

prestodb / presto

The official home of the Presto distributed SQL query engine for big data

Java 15,720 5,279 Updated Jun 25, 2024

hao-ai-lab / LookaheadDecoding

Python 1,024 62 Updated Feb 14, 2024

feifeibear / long-context-attention

Sequence Parallel Attention for Long Context LLM Model Training and Inference

Python 188 7 Updated Jun 25, 2024

zhuzilin / ring-flash-attention

Ring attention implementation with flash attention

Python 398 28 Updated May 20, 2024

shawntan / scattermoe

Triton-based implementation of Sparse Mixture of Experts.

Python 146 9 Updated Jun 15, 2024

hao-ai-lab / Consistency_LLM

[ICML 2024] CLLMs: Consistency Large Language Models

Python 310 14 Updated Jun 24, 2024

horseee / Awesome-Efficient-LLM

A curated list for Efficient Large Language Models

Python 924 70 Updated Jun 24, 2024

ufal / whisper_streaming

Whisper realtime streaming for long speech-to-text transcription and translation

Python 1,369 170 Updated Jun 6, 2024

alipay / PainlessInferenceAcceleration

Python 264 16 Updated Mar 6, 2024

lobehub / lobe-chat

🤯 Lobe Chat - an open-source, modern-design LLMs/AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Bedrock / Azure / Mistral / Perplexity ), Multi-Modals (Vision…

TypeScript 33,941 7,933 Updated Jun 25, 2024

ChatGPTNextWeb / ChatGPT-Next-Web

A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。

TypeScript 72,196 57,431 Updated Jun 25, 2024

TabbyML / tabby

Self-hosted AI coding assistant

Rust 18,203 762 Updated Jun 25, 2024

danny-avila / LibreChat

Enhanced ChatGPT Clone: Features OpenAI, Assistants API, Azure, Groq, GPT-4 Vision, Mistral, Bing, Anthropic, OpenRouter, Vertex AI, Gemini, AI model switching, message search, langchain, DALL-E-3,…

TypeScript 14,671 2,456 Updated Jun 25, 2024

ventoy / Ventoy

A new bootable USB solution.

C 60,090 3,929 Updated Jun 23, 2024

Portkey-AI / gateway

A Blazing Fast AI Gateway. Route to 200+ LLMs with 1 fast & friendly API.

Jupyter Notebook 5,019 347 Updated Jun 20, 2024

mckaywrigley / chatbot-ui

AI chat for every model.

TypeScript 27,185 7,531 Updated Jun 24, 2024

Uranus UranusSeven

Block or report UranusSeven

Lists (9)

Big data💿

Gen AI Applications

GGML

HPC💻

Local GPT🤖

🚀 My stack

Stable Diffusion⭐️

Tools🔨

Training

Starred repositories

Deep learning

C++

python3

Database