Skip to content
View crazy-dreamer's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report crazy-dreamer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 72 14 Updated Jul 31, 2024

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 9,945 1,428 Updated Jul 28, 2024

Sakana widget for Web. | 网页小组件版本的石蒜模拟器。

TypeScript 1,046 63 Updated Sep 26, 2023

AthenaOS is a next generation AI-native operating system managed by Swarms of AI Agents

Rust 16 1 Updated Jul 18, 2023

Code associated with the paper **Fine-tuning Language Models over Slow Networks using Activation Compression with Guarantees**.

Python 24 1 Updated Apr 25, 2023

Bagua Speeds up PyTorch

Python 873 84 Updated Aug 1, 2024

[NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning; [NeurIPS 2022] MCUNetV3: On-Device Training Under 2…

C 768 129 Updated Jul 8, 2024

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

956 20 Updated Jul 31, 2024

Sequence Parallel Attention for Long Context LLM Model Training and Inference

Python 246 9 Updated Jun 27, 2024

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 22,087 5,459 Updated Jun 11, 2024

Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wikitext-103 on a single A100 in <100 seconds. Scales to large…

Python 262 24 Updated Jul 29, 2024

Implementation for MatMul-free LM.

Python 2,788 169 Updated Jun 27, 2024

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Python 1,321 64 Updated Mar 8, 2024

NCCL Tests

Cuda 776 226 Updated Jul 30, 2024

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 29,967 6,342 Updated Jul 26, 2024

awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.

103 7 Updated Aug 1, 2024

Official Implementation of EAGLE-1 and EAGLE-2

Python 692 69 Updated Jul 30, 2024

📰 Must-read papers and blogs on Speculative Decoding ⚡️

295 12 Updated Aug 5, 2024

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

C++ 474 42 Updated Aug 7, 2024

Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个

830 31 Updated Jul 31, 2024

SGLang is yet another fast serving framework for large language models and vision language models.

Python 3,954 244 Updated Aug 7, 2024

PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" (https://arxiv.org/abs/2404.07143)

Python 260 22 Updated May 4, 2024

A "large" language model running on a microcontroller

C++ 466 33 Updated Dec 9, 2023

An unnecessarily tiny implementation of GPT-2 in NumPy.

Python 3,141 402 Updated Apr 24, 2023

The best way to write secure and reliable applications. Write nothing; deploy nowhere.

Dockerfile 60,171 4,719 Updated Aug 7, 2024

vendor independent TinyML deep learning library, compiler and inference framework microcomputers and micro-controllers

C++ 546 86 Updated Oct 29, 2022

Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"

Python 116 7 Updated Jun 20, 2024

Bamboo-7B Large Language Model

88 1 Updated Mar 28, 2024
Python 24 9 Updated Oct 2, 2023

MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW

Python 2,484 291 Updated Jun 4, 2024
Next