Skip to content
View qmdnls's full-sized avatar
👋
👋
Block or Report

Block or report qmdnls

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

llama3.np is a pure NumPy implementation for Llama 3 model.

Python 940 72 Updated Jun 2, 2024

🐚 OpenDevin: Code Less, Make More

Python 29,233 3,380 Updated Jul 31, 2024

🔧 My configuration files.

Vim Script 50 4 Updated Jul 31, 2023

Website for hosting the Open Foundation Models Cheat Sheet.

JavaScript 252 18 Updated Jun 26, 2024

speedrun implementation of dl papers throughout history

Python 27 Updated Mar 19, 2024

Zero Bubble Pipeline Parallelism

Python 234 11 Updated Jul 30, 2024

Material for cuda-mode lectures

Jupyter Notebook 2,003 196 Updated Jun 13, 2024

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Python 423 24 Updated Jul 12, 2024

Ring attention implementation with flash attention

Python 459 30 Updated May 20, 2024

Transformers with Arbitrarily Large Context

Python 587 43 Updated Jul 13, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 23,931 3,442 Updated Jul 31, 2024

A tiling window manager for macOS based on binary space partitioning

C 22,598 632 Updated Jul 9, 2024

Official release of InternLM2.5 7B base and chat models. 1M context support

Python 5,906 427 Updated Jul 23, 2024

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 1,357 143 Updated Jul 19, 2024

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 1,816 115 Updated Jul 22, 2024

Go ahead and axolotl questions

Python 7,104 775 Updated Jul 31, 2024

SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.

Python 873 84 Updated Jun 12, 2024

Development repository for the Triton language and compiler

C++ 12,127 1,450 Updated Jul 31, 2024

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 934 43 Updated Jan 16, 2024

Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models

Python 175 15 Updated Apr 24, 2024

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Python 2,476 178 Updated Mar 8, 2024

Why Do We Need Weight Decay in Modern Deep Learning? [arXiv, Oct 2023]

Python 36 Updated Oct 9, 2023
Jupyter Notebook 128 6 Updated Jun 2, 2023

Reformer, the efficient Transformer, in Pytorch

Python 2,082 255 Updated Jun 21, 2023
Python 1,137 160 Updated Jul 30, 2024

DeepSeek LLM: Let there be answers

Makefile 1,349 88 Updated Feb 4, 2024

Mamba SSM architecture

Python 11,968 1,002 Updated Jul 30, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 34,934 3,671 Updated Jul 28, 2024
Next