Skip to content
View ranonrkm's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report ranonrkm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Disaggregated serving system for Large Language Models (LLMs).

Jupyter Notebook 228 20 Updated Aug 1, 2024

Long Range Arena for Benchmarking Efficient Transformers

Python 707 78 Updated Dec 16, 2023

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,439 499 Updated Aug 1, 2024

LLM inference in C/C++

C++ 63,285 9,072 Updated Aug 10, 2024

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

Python 574 38 Updated Feb 27, 2024

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 3,523 365 Updated Aug 9, 2024

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

Jupyter Notebook 302 24 Updated Jun 29, 2024

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 9,300 931 Updated Aug 9, 2024

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 24,577 2,586 Updated Aug 10, 2024

RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems

Jupyter Notebook 72 6 Updated Apr 12, 2024

Universal and Transferable Attacks on Aligned Language Models

Python 3,176 449 Updated Aug 2, 2024

LlamaIndex is a data framework for your LLM applications

Python 34,322 4,844 Updated Aug 10, 2024

GGNN: State of the Art Graph-based GPU Nearest Neighbor Search

Cuda 141 22 Updated Mar 16, 2021

CUDA implementation of Hierarchical Navigable Small World Graph algorithm

Cuda 127 19 Updated Apr 19, 2021

KDEformer can approximate the attention in sub-quadratic time with provable spectral norm bounds.

Python 3 2 Updated Jul 19, 2023

Weighted MinHash implementation on CUDA (multi-gpu).

C++ 112 24 Updated Nov 29, 2023

Fast Neural Machine Translation in C++

C++ 1,205 227 Updated Aug 25, 2023

PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset

Python 122 20 Updated Aug 22, 2019

A project for clustering text streams using locality-sensitive hashing (LSH) in Python

Python 27 9 Updated Sep 23, 2011

Solve puzzles. Learn CUDA.

Jupyter Notebook 5,496 319 Updated Jul 5, 2024

Try to implement pre-training and fine-tuning GPT-2 model for research and education purpose.

Jupyter Notebook 8 1 Updated Apr 15, 2024

[ACL 2022] LinkBERT: A Knowledgeable Language Model 😎 Pretrained with Document Links

Python 411 41 Updated Apr 5, 2022

Implementaion of our EMNLP 2023 submission "Nearest Neighbor Machine Translation is Meta-Optimizer on Output Projection Layer"

Python 6 Updated Oct 17, 2023

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 35,631 3,737 Updated Jul 28, 2024

Landmark Attention: Random-Access Infinite Context Length for Transformers

Python 402 36 Updated Dec 20, 2023

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gan…

Python 52,757 5,454 Updated Aug 7, 2024

Some example MPI programs

92 35 Updated Sep 9, 2011

Efficient GPU kernels for block-sparse matrix multiplication and convolution

Cuda 1,018 200 Updated Jun 8, 2023

A plugin for [Obsidian](https://obsidian.md) which allows syntax highlighting for code blocks in the editor.

JavaScript 473 38 Updated Mar 25, 2024
Next