Skip to content
View 1a1a11a's full-sized avatar

Highlights

  • Pro

Organizations

@cacheMon
Block or Report

Block or report 1a1a11a

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

Minimalistic large language model 3D-parallelism training

Python 1,037 99 Updated Aug 17, 2024

Inference code for Llama models

Python 55,111 9,410 Updated Aug 18, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 11,845 924 Updated May 23, 2024

NVIDIA GPUDirect Storage Driver

C 189 31 Updated May 22, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 9,591 675 Updated Aug 18, 2024

grep for words with similar meaning to the query

Go 1,068 23 Updated Aug 10, 2024

A simple, high-throughput file client for mounting an Amazon S3 bucket as a local file system.

Rust 4,362 146 Updated Aug 17, 2024

Implementation of a new caching algo called SIEVE. Link to paper included in README

Python 1 Updated Jul 1, 2024

SIEVE Cache for Crystal lang

Crystal 2 Updated May 25, 2024

implement SIEVE cache eviction algorithm

1 Updated Jun 13, 2024

Cache implementation in ABAP

ABAP 4 1 Updated Jul 18, 2024

Retrieval and Retrieval-augmented LLMs

Python 6,494 462 Updated Aug 17, 2024

A web app for ranking computer science departments according to their research output in selective venues, and for finding active faculty across a wide range of areas.

Python 2,660 3,075 Updated Aug 18, 2024

Speed up fsspec data access with Alluxio distributed caching.

Python 12 6 Updated Aug 14, 2024

Decentralized deep learning in PyTorch. Built to train models on thousands of volunteers across the world.

Python 1,971 155 Updated Jul 16, 2024

A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems

Python 104 5 Updated Aug 17, 2024

Major CS conference publication stats (including accepted and submitted) by year.

Python 109 8 Updated Jun 19, 2024

A tool for measuring energy consumption of Intel CPUs

C 319 29 Updated Nov 2, 2023

The hub for EleutherAI's work on interpretability and learning dynamics

Jupyter Notebook 2,181 158 Updated Jul 12, 2024

Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"

Python 745 40 Updated Jul 31, 2024

Send email in Python conveniently for gmail using yagmail

Python 2,635 265 Updated Sep 28, 2022

A dump of some of our Presto logs, for use as part of ongoing Presto/HDFS research and presentations.

2 Updated Apr 23, 2024

Performance Optimizer Observation Platform

Zig 779 50 Updated Jul 18, 2024

A SIEVE cache implementation for D

D 6 Updated Jun 2, 2024

A SIEVE cache implementation for C++

C++ 3 Updated Aug 15, 2024

Java implementation of the S3 Fifo Cache

HTML 1 Updated Mar 28, 2024

Disaggregated serving system for Large Language Models (LLMs).

Jupyter Notebook 241 22 Updated Aug 1, 2024

Inference Llama 2 in one file of pure C

C 17,029 2,002 Updated Aug 6, 2024

Llama2 transformer walkthrough with code examples

C 28 5 Updated Nov 9, 2023
Next