Skip to content
View stas00's full-sized avatar

Organizations

@bigscience-workshop

Block or report stas00

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

cgmemtime measures the high-water RSS+CACHE memory usage of a process and its descendant processes.

C 109 17 Updated Dec 15, 2022

Ultra low overhead NVIDIA GPU telemetry plugin for telegraf with memory temperature readings.

C++ 60 3 Updated Jul 8, 2024

Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!

607 39 Updated Oct 18, 2024

Linux​ based GDDR6/GDDR6X VRAM temperature reader for NVIDIA RTX 3000/4000 series GPUs.

C 78 29 Updated Aug 23, 2024

NVIDIA GPU metrics exporter for Prometheus leveraging DCGM

Go 890 154 Updated Sep 19, 2024

Module, Model, and Tensor Serialization/Deserialization

Python 183 27 Updated Oct 16, 2024

All Algorithms implemented in Python

Python 192,456 45,390 Updated Oct 14, 2024

Run your own AI cluster at home with everyday devices πŸ“±πŸ’» πŸ–₯️⌚

Python 10,418 593 Updated Oct 16, 2024
Jupyter Notebook 453 22 Updated Aug 23, 2024

A python module to repair invalid JSON, commonly used to parse the output of LLMs

Python 826 48 Updated Oct 13, 2024

Efficient Triton Kernels for LLM Training

Python 3,239 173 Updated Oct 17, 2024

Gpu benchmark

Python 41 4 Updated Oct 6, 2024

A guidance language for controlling large language models.

Jupyter Notebook 18,908 1,041 Updated Oct 14, 2024
Python 29 3 Updated Jul 28, 2024

Utils for streaming large files (S3, HDFS, gzip, bz2...)

Python 3,194 385 Updated Oct 4, 2024

A hack to make MPI over Infiniband to work on Docker and Singularity containers

Shell 7 2 Updated Mar 13, 2017

Pragmatic approach to parsing import profiles for CI's

Python 11 1 Updated Jul 1, 2024
Python 260 35 Updated Aug 20, 2024

A sample pattern for running CI tests on Modal

Python 12 1 Updated Sep 20, 2024

An extremely fast Python package and project manager, written in Rust.

Rust 23,312 669 Updated Oct 19, 2024

A sample pattern for running CI tests on Modal

Python 2 Updated Jun 10, 2024

Making Reddit data accessible to researchers, moderators and everyone else. Interact with the data through large dumps, an API or web interface.

TypeScript 260 20 Updated Oct 5, 2024

Applied AI experiments and examples for PyTorch

Python 147 12 Updated Oct 18, 2024

The official Python client for the Huggingface Hub.

Python 2,052 539 Updated Oct 18, 2024

CUDA checkpoint and restore utility

Cuda 212 10 Updated Apr 17, 2024

Checkpoint/Restore tool

C 2,926 585 Updated Oct 16, 2024

Hardware locality (hwloc)

C 572 173 Updated Oct 8, 2024

InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.

Python 296 48 Updated Oct 18, 2024

A PyTorch Native LLM Training Framework

Python 637 33 Updated Aug 25, 2024

πŸš€ Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash attention v2.

Python 169 29 Updated Oct 19, 2024
Next