Skip to content
View kefirski's full-sized avatar
:shipit:
:shipit:

Block or report kefirski

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

Python 665 75 Updated Sep 18, 2024

Library for industrial alignment.

Python 64 2 Updated Sep 18, 2024

Arrays, Tensors and dynamic Neural Networks in Mojo 🔥

Mojo 193 6 Updated Sep 15, 2024

A simple, performant and scalable Jax LLM!

Python 1,455 272 Updated Sep 18, 2024

[IEEE ICIP 2024] Diversifying Deep Ensembles: A Saliency Map Approach for Enhanced OOD Detection, Calibration, and Accuracy

Python 6 Updated Jun 19, 2024

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 1,195 64 Updated Sep 16, 2024

A massively parallel, high-level programming language

Rust 17,221 425 Updated Sep 17, 2024

Tile primitives for speedy kernels

Cuda 1,488 57 Updated Sep 18, 2024

Kolmogorov Arnold Networks

Jupyter Notebook 14,578 1,336 Updated Sep 15, 2024

Grok open release

Python 49,420 8,327 Updated Aug 30, 2024

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Python 1,361 141 Updated Sep 10, 2024

Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"

Python 206 13 Updated Aug 16, 2024

Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"

Python 153 4 Updated Feb 19, 2024

A library for mechanistic interpretability of GPT-style language models

Python 1,418 273 Updated Sep 16, 2024
Python 28 Updated Sep 14, 2024

Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)

74 3 Updated Jan 8, 2024

JAX-accelerated Meta-Reinforcement Learning Environments Inspired by XLand and MiniGrid 🏎️

Python 185 15 Updated Aug 16, 2024

Mamba SSM architecture

Python 12,577 1,058 Updated Aug 15, 2024

Stainless neural networks in JAX

Python 30 Updated Sep 2, 2024
Python 9 Updated Oct 30, 2023

Deep learning in Rust, with shape checked tensors and neural networks

Rust 1,709 98 Updated Jul 23, 2024

⚡ A Fast, Extensible Progress Bar for Python and CLI

Python 28,351 1,348 Updated Aug 17, 2024

Transformers with Arbitrarily Large Context

Python 613 48 Updated Aug 12, 2024

Jaxpr Visualisation Tool

Python 15 1 Updated Jul 27, 2024

This is the Rust course used by the Android team at Google. It provides you the material to quickly teach Rust.

Rust 27,519 1,636 Updated Sep 18, 2024

Inference Llama 2 in one file of pure 🔥

Mojo 2,092 143 Updated May 21, 2024
Python 322 31 Updated Apr 12, 2024
Python 98 11 Updated Aug 19, 2024

Minimalist ML framework for Rust

Rust 15,119 883 Updated Sep 15, 2024
Next