kefirski

Daniil Gavrilov kefirski

The Last AI Bender

191 followers · 43 following

Moscow
@kefirski

Achievements

x2 x2

Achievements

x2 x2

Stars

huggingface / lighteval

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

Python 665 75 Updated Sep 18, 2024

turbo-llm / turbo-alignment

Library for industrial alignment.

Python 64 2 Updated Sep 18, 2024

endia-org / Endia

Arrays, Tensors and dynamic Neural Networks in Mojo 🔥

Mojo 193 6 Updated Sep 15, 2024

google / maxtext

A simple, performant and scalable Jax LLM!

Python 1,455 272 Updated Sep 18, 2024

corl-team / sdde

[IEEE ICIP 2024] Diversifying Deep Ensembles: A Saliency Map Approach for Enhanced OOD Detection, Calibration, and Accuracy

Python 6 Updated Jun 19, 2024

sustcsonglin / flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 1,195 64 Updated Sep 16, 2024

HigherOrderCO / Bend

A massively parallel, high-level programming language

Rust 17,221 425 Updated Sep 17, 2024

HazyResearch / ThunderKittens

Tile primitives for speedy kernels

Cuda 1,488 57 Updated Sep 18, 2024

KindXiaoming / pykan

Kolmogorov Arnold Networks

Jupyter Notebook 14,578 1,336 Updated Sep 15, 2024

xai-org / grok-1

Grok open release

Python 49,420 8,327 Updated Aug 30, 2024

openai / transformer-debugger

Python 4,005 232 Updated Jun 4, 2024

jiaweizzhao / GaLore

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Python 1,361 141 Updated Sep 10, 2024

HazyResearch / based

Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"

Python 206 13 Updated Aug 16, 2024

corl-team / rebased

Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"

Python 153 4 Updated Feb 19, 2024

TransformerLensOrg / TransformerLens

A library for mechanistic interpretability of GPT-style language models

Python 1,418 273 Updated Sep 16, 2024

radarFudan / mamba-minimal-jax

Python 28 Updated Sep 14, 2024

glassroom / heinsen_sequence

Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)

74 3 Updated Jan 8, 2024

corl-team / xland-minigrid

JAX-accelerated Meta-Reinforcement Learning Environments Inspired by XLand and MiniGrid 🏎️

Python 185 15 Updated Aug 16, 2024

state-spaces / mamba

Mamba SSM architecture

Python 12,577 1,058 Updated Aug 15, 2024

francois-rozet / inox

Stainless neural networks in JAX

Python 30 Updated Sep 2, 2024

elephantmipt / real_lru

Python 9 Updated Oct 30, 2023

coreylowman / dfdx

Deep learning in Rust, with shape checked tensors and neural networks

Rust 1,709 98 Updated Jul 23, 2024

tqdm / tqdm

⚡ A Fast, Extensible Progress Bar for Python and CLI

Python 28,351 1,348 Updated Aug 17, 2024

forhaoliu / ringattention

Transformers with Arbitrarily Large Context

Python 613 48 Updated Aug 12, 2024

zombie-einstein / jaxpr-viz

Jaxpr Visualisation Tool

Python 15 1 Updated Jul 27, 2024

google / comprehensive-rust

This is the Rust course used by the Android team at Google. It provides you the material to quickly teach Rust.

Rust 27,519 1,636 Updated Sep 18, 2024

tairov / llama2.mojo

Inference Llama 2 in one file of pure 🔥

Mojo 2,092 143 Updated May 21, 2024

google / flaxformer

Python 322 31 Updated Apr 12, 2024

deepvk / emospeech

Python 98 11 Updated Aug 19, 2024

huggingface / candle

Minimalist ML framework for Rust

Rust 15,119 883 Updated Sep 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Daniil Gavrilov kefirski

Achievements

Achievements

Block or report kefirski

Stars

huggingface / lighteval

turbo-llm / turbo-alignment

endia-org / Endia

google / maxtext

corl-team / sdde

sustcsonglin / flash-linear-attention

HigherOrderCO / Bend

HazyResearch / ThunderKittens

KindXiaoming / pykan

xai-org / grok-1

openai / transformer-debugger

jiaweizzhao / GaLore

HazyResearch / based

corl-team / rebased

TransformerLensOrg / TransformerLens

radarFudan / mamba-minimal-jax

glassroom / heinsen_sequence

corl-team / xland-minigrid

state-spaces / mamba

francois-rozet / inox

elephantmipt / real_lru

coreylowman / dfdx

tqdm / tqdm

forhaoliu / ringattention

zombie-einstein / jaxpr-viz

google / comprehensive-rust

tairov / llama2.mojo

google / flaxformer

deepvk / emospeech

huggingface / candle