Skip to content
View kabachuha's full-sized avatar

Organizations

@wesnoth @deforum-art

Block or report kabachuha

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Repository for Denoising Levy Probabilistic Models

2 Updated Jun 3, 2024

Factorization Vision Transformer: Modeling Long Range Dependency with Local Window Cost

Python 4 Updated Dec 14, 2023

To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 while maintaining accuracy.

Python 680 22 Updated Sep 3, 2024

Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis

73 2 Updated Jul 16, 2024

Official PyTorch Implementation of "Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models"

22 Updated Jun 14, 2024

This is the official repository of the paper "OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI"

JavaScript 76 3 Updated Aug 15, 2024

SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining

Python 15 1 Updated Jul 1, 2024

Official code for "Block Transformer: Global-to-Local Language Modeling for Fast Inference"

Python 117 6 Updated Sep 3, 2024
Python 20 2 Updated Jul 19, 2024

This is the official repository for the paper "Flora: Low-Rank Adapters Are Secretly Gradient Compressors" in ICML 2024.

Python 50 3 Updated Jul 1, 2024

Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"

Python 475 38 Updated Jun 28, 2024

Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch

Python 516 42 Updated Sep 2, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 6 6 Updated Nov 29, 2023

nanoGPT reproduction but for llama 2

Python 3 Updated Jul 25, 2024

GRASS: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients

6 Updated Jun 19, 2024

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Python 1,343 139 Updated Jun 3, 2024
Python 61 8 Updated Jul 11, 2024

A simple minimal implementation of Reversible Vision Transformers

Python 113 7 Updated Mar 14, 2024
Python 87 7 Updated Jun 28, 2024

STAR: Scale-wise Text-to-image generation via Auto-Regressive representations

107 1 Updated Jun 18, 2024

[BSQ-ViT] Image and Video Tokenization with Binary Spherical Quantization

Python 73 Updated Jun 12, 2024

EasyRobust: an Easy-to-use library for state-of-the-art Robust Computer Vision Research with PyTorch.

Jupyter Notebook 319 37 Updated Jun 30, 2024

Bidirectional Autoregressive Talker from Generative Pre-trained Transformer

Python 37 1 Updated Jul 27, 2023

Tutorial for how to build BERT from scratch

Python 80 20 Updated May 22, 2024

Fast and memory-efficient exact attention

Python 13,247 1,192 Updated Sep 4, 2024

The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)

Jupyter Notebook 750 81 Updated Jan 3, 2024

This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation

Jupyter Notebook 378 15 Updated Aug 28, 2024

Vector (and Scalar) Quantization, in Pytorch

Python 2,360 195 Updated Sep 4, 2024

[ICML'24] The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”

Python 114 6 Updated Jul 8, 2024

Official PyTorch implementation code for realizing the technical part of Traversal of Layers (TroL) presenting new propagation operation to get super vision language performances. (Under Review)

Python 82 1 Updated Jun 23, 2024
Next