kabachuha

kabachuha

Beauty is an equation, meaning is a vector

157 followers · 6 following

@huawei-noah

Achievements

x3 x2 x3

Achievements

x3 x2 x3

Organizations

Lists (3)

Sort

Beta Lists are currently in beta. Share feedback and report bugs.

Stars

darioShar / DLPM

Repository for Denoising Levy Probabilistic Models

2 Updated Jun 3, 2024

q2479036243 / FaViT

Factorization Vision Transformer: Modeling Long Range Dependency with Local Window Cost

Python 4 Updated Dec 14, 2023

microsoft / MInference

To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 while maintaining accuracy.

Python 680 22 Updated Sep 3, 2024

fusiming3 / MARS

Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis

73 2 Updated Jul 16, 2024

qihao067 / DiMR

Official PyTorch Implementation of "Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models"

22 Updated Jun 14, 2024

GAIR-NLP / OlympicArena

This is the official repository of the paper "OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI"

JavaScript 76 3 Updated Aug 15, 2024

andyjm3 / SLTrain

SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining

Python 15 1 Updated Jul 1, 2024

itsnamgyu / block-transformer

Official code for "Block Transformer: Global-to-Local Language Modeling for Fast Inference"

Python 117 6 Updated Sep 3, 2024

PgLoLo / optiacts

Python 20 2 Updated Jul 19, 2024

BorealisAI / flora-opt

This is the official repository for the paper "Flora: Low-Rank Adapters Are Secretly Gradient Compressors" in ICML 2024.

Python 50 3 Updated Jul 1, 2024

ironjr / grokfast

Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"

Python 475 38 Updated Jun 28, 2024

lucidrains / rotary-embedding-torch

Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch

Python 516 42 Updated Sep 2, 2024

simran-arora / cs229s-nanoGPT

Forked from karpathy/nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 6 6 Updated Nov 29, 2023

danikhan632 / nanoLlama

nanoGPT reproduction but for llama 2

Python 3 Updated Jul 25, 2024

aashiqmuhamed / GRASS

GRASS: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients

6 Updated Jun 19, 2024

jiaweizzhao / GaLore

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Python 1,343 139 Updated Jun 3, 2024

minyoungg / LTE

Python 61 8 Updated Jul 11, 2024

karttikeya / minREV

A simple minimal implementation of Reversible Vision Transformers

Python 113 7 Updated Mar 14, 2024

zh460045050 / VQGAN-LC

Python 87 7 Updated Jun 28, 2024

krennic999 / STAR

STAR: Scale-wise Text-to-image generation via Auto-Regressive representations

107 1 Updated Jun 18, 2024

zhaoyue-zephyrus / bsq-vit

[BSQ-ViT] Image and Video Tokenization with Binary Spherical Quantization

Python 73 Updated Jun 12, 2024

alibaba / easyrobust

EasyRobust: an Easy-to-use library for state-of-the-art Robust Computer Vision Research with PyTorch.

Jupyter Notebook 319 37 Updated Jun 30, 2024

zcli-charlie / BatGPT

Bidirectional Autoregressive Talker from Generative Pre-trained Transformer

Python 37 1 Updated Jul 27, 2023

coaxsoft / pytorch_bert

Tutorial for how to build BERT from scratch

Python 80 20 Updated May 22, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 13,247 1,192 Updated Sep 4, 2024

kakaobrain / rq-vae-transformer

The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)

Jupyter Notebook 750 81 Updated Jan 3, 2024

bytedance / 1d-tokenizer

This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation

Jupyter Notebook 378 15 Updated Aug 28, 2024

lucidrains / vector-quantize-pytorch

Vector (and Scalar) Quantization, in Pytorch

Python 2,360 195 Updated Sep 4, 2024

YuchuanTian / RethinkTinyLM

[ICML'24] The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”

Python 114 6 Updated Jul 8, 2024

ByungKwanLee / TroL

Official PyTorch implementation code for realizing the technical part of Traversal of Layers (TroL) presenting new propagation operation to get super vision language performances. (Under Review)

Python 82 1 Updated Jun 23, 2024

kabachuha

Organizations

Lists (3)

🔮 Future ideas

RL training ideas

🔧 tools

Stars