Skip to content
View dmizr's full-sized avatar

Block or report dmizr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

WIP

Python 76 1 Updated Aug 13, 2024

A library for unit scaling in PyTorch

Jupyter Notebook 90 6 Updated Aug 21, 2024

Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam

Python 63 3 Updated Jul 28, 2024
Python 67 4 Updated Jul 5, 2024

LLM101n: Let's build a Storyteller

27,154 1,482 Updated Aug 1, 2024

DataComp for Language Models

HTML 1,069 94 Updated Aug 19, 2024
Python 173 8 Updated Jul 15, 2024
Python 7,053 546 Updated Aug 12, 2024

seqax = sequence modeling + JAX

Python 128 10 Updated Jul 17, 2024

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly …

Python 3,941 298 Updated Jul 16, 2024

This repository provides the code and model checkpoints of the research paper: Scalable Pre-training of Large Autoregressive Image Models

Python 677 42 Updated May 2, 2024

PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"

Python 491 32 Updated Jan 7, 2024

MLX: An array framework for Apple silicon

C++ 16,226 925 Updated Aug 22, 2024

PyTorch implementation of RCG https://arxiv.org/abs/2312.03701

Python 784 36 Updated Mar 25, 2024

An Extensible Deep Learning Library

Python 1,736 223 Updated Aug 22, 2024
Python 2,611 297 Updated Aug 21, 2024

Task-based datasets, preprocessing, and evaluation for sequence models.

Python 549 56 Updated Aug 22, 2024

distributed trainer for LLMs

Python 513 73 Updated May 20, 2024

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,246 578 Updated Aug 22, 2024

COYO-700M: Large-scale Image-Text Pair Dataset

Python 1,136 35 Updated Nov 30, 2022

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 5,149 322 Updated Jul 21, 2024

Inference code for Llama models

Python 55,189 9,410 Updated Aug 18, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 9,498 936 Updated Aug 21, 2024

A playbook for systematically maximizing the performance of deep learning models.

26,176 2,181 Updated Jun 18, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 35,812 5,558 Updated Aug 19, 2024

Tensors, for human consumption

Jupyter Notebook 1,082 15 Updated Jul 8, 2024

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Python 2,384 248 Updated Apr 24, 2024

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 3,541 329 Updated Aug 7, 2024

[ICLR2024] Exploring Target Representations for Masked Autoencoders

Python 51 8 Updated Jan 17, 2024
Jupyter Notebook 3,021 283 Updated May 14, 2024
Next