Skip to content
View versae's full-sized avatar

Organizations

@CulturePlex @NbAiLab
Block or Report

Block or report versae

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

High-quality datasets, tools, and concepts for LLM fine-tuning.

1,143 108 Updated Jul 28, 2024

Schedule-Free Optimization in PyTorch

Python 1,731 55 Updated Jul 12, 2024

Train GEMMA on TPU/GPU! (Codebase for training Gemma-Ko Series)

Python 40 11 Updated Mar 2, 2024

Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.

Python 23 1 Updated Jul 30, 2024

Code for SaGe subword tokenizer (EACL 2023)

Python 21 2 Updated May 10, 2024

Whisper realtime streaming for long speech-to-text transcription and translation

Python 87 3 Updated Jan 29, 2024

Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"

Assembly 522 42 Updated May 20, 2024

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 19,576 2,424 Updated Apr 28, 2024

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Python 43 12 Updated Jul 25, 2024

Converts text to speech in realtime

Python 1,507 137 Updated Jul 22, 2024

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Python 2,563 262 Updated Jun 2, 2024

Structured Text Generation

Python 7,417 379 Updated Jul 31, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 18,439 2,023 Updated Jul 14, 2024

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

Python 651 41 Updated Apr 10, 2024

Official codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior

Python 3,174 268 Updated Jul 3, 2024

Multipack distributed sampler for fast padding-free training of LLMs

Python 160 12 Updated Jul 8, 2023

Making large AI models cheaper, faster and more accessible

Python 38,421 4,314 Updated Jul 31, 2024

Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)

Python 184 18 Updated May 20, 2024

Make Praat Picture style plots of acoustic data

R 25 2 Updated May 22, 2024

This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization" ICCV 2023

Python 1,785 99 Updated Nov 30, 2023

🕸️ Web apps in pure Python 🐍

Python 18,239 1,015 Updated Jul 31, 2024

JAX implementation of the Llama 2 model

Python 203 23 Updated Feb 2, 2024

Transformers with Arbitrarily Large Context

Python 587 43 Updated Jul 13, 2024

Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)

Python 345 47 Updated Nov 7, 2023

LoRA for arbitrary JAX models and functions

Python 124 4 Updated Feb 26, 2024

Conformal classifiers, regressors and predictive systems

Python 363 28 Updated Jun 27, 2024

Annotation Tool and Image Search

JavaScript 11 3 Updated Jul 19, 2024
Jupyter Notebook 5 1 Updated Jun 9, 2024

JAX Synergistic Memory Inspector

Python 149 3 Updated Jul 16, 2024

Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).

Python 2,062 131 Updated Jun 7, 2023
Next