Skip to content
View GorkaUrbizu's full-sized avatar

Block or report GorkaUrbizu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,157 210 Updated Nov 6, 2024

TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.

Python 14 Updated Jun 24, 2024

Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry lead…

Python 456 68 Updated Oct 28, 2024

Fixes mojibake and other glitches in Unicode text, after the fact.

Python 3,810 121 Updated Oct 30, 2024

Scalable data pre processing and curation toolkit for LLMs

Jupyter Notebook 585 78 Updated Nov 8, 2024

A Jax-based library for designing and training transformer models from scratch.

Python 275 11 Updated Aug 28, 2024

Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax

Python 516 81 Updated Nov 8, 2024

A simple, performant and scalable Jax LLM!

Python 1,524 292 Updated Nov 9, 2024

Accelerate, Optimize performance with streamlined training and serving options with JAX.

Python 202 25 Updated Nov 9, 2024

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Python 2 Updated Feb 5, 2024

Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI

Python 1,333 73 Updated Apr 11, 2024

Convert PDF to markdown quickly with high accuracy

Python 17,595 1,008 Updated Nov 7, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,840 461 Updated May 3, 2024

Utility for behavioral and representational analyses of Language Models

Python 122 29 Updated Aug 30, 2024

LLM inference in C/C++

C++ 67,500 9,691 Updated Nov 9, 2024

Repo for the Belebele dataset, a massively multilingual reading comprehension dataset.

Python 314 21 Updated Aug 12, 2024

ByT5 model scripts

Jupyter Notebook 2 Updated Jul 12, 2021

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,569 350 Updated Oct 17, 2024

The collection of files that contain datasets for 6 languages (Russian, Basque, Turkish, Spanish, Czech and English) with labels of different morphological complexity

Python 1 Updated Feb 29, 2024

List of papers on hallucination detection in LLMs.

669 54 Updated Nov 1, 2024

Code and data for the paper "Evaluating German Transformer Language Models with Syntactic Agreement Tests" (Zaczynska et al., 2020)

Python 7 2 Updated Jun 12, 2023

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Python 2,402 255 Updated Aug 13, 2024

Code for Paper: “Low-Resource” Text Classification: A Parameter-Free Classification Method with Compressors

Python 1,769 155 Updated Aug 7, 2023

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 37,211 5,922 Updated Aug 19, 2024

Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigscience/license) using Alpaca-LoRA and Alpaca_data_cleaned.json

Jupyter Notebook 183 39 Updated Jun 18, 2023

Expanding natural instructions

Python 956 189 Updated Dec 11, 2023

LTG-Bert

Python 29 4 Updated Jan 8, 2024

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 1,923 153 Updated Mar 27, 2024

Reduce the size of pretrained Hugging Face models via vocabulary trimming.

Python 43 5 Updated Dec 28, 2022

Summarization Papers

TeX 985 143 Updated Jul 15, 2023
Next