Skip to content
View Lauler's full-sized avatar

Organizations

@Kungbib

Block or report Lauler

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,223 218 Updated Nov 13, 2024

Tips for Writing a Research Paper using LaTeX

TeX 3,131 372 Updated May 4, 2023

DSPy: The framework for programmingβ€”not promptingβ€”language models

Python 18,885 1,445 Updated Nov 19, 2024

Code for paper "The effect of batch size on contrastive self-supervised speech representation learning"

Python 8 1 Updated Aug 29, 2024

Minimalistic large language model 3D-parallelism training

Python 7 5 Updated Nov 18, 2024

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 10,734 1,065 Updated Nov 13, 2024

Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy

Python 899 37 Updated Nov 13, 2024
Python 102 6 Updated Jun 28, 2024

A repository for research on medium sized language models.

Python 479 69 Updated Nov 19, 2024

Structured Text Generation

Python 9,488 485 Updated Nov 18, 2024
Python 66 4 Updated Nov 17, 2024
Python 450 45 Updated Oct 28, 2024

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 807 96 Updated Nov 18, 2024

JAX-Toolbox

Jupyter Notebook 245 48 Updated Nov 18, 2024

[TPAMI'24] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation

Python 209 13 Updated Nov 15, 2024
Python 256 16 Updated Jun 14, 2024

A native PyTorch Library for large model training

Python 2,621 204 Updated Nov 19, 2024

A repository for managing public, versioned releases of the Swedish Parliament Corpus.

Python 5 Updated Nov 1, 2024

The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models"

Jupyter Notebook 56 2 Updated Oct 11, 2024

My Arch dotfiles

Shell 8 Updated Nov 17, 2024

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 7,645 746 Updated Jun 24, 2024

Web Serving and Remote Procedure Calls at 50x lower latency and 70x higher bandwidth than FastAPI, implementing JSON-RPC & REST over io_uring ☎️

C 1,140 43 Updated Oct 4, 2024

Fast Open-Source Search & Clustering engine Γ— for Vectors & πŸ”œ Strings Γ— in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram πŸ”

C++ 2,262 141 Updated Nov 18, 2024

PyTorch native finetuning library

Python 4,328 436 Updated Nov 18, 2024

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Python 3,869 445 Updated Nov 18, 2024

πŸš€ Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash attention v2.

Python 193 32 Updated Nov 17, 2024

library supporting NLP and CV research on scientific papers

Python 704 55 Updated Nov 8, 2024

A pipeline to improve skills of large language models

Python 191 41 Updated Nov 19, 2024

Experiments for efforts to train a new and improved t5

Python 76 5 Updated Apr 15, 2024
Python 224 20 Updated Jul 11, 2024
Next