Lauler

Follow

Faton Lauler

Follow

20 followers · 9 following

Sweden

Achievements

Achievements

Organizations

Stars

facebookresearch / lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,223 218 Updated Nov 13, 2024

guanyingc / latex_paper_writing_tips

Tips for Writing a Research Paper using LaTeX

TeX 3,131 372 Updated May 4, 2023

stanfordnlp / dspy

DSPy: The framework for programming—not prompting—language models

Python 18,885 1,445 Updated Nov 19, 2024

nikvaessen / w2v2-batch-size

Code for paper "The effect of batch size on contrastive self-supervised speech representation learning"

Python 8 1 Updated Aug 29, 2024

swiss-ai / nanotron

Forked from huggingface/nanotron

Minimalistic large language model 3D-parallelism training

Python 7 5 Updated Nov 18, 2024

Lightning-AI / litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 10,734 1,065 Updated Nov 13, 2024

xhluca / bm25s

Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy

Python 899 37 Updated Nov 13, 2024

zh460045050 / VQGAN-LC

Python 102 6 Updated Jun 28, 2024

mlfoundations / open_lm

A repository for research on medium sized language models.

Python 479 69 Updated Nov 19, 2024

dottxt-ai / outlines

Structured Text Generation

Python 9,488 485 Updated Nov 18, 2024

AnswerDotAI / bert24

Python 66 4 Updated Nov 17, 2024

huggingface / cosmopedia

Python 450 45 Updated Oct 28, 2024

huggingface / lighteval

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 807 96 Updated Nov 18, 2024

NVIDIA / JAX-Toolbox

JAX-Toolbox

Jupyter Notebook 245 48 Updated Nov 18, 2024

ymy-k / Hi-SAM

[TPAMI'24] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation

Python 209 13 Updated Nov 15, 2024

huggingface / diarizers

Python 256 16 Updated Jun 14, 2024

pytorch / torchtitan

A native PyTorch Library for large model training

Python 2,621 204 Updated Nov 19, 2024

swerik-project / the-swedish-parliament-corpus

A repository for managing public, versioned releases of the Swedish Parliament Corpus.

Python 5 Updated Nov 1, 2024

siyan-zhao / prepacking

The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models"

Jupyter Notebook 56 2 Updated Oct 11, 2024

h4pZ / h4rch

My Arch dotfiles

Shell 8 Updated Nov 17, 2024

jasonppy / VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 7,645 746 Updated Jun 24, 2024

unum-cloud / ucall

Web Serving and Remote Procedure Calls at 50x lower latency and 70x higher bandwidth than FastAPI, implementing JSON-RPC & REST over io_uring ☎️

C 1,140 43 Updated Oct 4, 2024

unum-cloud / usearch

Fast Open-Source Search & Clustering engine × for Vectors & 🔜 Strings × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍

C++ 2,262 141 Updated Nov 18, 2024

pytorch / torchtune

PyTorch native finetuning library

Python 4,328 436 Updated Nov 18, 2024

mindee / doctr

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Python 3,869 445 Updated Nov 18, 2024

foundation-model-stack / fms-fsdp

🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash attention v2.

Python 193 32 Updated Nov 17, 2024

allenai / papermage

library supporting NLP and CV research on scientific papers

Python 704 55 Updated Nov 8, 2024

Kipok / NeMo-Skills

A pipeline to improve skills of large language models

Python 191 41 Updated Nov 19, 2024

EleutherAI / improved-t5

Experiments for efforts to train a new and improved t5

Python 76 5 Updated Apr 15, 2024

rwitten / HighPerfLLMs2024

Python 224 20 Updated Jul 11, 2024