moreymat

Mathieu Morey moreymat

Achievements

Stars

etalab-ia / franceservices-backend

Backend ressources for Albert. Albert is a conversational agent that uses official French data sources to answer administrative agents questions.

Python 116 8 Updated Oct 30, 2024

rongstat / SMAI

Alignability testing and integration of single-cell data

R 21 3 Updated Feb 29, 2024

karpathy / minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,162 856 Updated Jul 1, 2024

huggingface / tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 9,024 797 Updated Nov 1, 2024

huggingface / datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,016 143 Updated Oct 31, 2024

huggingface / nanotron

Minimalistic large language model 3D-parallelism training

Python 1,198 118 Updated Nov 1, 2024

VikParuchuri / surya

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 13,588 855 Updated Oct 30, 2024

Chrats-Melkonian / mi_cheese

Repository with code & data for the publication Microbial interactions shape cheese flavour formation

R 3 Updated Jan 5, 2024

maestro-os / maestro

Unix-like kernel written in Rust

Rust 2,961 95 Updated Nov 1, 2024

davisidarta / topometry

Systematically learn and evaluate manifolds from high-dimensional data

Python 94 4 Updated Jul 6, 2023

emilienschultz / history_of_jupyter

Collecting archives and analysis on Jupyter's history

2 Updated Feb 1, 2024

OpenLLM-France / Lit-Claire

Continual pretraining of foundation LLM using ⚡ Lightning Fabric

Python 33 1 Updated Sep 10, 2024

pydantic / FastUI

Build better UIs faster.

Python 8,223 316 Updated Aug 22, 2024

abstractqqq / polars_ds_extension

Polars extension for general data science use cases

Rust 368 24 Updated Nov 1, 2024

functime-org / functime

Time-series machine learning at scale. Built with Polars for embarrassingly parallel feature extraction and forecasts on panel data.

Python 1,035 56 Updated Jun 29, 2024

pemistahl / lingua-rs

The most accurate natural language detection library for Rust, suitable for short text and mixed-language text

Rust 890 41 Updated Oct 30, 2024

pemistahl / lingua-py

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

Python 1,143 45 Updated Oct 31, 2024

tifhair / tifhair-website

HTML 8 2 Updated Oct 23, 2023

huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences

Python 4,636 403 Updated Oct 7, 2024

aangelopoulos / ppi_py

A package for statistically rigorous scientific discovery using machine learning. Implements prediction-powered inference.

Python 204 15 Updated Oct 27, 2024

scikit-learn-contrib / MAPIE

A scikit-learn-compatible module to estimate prediction intervals and control risks based on conformal predictions.

Jupyter Notebook 1,293 110 Updated Oct 31, 2024

MilesCranmer / awesome-ml-demos

Curated list of interactive ML demos

337 12 Updated Nov 21, 2023

taylorai / galactic

data cleaning and curation for unstructured text

Python 325 15 Updated Aug 6, 2024

norvig / pytudes

Python programs, usually short, of considerable difficulty, to perfect particular skills.

Jupyter Notebook 23,090 2,430 Updated Oct 28, 2024

mindsdb / mindsdb

The platform for building AI from enterprise data

Python 26,731 4,868 Updated Nov 1, 2024

nlp-uoregon / trankit

Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing

Python 734 101 Updated Oct 13, 2024

cgpotts / cs224u

Code for Stanford CS224u

Jupyter Notebook 2,110 910 Updated Sep 17, 2024

prefix-dev / pixi

Package management made easy

Rust 3,227 177 Updated Nov 1, 2024

mkirchhof / url

Uncertainty-aware representation learning (URL) benchmark

Python 97 2 Updated Feb 27, 2024

rasbt / scipy2023-deeplearning

Jupyter Notebook 601 105 Updated Sep 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mathieu Morey moreymat

Achievements

Achievements

Block or report moreymat

Stars

etalab-ia / franceservices-backend

rongstat / SMAI

karpathy / minbpe

huggingface / tokenizers

huggingface / datatrove

huggingface / nanotron

VikParuchuri / surya

Chrats-Melkonian / mi_cheese

maestro-os / maestro

davisidarta / topometry

emilienschultz / history_of_jupyter

OpenLLM-France / Lit-Claire

pydantic / FastUI

abstractqqq / polars_ds_extension

functime-org / functime

pemistahl / lingua-rs

pemistahl / lingua-py

tifhair / tifhair-website

huggingface / alignment-handbook

aangelopoulos / ppi_py

scikit-learn-contrib / MAPIE

MilesCranmer / awesome-ml-demos

taylorai / galactic

norvig / pytudes

mindsdb / mindsdb

nlp-uoregon / trankit

cgpotts / cs224u

prefix-dev / pixi

mkirchhof / url

rasbt / scipy2023-deeplearning