Skip to content
View moreymat's full-sized avatar

Block or report moreymat

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Backend ressources for Albert. Albert is a conversational agent that uses official French data sources to answer administrative agents questions.

Python 116 8 Updated Oct 30, 2024

Alignability testing and integration of single-cell data

R 21 3 Updated Feb 29, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,162 856 Updated Jul 1, 2024

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 9,024 797 Updated Nov 1, 2024

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,016 143 Updated Oct 31, 2024

Minimalistic large language model 3D-parallelism training

Python 1,198 118 Updated Nov 1, 2024

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 13,588 855 Updated Oct 30, 2024

Repository with code & data for the publication Microbial interactions shape cheese flavour formation

R 3 Updated Jan 5, 2024

Unix-like kernel written in Rust

Rust 2,961 95 Updated Nov 1, 2024

Systematically learn and evaluate manifolds from high-dimensional data

Python 94 4 Updated Jul 6, 2023

Collecting archives and analysis on Jupyter's history

2 Updated Feb 1, 2024

Continual pretraining of foundation LLM using ⚡ Lightning Fabric

Python 33 1 Updated Sep 10, 2024

Build better UIs faster.

Python 8,223 316 Updated Aug 22, 2024

Polars extension for general data science use cases

Rust 368 24 Updated Nov 1, 2024

Time-series machine learning at scale. Built with Polars for embarrassingly parallel feature extraction and forecasts on panel data.

Python 1,035 56 Updated Jun 29, 2024

The most accurate natural language detection library for Rust, suitable for short text and mixed-language text

Rust 890 41 Updated Oct 30, 2024

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

Python 1,143 45 Updated Oct 31, 2024
HTML 8 2 Updated Oct 23, 2023

Robust recipes to align language models with human and AI preferences

Python 4,636 403 Updated Oct 7, 2024

A package for statistically rigorous scientific discovery using machine learning. Implements prediction-powered inference.

Python 204 15 Updated Oct 27, 2024

A scikit-learn-compatible module to estimate prediction intervals and control risks based on conformal predictions.

Jupyter Notebook 1,293 110 Updated Oct 31, 2024

Curated list of interactive ML demos

337 12 Updated Nov 21, 2023

data cleaning and curation for unstructured text

Python 325 15 Updated Aug 6, 2024

Python programs, usually short, of considerable difficulty, to perfect particular skills.

Jupyter Notebook 23,090 2,430 Updated Oct 28, 2024

The platform for building AI from enterprise data

Python 26,731 4,868 Updated Nov 1, 2024

Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing

Python 734 101 Updated Oct 13, 2024

Code for Stanford CS224u

Jupyter Notebook 2,110 910 Updated Sep 17, 2024

Package management made easy

Rust 3,227 177 Updated Nov 1, 2024

Uncertainty-aware representation learning (URL) benchmark

Python 97 2 Updated Feb 27, 2024
Jupyter Notebook 601 105 Updated Sep 17, 2023
Next