Stars
Record matching and entity resolution at scale in Spark
This is a repo with links to everything you'd ever want to learn about data engineering
Large Language Model Text Generation Inference
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🦜🔗 Build context-aware reasoning applications
mercury-explainability is a library with implementations of different state-of-the-art methods in the field of explainability. They are designed to work efficiently and to be easily integrated with…
mercury-robust is a framework to perform robust testing on ML models and datasets. It provides a collection of test that are easy to configure and helpful to guarantee robustness in your ML processes.
A Python 3 library developed in C++ that enables efficient storage and querying of sets of sets. It can be used to perform fast document search. Uses the Settrie algorithm: https://osebje.famnit.up…
Reels is a library for analyzing sequences of events from transactional data to predict when related target events may occur in the future.
mercury-monitoring is a library to monitor data and model drift
Utility package that, given a Pandas DataFrame, it uses the DataSchema class which auto-infers feature types and automatically calculates different statistics depending on the types.
What's in your data? Extract schema, statistics and entities from datasets
Blue Brain Nexus - A knowledge graph for data-driven science