Skip to content
View GuillaumeDD's full-sized avatar

Block or report GuillaumeDD

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

HTML 8,599 703 Updated Sep 27, 2024

Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.

Jupyter Notebook 10,017 6,744 Updated Sep 25, 2024

Time series Timeseries Deep Learning Machine Learning Python Pytorch fastai | State-of-the-art Deep Learning library for Time Series and Sequences in Pytorch / fastai

Jupyter Notebook 5,133 641 Updated Apr 23, 2024

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 2,933 376 Updated Sep 4, 2024

Retrieval and Retrieval-augmented LLMs

Python 6,972 508 Updated Sep 26, 2024

Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024

Python 1,290 110 Updated Sep 23, 2024

Source code and data for Like a Good Nearest Neighbor

Python 28 Updated Jan 30, 2024

🐝 GPTSwarm: LLM agents as (Optimizable) Graphs

Python 520 25 Updated Aug 28, 2024

Easily embed, cluster and semantically label text datasets

Python 442 33 Updated Mar 28, 2024

🗺️ Data Cleaning and Textual Data Visualization 🗺️

Python 133 13 Updated Jun 18, 2024

A benchmark to evaluate language models on questions I've previously asked them to solve.

Python 875 64 Updated Sep 13, 2024

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 1,959 139 Updated Sep 27, 2024

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

Python 3,825 359 Updated Sep 27, 2024

Python packaging and dependency management made easy

Python 31,195 2,253 Updated Sep 23, 2024

A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.

TypeScript 7,397 679 Updated Sep 9, 2024

DSPy: The framework for programming—not prompting—foundation models

Python 17,284 1,324 Updated Sep 27, 2024

Generative Agents: Interactive Simulacra of Human Behavior

16,287 2,091 Updated Aug 5, 2024

Rift: an AI-native language server for your personal AI software engineer

Python 3,083 149 Updated Nov 18, 2023

Track and predict the energy consumption and carbon footprint of training deep learning models.

Python 369 27 Updated Sep 20, 2024

LlamaIndex is a data framework for your LLM applications

Python 35,716 5,050 Updated Sep 27, 2024

🤗 Evaluate: A library for easily evaluating machine learning models and datasets.

Python 1,977 255 Updated Sep 17, 2024

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

7,352 374 Updated Jul 16, 2023

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

Python 4,015 366 Updated Sep 27, 2024

An autoregressive character-level language model for making more things

Python 2,483 655 Updated Jun 4, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 132,646 26,431 Updated Sep 27, 2024

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 166,980 44,170 Updated Sep 27, 2024

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Jupyter Notebook 10,097 1,449 Updated Aug 8, 2024

A curated, but incomplete, list of data-centric AI resources.

1,031 72 Updated Jun 26, 2024
Next