Skip to content
View ola13's full-sized avatar

Block or report ola13

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Scripts supporting the development and serving the Roots Search Tool - https://hf.co/spaces/bigscience-data/roots-search

Jupyter Notebook 10 7 Updated Mar 10, 2023

Pipeline for pulling and processing online language model pretraining data from the web

Python 173 23 Updated Jul 31, 2023

Apache Lucene open-source search software

Java 2,606 1,016 Updated Sep 26, 2024

Web-scale retrieval for knowledge-intensive NLP

Python 553 27 Updated Dec 6, 2022

Seminar on Large Language Models (COMP790-101 at UNC Chapel Hill, Fall 2022)

309 16 Updated Nov 21, 2022

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.

Python 1,640 364 Updated Sep 25, 2024

The AI Knowledge Editor

Python 181 9 Updated Jul 12, 2022

🤗 Evaluate: A library for easily evaluating machine learning models and datasets.

Python 1,976 255 Updated Sep 17, 2024

A library for building and serving multi-node distributed faiss indices.

Python 252 18 Updated Nov 1, 2023

LAnguage Model Analysis

Python 1,345 181 Updated Jul 7, 2024

Misspelling Oblivious Word Embeddings

202 22 Updated Aug 6, 2019

Library for fast text representation and classification.

HTML 25,863 4,711 Updated Mar 22, 2024