Stars
A playbook for systematically maximizing the performance of deep learning models.
SciRepEval benchmark training and evaluation scripts
ScienceNOW: Topic Modelling with Tweets, Arxiv, Reddit & Mendeley
The official tool for transforming doccano format into common dataset formats.
A collection of notebooks that implement algorithms introduced in "Learning from positive and unlabeled data: a survey"
MTEB: Massive Text Embedding Benchmark
Examples and guides for using the OpenAI API
SGPT: GPT Sentence Embeddings for Semantic Search
A browser extension that enhance search engines with ChatGPT
LlamaIndex is a data framework for your LLM applications
Aligned Neural Topic Model (ANTM) for Exploring Evolving Topics: a dynamic neural topic model that uses document embeddings (data2vec) to compute clusters of semantically similar documents at diffe…
The official Python library for the OpenAI API
Python example app from the OpenAI API quickstart tutorial
A concise but complete full-attention transformer with a set of promising experimental features from various papers
An implementation of masked language modeling for Pytorch, made as concise and simple as possible
Jupyter notebooks for the Natural Language Processing with Transformers book
State-of-the-Art Text Embeddings
TensorFlow code and pre-trained models for BERT
Template for using Sphinx-Gallery to document a package
A set of scripts to grab public datasets from resources related to arXiv
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
Code and examples for "Learning on Knowledge Graph Dynamics Provides Early Warning of Impactful Research".
🎓 Sharing machine learning course / lecture notes.