Highlights
- Pro
Stars
A Chrome extension to help quickly go through arxiv papers by allowing hiding papers containing specific keywords
A data set based on all arXiv publications, pre-processed for NLP, including structured full-text and citation network
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
Modeling, training, eval, and inference code for OLMo
[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.
newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
Aligning pretrained language models with instruction data generated by themselves.
Paper collections of methods that using language to interact with environment, including interact with real world, simulated world or WWW(🏄).
Cereal Bar is a two-player web game designed for studying language understanding agents in collaborative interactions. This repository contains code for the game, a webapp hosting the game, the age…
A framework for few-shot evaluation of language models.
The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Levy. EMNLP, 2021.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Apollo: An Adaptive Parameter-wise Diagonal Quasi-Newton Method for Nonconvex Stochastic Optimization
A curated list of programmatic weak supervision papers and resources
[ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activelo…
An awesome list of events and fellowship opportunities for Computer Science students
A curated list of fellowships for graduate students in Computer Science and related fields.
A framework for training and evaluating AI models on a variety of openly available dialogue datasets.