- Amherst, MA, USA
- https://orcid.org/0009-0002-3246-5198
Highlights
- Pro
Stars
Collection of leetcode company tag problems. Periodically updating.
⏰ AI conference deadline countdowns
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
The Center for Data Science repository for the International Hate Observatory Project and analyzing Reddit. This produces the models used in RedditMap.social.
An Open-Source Package for Knowledge Embedding (KE)
Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
A playbook for systematically maximizing the performance of deep learning models.
StableLM: Stability AI Language Models
ConvoKit is a toolkit for extracting conversational features and analyzing social phenomena in conversations. It includes several large conversational datasets along with scripts exemplifying the u…
ACL 2021: Question Answering over Temporal Knowledge Graphs
Community builds using source code from OpenJDK project
An implementation of TransE and its extended models for Knowledge Representation Learning on TensorFlow
First Order Inductive Learner (FOIL) algorithm implemented in Python
Natural Language Processing Best Practices & Examples
Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
Toolbox of models, callbacks, and datasets for AI/ML researchers.
Google USE (Universal Sentence Encoder) for spaCy
Sentence transformers models for SpaCy
The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.
A topic-centric list of HQ open datasets.
Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
📝 A text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion
LankyCyril / pyvenn
Forked from tctianchi/pyvennPython module for plotting Venn diagrams of 2..6 sets
An ultra fast cross-platform multiple screenshots module in pure Python using ctypes.