Stars
🚪✊Knock Knock: Get notified when your training ends with only two additional lines of code
A library for efficient similarity search and clustering of dense vectors.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
EMNLP 2021 - Pre-training architectures for dense retrieval
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
[EMNLP'21] Visual News: Benchmark and Challenges in News Image Captioning
EmbedRank: Unsupervised Keyphrase Extraction using Sentence Embeddings (official implementation)
Large, curated set of benchmark datasets for evaluating automatic keyphrase extraction algorithms.
Google Chrome, Firefox, and Thunderbird extension that lets you write email in Markdown and render it before sending.
Facebook AI Research Sequence-to-Sequence Toolkit
Demonstration of the results in "Text Normalization using Memory Augmented Neural Networks", Authors: Subhojeet Pramanik, Aman Hussain
A tutorial on how to implement models for part-of-speech tagging using PyTorch and TorchText.
pyTorch implementation for Text Normalization Challenge
Google & Kaggle text normalization challenge with LSTM encoder/decoder.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Links to data used in Sproat & Jaitly (https://arxiv.org/abs/1611.00068) experiments.
AutoPhrase: Automated Phrase Mining from Massive Text Corpora
pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"
Well tested & Multi-language evaluation framework for text summarization.
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL