Data augmentation for NLP, presented at EMNLP 2019
-
Updated
Mar 19, 2023 - Python
Data augmentation for NLP, presented at EMNLP 2019
🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.
📃Language Model based sentences scoring library
ICLR 2018 Quick-Thought vectors
Python API & command-line tool to easily transcribe speech-based video files into clean text
🙊 Stop repeating yourself
Extract Information from web corpus using Open Information Extraction.
10,000 sentences: an Android app to help you learn new words in foreign languages
Tensorflow Implementation of Variational Attention for Sequence to Sequence Models (COLING 2018)
Apache OpenNLP wrapper for Nodejs
Russian language support for NLTK's PunktSentenceTokenizer
A sentence segmentation library with wide language support optimized for speed and utility.
Port of PragmaticSegmenter for sentence boundary detection
Join all elements of an array and create a human-readable string
Add a description, image, and links to the sentence topic page so that developers can more easily learn about it.
To associate your repository with the sentence topic, visit your repo's landing page and select "manage topics."