Stars
This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences from C4 using a tagged corruption model. The approach and the d…
The UC Davis Corpus of Written Spanish, L2 and Heritage Speakers
Unannotated Spanish 3 Billion Words Corpora
A Spanish Reddit dialogues corpus, constructed using Reddit comments of 2019.
tensorflow TxetCnn TextRNN 使用Textcnn、Textrnn对文本进行分类
SeqGAN for paraphrase generation.
TensorFlow Neural Machine Translation Tutorial
🍡 SeqGAN implementation for generating text using an RNN.
TextGAN is a PyTorch framework for Generative Adversarial Networks (GANs) based text generation models.
Implementation of Sequence Generative Adversarial Nets with Policy Gradient
Library to scrape and clean web pages to create massive datasets.
Split text files into sentences with Python and NLTK API
SeqGAN tensorflow implementation