Stars
PyTorch implementation of the paper "Hyperbolic Interaction Model For Hierarchical Multi-Label Classification"
The UC Davis Corpus of Written Spanish, L2 and Heritage Speakers
This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences from C4 using a tagged corruption model. The approach and the d…
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Sharpness-Aware Minimization for Efficiently Improving Generalization
Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
Feel free to fine tune large BERT models with Multi-GPU and FP16 support.
keras implement of transformers for humans
A tensorflow implementation of Fairseq Convolutional Sequence to Sequence Learning(Gehring et al. 2017)
YSDA course in Natural Language Processing