devtools
A PyTorch Library for Multi-Task Learning
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
Open source annotation tool for machine learning practitioners.
EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit
Mimix: A Text Generation Tool and Pretrained Chinese Models
Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.
A general-purpose neural semantic parser for mapping natural language queries into machine executable code
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型,实现了包括LLaMA,ChatGLM,BLOOM,GPT2,Seq2Seq,BART,T5,UDA等模型的训练和预测,开箱即用。
CRF, Partial CRF and Marginal CRF in PyTorch
Automated Phrase Mining from Massive Text Corpora in Python.
PyScript is an open source platform for Python in the browser. Try PyScript: https://pyscript.com Examples: https://tinyurl.com/pyscript-examples Community: https://discord.gg/HxvBtukrg2
A Unified Semi-Supervised Learning Codebase (NeurIPS'22)
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
The simplest, fastest repository for training/finetuning medium-sized GPTs.
ModelScope: bring the notion of Model-as-a-Service to life.
[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Streamlit — A faster way to build and share data apps.
Unsupervised text tokenizer for Neural Network-based text generation.
High-accuracy NLP parser with models for 11 languages.