Block or Report
Block or report ZhiYuanZeng
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
Some preliminary explorations of Mamba's context scaling.
Official release of InternLM2.5 7B base and chat models. 1M context support
Collaborative Training of Large Language Models in an Efficient Way
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
An open-source tool-augmented conversational language model from Fudan University
Instruct-tune LLaMA on consumer hardware
Codebase for multilingual neural machine translation
Source codes of ACL 2022-Efficient Cluster-based k-Nearest-Neighbor Machine Translation
A retrieval augmented sequence modeling toolkit implemented based on Fairseq
Awesome papers on Language-Model-as-a-Service (LMaaS)
The entmax mapping and its loss, a family of sparse softmax alternatives.
PDFs and Codelabs for the Efficient Deep Learning book.
Boosting your Web Services of Deep Learning Applications.
A curated reading list of research in Mixture-of-Experts(MoE).
Tutel MoE: An Optimized Mixture-of-Experts Implementation
Making large AI models cheaper, faster and more accessible
Ongoing research training transformer models at scale
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Methods and Implements of Deep Clustering
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
如果可以,你最想穿越到哪部电影,小说里?利用 paddlenlp 中提供的 GPT2 和 wechaty 库展开对话故事续写,与 AI 互动共同创造剧情
A Word Sense Disambiguation system integrating implicit and explicit external knowledge.
TextBox 2.0 is a text generation library with pre-trained language models
Open Source Neural Machine Translation and (Large) Language Models in PyTorch