Stars
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
Mastering Diverse Domains through World Models
bdashore3 / flash-attention
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
Llama3、Llama3.1 中文仓库(随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)
This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and training code.
用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.
This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.
这是一个一键让小参数大模型进行角色扮演的项目,从数据构成和训练都包含在这项目中
本项目旨在对大量文本文件进行快速编码检测和转换以辅助mnbvc语料集项目的数据清洗工作
🌊 Numerically solving and backpropagating through the wave equation
A high-level toolbox for using complex valued neural networks in PyTorch
A Gradio web UI for Large Language Models.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
中文nlp解决方案(大模型、数据、模型、训练、推理)
ChineseNMT: Translate English to Chinese with PyTorch Implementation of Transformer
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for c…