Block or Report
Block or report Edwardus2022
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
bdashore3 / flash-attention
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
Llama3 中文仓库(聚合资料,各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)
This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and training code.
用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.
This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.
本项目旨在对大量文本文件进行快速编码检测和转换以辅助mnbvc语料集项目的数据清洗工作
🌊 Numerically solving and backpropagating through the wave equation
A high-level toolbox for using complex valued neural networks in PyTorch
A Gradio web UI for Large Language Models.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
中文nlp解决方案(大模型、数据、模型、训练、推理)
ChineseNMT: Translate English to Chinese with PyTorch Implementation of Transformer
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for c…