This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success of deep learning attributes to both network architecture and …

243 23 Updated Apr 10, 2024

LargeWorldModel / LWM

Python 7,103 549 Updated Aug 12, 2024

lucidrains / ring-attention-pytorch

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Python 462 27 Updated Aug 15, 2024

karpathy / minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,090 843 Updated Jul 1, 2024

mistralai / megablocks-public

Forked from databricks/megablocks

Python 858 54 Updated Dec 8, 2023

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 38,724 4,341 Updated Oct 11, 2024

linexjlin / GPTs

leaked prompts of GPTs

28,501 3,851 Updated Sep 27, 2024

liamnguyen97 / deepspeed_llama2

Python 8 Updated Aug 21, 2023

lxe / llama-tune

LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers

Python 51 7 Updated Mar 15, 2023

dongrixinyu / JioNLP

中文 NLP 预处理、解析工具包，准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com

Python 3,307 400 Updated Oct 8, 2024

codelucas / newspaper

newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:

Python 14,105 2,113 Updated Jul 23, 2024

modelscope / data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据！

Python 2,680 168 Updated Oct 12, 2024

Cerebras / modelzoo

Python 936 130 Updated Oct 10, 2024

tloen / alpaca-lora

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,583 2,214 Updated Jul 29, 2024

tatsu-lab / alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook 1,469 234 Updated Oct 9, 2024

SkyworkAI / Skywork

Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation me…

Python 1,214 111 Updated Apr 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

程嘉梁(Jialiang Cheng) JL-Cheng

Achievements

Achievements

Block or report JL-Cheng

Stars

arcee-ai / mergekit

yegcjs / mixinglaws

EleutherAI / cookbook

huggingface / lighteval

PrimeIntellect-ai / OpenDiloco

Lightning-Universe / lightning-Hivemind

learning-at-home / hivemind

pytorch / torchdistx

openai / simple-evals

meta-llama / llama3

ali-vilab / Ranni

openai / transformer-debugger

p-lambda / dsir

HKUNLP / ChunkLlama

zeke-xie / deep-learning-dynamics-paper-list