Improve Conversion Utilities #1124

haileyschoelkopf · 2024-01-17T19:05:10Z

[Draft for now]

This PR seeks to improve the state of conversion / checkpoint import+export tools in GPT-NeoX.

Intended changes are:
NeoX to HF:

Unify the previously-separate PipelineModule and Sequential conversion scripts
Address some NeoX-to-HF QoL features (manually override checkpoint's save precision, allow to skip tokenizer upload (if one is using Tiktoken, say)
Allow for Mistral + Llama models, and GQA in these

Meta Llama / Mistral to NeoX:

enable + test GQA
Add Llama 2 loading instructions

Tests:

convert_module_to_hf.py and convert_neox_to_hf.py output precisely the same files, even under tensor parallelism (NeoX)
convert_sequential_to_hf.py and convert_neox_to_hf.py output precisely the same files, even under tensor parallelism (NeoX)
Llama model conversion functions, in the absence of GQA
Llama model conversion functions, with GQA

…AI/gpt-neox into update-conversion-utils

… of convert_neox_to_hf.py

Quentin-Anthony

All comments addressed over DM. Great work!

haileyschoelkopf and others added 2 commits January 17, 2024 19:01

draft: unify sequential + PPModule conversion scripts

b23da37

Update NeoXArgs docs automatically

7a8ca30

This was linked to issues Jan 17, 2024

Add Instructions for Loading Llama2 Models #1051

Closed

Convert HF format or raw weights of Llama2 to NEOX format #1112

Closed

haileyschoelkopf and others added 11 commits January 17, 2024 20:31

draft: pull out model param names / model definition

913f66d

Merge branch 'update-conversion-utils' of https://github.com/Eleuther…

60901a6

…AI/gpt-neox into update-conversion-utils

Update NeoXArgs docs automatically

87da3a9

tested: neox models with TP = 1, PipelineModule, work

7a9aed4

Update NeoXArgs docs automatically

18eefcc

draft: Llama + GQA QKV resharding

1887aa8

Update NeoXArgs docs automatically

a67b533

update Llama conversion script to support Mistral and GQA

14ccc18

Update NeoXArgs docs automatically

dc63ca0

test Mistral-7B conversion

4ea64ab

Update NeoXArgs docs automatically

e7c2ff1

AIproj mentioned this pull request Jan 25, 2024

Draft PR Adding mistral 0.1 #1131

Merged

Quentin-Anthony and others added 6 commits February 5, 2024 14:12

Merge branch 'main' into update-conversion-utils

cddeca1

Update NeoXArgs docs automatically

2d1607a

push documentation on imports / Llama loading

1243cdd

push further readme updates (Mistral included)

d3b0011

Preventconversions for unsupported featurees, disclaim in README

99e5252

Update NeoXArgs docs automatically

265bfb6

haileyschoelkopf marked this pull request as ready for review February 6, 2024 01:40

haileyschoelkopf requested a review from Quentin-Anthony as a code owner February 6, 2024 01:40

haileyschoelkopf and others added 4 commits February 8, 2024 00:12

revert PR#1072 RowParallel bias conversion error

00af983

remove sequential_to_hf and module_to_hf scripts, deprecated in favor…

ae981fa

… of convert_neox_to_hf.py

Update NeoXArgs docs automatically

8c52e29

pre-commit

6a8a829

Quentin-Anthony previously approved these changes Feb 8, 2024

View reviewed changes

Quentin-Anthony dismissed their stale review via 6a8a829 February 8, 2024 00:23

Update NeoXArgs docs automatically

69c1653

Quentin-Anthony approved these changes Feb 8, 2024

View reviewed changes

Quentin-Anthony merged commit f7373f8 into main Feb 8, 2024
2 checks passed

Quentin-Anthony deleted the update-conversion-utils branch February 8, 2024 00:25

This was referenced Feb 21, 2024

LLaMA-to-HF conversion #960

Closed

misindexing when converting llama weights to gpt-neox format #971

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve Conversion Utilities #1124

Improve Conversion Utilities #1124

haileyschoelkopf commented Jan 17, 2024 •

edited

Loading

Quentin-Anthony left a comment

Improve Conversion Utilities #1124

Improve Conversion Utilities #1124

Conversation

haileyschoelkopf commented Jan 17, 2024 • edited Loading

Quentin-Anthony left a comment

Choose a reason for hiding this comment

haileyschoelkopf commented Jan 17, 2024 •

edited

Loading