Remove unnecessary fp32/bf16 conversion #1169

DayOfThePenguin · 2024-03-03T23:14:25Z

No torch.distributed operation that would require a bf16 -> fp32 conversion is performed in the _split function, so it's safe to remove this.

Since fp32_allreduce=True is necessary for combining zero 1 & bf16, this should result in less memory allocation for common zero 1 + MP + PP use cases (saw ~10% drop on 125M model, albeit with batch size 4)

…erformed

Quentin-Anthony

Oh nice catch!

feat: remove unnecessary bf16 conversions since no collective op is p…

6abad8b

…erformed

DayOfThePenguin requested a review from Quentin-Anthony as a code owner March 3, 2024 23:14

pre-commit

00d473e

Quentin-Anthony approved these changes Mar 4, 2024

View reviewed changes

Quentin-Anthony merged commit 7b8187a into EleutherAI:main Mar 4, 2024
2 of 5 checks passed

DayOfThePenguin deleted the split-mappings-bf16-fix branch March 27, 2024 15:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove unnecessary fp32/bf16 conversion #1169

Remove unnecessary fp32/bf16 conversion #1169

DayOfThePenguin commented Mar 3, 2024 •

edited

Quentin-Anthony left a comment

Remove unnecessary fp32/bf16 conversion #1169

Remove unnecessary fp32/bf16 conversion #1169

Conversation

DayOfThePenguin commented Mar 3, 2024 • edited

Quentin-Anthony left a comment

Choose a reason for hiding this comment

DayOfThePenguin commented Mar 3, 2024 •

edited