Recent LR Scheduler change does not account for inference/evaluation #1059

dashstander · 2023-10-17T01:19:23Z

The function setup_model_and_optimizer is used for evaluation and inference as a hack to initialize DeepSpeed properly. However, there was a recent change to make sure that the LR scheduler is properly updated after resuming training from a checkpoint that assumes there will be an lr_scheduler object. Right attempting to run evaluate.py from NeoX main gives

File "/mnt/ssd-1/dashiell/gpt-neox/megatron/utils.py", line 448, in setup_for_inference_or_eval
        lr_scheduler.optimizer = model.optimizerlr_scheduler.optimizer = model.optimizer

    lr_scheduler.optimizer = model.optimizerAttributeError
AttributeError: : 'NoneType' object has no attribute 'optimizer''NoneType' object has no attribute 'optimizer'

The text was updated successfully, but these errors were encountered:

dashstander added the bug Something isn't working label Oct 17, 2023

dashstander self-assigned this Oct 17, 2023

dashstander linked a pull request Oct 17, 2023 that will close this issue

LR scheduler fix no longer breaks inference #1060

Merged

Quentin-Anthony closed this as completed in #1060 Oct 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Recent LR Scheduler change does not account for inference/evaluation #1059

Recent LR Scheduler change does not account for inference/evaluation #1059

dashstander commented Oct 17, 2023

Recent LR Scheduler change does not account for inference/evaluation #1059

Recent LR Scheduler change does not account for inference/evaluation #1059

Comments

dashstander commented Oct 17, 2023