Skip to content

Commit

Permalink
Do not set the address of failed PS into PS hosts (#399)
Browse files Browse the repository at this point in the history
  • Loading branch information
workingloong committed May 9, 2023
1 parent 6ac2e11 commit edc076b
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions dlrover/python/master/node/ps.py
Original file line number Diff line number Diff line change
Expand Up @@ -286,6 +286,7 @@ def get_ps_addrs(self):
and not ps.is_released
and ps.status
in [NodeStatus.INITIAL, NodeStatus.PENDING, NodeStatus.RUNNING]
and not ps.timeout(_dlrover_ctx.seconds_to_wait_failed_ps)
):
ps_addrs[ps.rank_index] = ps.service_addr
ps_addrs = collections.OrderedDict(sorted(ps_addrs.items()))
Expand Down

0 comments on commit edc076b

Please sign in to comment.