Skip to content

Actions: intelligent-machine-learning/dlrover

Actions

CI

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
2,599 workflow runs
2,599 workflow runs

Filter by Event

Loading

Filter by Status

Loading

Filter by Branch

Loading

Filter by Actor

Loading
Check wether AUTO_MONITOR_RESOURCE env is True (#452)
CI #1342: Commit 29367b9 pushed by workingloong
June 25, 2023 02:03 5m 42s master
June 25, 2023 02:03 5m 42s
June 21, 2023 12:03 6m 32s
Add args to enable relaunch error pods. (#449)
CI #1336: Commit 3e05e98 pushed by hxdtest
June 19, 2023 10:24 5m 29s master
June 19, 2023 10:24 5m 29s
Randomly select a free port for DLRover master. (#448)
CI #1333: Commit 326ffa8 pushed by workingloong
June 19, 2023 08:07 5m 37s master
June 19, 2023 08:07 5m 37s
Only local rank 0 process reports node status (#447)
CI #1330: Commit 532173d pushed by workingloong
June 16, 2023 07:47 7m 2s master
June 16, 2023 07:47 7m 2s
Log the worker name when added to rendezvous (#446)
CI #1328: Commit 44f7f6e pushed by hxdtest
June 16, 2023 02:32 6m 25s master
June 16, 2023 02:32 6m 25s
June 14, 2023 09:11 6m 0s
add auto_accelerate API (#441)
CI #1321: Commit 5751c13 pushed by nash635
June 14, 2023 03:54 5m 33s master
June 14, 2023 03:54 5m 33s
Overwrite torchrun to register dlrover_master (#442)
CI #1319: Commit 697db48 pushed by workingloong
June 14, 2023 03:25 5m 11s master
June 14, 2023 03:25 5m 11s
ProTip! You can narrow down the results and go further in time using created:<2023-06-14 or the other filters available.