Skip to content

Luo-Liang/OSDP-public

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

OSDP-public

Composable + Tunable = Optimal

OSDP =

  1. A learned communication performance (GBDT) model based on real world measurements
  2. A profiled compute performance model based on memory and latency profile for the module
  3. A Simulator that computes latency and peak memory usage given these two performance models' The sim is needed because launching distributed jobs are too slow - this surrogate model allows the tuner to explore a larger space
  4. A sequential model optimizer that explores the exploration space.

The core of the OSDP performance model is similar to the one used in Srifty:

Luo, L., West, P., Patel, P., Krishnamurthy, A. and Ceze, L., 2022. SRIFTY: Swift and Thrifty Distributed Neural Network Training on the Cloud. Proceedings of Machine Learning and Systems, 4, pp.833-847.

Output = A list of ShardingStrategy which corresponds to the optimal FSDP shardingstrategy given a list of modules (abstracted as a serialized list of execution information).

About

Composable + Tunable = Optimal

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages