Is PP = 1 faster than Sequential? #574

StellaAthena · 2022-02-25T13:43:17Z

As described in #573, there is a note in the code that if one sets PP = 1 the code will actually build a sequential model rather than a pipeline parallel model with one stage. It makes intuitive sense to me that a sequential model would be faster than a PP = 1 model because the pipelining has some overhead, but the authors of that issue report the opposite.

We should systematically test whether pipelining with one stage is faster than using a sequential model. If it isn’t, then we should consider changing it so that PP = 1 sequentializes the model. If it’s not faster, it may be worth keeping PP = 0 as a sequential model because there are times that the pipeline engine is difficult to use.

either way, we should clearly document the functionality and make the comments correct.

sdtblck · 2022-03-01T13:51:26Z

I tested this ages ago, there is not much noticeable difference, the overhead is quite small.

Also, slight mistake, you said

there is a note in the code that if one sets PP = 1 the code will actually build a sequential model rather than a pipeline parallel model with one stage.

it's the other way round. PP=0 == sequential, PP=1 == pipe parallel module

sdtblck · 2022-03-01T13:52:42Z

See this pr #269 for details. Going to close this, as I think the linked thread should answer your questions.

StellaAthena added the experiments Experiments we wish to perform on the codebase label Feb 25, 2022

sdtblck closed this as completed Mar 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is PP = 1 faster than Sequential? #574

Is PP = 1 faster than Sequential? #574

StellaAthena commented Feb 25, 2022

sdtblck commented Mar 1, 2022

sdtblck commented Mar 1, 2022 •

edited

Loading

Is PP = 1 faster than Sequential? #574

Is PP = 1 faster than Sequential? #574

Comments

StellaAthena commented Feb 25, 2022

sdtblck commented Mar 1, 2022

sdtblck commented Mar 1, 2022 • edited Loading

sdtblck commented Mar 1, 2022 •

edited

Loading