Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

3d Composability #129290

Closed
wants to merge 26 commits into from
Closed

3d Composability #129290

wants to merge 26 commits into from

Conversation

mori360
Copy link
Contributor

@mori360 mori360 commented Jun 22, 2024

pytorch (fsdp, tp, pp) -> pytorch (composable)
Move (fsdp, tp, pp) tests under pytorch into a composable folder

TP:
test/distributed/tensor/parallel/test_ddp_2d_parallel.py
test/distributed/tensor/parallel/test_fsdp_2d_parallel.py
PP:
test/distributed/pipelining/test_composability.py
FSDP:
test/distributed/_composable/fsdp/test_fully_shard_trainin.py
-TestFullyShard2DTraining
-TestFullyShardHSDPTraining

=>
distributed/_composable/test_composability/test_noncontinuous.py
distributed/_composable/test_composability/test_continuous.py

cc @mrshenli @pritamdamania87 @zhaojuanmao @satgera @gqchen @aazzolini @osalpekar @jiayisuse @H-Huang @kwen2501 @awgu @penguinwu @fegin @XilunWu @wanchaol @fduwjj @wz337 @tianyu-l @wconstab @yf225 @chauhang @d4l3k

Copy link

pytorch-bot bot commented Jun 22, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/129290

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 96e413e with merge base 7b1988f (image):
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@pytorch-bot pytorch-bot bot added the oncall: distributed Add this issue/PR to distributed oncall triage queue label Jun 22, 2024
@mori360 mori360 added the ciflow/trunk Trigger trunk jobs on your pull request label Jun 25, 2024
@pytorch-bot pytorch-bot bot added the release notes: distributed (ddp) release notes category label Jun 26, 2024
@mori360 mori360 requested a review from wconstab June 27, 2024 04:47
@mori360 mori360 marked this pull request as ready for review June 27, 2024 04:47
@mori360 mori360 requested a review from a team as a code owner June 27, 2024 04:47
time python test/run_test.py --verbose -i distributed/pipelining/test_composability.py
# 3D composability tests
time python test/run_test.py --verbose -i distributed/_composable/test_composability/test_noncontinuous.py
time python test/run_test.py --verbose -i distributed/_composable/test_composability/test_continuous.py
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

could I know why they are new tests are called test_noncontinuous and test_continuous ?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The MultiProcContinousTest at test_continuous relates to MultiProcContinousTest, thus name the test file as test_continuous.
The others tests relate the old style test. test_noncontinuous is not suitable, what do you think of test_composability_fsdp_tp

@weifengpy
Copy link
Contributor

weifengpy commented Jun 27, 2024

it might be more actionable if the PR is break down into 4 PRs, so each POC can comfortably review and accept for the change. Are you interested in leanring ghstack for this purpose? If it's too much trouble, we can discuss if we need to collect stamps from each POC before landing

  • moving FSDP tests
  • moving TP tests
  • moving PP tests
  • moving DDP tests

@mori360 mori360 marked this pull request as draft June 27, 2024 06:22
@Dawnrena01
Copy link

Could u possible help me with this 3d face app

@mori360 mori360 closed this Jul 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ciflow/trunk Trigger trunk jobs on your pull request oncall: distributed Add this issue/PR to distributed oncall triage queue release notes: distributed (ddp) release notes category
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants