Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

writing restart files on write grid comp #1779

Closed
junwang-noaa opened this issue Jun 2, 2023 · 17 comments
Closed

writing restart files on write grid comp #1779

junwang-noaa opened this issue Jun 2, 2023 · 17 comments
Assignees
Labels
bug Something isn't working

Comments

@junwang-noaa
Copy link
Collaborator

Description

It is found that when turning on the option to write restart files on the write grid comp in C768 coupled run, the writing is slow at this time as the ESMFproto_FieldBundleWrite is used (~90s) , the writing time is comparable to fms writing restart files when using sequential netcdf (~20s). Some code changes are required to use sequential netcdf on write grid component.

To Reproduce:

  1. check out the latest ufs weather model develop branch and compile the code with FV3_GFS_v17_coupled_p8
  2. run C768 coupled test (test case is available at: /lfs/h2/emc/ptmp/jun.wang/nco_benchmark/c768coupledDev)
  3. in model_configure, turn on "quilting_restart"

Additional context

Output

@junwang-noaa junwang-noaa added the bug Something isn't working label Jun 2, 2023
@junwang-noaa
Copy link
Collaborator Author

@DusanJovic-NOAA Is this issue fixed?

@DusanJovic-NOAA
Copy link
Collaborator

@DusanJovic-NOAA Is this issue fixed?

I do not remember. What's the simplest way to test this again?

@junwang-noaa
Copy link
Collaborator Author

@DusanJovic-NOAA Can you try the HR3a test case on wcoss2 to see if there is slowness when the quilting_restart is turned on? Thanks

C768 on WCOSS2
/lfs/h2/emc/couple/noscrub/jessica.meixner/HR3/scoutruns/hr3scoutC768

@DusanJovic-NOAA
Copy link
Collaborator

@DusanJovic-NOAA Can you try the HR3a test case on wcoss2 to see if there is slowness when the quilting_restart is turned on? Thanks

C768 on WCOSS2 /lfs/h2/emc/couple/noscrub/jessica.meixner/HR3/scoutruns/hr3scoutC768

I do not see model run directory in this location, only logs and output grib files.

@junwang-noaa
Copy link
Collaborator Author

Sorry, the test run is at:

/lfs/h2/emc/couple/noscrub/jessica.meixner/HR3/scoutruns/t02C768/RUNDIRS/t02C768/fcst.55567

@DusanJovic-NOAA
Copy link
Collaborator

Is this on Cactus or Dogwood?

dusan.jovic@clogin03: ~
$ cd /lfs/h2/emc/couple/noscrub/jessica.meixner/HR3/scoutruns/t02C768/RUNDIRS/t02C768/fcst.55567
-bash: cd: /lfs/h2/emc/couple/noscrub/jessica.meixner/HR3/scoutruns/t02C768/RUNDIRS/t02C768/fcst.55567: No such file or directory

@junwang-noaa
Copy link
Collaborator Author

@JessicaMeixner-NOAA May I ask if you can provide a GFSv17 HR3 test case? Thanks

@JessicaMeixner-NOAA
Copy link
Collaborator

@GeorgeVandenberghe-NOAA has a canned case for the scout runs of HR3 running on various machines.

@GeorgeVandenberghe-NOAA
Copy link
Collaborator

@junwang-noaa
Copy link
Collaborator Author

@GeorgeVandenberghe-NOAA can you provide the location of the run directory you have on wcoss2? Thanks

@GeorgeVandenberghe-NOAA
Copy link
Collaborator

@junwang-noaa
Copy link
Collaborator Author

@DusanJovic-NOAA Do you have chance to test the HR3? Thanks

@DusanJovic-NOAA
Copy link
Collaborator

@DusanJovic-NOAA Do you have chance to test the HR3? Thanks

This commit should improve the timing of cubed sphere restart file writing:

NOAA-EMC/fv3atm@d59e80e

@junwang-noaa
Copy link
Collaborator Author

@DusanJovic-NOAA Just want to check if the code updates above are in the any of FV3ATM PRs? If not, can you add it to your no-warning PR ? Thanks

@DusanJovic-NOAA
Copy link
Collaborator

@DusanJovic-NOAA Just want to check if the code updates above are in the any of FV3ATM PRs? If not, can you add it to your no-warning PR ? Thanks

It's in this PR NOAA-EMC/fv3atm#775

@junwang-noaa
Copy link
Collaborator Author

@DusanJovic-NOAA The FV3 PR#775 was committed, can we close the issue?

@DusanJovic-NOAA
Copy link
Collaborator

@DusanJovic-NOAA The FV3 PR#775 was committed, can we close the issue?

Yes. Closing.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
No open projects
Development

No branches or pull requests

4 participants