Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Re-work testing verification, fail file naming consistency, remove lsf support, simplify RT logs. #2061

Merged

Conversation

BrianCurtis-NOAA
Copy link
Collaborator

@BrianCurtis-NOAA BrianCurtis-NOAA commented Dec 22, 2023

Commit Queue Requirements:

  • Fill out all sections of this template.
  • All sub component pull requests have been reviewed by their code managers.
  • Run the full Intel+GNU RT suite (compared to current baselines) on either Hera/Derecho/Hercules
  • Commit 'test_changes.list' from previous step

Description

This PR updates the rt.sh script:

  • Makes fail_test and fail_compile files consistent in naming
  • Created a new generate_log() function in rt.sh to allow for better control over the information logged after the full RT suite is finished.
    • Log outputs minimized to information that's important to track, and easier to read for developers.
    • Adds test_changes.list file for repository tracking of what tests are changed due to the changes in the PR.
    • Adds test_changes.list file for easy creation of new baselines specific to the PR using rt.sh -b test_changes.out.
  • Ends support for lsf (queue system from old WCOSS systems)

Commit Message

* UFSWM
  * Re-work test verification in rt.sh for developers to use in PR process. (Closes #2058)
  * fail_<test_number> file name upgraded to use compile/test name instead of number.
  * Regression test logs do not clearly indicate when a test fails to run. (Closes #1821)
  * New test_changes.list file:
    * Verification of what tests change for each PR.
    * Used with './rt.sh -c -b test_changes.list` to help create new baselines for each PR.

Priority

  • Normal

Git Tracking

UFSWM

Sub component Pull Requests

  • None

UFSWM Blocking Dependencies

  • None

Changes

Regression Test Changes (Please commit test_changes.list):

  • None Expected

Input data Changes

  • None.

Library Changes/Upgrades

  • No Updates

Testing Log:

  • RDHPCS
    • Hera
    • Orion
    • Hercules
    • Jet
    • Gaea
    • Derecho
  • WCOSS2
    • Dogwood/Cactus
    • Acorn
  • CI
  • opnReqTest (complete task if unnecessary)

@BrianCurtis-NOAA BrianCurtis-NOAA changed the title Rt pretest Add pre-test option to rt.sh (-p), re-work testing verification, fail file naming consistency. Dec 22, 2023
@BrianCurtis-NOAA BrianCurtis-NOAA changed the title Add pre-test option to rt.sh (-p), re-work testing verification, fail file naming consistency. Add pre-test option to rt.sh (-p), re-work testing verification, fail file naming consistency, remove lfs support. Dec 22, 2023
@BrianCurtis-NOAA
Copy link
Collaborator Author

@zach1221 it looks like the Derecho issue would block any PR at this point, should we proceed with testing? Have the ORT's been successful?

I don't think there's another PR ready at the moment. We could continue testing against this PR today, on the other machines, while Derecho is worked on. I've got ORTs running on Hera now. @jkbk2004 do you want to wait until Derecho is ready?

@zach1221 can you submit jobs in sequential mode even if rocoto/eclfow are not functional ?

This would take an extremely long time.

@zach1221
Copy link
Collaborator

@zach1221 it looks like the Derecho issue would block any PR at this point, should we proceed with testing? Have the ORT's been successful?

I don't think there's another PR ready at the moment. We could continue testing against this PR today, on the other machines, while Derecho is worked on. I've got ORTs running on Hera now. @jkbk2004 do you want to wait until Derecho is ready?

@zach1221 can you submit jobs in sequential mode even if rocoto/eclfow are not functional ?

This would take an extremely long time.

It's still running since last night. Hopefully not too much longer.

@BrianCurtis-NOAA BrianCurtis-NOAA added No Baseline Change No Baseline Change Ready for Commit Queue The PR is ready for the Commit Queue. All checkboxes in PR template have been checked. labels Feb 20, 2024
@zach1221
Copy link
Collaborator

Jet is mostly done. Just had one test timeout that I'm rerunning.

@BrianCurtis-NOAA
Copy link
Collaborator Author

What changes were made in Derecho to allow these tests to pass? I didn't see any Derecho changes in that regard.

@zach1221
Copy link
Collaborator

What changes were made in Derecho to allow these tests to pass? I didn't see any Derecho changes in that regard.

I'm about to commit them.

@grantfirl grantfirl mentioned this pull request Feb 21, 2024
14 tasks
@zach1221
Copy link
Collaborator

Ok we should be ready to proceed with final review here. @jkbk2004 fyi

@BrianCurtis-NOAA
Copy link
Collaborator Author

BrianCurtis-NOAA commented Feb 21, 2024

Should be good to go for review/merge.
@jkbk2004 @SadeghTabas-NOAA

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
No Baseline Change No Baseline Change Ready for Commit Queue The PR is ready for the Commit Queue. All checkboxes in PR template have been checked.
Projects
None yet
7 participants