Improve `BENCHMARKING.md` with more detailed info. #619

tcojean · 2020-08-10T14:21:08Z

Adds more detailed information on how to benchmark Ginkgo.

Thanks @adam-m-jcbs for his useful review process for the JOSS paper which pointed to a lack of documentation on how to benchmark Ginkgo. I hope that with this PR we can fix this issue. See #597.

Summary:

Detail a bit how to use ssget and what to watch out for.
Add a benchmark overview section with the most important options.
Optionally detail a little how to interact with the GPE after obtaining benchmark results as well as what to watch out for.
Detail a little how to obtain more detailed information as well as how to debug Ginkgo through loggers.
Update the available options in the script.
In addition, I try to change the way we generate the run_all_benchmarks.sh script in order to give the proper rights to this file by default (execution x access).

codecov · 2020-08-10T17:58:36Z

Codecov Report

Merging #619 into develop will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff            @@
##           develop     #619   +/-   ##
========================================
  Coverage    92.89%   92.89%           
========================================
  Files          303      303           
  Lines        21331    21331           
========================================
  Hits         19815    19815           
  Misses        1516     1516

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update bc432c4...3ea7874. Read the comment docs.

pratikvn

LGTM! Pretty comprehensive. It would also be nice to have something like a best-practice guide that explains what settings of Ginkgo the benchmarks should be run for. There we could mention some generic aspects such as:

Compile your code in Release
Make sure the machine has no competing jobs.
The recommended number of warmup iterations (which might be different for solvers, preconds and SpMV)

And Ginkgo specific ones such as:

For adaptive block jacobi, enable the GINKGO_JACOBI_FULL_OPTIMIZATIONS flag with a warning that this uses a lot more memory.
Mention that we also have a overhead benchmarking setup.

and any other ones that I have forgotten

BENCHMARKING.md

tcojean · 2020-08-12T16:52:55Z

Thanks for your comments @pratikvn, I integrated your suggestions.

yhmtsai

LGTM

BENCHMARKING.md

upsj

LGTM! A general proposal: If we have interdependent pieces of documentation (BENCHMARKING.md + benchmark commandline options, ...), do you think it would make sense to add a note to the source code to check the dependent documentation? An example would be adding a new preconditioner, spmv algorithm or solver to the benchmarks

Co-authored-by: Pratik Nayak <[email protected]>

Co-authored-by: Yuhsiang M. Tsai <[email protected]>

tcojean · 2020-08-24T07:57:45Z

@upsj how do you suggest to do this? I don't think there is a central place in the source code where we could do this, so would you add a comment in every source file?

tcojean · 2020-08-24T08:55:11Z

I opened #623 to discuss the split documentation issue.

sonarcloud · 2020-08-24T13:07:59Z

Kudos, SonarCloud Quality Gate passed!

0 Bugs
0 Vulnerabilities (and 0 Security Hotspots to review)
0 Code Smells

No Coverage information
No Duplication information

The version of Java (1.8.0_121) you have used to run this analysis is deprecated and we will stop accepting it from October 2020. Please update to at least Java 11.
Read more here

Release 1.3.0 of Ginkgo. The Ginkgo team is proud to announce the new minor release of Ginkgo version 1.3.0. This release brings CUDA 11 support, changes the default C++ standard to be C++14 instead of C++11, adds a new Diagonal matrix format and capacity for diagonal extraction, significantly improves the CMake configuration output format, adds the Ginkgo paper which got accepted into the Journal of Open Source Software (JOSS), and fixes multiple issues. Supported systems and requirements: + For all platforms, cmake 3.9+ + Linux and MacOS + gcc: 5.3+, 6.3+, 7.3+, all versions after 8.1+ + clang: 3.9+ + Intel compiler: 2017+ + Apple LLVM: 8.0+ + CUDA module: CUDA 9.0+ + HIP module: ROCm 2.8+ + Windows + MinGW and Cygwin: gcc 5.3+, 6.3+, 7.3+, all versions after 8.1+ + Microsoft Visual Studio: VS 2017 15.7+ + CUDA module: CUDA 9.0+, Microsoft Visual Studio + OpenMP module: MinGW or Cygwin. The current known issues can be found in the [known issues page](https://github.com/ginkgo-project/ginkgo/wiki/Known-Issues). Additions: + Add paper for Journal of Open Source Software (JOSS). [#479](#479) + Add a DiagonalExtractable interface. [#563](#563) + Add a new diagonal Matrix Format. [#580](#580) + Add Cuda11 support. [#603](#603) + Add information output after CMake configuration. [#610](#610) + Add a new preconditioner export example. [#595](#595) + Add a new cuda-memcheck CI job. [#592](#592) Changes: + Use unified memory in CUDA debug builds. [#621](#621) + Improve `BENCHMARKING.md` with more detailed info. [#619](#619) + Use C++14 standard instead of C++11. [#611](#611) + Update the Ampere sm information and CudaArchitectureSelector. [#588](#588) Fixes: + Fix documentation warnings and errors. [#624](#624) + Fix warnings for diagonal matrix format. [#622](#622) + Fix criterion factory parameters in CUDA. [#586](#586) + Fix the norm-type in the examples. [#612](#612) + Fix the WAW race in OpenMP is_sorted_by_column_index. [#617](#617) + Fix the example's exec_map by creating the executor only if requested. [#602](#602) + Fix some CMake warnings. [#614](#614) + Fix Windows building documentation. [#601](#601) + Warn when CXX and CUDA host compiler do not match. [#607](#607) + Fix reduce_add, prefix_sum, and doc-build. [#593](#593) + Fix find_library(cublas) issue on machines installing multiple cuda. [#591](#591) + Fix allocator in sellp read. [#589](#589) + Fix the CAS with HIP and NVIDIA backends. [#585](#585) Deletions: + Remove unused preconditioner parameter in LowerTrs. [#587](#587) Related PR: #625

The Ginkgo team is proud to announce the new minor release of Ginkgo version 1.3.0. This release brings CUDA 11 support, changes the default C++ standard to be C++14 instead of C++11, adds a new Diagonal matrix format and capacity for diagonal extraction, significantly improves the CMake configuration output format, adds the Ginkgo paper which got accepted into the Journal of Open Source Software (JOSS), and fixes multiple issues. Supported systems and requirements: + For all platforms, cmake 3.9+ + Linux and MacOS + gcc: 5.3+, 6.3+, 7.3+, all versions after 8.1+ + clang: 3.9+ + Intel compiler: 2017+ + Apple LLVM: 8.0+ + CUDA module: CUDA 9.0+ + HIP module: ROCm 2.8+ + Windows + MinGW and Cygwin: gcc 5.3+, 6.3+, 7.3+, all versions after 8.1+ + Microsoft Visual Studio: VS 2017 15.7+ + CUDA module: CUDA 9.0+, Microsoft Visual Studio + OpenMP module: MinGW or Cygwin. The current known issues can be found in the [known issues page](https://github.com/ginkgo-project/ginkgo/wiki/Known-Issues). Additions: + Add paper for Journal of Open Source Software (JOSS). [#479](#479) + Add a DiagonalExtractable interface. [#563](#563) + Add a new diagonal Matrix Format. [#580](#580) + Add Cuda11 support. [#603](#603) + Add information output after CMake configuration. [#610](#610) + Add a new preconditioner export example. [#595](#595) + Add a new cuda-memcheck CI job. [#592](#592) Changes: + Use unified memory in CUDA debug builds. [#621](#621) + Improve `BENCHMARKING.md` with more detailed info. [#619](#619) + Use C++14 standard instead of C++11. [#611](#611) + Update the Ampere sm information and CudaArchitectureSelector. [#588](#588) Fixes: + Fix documentation warnings and errors. [#624](#624) + Fix warnings for diagonal matrix format. [#622](#622) + Fix criterion factory parameters in CUDA. [#586](#586) + Fix the norm-type in the examples. [#612](#612) + Fix the WAW race in OpenMP is_sorted_by_column_index. [#617](#617) + Fix the example's exec_map by creating the executor only if requested. [#602](#602) + Fix some CMake warnings. [#614](#614) + Fix Windows building documentation. [#601](#601) + Warn when CXX and CUDA host compiler do not match. [#607](#607) + Fix reduce_add, prefix_sum, and doc-build. [#593](#593) + Fix find_library(cublas) issue on machines installing multiple cuda. [#591](#591) + Fix allocator in sellp read. [#589](#589) + Fix the CAS with HIP and NVIDIA backends. [#585](#585) Deletions: + Remove unused preconditioner parameter in LowerTrs. [#587](#587) Related PR: #627

tcojean added is:enhancement An improvement of an existing feature. reg:build This is related to the build system. reg:documentation This is related to documentation. reg:benchmarking This is related to benchmarking. 1:ST:ready-for-review This PR is ready for review labels Aug 10, 2020

tcojean self-assigned this Aug 10, 2020

tcojean force-pushed the improve_benchmarking_doc branch from 13befed to 9aed0d9 Compare August 10, 2020 14:22

tcojean requested review from fritzgoebel, pratikvn, thoasm, upsj and yhmtsai August 10, 2020 14:45

pratikvn approved these changes Aug 12, 2020

View reviewed changes

BENCHMARKING.md Outdated Show resolved Hide resolved

BENCHMARKING.md Outdated Show resolved Hide resolved

BENCHMARKING.md Outdated Show resolved Hide resolved

BENCHMARKING.md Outdated Show resolved Hide resolved

tcojean force-pushed the improve_benchmarking_doc branch from 9aed0d9 to 409acef Compare August 12, 2020 16:52

yhmtsai approved these changes Aug 13, 2020

View reviewed changes

BENCHMARKING.md Show resolved Hide resolved

BENCHMARKING.md Show resolved Hide resolved

BENCHMARKING.md Outdated Show resolved Hide resolved

tcojean force-pushed the improve_benchmarking_doc branch from 409acef to 2c2185d Compare August 13, 2020 18:41

upsj approved these changes Aug 17, 2020

View reviewed changes

upsj added 1:ST:ready-to-merge This PR is ready to merge. and removed 1:ST:ready-for-review This PR is ready for review labels Aug 17, 2020

tcojean and others added 3 commits August 24, 2020 09:55

Improve BENCHMARKING.md with more detailed info.

999d0ec

Add a best practice guideline and fix some typos.

2728fd1

Co-authored-by: Pratik Nayak <[email protected]>

Improve the data commit detailed steps and format

3ea7874

Co-authored-by: Yuhsiang M. Tsai <[email protected]>

tcojean force-pushed the improve_benchmarking_doc branch from 2c2185d to 3ea7874 Compare August 24, 2020 07:56

tcojean merged commit f54f421 into develop Aug 24, 2020

tcojean deleted the improve_benchmarking_doc branch August 24, 2020 14:04

tcojean mentioned this pull request Aug 25, 2020

Fix documentation warnings and errors. #624

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve `BENCHMARKING.md` with more detailed info. #619

Improve `BENCHMARKING.md` with more detailed info. #619

tcojean commented Aug 10, 2020

codecov bot commented Aug 10, 2020 •

edited

Loading

pratikvn left a comment

tcojean commented Aug 12, 2020

yhmtsai left a comment

upsj left a comment

tcojean commented Aug 24, 2020

tcojean commented Aug 24, 2020

sonarcloud bot commented Aug 24, 2020

Improve BENCHMARKING.md with more detailed info. #619

Improve BENCHMARKING.md with more detailed info. #619

Conversation

tcojean commented Aug 10, 2020

codecov bot commented Aug 10, 2020 • edited Loading

Codecov Report

pratikvn left a comment

Choose a reason for hiding this comment

tcojean commented Aug 12, 2020

yhmtsai left a comment

Choose a reason for hiding this comment

upsj left a comment

Choose a reason for hiding this comment

tcojean commented Aug 24, 2020

tcojean commented Aug 24, 2020

sonarcloud bot commented Aug 24, 2020

Improve `BENCHMARKING.md` with more detailed info. #619

Improve `BENCHMARKING.md` with more detailed info. #619

codecov bot commented Aug 10, 2020 •

edited

Loading