Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve rumtime statistics #298

Closed
4 tasks
stgeke opened this issue May 24, 2021 · 2 comments
Closed
4 tasks

Improve rumtime statistics #298

stgeke opened this issue May 24, 2021 · 2 comments
Assignees

Comments

@stgeke
Copy link
Collaborator

stgeke commented May 24, 2021

  • print min, max, avg for time per timestep
  • print fraction relative to total solve time for all timers
  • add timestep (makes it easier to grep)
  • add time spent in BC kernels
@stgeke stgeke added this to the v21.1 milestone May 25, 2021
RonRahaman added a commit to RonRahaman/nekRS that referenced this issue May 26, 2021
977e71b9d CHANGELOG adjustments
0ca69b7ed Update CHANGELOG
1b1ca7baf Fixed typo in ref manual
540a72b9f Sorted user manual references
0a82c6d41 One more update to CHANGELOG
d380ba124 amg parameter numbers
c05d97a62 more on GPU AMG
9d1a9d410 boomerAMG on GPUs
57d1288bd bug fix
dfceea17d Merge branch 'master' of github.com:hypre-space/hypre
0bb67902b GPU build
3b65190b2 A few updates to the CHANGELOG
0fb7956ea Update version number for new release
426b83b64 Updated CHANGELOG for 2.21.0 release
3bc7d267e Gpu default (Nek5000#336)
8a41a42c8 Cmake cuda updates (Nek5000#349)
995969c9a fix reordering option for diagonal of square matrix transpose (Nek5000#370)
ae362727b Change HYPRE_RELEASE_NUMBER to be int in CMake Config.h file (Nek5000#372)
3894857b1 Adding a new HYPRE_RELEASE_NUMBER macro (Nek5000#366)
391f4fd6e Fix hypre_CSRMatrixAddHost (Nek5000#369)
08b8cdbf7 bug fix
74482aef4 This branch fixes some of the profiling code in nvtx. All Push/Pop ranges never explicitly use PushColor version around device kernels. We allow the general method to determine the correct color. Next, there was a missing PopRange() in par_mod_lr_interp.c. Finally, I changed the profiling names in par_mod_lr_interp.c as they conflicted with names in another file. (Nek5000#368)
c7175a987 Cuda11.2 (Nek5000#362)
1c979f338 Fix so this compiles with hip (Nek5000#365)
c38527c45 Add OMP support to Mat/Mat add functions (Nek5000#341)
91e82e699 hybridSetAggInterpType (Nek5000#358)
fbaf2b75d Fix a corner-case divide-by-zero in PFMG (see github issue Nek5000#332) (Nek5000#355)
c272f6980 Fix MGR setup on GPU and MGR bugfixes.  (#260)
3f12d4765 Add support for matrices with many zero rows (Nek5000#300)
521ef12ff Fix colmap (Nek5000#328)
c5110654c add include in fei makefile (Nek5000#334)
4718207e9 hypre_strcpy (Nek5000#340)
9a28cc622 Config without args (Nek5000#338)
25646da90 Mat descr (Nek5000#331)
34b2f43ba Merge branch 'master' of github.com:hypre-space/hypre
0d730f374 Fixed seg-fault problem
3f6c0abc4 Merge branch 'master' of https://github.com/hypre-space/hypre
e79d05d0b removed some missed col_offd_S_to_A
d2ef5cf18 Fixed ExtInterpDevice
0a0ddd072 Fix s (Nek5000#312)
25d0498a3 Roctx (Nek5000#319)
b3a4a76a5 Roc sparse (Nek5000#316)
366b80f89 2 stage gs (Nek5000#314)
8002200aa Revert "2 stage gs (Nek5000#308)" (Nek5000#313)
f7fb2853f 2 stage gs (Nek5000#308)
fcd2b2b4b Add missing utilities source file to CMakeList
ce2a43821 removing device bindings in HYPRE_Init (Nek5000#296)
8223465ea More hip support (Nek5000#302)
67a0bdeab Hip support (Nek5000#298)
4630047ad HYPRE_USING_GPU (Nek5000#295)
9d45b1934 Remove unnecessary including of superlu_ddefs in ij test
2e97ffcc9 bug fix recvbufsize (Nek5000#294)
c9848a5ed Fixed a small mistake in configure/configure.in
b5070c14e configure fix for last merged PR
8c00f18a9 Hip build (Nek5000#293)
7859aeee3 Changed the rhs in several TEST_sstruct/periodic runs (Nek5000#291)
512219634 Adding filter to check-license test to ignore runtests-* files
68f510c11 Test jobs for enable-mixedint (Nek5000#280)
277b35785 Removing 'memory.h' include since it is not ISO standard (Nek5000#279)
c4ca3e105 fixed wrong pointer checking; error handler in debug mode (Nek5000#275)
33da39f68 Fixed a typo in the sstruct.in.README file comment
51e5a4c6d SStruct interface without UVM (Nek5000#170)
3438132e1 GPU examples (Nek5000#268)
6eb66f869 Fixed a small mistake in the sludist.sh test
be18e595a Remove the global partition code from hypre (Nek5000#273)
6f9260b67 Add saved-file extension to runtest (Nek5000#271)
414fa671b Umpire (Nek5000#243)
8462f60dc Hypre warp bitshift (Nek5000#267)
2186a8fb3 triangular solve on GPUs; runcheck (Nek5000#256)
bd76daf12 Updated saved files to reflect change in NSH solve on ILU Schur system -- See PR#251. (Nek5000#254)
6a1caf899 Modification to fix error code warning for coarse level solver (#251)
a6c852be5 fixed syntax error with --enable-gpu-aware-mpi (Nek5000#250)
950f9f250 fix GPU SpMV for zero matrices (#246)
b49727f16 Cuda triangular smoothers (Nek5000#240)
763ea8a5a cmake: Optionally accept path to BLAS/LAPACK libraries (Nek5000#230)
804609b6c Reorg relax (Nek5000#237)
9c24f006a ILU updates (#239)
eae9be29b bug fix cycle_param (Nek5000#236)
56012897e Ilu dev 2019 (Nek5000#160)
2bc4228ec Changed sludist.saved file to correct new superlu-dist autotest errors
796ab0af4 Use basename when checking compiler in configure (Nek5000#225)
22f4d3f8c Cuda 11 API (Nek5000#163)
641f7a4e3 Minor spelling fix (Nek5000#222)
2e1ccee24 Euclid fix (Nek5000#218)
5ac2b3a54 Improve portability for update-release script (for LINUX, UNIX and macOS). (Nek5000#227)
9fb1b351c MS-Windows OMP pragma (Nek5000#223)
636706acd Fixing compile issues --with-caliper (Nek5000#216)
dd4ddba0f Added a filter to runtest for 'lrun warning'
ff45ecef3 Set default convergence tolerance to 1.0e-6 (#206)
7b2379c0d optimization in hypre_CSRMatrixBigJtoJ and JtoBigJ (Nek5000#204)
1ddd69f27 Fixed problems when calling HYPRE_Finalize() multiple times (#207)
5988a506b Update CHANGELOG
54190a846 Update CHANGELOG
4a5c5aca4 Update CHANGELOG
d257887cd Another CHANGELOG update for 2.20.0
00b826e84 Update version number and date for release 2.20.0
2fe718e11 Update CHANGELOG for release 2.20.0
aaf5aa564 Aggressive coarsening and 2- stage MM-ext Interpolations on GPUs (Nek5000#195)
0fcb67054 Fix AMGDD (Nek5000#190)
37f7a0a3f Epe gpu (Nek5000#187)
5f3141a64 Change issue reporting to use github's issue tracker (Nek5000#189)
d5e4eb4bd Fixed a few minor autotest errors
36d0bfba4 Fixed a compile error.
0b80656ce AMG-DD implementation (Nek5000#145)
7f9d222ed run headers
2b2ea3920 should run `headers' to make sure _hypre_parcsr_mv.h is not directly changed
3ae6c7fec Merge pull request Nek5000#172 from hypre-space/PETScFix
1c0598626 Merge branch 'master' of https://github.com/hypre-space/hypre into PETScFix
8833bed15 add compile flags in GPU regression test scripts
ffe35407a format change
f6f98cb36 bug fix (hopefully...)
66b653a5e Merge pull request Nek5000#184 from hypre-space/fix-issue183
26e2f4f4d Add implementation of ffs for MSVC
f4e128cab Merge pull request Nek5000#178 from hypre-space/fix-hopscotch
9a73fbbae Merge branch 'master' into fix-hopscotch
88c7a2d73 Add regression test for concurrent hopscotch
0b2f113c5 fixed leak
5da38fea6 Merge branch 'master' of https://github.com/hypre-space/hypre
b599ea759 kokkos + gcc warnings
057b1fe0f Merge pull request Nek5000#179 from hypre-space/fix-regtest-error
fd9d861c9 Complete renaming of superlu.* to dsuperlu.*.
a96a763c4 Renamed superlu.* to dsuperlu.*.
f31b8b7aa Merge branch 'master' into fix-regtest-error
7c663ef6c Fixes for check-header errors.
6b42e64d4 Bug fix in header file
d2f566e5d Include superlu.h
b222e7aea Move DSLUData structure to its own header file.
d03a93b3e Add missing include guards
8a7617c6c Init Kokkos in the driver
7c0da1bee Edited to fix regression test errors.
e42ab96f9 Move ifdefs to proper places
079402c27 Update private header file
fba878903 Remove unnecessary ifdefs
2d9a37f9a Merge pull request Nek5000#176 from hypre-space/mgr-dev
e9166cdaf Revert accidental deletion of EOL
9a268728e Truncate the coarse grid at each reduction level using a threshold.
e75fb7bb2 Add new functions prototypes to protos.h.
07487b84c Merge branch 'master' into mgr-dev
7e6683786 Fixing another slurm filter issue in 'runtest.sh'
9b5f2730c changed some assertions; offset in matvec
6906e8e0f Merge branch 'master' of https://github.com/hypre-space/hypre into PETScFix
efa4c2c85 minor changes, white spaces, comments, etc
dfa31b5e0 Merge pull request Nek5000#157 from pbauman/gpu-fixes
cea946c51 CXXFLAGS
ba717c7ee Update example test.
13e2cad48 Merge pull request Nek5000#165 from hypre-space/fix-mixed-int
e98ac7777 fixed a mixed-int in GPU code
d8f683d11 Merge pull request Nek5000#141 from hypre-space/gpu_doc
4431a4acd Minor clean up.
079e99515 First implementation of direct solver option for coarse grid solve in MGR.
b5d162e80 fixed some mixed-int issues.
6846fc28d Added a comment associated with previous --enable-mixedint commit
3ff4454aa Fixing a problem with --enable-mixedint on a Mac
d20cef846 fixed compile flag issue for gpu
2370d7d68 Merge branch 'master' of https://github.com/hypre-space/hypre into PETScFix
ffeed7229 fix cusparse matvec for ncol==0
26a1a8a97 result_type needed by clang
a5594908f Parentheses for better readability
2e5ccb684 Critical portability fix in warp_prefix_sum
2f51e7501 Use HYPRE_WARP_SIZE for shared memory alloc in blockReduceSum
950a747ae Use HYPRE_WARP_SIZE instead of hardcoded numbers
b075d6425 Merge pull request Nek5000#155 from hypre-space/fix-caliper
d8a270c3f Merge branch 'master' into fix-caliper
5fca3a5f3 Annotate Setup levels of BoomerAMG
163ef941b Annotate ILU
d2c4df7f6 Annotate MGR
33f423213 Merge pull request Nek5000#147 from hypre-space/fix-fortint
1079429f7 Merge branch 'master' into fix-fortint
175dd5912 description of GPU options of AMG
42448bf06 Adding filter for 'HYPRE_config.h.in' in AUTOTEST/check-license.sh
f0b7c705e Merge pull request Nek5000#150 from pbauman/use-autoheader
2e4bc39f5 Updated configure from the bootstrap
24250fe92 Run bootstrap, regenerate HYPRE_config.h.in
5b7ca434c run autoheader after autoconf
dd8e7ba37 Add descriptions to AC_DEFINE
ca195740f Merge pull request Nek5000#144 from hypre-space/fix-bool
84423d95d Merge branch 'master' into fix-bool
2ba72ed41 Merge pull request Nek5000#149 from pbauman/fix-nvtx-build
c10b129ce Compile hypre_nvtx.c with nvcc
9ff066b52 Move some preprocessor directives before extern C
14c9b050d Merge branch 'fix-bool' of https://github.com/hypre-space/hypre into fix-bool
4b8bb0f5f Erase ifdef statement
8dd210957 Moved the autotest run filter to 'runtest.sh' where it belongs
c94ba5842 Merge branch 'fix-bool' of https://github.com/hypre-space/hypre into fix-bool
0be3e041f Use HYPRE_REAL_MIN
de87515b0 add a warning msg in headers
f7673f1a3 Set atol to correct value when HYPRE_SINGLE is defined
343e31c84 Merge pull request Nek5000#148 from hypre-space/mgr-dev
04c4e0601 Minor cleanup
f145db623 Fix annotation for single level MG
af38cf66f Merge branch 'master' into mgr-dev
24fb670c0 Remove SparseMSG
c989919d8 Update struct fortran driver
c9216c4f7 Update SetPrecond routines
105dae6b9 - Change option for using full AMG V-cycle for F-relaxation from 99 -> 2
96cd74c43 - Set tolerance of global smoother, F-relaxation, and coarse grid solvers to 0 to avoid non-convergence error return.
735df4bc3 Adding a run.filters file to AUTOTEST
93d011a3d Annotate PFMG
1277fbcb5 Annotate BoomerAMG levels
7e5b059b7 Support annotation of multigrid levels
61ebb5cee Annotate krylov solvers
b71a8ad5c Update caliper calls according to new API
89095db97 Update Euclid private header
1738f970f removed option listings but added explanations
27d263927 Fix compiler warnings
1f1ec406c Use stdbool in Euclid
2ce9617b6 GPU build doc
3b22847de Merge pull request Nek5000#138 from hypre-space/headers
d9d040ded minor changes
3a8c7f96f fixed one more typo .....
837a40c95 fixed a typo
b0a79cfbb further clean-up
5eb6e965e changes per Rob's advice
3f8ff325b bug fixes after regression tests
aad671be8 separation of C/C++ headers, and all the adjustment from it
ebfc63324 Merge pull request Nek5000#136 from hypre-space/Interp
870d530de fixed issue with hypre_BoomerAMGBuildModExtPIInterpHost and hypre_BoomerAMGBuildModExtHost
7fd7dfca5 added some regression tests for new 2-stage-ee interpolation
0f8aafcb8 Merge branch 'master' into Interp
7baf13204 Merge pull request Nek5000#135 from hypre-space/RL_FIX
b0c0c9591 added fix for large number of threads
febafc15b Removed experimental option
e6a3b3512 fixed another issue with missed special fine points
082f4c4dc fixed some bugs caused by lacking inclusion of special fine points
9edf9d274 Merge branch 'master' of https://github.com/hypre-space/hypre into RL_FIX
fb8e844f3 Merge branch 'master' into Interp
f875c2a38 Merge pull request Nek5000#134 from hypre-space/RL_FIX_INTERP
59836adad uploaded a wrong file....
6440948e6 bug fix (omp with large #threads)
933ceed52 Merge pull request Nek5000#128 from hypre-space/SpGeMMDist
5d5b75bc0 GPU regression tests
3d1a44167 a minor change in check-headers.sh
1d35b0f7d Merge branch 'master' of https://github.com/hypre-space/hypre into SpGeMMDist
4ab72fd3a Merge pull request Nek5000#129 from hypre-space/fix-headers
b39a8455a minor bug fix
6f34c5a8e fixes for PETSc
0f3a1c613 cuda build/compile error fix
e121789fc Adding header check to machine-tux.sh autotest script
0b322c996 Fixed a few additional header file issues in examples
4542bf094 Fixed header usage problems and added regression test
aeb23b458 Removed lower-case header dependencies in examples
ce7ef0849 new sparse mat-mat-dist, triple-mat-dist
8bc028e5a fixed a few bugs
2a76685fc added a new 2-stage interpolation routine and auxiliary routines
60212ab6f Minor changes to CHANGELOG
28a7abe9b Merge branch 'master' into Interp
72c8dc25a added changes
708f58b21 Updates for new release 2.19.0
5bf6396a2 added new 2 stage interpolation numbers
cc7063a70 added support file
b84f132be updated reference manual added support file
2e9e8747f fixed a bug when using rlx 7 with OpenMP
e76dced04 Merge pull request Nek5000#122 from hypre-space/spack-autotest-update
cc0dde722 Small changes to spack autotest script
d08a0f35c Uninstall spack packages in hypre+superlu-dist test.
a2090abe5 fixed EXT interp seg fault issue with non-GPU coarsenings
b70859897 added Rob's interpolation (18) and some new 2stage versiona built on it (7)
9791056f0 Merge pull request Nek5000#121 from hypre-space/gpudev-bjorn
b94555398 updated regression tests on lassen
0c37e65d5 a minor change
636725e4e regression tests for the new functionalities
23b0e1226 Merge branch 'gpudev-bjorn' of https://github.com/hypre-space/hypre into gpudev-bjorn
3e6aa4390 some good format changes were lost (how come?)
fc8305e05 fix after regression tests
fe5e811b9 do not change Makefile in test
7336eab09 Fixed a bug in autotest 'make.sh' script
5cebc9383 bug fix
d632e6a8a ext/ext+i interp. device
6c0e7d241 Merge pull request Nek5000#119 from hypre-space/spack-autotest
1d014c27c Adding a Spack autotest script
62b61918c Merge pull request Nek5000#118 from hypre-space/superlu-dist-update
35386791f Updating autotest for SuperLU-dist version 6.3.1
506fc0a35 headers
a129c4257 Merge branch 'master' of https://github.com/hypre-space/hypre into gpudev-bjorn
249383ad6 removed a duplicate header
0ea3f8da1 Merge pull request Nek5000#117 from hypre-space/parcsr_header
d31f28b82 Merge branch 'master' of https://github.com/hypre-space/hypre into parcsr_header
8e9c5cebe Merge pull request Nek5000#111 from hypre-space/RL_GPU
535d01e01 Merge pull request Nek5000#114 from hypre-space/RAJA
8bf373b0e Merge pull request Nek5000#115 from hypre-space/fix-2s-interp
751925751 fixed a bug in two-stage interpolation
53912e72f Merge branch 'master' of https://github.com/hypre-space/hypre into parcsr_header
3c6cb66fb changes according to the latest RAJA
d3cc09b0c Merge pull request Nek5000#113 from hypre-space/vm-fix
6f8f51316 added protos.h; bug fix
ad7fb8700 header scripts in parcsr_mv/_ls and seq_mv
96bc109ec Build recv_rboxes in SStructAMRInterCommunication
b718ebfda Merge branch 'master' of https://github.com/hypre-space/hypre into RL_GPU
435eab61c bug fix
0d4089bdf FFFC on device
ff79fb0d7 optimization for fffc: squeeze out unused offd columns
15737275b added extended+i interpolation on GPU
bd1f981c6 Merge pull request Nek5000#110 from hypre-space/fix-issue108
176b84eda Fix periodic.out
41df3db1d Merge branch 'master' into fix-issue108
786e25cd4 Add new regression test.
84ebcd0b0 Map RecvRBoxes to coarse grid
2f7349400 Check if rboxes are empty before send/recv
b4c779c36 Merge pull request Nek5000#109 from hypre-space/mgr-dev
74d16716b Minor clean up.
83ff765c4 Cleanup to remove unused variables
7ab476182 Fixed dividing by 0 computing inv(a_diag) in building interpolation.
5ff2630da Merge with master.
4182f90cd Merge pull request Nek5000#106 from hypre-space/dslu_bugfix
5ee53e36c Edited comment to pass regression test.
02c04fff4 Merge branch 'master' into dslu_bugfix
f289a8b5b Modified superlu.c to use preprocessor definitions to decide when to copy data for MixedInt case
9f4e1b273 Add option for setting local reordering in ILU
9d1fbbc4e Merge pull request Nek5000#104 from hypre-space/new-interp
1ae1c7778 added another file to CMakeLists.txt
4d8fe2042 added new files to CMakeLists.txt
6888e05b6 changed regression test and a few small edits
29cb50203 Fixed some memory bugs and updated hypre to use latest version of superlu_dist
212182668 fixed memory locations for GPU
035f2a7d9 added new extended(interptype 16), extended+i(iinterptype 17) and i 2stage extended (iagg_interptype 5) interpolation and supporting routines.
84df86f9b updated extended interpolation on device, fixed issue in extractsubmatrixFC
952a1477c Merge pull request Nek5000#102 from hypre-space/hypre_Handle_wo_HYPRE_Init
dbc45fcb1 bug fix
3869b3196 minor changes
55475ea61 changed function arg `l1_norms' back to HYPRE_Real *
71f420863 minor changes after regression tests
445732424 access global var _hypre_handle via hypre_handle()
2e843c512 fixed typos
25eacb1d5 Merge pull request Nek5000#101 from jedbrown/jed/fix-complex
85c80d00a Fix for inclusion with complex.h
e9bcf42db Merge pull request Nek5000#93 from hypre-space/amg-setup
e8efde802 Merge branch 'master' of https://github.com/hypre-space/hypre
51f7834b0 remove license from cub
28f60b924 Merge branch 'master' of https://github.com/hypre-space/hypre into amg-setup
7b7aa2d0a Merge pull request Nek5000#98 from hypre-space/fix-issue96
aed4a3fdc Add test to treat S2 with zero size
62b15c742 Do not compute avg_nnzrow when S has global size 0
0bd0c5b15 Enable additional tests
b921cd2b8 Add SetOffProc test
eab7b7ea2 IJSetValues: tried to save some malloc, free, and memcpy, but the overall time seems not being improved
1e4dd38c4 ij_assembly
23fb795ca Merge branch 'RL_FIX' of https://github.com/hypre-space/hypre into amg-setup
97631a505 fixed compiler warning
432be1611 ij_assembly
0f102bd3e assembly driver
b0000ee0b Merge branch 'RL_FIX' of https://github.com/hypre-space/hypre into amg-setup
3327f4fc9 Merge branch 'amg-setup' of https://github.com/hypre-space/hypre into amg-setup
761e014ed Update with new memory model and test_Set
51f085b23 minor changes
02dad90b9 add exec_policy in more drivers
f87ec504f new file  in CMakefile
c4e6cdc08 for running regression tests with GPU setup/assembly
3982a1e89 Merge branch 'amg-setup' of https://github.com/hypre-space/hypre into amg-setup
4f8352e70 bug fix
2e4c8145e remove cub from check license filters
b58c1f2f4 more bug fixes regarding CUB
cf4d9b78b bug fix in ILU
fbb703400 Merge branch 'amg-setup' of https://github.com/hypre-space/hypre into amg-setup
43ad3d670 cub allocator
5b1d72cb2 bug fix
0f20a71d7 fix after pull in master
f491d130f Merge branch 'master' of https://github.com/hypre-space/hypre into amg-setup
4ffbb00db Fix indentation and minor clean up.
84423747c New interface for setting Cpoints using a point marker array. Updated test for compositional flow with wells.
022d476ed bug fix
22b18ab98 bug fix in ams
c9b0f47a3 bug fix
9ef162da5 HYPRE_Init in examples; cmake; ams
9e83173a8 Merge branch 'amg-setup' of https://github.com/hypre-space/hypre into amg-setup
1afd4a09a memory model, exec policy, etc
eb71948ed Merge branch 'master' into mgr-dev
9e2e1491a Merge pull request Nek5000#80 from hypre-space/docs
4bfbba62a added par_lr_interp_device.c
f68cb7bdb Added copyright header to readthedocs.yml file
0b1f1f244 Fixed a few remaining issues in HYPRE_parcsr_ls.h
1e70d39cd Adding copyright header to docs/usr-manual/requirements.txt file
716ddf63d Updated ParCSR solver header file to conform to new documentation approach
96a23ae14 first version of extended interpolation on device
766504983 Merge branch 'master' into docs
10a4cd53f Merge pull request Nek5000#86 from hypre-space/lobpcg
086c56e5d Merge branch 'master' into lobpcg
3d6424fa5 Merge pull request Nek5000#72 from hypre-space/mgr-dev
f3b79f23f Merge branch 'amg-setup' of https://github.com/hypre-space/hypre into amg-setup
1147d72e5  Added hypre_ILU solver info to user manual.
68c064eac Minor change to fix compile warning and error
33bb597ac Modified to remove clang warning and potential bug.
aac486403 Merge branch 'master' into mgr-dev
3d0cd5064 Some benign edits to revert some recent changes.
0b0e66fae Merge branch 'gpu-assembly' of https://github.com/hypre-space/hypre into amg-setup
271d55ed7 First version of an interface to set Cpoints by a marker array.
e22ae7f6a Add driver for testing IJMatrix assembly
7253e38ad Add missing include guards
687232e11 gpu regression test [clean regression tests on ray]
79aa36fc7 bug fix
9b9e44d83 bug fix
93261e436 bug fix
c7b29020d Merge pull request Nek5000#91 from hypre-space/gpu-assembly
79b5d7bc1 wrong configure pushed...
fc9e301a9 Merge branch 'amg-setup' of https://github.com/hypre-space/hypre into gpu-assembly
eaecaf173 Merge branch 'amg-setup' of https://github.com/hypre-space/hypre into gpu-assembly
d2e5108f3 cuda stream sync, prefetch stream, etc
039dc2833 Merge pull request Nek5000#90 from hypre-space/gpu-interp-new
94c0834c6 bug fix
2b8fd590b bug fix
3945f82e4 Merge branch 'amg-setup' of https://github.com/hypre-space/hypre into gpu-interp-new
60101610d 1. put l1-norms inside container hypre_vector 2. code restructure
d160f09a9 commented out all CudaPrefetch
74ae74d52 cub name space --> hypre::cub
794a1fc12 Merge pull request Nek5000#87 from hypre-space/fix-pcg
336a7bee9 Merge branch 'master' into fix-pcg
0fa087949 Fix memory leak, a block mode check in BoomerAMG, and string literal type
a88591509 pcg fix from Rob
03688671a new impl of direct interp and a new interp formula
dd9a724f1 CMakelist change corresponding to the previous name change
11f56bde6 Added an extra check for subnormal values in pcg
9492af96d bug fix
b239a5c57 change Makefile after the previous name change
2f412591b rename a file with a better name that describes
34a21aca5 Merge branch 'improve-coarsen'
acac8fd66 All tux tests are passing now
33475ea51 Merge branch 'master' into improve-coarsen
39e4f836a nvcc flag -lineinfo
2cb1b8389 minor changes
4775c5ff9 Merge branch 'improve-coarsen'
bae22bc5f Remaining fixes for two memory leaks
fcab2e6c5 Merge branch 'master' into improve-coarsen
b2f6d96e7 tux tests are passing
7fbe167e9 bug fix
e59109ad1 bug fix
641a018a3 Fix memory leak
5134773fb Merge branch 'master' into mgr-dev
fc4ccf513 Minor changes to docs readme file
b9fb9345d Adding a README file for building documentation
97e981469 Simplified the docs makefiles a bit
adde096bd Added a dependence to documentation makefile
45588b218 Merge branch 'master' of https://github.com/hypre-space/hypre into amg-setup
4e1185331 1. IJMatrix Set/Add values, allow ncols==NULL 2. changes in ij.c for non-um
f4f2a54de Merge pull request Nek5000#85 from koning/bugfix/koning/fei_memory_bigint
2ec1031b8 Fixing a small include file problem in examples/vis.c
c69a3cbb0 more changes on drivers etc
2e3d60e34 Cleaned up some lobpcg-related header file stuff
013f111eb Change AMSData structure to use HYPRE_BigInt , HYPRE_Int and HYPRE_Real data types. Use hypre_CTAlloc and hypre_TFree for allocations sent to the ams solver. Remove one AMSData free due to double free in hypre_AMSFEIDestroy.
16961497a Merge branch 'amg-setup' of https://github.com/hypre-space/hypre into gpu-assembly
0b47276cd IJ vector assembly
0a0aaf472 Merge pull request Nek5000#83 from hypre-space/cmake-install-prefix-fix
f454c6e96 cmake: Use default location to install if CMAKE_INSTALL_PREFIX is initialized to default value.
8f873ab25 Merge pull request Nek5000#82 from hypre-space/RL-FIX
85fecd552 fixed gpu memory leak
216cbba91 bug fix - part 2
7bfebfe7a bug fix
7ed934afd Adding a .readthedocs.yml file for readthedocs
69451499e Adding a requirements.txt file for readthedocs
10f4317ed Trying to get this to run on readthedocs
2a2c8107b Adding some readthedocs stuff to usr-manual/conf.py
5e9dda399 Updating a few informational items in user-manual zREADME file
93fe7d5aa First complete version of API in User Manual
015643413 Adding system interface reference to new API tab in User Manual
c4f019bd7 add off-proc values
692fb1d4d Adding license header to new cmake file
94036101b Merge pull request Nek5000#78 from ZedThree/cmake-update
c7b290ab8 changes on IJ set sizes
d34f9f7b4 local assembly should work now
a5072811d save this version
4bb6d3d7c CMake: Remove unused HYPRE_SOURCES
a708f8b33 CMake: first pass at using target-based approach
023e10571 CMake: Install basic CMake config files
81b8f8abf CMake: Don't clobber standard CMake flags
5644ce88e CMake: Require more recent version of CMake
7eb73531a Add remaining BigInts
82d208c30 Change global indices to HYPRE_BigInt
ef6e3b042 Merge branch 'master' into improve-coarsen
79c70b44c Bug fix when using aggressive coarsening.
f0275d9eb Add support to set F/C points to the various solvers.
1ef19f2f6 Drop ToKeep from SetCPoints function and C_points variables. HYPRE_BoomerAMGSetCpointsToKeep is deprecated.
4409ee92e Merge pull request Nek5000#77 from hypre-space/RL-FIX
c1175f50a Fixed memory issue for 2nd time solves
e03a335f0 Merge branch 'master' into mgr-dev
776122b97 Passes GPU regression test
67376c1c7 hypre_IJMatrixSetAddValues (*not done*) save this impl
7fb119959 Fixes for mixed-int build.
01a462ad5 Merge pull request Nek5000#66 from hypre-space/pgi-gpu-fix
1f89f7702 bug fix
aac4d2567 Merge branch 'master' into mgr-dev
885fe5aa4 Some bug fixes for BigInt and Global_Partition
b8c05a5c3 Added some new sort routines
97b3a9652 cub allocator
b25be1a83 Merge pull request Nek5000#71 from hypre-space/mempool
9395a69f1 fixes after regression tests
f80db4bc3 cub
9fadef736 Merge branch 'mempool' of https://github.com/hypre-space/hypre into mempool
41d5a9a19 ij.c
4f3b23f65  fixed conflicts after merge
aa61cc43d Merge branch 'mempool' of https://github.com/hypre-space/hypre into mempool
9be64bd29 added calls to the cub memory pool
213d2390c Merge branch 'gpu-assembly' of https://github.com/hypre-space/hypre into mempool
8c636c0df minor changes
3e111ca88 initial update
52d3c26ad some changes
5f9a2c513 Merge branch 'amg-setup' of https://github.com/hypre-space/hypre into mempool
0a51805aa ij driver option
2a1c2d8e3 cub allocator
5d0ffec33 Do not restrict error messages to proc 0
5923a6a3b Remove gitignore file
db237b821 Throw warning message by using the proper API
6c51a90ee memory tracker; do not save the size in device malloc anymore; prototype of TRealloc changed!
d2765b551 Merge branch 'pgi-gpu-fix' of https://github.com/hypre-space/hypre into pgi-gpu-fix
564b469a0 Merge branch 'master' into improve-coarsen
18eaa531a Bug fix - invalid free
ef8a25258 Clean debug statements
d5f8142a4 Add gitignore file
5b06704f6 Merge branch 'amg-setup' of https://github.com/hypre-space/hypre into mempool
b1d56dcac Reserved coarse points are BigInt
452fd1f46 Pass global coarse indices to SetCpointsToKeep
3f88e69d8 Initialize CF_marker to NULL
04bb19ab7 Minor fix + valgrind clean
bdad4e3e2 Add option to set a list of C points to keep in the AMG hierarchy
197a822f1 HYPRE_BoomerAMGSetCpointsToKeep works with global indices
a7ba73679 Fix file permissions
2995470c0 Fix file permissons
385e935fe Improve support to pgi compilers
df469267e Use deprecated-register flag only with xlc
0adffeead Merge branch 'master' of https://github.com/hypre-space/hypre into amg-setup
d533ee4f8 bug fix in ams, and ij driver to read rhs and x0
81c876e87 Add options to read fine and isolated fine points from file
d8ff006e6 Allocate CF_marker if it is a null pointer
61af69b37 Add BoomerAMGSetFPoints function
37aef4347 Move CF_marker allocation to par_amg_setup.
c3dc01761 Merge pull request Nek5000#61 from hypre-space/HYPRE_DEBUG
2d3064277 Add BoomerAMGSetIsolatedFPoints for seting a list of points as SF_PT in the finest level
7d9bd0947 Fix typo
6351dd1c8 Fix minor style issues
d50750314 Merge branch 'amg-setup' of https://github.com/hypre-space/hypre into amg-setup
ff6c49af1 hybrid timer
f0e28b79f Add SetCoarsenCutFactor function
d0723ca24 Add strategy to automatically treat dense rows as SF_PT
81d5200e7 Fix some memory leaks
0906874b1 Bug fixes
f3b9891d9 passed tuxbox regression tests
bae521e55 nothing changed
3e190f283 Merge pull request Nek5000#63 from hypre-space/lessUM
1168e7481 passed gpu regression tests
c2d875ffc Add suport to read vectors from a single file and distribute them to multiple processes in the case of no_global_partition
80cf3ca14 Extend VectorToParVector to no_global_partition case
fb543f356 Use GetGlobalRowPartitioning in BuildFuncsFromFile
b85fbc895 Add function that computes a global row partitioning of a ParCSRMatrix
b029be434 Extend HYPRE_CSRMatrixToParCSRMatrix and BuildParFromOneFile to no_global_partition case
3322ffff3 Extend BuildFuncsFromOneFile to no_global_partition case
9da1274f0 Merge branch 'amg-setup' of https://github.com/hypre-space/hypre into lessUM
f2751c61b Merge branch 'master' of https://github.com/hypre-space/hypre into lessUM
c9755f5d0 less UM
a647d7208 Merge branch 'amg-setup' of https://github.com/hypre-space/hypre into amg-setup
973acba80 sstruct driver
0333171fb bug fix
ccbc5a7d7 Updates to fix valgrind errors, and some minor cleanup
05fa1b500 removed .swp file
6f713accf Merge branch 'amg-setup' of https://github.com/hypre-space/hypre into lessUM
145c1359d change memory copy behavior from/to UM
39b4e0393 nvtx
37899993f change default spgemm to cusparse
c54fab8c8 Merge branch 'master' of https://github.com/hypre-space/hypre into amg-setup
95c48458e util header, comment in hypre_memory
8b422d61f Merge branch 'HYPRE_DEBUG' of https://github.com/hypre-space/hypre into amg-setup
11b510828 oversimple memory pool [proof of concept]
e0a7ecf22 some changes in sstruct driver, hybrid solver setup time, and nvtx
ca08a9668 ptr casting to int problem in `hypre_error_assert`
cdf51f96d hypre_printf.h
8843d7a83 bug fix
9029876ed Merge branch 'HYPRE_DEBUG' of https://github.com/hypre-space/hypre into amg-setup
17dc86f84 bug fix
bf25c6ed6 Merge pull request Nek5000#62 from hypre-space/hybrid-fix
6246b9486 bug fix hybrid amg
00616067b Merge branch 'HYPRE_DEBUG' of https://github.com/hypre-space/hypre into amg-setup
ad5ff7940 bug fix (need to use assert in device code)
ce2fe70fd changed all assert to hypre_assert
4c15015b8 changes after regression tests
1f2a3171b Incorporate Quan's edits to fix possible memory leak.
e1a600b6e bug fix
1d97ab76e Changed 'strings.h' to 'string.h' in hopscotch stuff.
f0fc40255 Merge branch 'HYPRE_DEBUG' of https://github.com/hypre-space/hypre into amg-setup
b98bf115e Merge branch 'master' of https://github.com/hypre-space/hypre into amg-setup
dda5db244 bug fix
db024c782 Fix to CMakeLists (elseif -> else).
76d581ea2 bug fixes
c1a45d643 a small change
69b73a038 nvcc c++11 does not like cast void (really a pain)
b117db0fb Casting to void is used to suppress compiler warnings.
61db186de fixes after regression test
684820607 Bug fix by Quan
1f07247da Cleanup to resolve issues in regression test
8cbdca2b7 HYPRE Debug mode
b56fb2d51 bug fix in hypre_ParCSRMatrixSetDNumNonzeros
e195422be a "better" but tricker impl of ptr2ind
2a2f992e7 bug fix
e01dd0ba3 stream sync for mpi-gpu
3d9f90e8c revoke all format changes
0c564e148 format changes
78f5ccf5d Merge branch 'amg-setup' of https://github.com/hypre-space/hypre into amg-setup
f053ea13a more format changes
a667cdbe4 Merge branch 'master' of https://github.com/hypre-space/hypre into amg-setup
abee12a09 only format changes [typos, trailing whitespaces, tabs]
d46fae80c Commented out unused variable to avoid compile warnings
94fe8cebc Minor edits to fix potential bug in ILU when residual vectors are not initialized to zero
2191feedb - Add the option to reduce the reserved Cpoints after a specified level. - Remove building A_ff matrix if an external F-relaxation solver is given.
93baaa8c9 Fixing a couple of PETSc-related valgrind errors
0a00f18bb Merge branch 'master' of https://github.com/hypre-space/hypre into amg-setup
c758e4c26 Fixed bug in updating coarse indices on each level for more than 3 levels when setNonCpointToF = 1.
d56485b3e - Add option to set F-relax solver for the first level. - Rearrange codes for building restriction and non-Galerkin coarse grid more efficiently.
9a621e346 Edits to cleanup build warnings
09ebccbc2 rebuilt _hypre_utilties.h``
d21f6dd2d Some cleanup after merge from master
ab0024110 Merge branch 'master' into mgr-dev
d509a088b Merge branch 'master' of https://github.com/hypre-space/hypre into amg-setup
e35d6268e need a sync stream for gpu-mpi [temporary fix]
ee84ffa23 Add mgr and ILU to CMakeLists
8da315a4a Merge branch 'amg-setup' of https://github.com/hypre-space/hypre into amg-setup
0bf20cd66 Merge branch 'amg-setup' of https://github.com/hypre-space/hypre into amg-setup
d3aa485dd commit after pull
3b38dbaa1 fixed conflicts after git pull
06e77adf0 minor changes to have less conflicts between device memory and host memory
94b70cff6 Merge branch 'master' into mgr-dev
41cc6f3ec Some edits to BoomerAMGGetGridHierarchy function
e7ecd8054 Added code to print coarse grid hierarchy for AMG
4b9681b9e Updated saved files for ILU tests
0d7aa599e Updated tests to include ILU testing. Also added RCM local reordering.
adf7962ea Edited to fix compilation issues after merging in master
139c08fb1 Merge branch 'master' into mgr-dev
b7bc2151b Fixed bug for GPU compilers
6e9651149 Merge branch 'master' into mgr-dev
78f8d23bb Fix parallel bugs in build sub-blocks of a matrix.
203ef956f Fix bug for getting sub-blocks of a matrix. Clean up merge.
12647b851 Merge with BigInt changes from master
6b4acd98c Merge branch 'master' into mgr-dev
e4e2560bf Merged BigInt changes and minor cleanup of unused variables
62fcc11b1 Merge branch 'mgr-dev' of github.com:hypre-space/hypre into mgr-dev
b00f87412 Merged changes to use hypre ILU and approximate inverse smoothers and interpolation operators
45ece2bdc Merge branch 'master' into mgr-dev
7805fecaa Change the routine for computing the non-Galerkin coarse grid in MGR to ensure correct sorting using hypre_qsort2_abs (hypre_qsort2_abs now sorts descendingly instead of ascendingly).
80aa9dba9 Merge with orgin/mgr-dev. Add non-Galerkin coarse grid option. Also add approximate inverse for building interpolation and coarse grid.
4db1092e1 Minor cleanup and updates to header file to remove compile warnings
af58accdb Modified saved files to reflect code updates and bug fixes
8fcad73a8 Fixed bug for multilevel with vcycle smoother
6b32feb05 Some minor bug fixes and code cleanup.
5a06c8dd8 Added functionality to perform dual truncation for parcsr (utilizing existing code). Also did some minor cleanup to use routines that already exist without having to rewrite them.
53087be58 Changes made in this version include - Added block Gauss-Seidel relaxation and change the function name   from 'hypre_block_jacobi' to 'hypre_blockRelax_solve'. - Remove 'last_level' argument from hypre_MGRBuildInterp. Now users   can specify the interpolation type at all levels. - Fixed some memory leaks when using full AMG for F-relaxation.
90da2ceea Merge branch 'master' into mgr-dev
9ef6800b3 Comment edit. testing push to hypre-space
0a9dba42a Merge branch 'ilu-dev' into mgr-dev
93d2e4e3d Merge branch 'master' into mgr-dev
8cc2eef72 Merge branch 'master' into ilu-dev
f4cb34489 Merge branch 'master' into ilu-dev
ea047fdf4 Fixed misleading indentation.
50f358389 Cleaned up some print statements.
bbf55a077 Merge branch 'master' into mgr-dev
015d56e06 Changes included in this version: - Separate the functions for specifying interpolation,   restriction, and F-relaxation methods using an integer (for all the levels)   and an array (for each level) to keep the previous interface intact. - Revert the name of the function for setting block data to the previous   version and rename the new function for setting block data (when   the physical variables are ordered contiguously). - Set default values for the parameters to prevent the code from   crashing even if the user does not specify any parameters.
89a35de55 Make functions that set parameters for MGR more consistent. Fixed some memory leaks.
377a5b79f New code from Tianshi Xu
5735f5192 Put back option for doing a full AMG V-cycle for F-relaxation for testing purposes.
6d087582a Fixed the routine building the strength connection matrix for a sub-matrix block. The previous version does not account for the off diagonal part.
574fc2a9a Do some clean up and fix indentation to make the code more readable. We use 2 spaces (not tab) for indentation.
a7bc1f9b3 Changes for this update include: - Allow for different interpolation and restriction options for each MGR level. - Add the number of functions for the Frelax V-cycle. - Fixed a bug for Frelax V-cycle when used as a preconditioner. The RHS should   be obtained from the Solve phase, not the Setup phase. - Optimize sparsity pattern of interpolation operator. Injection does not need   non-zero mapping for zero block. - Set some default values for using AMG for F-relaxation to prevent crashes.
142dd5912 This update contains the following changes: - New interface for setting C-F splitting for matrices with block structure, i.e., the same variables are ordered contiguously (s_1,s_2,...,s_n,p_1,p_2,...,p_n,...) - Allow different methods for F-relaxation at different levels. - Added a test file for MGR to test flow matrices coming from geocentric.
660cce2df Add a new qsort function to utilities.
3701a7f2e Final push before leave. Add ddPQ-GMRES-ILU into HYPRE. Add ParILU as a smoother options for AMG.
48ee6a098 Add local ddPQ permutation into ilu-dev.
5c8b8278f Sort U part to have better scaling result for high number of processors.
3c9365e52 Add RAS-ILU solve to ilu-dev.
e154ba332 Add NSH-ILU solve/precond to ilu-dev
1ab5433aa Update operator_complexity, now ILU-GMRES also consider the memory for the preconditioner on Schur Complement.
d4891d10f Add iluk and ilut for Parallel ILU solver and preconditioner Add ILU-GMRES solve for Parallel ILU solver and preconditioner Update ij.c to support more tests
55087f868 Updated ij driver to include solver options for ILU. Currently tested with ILU(0) and everything is working fine.
c98edc4d2 Finalized initial implementation of ILUSetup and ILUSolve routines. Also added setup for ILU(0).
13bde64ce Updated setup routine. Still incomplete.
dba5d829b Added setup and solve functions for ILU. These functions are currently place holders and are empty for now.
ae03e79ae Added more functions to set ILU preconditioner.
8829a2731 Added AIR option for restriction
de0656db9 Merge branch 'master' into mgr-dev
c24f9abfa Initial commit for new ilu-dev branch. Added ILU data struct and create/destroy functions.

git-subtree-dir: 3rd_party/hypre
git-subtree-split: 0b09a2238c307411bb4eb8a42f6f1870b5285c02
@stgeke stgeke self-assigned this Jul 8, 2021
@MalachiTimothyPhillips
Copy link
Collaborator

Note: we should also have some way of distinguishing between, e.g. velocity residuals and pressure residuals when running in verbose mode.

@stgeke
Copy link
Collaborator Author

stgeke commented Sep 22, 2021

Added in 7a55f45

@stgeke stgeke closed this as completed Sep 22, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants