[REVIEW] Split out cudf::distinct_count from drop_duplicates.cu #6822

davidwendt · 2020-11-20T19:08:00Z

One of our longest compile times is for drop_duplicates.cu measured currently at about 15 minutes. This file contains two cudf APIs: cudf::drop_duplicates and cudf::distinct_count. They share no code so it is reasonable to move the distinct_count to its own source file. Both files individually have a compile time around 7 minutes.

Also, analyzing the drop_duplicates source code I found that the table row-comparator is used in two thrust::copy_if() calls which inlines this large comparator function many times. And, the first copy_if has a lambda with two calls to the comparator meaning it is inlined twiced for each time it is inlined by copy_if. This is the main cause of the increased compile time and code size. I found a way to combine the logic for both the copy_if lambdas and further reduce the lambda to include only one copy of the comparator. The copy_if itself will still inline this many times but should reduce it by a factor of 3.

The drop_duplicates does not have a gbenchmark so I added one in this PR to measure before and after this change. No measurable effect was found so I'm including the change in this PR.

The distinct_count API passes the row-comparator to the thrust::count_if but only once so the same improvement does not apply there.

Note that no new function has been added (or removed). This is simply an improvement to compile time and size for this source file. The compile time for drop_duplicates.cu is now less than 5 minutes.

davidwendt · 2020-11-20T19:09:42Z

cpp/src/stream_compaction/distinct_count.cu

@@ -0,0 +1,196 @@
+/*


This is not new code but was just moved from the drop_duplicates.cu source file here to its own source file.

davidwendt · 2020-11-20T19:10:16Z

cpp/src/stream_compaction/drop_duplicates.cu

@@ -151,44 +173,6 @@ column_view get_unique_ordered_indices(cudf::table_view const& keys,
 }
 }

-cudf::size_type distinct_count(table_view const& keys,


This code was moved to the new distinct_count.cu source file.

kkraus14 · 2020-11-20T19:24:37Z

@davidwendt did you do any experimentation with removing pragmas from thrust to see what happens if things aren't inlined in this situation and the performance implications?

davidwendt · 2020-11-20T21:46:17Z

@davidwendt did you do any experimentation with removing pragmas from thrust to see what happens if things aren't inlined in this situation and the performance implications?

Unroll may have implications for thrust::copy_if (https://godbolt.org/z/asffbo) which is used by code in this PR but I've not tried disabling it here.

codecov · 2020-11-20T23:04:23Z

Codecov Report

Merging #6822 (4636246) into branch-0.17 (632ac54) will not change coverage.
The diff coverage is n/a.

@@             Coverage Diff              @@
##           branch-0.17    #6822   +/-   ##
============================================
  Coverage        81.94%   81.94%           
============================================
  Files               96       96           
  Lines            16164    16164           
============================================
  Hits             13246    13246           
  Misses            2918     2918

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 632ac54...4636246. Read the comment docs.

davidwendt · 2020-11-21T00:52:00Z

rerun tests

kkraus14

cmake lgtm

davidwendt added 3 commits November 20, 2020 12:55

Split up drop_duplicates.cu

09b7147

fix merge conflict

5309e74

update changelog

61367cd

davidwendt requested review from a team as code owners November 20, 2020 19:08

davidwendt requested review from karthikeyann and jrhemstad November 20, 2020 19:08

davidwendt self-assigned this Nov 20, 2020

davidwendt added 3 - Ready for Review Ready for review by team libcudf Affects libcudf (C++/CUDA) code. labels Nov 20, 2020

davidwendt commented Nov 20, 2020

View reviewed changes

kkraus14 approved these changes Nov 22, 2020

View reviewed changes

harrism approved these changes Nov 22, 2020

View reviewed changes

Merge branch 'branch-0.17' into split-up-drop-duplicate

7c64e06

davidwendt mentioned this pull request Nov 23, 2020

[REVIEW] Move template param to member var to improve compile of hash/groupby.cu #6835

Merged

Merge branch 'branch-0.17' into split-up-drop-duplicate

7774bef

karthikeyann approved these changes Nov 24, 2020

View reviewed changes

kkraus14 added 5 - Ready to Merge Testing and reviews complete, ready to merge and removed 3 - Ready for Review Ready for review by team labels Nov 24, 2020

Merge branch 'branch-0.17' into split-up-drop-duplicate

4636246

davidwendt merged commit e1e3047 into rapidsai:branch-0.17 Nov 25, 2020

davidwendt deleted the split-up-drop-duplicate branch November 25, 2020 04:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[REVIEW] Split out cudf::distinct_count from drop_duplicates.cu #6822

[REVIEW] Split out cudf::distinct_count from drop_duplicates.cu #6822

davidwendt commented Nov 20, 2020 •

edited

Loading

davidwendt Nov 20, 2020

davidwendt Nov 20, 2020

kkraus14 commented Nov 20, 2020

davidwendt commented Nov 20, 2020

codecov bot commented Nov 20, 2020 •

edited

Loading

davidwendt commented Nov 21, 2020

kkraus14 left a comment

[REVIEW] Split out cudf::distinct_count from drop_duplicates.cu #6822

[REVIEW] Split out cudf::distinct_count from drop_duplicates.cu #6822

Conversation

davidwendt commented Nov 20, 2020 • edited Loading

davidwendt Nov 20, 2020

Choose a reason for hiding this comment

davidwendt Nov 20, 2020

Choose a reason for hiding this comment

kkraus14 commented Nov 20, 2020

davidwendt commented Nov 20, 2020

codecov bot commented Nov 20, 2020 • edited Loading

Codecov Report

davidwendt commented Nov 21, 2020

kkraus14 left a comment

Choose a reason for hiding this comment

davidwendt commented Nov 20, 2020 •

edited

Loading

codecov bot commented Nov 20, 2020 •

edited

Loading