Rework available query processing #3278

doitsujin · 2023-03-03T18:39:13Z

Supersedes #3273.

@CFSworks please let me know if this looks alright to you. I made the proposed changes yesterday and put you as a co-author since it's the same idea.

And do so when adding additional query handles, in order to avoid allocating queries indefinitely if End is never called, which Halo:MCC supposedly does. Co-authored-by: Sam Edwards <[email protected]>

CFSworks

This approach is a lot cleaner than mine, and thank you for the co-author tag!

I have had a very busy day today so I'll have to test this change sometime this weekend, but it does look like it does the job quite nicely.

My only code comment on this PR is related to whether m_ended really needs to be atomic after all: It may even be good to do away with it, or use it only in debug builds for an assert. But either way, I don't feel very strongly about it. Do as you like.

CFSworks · 2023-03-04T05:59:54Z

src/dxvk/dxvk_gpu_query.cpp

@@ -34,15 +31,14 @@ namespace dxvk {
  DxvkGpuQueryStatus DxvkGpuQuery::getData(DxvkQueryData& queryData) const {
    queryData = DxvkQueryData();

-    if (!m_ended)
+    // Callers must ensure that no begin call is pending when


I decided to check just to make sure that this assurance holds; here's my sketch of a proof for it:

Given:

The D3D11Query implementation uses a simple, lock-protected FSM to guarantee that it alternates between starting Begin and End operations: an out-of-order Begin is discarded, and an out-of-order End has a Begin injected before it.

Operations are sent to the CS thread and dispatched in strict FIFO order (I am not 100% on this)

D3D11Query also has an atomic counter that accurately tracks the number of in-flight End operations.

Therefore:

The D3D GetData() implementation will only attempt to call the underlying query's getData() if it is in the "ended" state (meaning the most recent operation started was an End, per given 1) and there are no in-flight End operations (per given 3). And, if the most recently started operation is no longer in-flight, then no other operations are in-flight (per given 2) at all. Also, the atomic-sub has release semantics, and the GetData()'s implied atomic-load defaults to seq_cst, so data is also properly synchronized.

Unless I am missing something, this means that there was no race between DxvkGpuQuery::begin()/DxvkGpuQuery::end() and DxvkGpuQuery::getData() in the first place! So making m_ended atomic may be redundant: perhaps you decide that's a good thing for robustness, but if not it's an unnecessary performance cost.

In general I prefer an all-or-nothing approach to making objects guarantee thread safety: DxvkGpuQuery should be responsible for all of its synchronization guarantees or none of them. That is, if we're going to require that callers guarantee that no begin() call is pending when calling getData(), we may as well also require that callers guarantee that no end() call is pending either. (And we are changing the getData() contract anyway by making it non-const, so it might be good to codify these requirements at the same time.)

CFSworks · 2023-03-05T19:18:09Z

Just tested and confirmed that this PR does indeed fix the problem in Halo: MCC (both tests done on the pause menu):

Commit 11f76306: memory usage stays fairly constant
Reverting to commit c5b7a669: At 60 FPS, RSS increases at ~123 MiB/min. with query allocation at ~36 pools/min.

doitsujin and others added 2 commits March 2, 2023 17:13

[dxvk] Use small_vector to store query handles.

c5b7a66

[dxvk] Accumulate query data into query object

11f7630

And do so when adding additional query handles, in order to avoid allocating queries indefinitely if End is never called, which Halo:MCC supposedly does. Co-authored-by: Sam Edwards <[email protected]>

doitsujin force-pushed the query-rework branch from 095d883 to 11f7630 Compare March 3, 2023 18:43

CFSworks reviewed Mar 4, 2023

View reviewed changes

doitsujin merged commit 7f21a6c into master Mar 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rework available query processing #3278

Rework available query processing #3278

doitsujin commented Mar 3, 2023

CFSworks left a comment

CFSworks Mar 4, 2023

CFSworks commented Mar 5, 2023

Rework available query processing #3278

Rework available query processing #3278

Conversation

doitsujin commented Mar 3, 2023

CFSworks left a comment

Choose a reason for hiding this comment

CFSworks Mar 4, 2023

Choose a reason for hiding this comment

CFSworks commented Mar 5, 2023