Reduce storage required for indexing - stop writing sp_name, res_type, and sp_updated to hfj_spidx_* tables #5941

volodymyr-korzh · 2024-05-15T17:05:51Z

Migration:

migrated all HFJ_SPIDX tables to allow SP_NAME and RES_TYPE columns to be nullable.
Migration runs with failureAllowed(), as both SP_NAME and RES_TYPE could be included in custom indexes, and SQL Server won't let us change nullability on columns with indexes pointing to them.

Optimization changes:

added IndexStorageOptimizationListener - it is applied only to HFJ_SPIDX tables - nulling SP_NAME, RES_TYPE SP_UPDATED if this feature is enabled.
IndexStorageOptimizationListener uses JpaSearchParamCache to recover SP_NAME, RES_TYPE from hash_identity after loading from DB.
HASH_IDENTITY field was added to BaseResourceIndexedSearchParam entity and removed from all inheritor entities.
Equals and hashCode methods for all ResourceIndexedSearchParam now using HASH_IDENTITY instead of sp_name and res_type. This is required to make ResourceIndexedSearchParam objects with and without optimization to be equal - to not cause unnecessary ResourceIndexedSearchParams updates. (as we are comparing db version of entities with in-memory built Search params)
Updated DaoSearchParamSynchronizer logic to check whether it is needed to update existing search parameters after isIndexStorageOptimized change
Exception is thrown during the startup if isIncludePartitionInSearchHashes and isIndexStorageOptimized are enabled on server. (isIncludePartitionInSearchHashes is not supported if isIndexStorageOptimized is set to true)
InMemoryResourceMatcher now uses hashIdentity to filter SearchParams instead of sp_name. (only if optimization is enabled)
BaseSearchParamPredicateBuilder now uses hashIdentity instead of SP_NAME, RES_TYPE to build a query. This way new optimization could work in pair with Enabled IndexMissingFields setting. (only if optimization is enabled)
Added new tests and documentation.

…_type and sp_updated columns of HFJ_SPIDX tables nullable

github-actions · 2024-05-15T17:06:58Z

Formatting check succeeded!

codecov · 2024-05-15T18:31:22Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 83.51%. Comparing base (497b9f2) to head (263d02f).
Report is 106 commits behind head on master.

Additional details and impacted files

@@             Coverage Diff              @@
##             master    #5941      +/-   ##
============================================
+ Coverage     83.39%   83.51%   +0.11%     
- Complexity    26927    27324     +397     
============================================
  Files          1681     1701      +20     
  Lines        103965   105751    +1786     
  Branches      13189    13351     +162     
============================================
+ Hits          86702    88315    +1613     
- Misses        11613    11729     +116     
- Partials       5650     5707      +57

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

...ver-model/src/main/java/ca/uhn/fhir/jpa/model/listener/IndexStorageOptimizationListener.java

… RES_TYPE after update/load

…-required-for-indexing-tables # Conflicts: # hapi-fhir-jpaserver-base/src/main/java/ca/uhn/fhir/jpa/migrate/tasks/HapiFhirJpaMigrationTasks.java

…eters if IndexMissingFields and optimizeIndexStorage are both enabled

…arameters

…rect configuration handling

… SP recovery, documentation updates

…-required-for-indexing-tables

… default

…-required-for-indexing-tables

tadgh

Approved pending various comments.

...main/resources/ca/uhn/hapi/fhir/changelog/7_4_0/5937-reduce-storage-for-sp-index-tables.yaml

hapi-fhir-jpaserver-base/src/main/java/ca/uhn/fhir/jpa/config/SearchConfig.java

...ir-jpaserver-base/src/main/java/ca/uhn/fhir/jpa/migrate/tasks/HapiFhirJpaMigrationTasks.java

.../src/main/java/ca/uhn/fhir/jpa/search/builder/predicate/BaseSearchParamPredicateBuilder.java

...ver-model/src/main/java/ca/uhn/fhir/jpa/model/listener/IndexStorageOptimizationListener.java

hapi-fhir-jpaserver-model/src/main/java/ca/uhn/fhir/jpa/model/util/SearchParamHash.java

…-required-for-indexing-tables

…-required-for-indexing-tables # Conflicts: # hapi-fhir-jpaserver-model/src/test/java/ca/uhn/fhir/jpa/model/entity/ResourceIndexedSearchParamCoordsTest.java # hapi-fhir-jpaserver-model/src/test/java/ca/uhn/fhir/jpa/model/entity/ResourceIndexedSearchParamDateTest.java # hapi-fhir-jpaserver-model/src/test/java/ca/uhn/fhir/jpa/model/entity/ResourceIndexedSearchParamQuantityNormalizedTest.java # hapi-fhir-jpaserver-model/src/test/java/ca/uhn/fhir/jpa/model/entity/ResourceIndexedSearchParamQuantityTest.java # hapi-fhir-jpaserver-model/src/test/java/ca/uhn/fhir/jpa/model/entity/ResourceIndexedSearchParamStringTest.java # hapi-fhir-jpaserver-model/src/test/java/ca/uhn/fhir/jpa/model/entity/ResourceIndexedSearchParamTokenTest.java # hapi-fhir-jpaserver-model/src/test/java/ca/uhn/fhir/jpa/model/entity/ResourceIndexedSearchParamUriTest.java

michaelabuckley

Good work.

Reading your work exposed some missing tests around the entity equals/hashCode work.
Made some minor suggestions.
Remember to restore settings changed Spring contexts after the test.

.../src/main/java/ca/uhn/fhir/jpa/search/builder/predicate/BaseSearchParamPredicateBuilder.java

hapi-fhir-docs/src/main/resources/ca/uhn/hapi/fhir/docs/server_jpa/performance.md

...aserver-model/src/main/java/ca/uhn/fhir/jpa/model/entity/BaseResourceIndexedSearchParam.java

hapi-fhir-jpaserver-model/src/main/java/ca/uhn/fhir/jpa/model/util/SearchParamHash.java

...ver-model/src/main/java/ca/uhn/fhir/jpa/model/listener/IndexStorageOptimizationListener.java

hapi-fhir-server/src/main/java/ca/uhn/fhir/rest/server/util/FhirContextSearchParamRegistry.java

...test-r5/src/test/java/ca/uhn/fhir/jpa/dao/r5/FhirResourceDaoR5IndexStorageOptimizedTest.java

.../ca/uhn/fhir/jpa/searchparam/matcher/InMemoryResourceMatcherR5IndexStorageOptimizedTest.java

...test-r4/src/test/java/ca/uhn/fhir/jpa/dao/r4/FhirResourceDaoR4IndexStorageOptimizedTest.java

...-jpaserver-test-r4/src/test/java/ca/uhn/fhir/jpa/dao/r4/FhirResourceDaoR4SearchNoFtTest.java

Co-authored-by: Michael Buckley <[email protected]>

…-required-for-indexing-tables # Conflicts: # hapi-fhir-jpaserver-model/src/main/java/ca/uhn/fhir/jpa/model/entity/StorageSettings.java # hapi-fhir-jpaserver-test-r4/src/test/java/ca/uhn/fhir/jpa/dao/r4/FhirResourceDaoR4SearchNoFtTest.java

hapi-fhir-docs/src/main/resources/ca/uhn/hapi/fhir/docs/server_jpa/performance.md

…-required-for-indexing-tables # Conflicts: # hapi-fhir-jpaserver-base/src/main/java/ca/uhn/fhir/jpa/migrate/tasks/HapiFhirJpaMigrationTasks.java

…-required-for-indexing-tables

Reduce storage required for indexing - migration to make sp_name, res…

2b7d735

…_type and sp_updated columns of HFJ_SPIDX tables nullable

volodymyr-korzh self-assigned this May 15, 2024

volodymyr-korzh linked an issue May 15, 2024 that may be closed by this pull request

Reduce storage required for indexing - stop writing sp_name, res_type, and sp_updated to hfj_spidx_* tables #5937

Closed

volodymyr-korzh added 3 commits May 15, 2024 14:29

Reduce storage required for indexing - new setting and unit tests

3e8073a

Reduce storage required for indexing - implementation part 1

ff523ab

Reduce storage required for indexing - implementation part 1 fix

658d5d4

jamesagnew reviewed May 17, 2024

View reviewed changes

...ver-model/src/main/java/ca/uhn/fhir/jpa/model/listener/IndexStorageOptimizationListener.java Outdated Show resolved Hide resolved

volodymyr-korzh added 11 commits May 17, 2024 11:23

Reduce storage required for indexing - fixes

c18de2c

Reduce storage required for indexing - added restoring of SP_NAME and…

c411b12

… RES_TYPE after update/load

Merge remote-tracking branch 'origin/master' into 5937-reduce-storage…

f305e1d

…-required-for-indexing-tables # Conflicts: # hapi-fhir-jpaserver-base/src/main/java/ca/uhn/fhir/jpa/migrate/tasks/HapiFhirJpaMigrationTasks.java

Reduce storage required for indexing - Fixed search for missing param…

71483dd

…eters if IndexMissingFields and optimizeIndexStorage are both enabled

Reduce storage required for indexing - Fixed search refchain search p…

4ae9c13

…arameters

Reduce storage required for indexing - Migration updated, added incor…

886be87

…rect configuration handling

Reduce storage required for indexing - Tests update, updated logic of…

32c5555

… SP recovery, documentation updates

Merge remote-tracking branch 'origin/master' into 5937-reduce-storage…

18ed7fd

…-required-for-indexing-tables

Reduce storage required for indexing - IndexStorageOptimized false by…

8b836e1

… default

Reduce storage required for indexing - fixes

551ad25

Reduce storage required for indexing - docs update and changelog

fa81205

volodymyr-korzh marked this pull request as ready for review May 30, 2024 17:41

volodymyr-korzh requested a review from a team as a code owner May 30, 2024 17:41

volodymyr-korzh added 2 commits May 30, 2024 18:42

Reduce storage required for indexing - docs update

0b393fa

Merge remote-tracking branch 'origin/master' into 5937-reduce-storage…

5932478

…-required-for-indexing-tables

tadgh approved these changes Jun 5, 2024

View reviewed changes

volodymyr-korzh added 5 commits June 5, 2024 14:20

Reduce storage required for indexing - minor fixes

ee5e9c7

Merge remote-tracking branch 'origin/master' into 5937-reduce-storage…

f9402a3

…-required-for-indexing-tables

Reduce storage required for indexing - assertj migration

5d53fbf

Reduce storage required for indexing - javadoc update

0e40bf1

michaelabuckley approved these changes Jun 12, 2024

View reviewed changes

volodymyr-korzh and others added 8 commits June 13, 2024 09:40

Update BaseResourceIndexedSearchParam.java - HASH_IDENTITY javadoc

3e1c14e

Co-authored-by: Michael Buckley <[email protected]>

Update performance.md

21fe247

Co-authored-by: Michael Buckley <[email protected]>

Reduce storage required for indexing - added more tests

ff278d9

Reduce storage required for indexing - added tests and fixes

596f217

Reduce storage required for indexing - more tests

f5476b4

Reduce storage required for indexing - added upgrade notes

ef0d72c

Reduce storage required for indexing - checkstyle fix

f4e226f

jamesagnew reviewed Jun 18, 2024

View reviewed changes

hapi-fhir-docs/src/main/resources/ca/uhn/hapi/fhir/docs/server_jpa/performance.md Outdated Show resolved Hide resolved

volodymyr-korzh added 4 commits June 18, 2024 11:30

Merge remote-tracking branch 'origin/master' into 5937-reduce-storage…

4d4065d

…-required-for-indexing-tables # Conflicts: # hapi-fhir-jpaserver-base/src/main/java/ca/uhn/fhir/jpa/migrate/tasks/HapiFhirJpaMigrationTasks.java

Reduce storage required for indexing - updated HapiFhirJpaMigrationTasks

32ff4f8

Reduce storage required for indexing - minor fixes

00f149e

Merge remote-tracking branch 'origin/master' into 5937-reduce-storage…

263d02f

…-required-for-indexing-tables

volodymyr-korzh merged commit 0397b9d into master Jun 20, 2024
66 checks passed

volodymyr-korzh deleted the 5937-reduce-storage-required-for-indexing-tables branch June 20, 2024 20:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce storage required for indexing - stop writing sp_name, res_type, and sp_updated to hfj_spidx_* tables #5941

Reduce storage required for indexing - stop writing sp_name, res_type, and sp_updated to hfj_spidx_* tables #5941

volodymyr-korzh commented May 15, 2024 •

edited

Loading

github-actions bot commented May 15, 2024 •

edited

Loading

codecov bot commented May 15, 2024 •

edited

Loading

tadgh left a comment

michaelabuckley left a comment •

edited

Loading

Reduce storage required for indexing - stop writing sp_name, res_type, and sp_updated to hfj_spidx_* tables #5941

Reduce storage required for indexing - stop writing sp_name, res_type, and sp_updated to hfj_spidx_* tables #5941

Conversation

volodymyr-korzh commented May 15, 2024 • edited Loading

github-actions bot commented May 15, 2024 • edited Loading

codecov bot commented May 15, 2024 • edited Loading

Codecov Report

tadgh left a comment

Choose a reason for hiding this comment

michaelabuckley left a comment • edited Loading

Choose a reason for hiding this comment

volodymyr-korzh commented May 15, 2024 •

edited

Loading

github-actions bot commented May 15, 2024 •

edited

Loading

codecov bot commented May 15, 2024 •

edited

Loading

michaelabuckley left a comment •

edited

Loading