Convert LFU to LRU cache for blob storage #107130

jdconrad · 2024-04-04T22:31:17Z

This changes our approach to caching for blob storage. This converts the least frequently used cache to a much simpler least recently used cached. The new cache has two lists called front and middle. New entries are inserted into the middle list to avoid thrashing. Any entry accessed another time is moved to the head of the front list. Only the last entry of the middle list is checked for eviction when the cache becomes full avoiding degenerate cases of moving through the entirety of the lists as it's very unlikely that anything being currently written to will make it to the end of the middle list.

elasticsearchmachine · 2024-04-04T22:31:40Z

Pinging @elastic/es-search (Team:Search)

jdconrad · 2024-04-04T23:01:24Z

@elasticmachine run elasticsearch-ci/part-3

JVerwolf

Nice work! At a high level, this looks good - though I'm quite interested in what others say. I'm not qualified to approve, so my comments should not block - feel free to take them as ideas/suggestions :)

There are a few ideas that came to mind which are well into the realm of over-optimization. I'd be interested to see how this performs in the micro-benchmarks to see if further optimization is warranted, however.

On that topic, it seems like many of the operations are conceptually concurrent. For instance, we should be able to force-evict swaths of entries concurrently while also adding and removing to the queues concurrently. I'm not sure if the coordination costs are worth it, though (not to mention the code complexity).

I also wonder if there are things that could be done to improve reading/writing back and forth to main memory from the cpu caches (by storing the parts of the data structure which change most in such a way that they can stay cached), or alternatively by bulk updating (waiting for 100 writes to the "front" list before moving 100 entries to the back list all at once with a single set of pointer updates, etc).

But like I said, without benchmarks to show that any of this needs improving, I think it looks great.

JVerwolf · 2024-04-11T18:42:30Z

...ugin/blob-cache/src/main/java/org/elasticsearch/blobcache/shared/SharedBlobCacheService.java

 }
 }
 }

- private void unlink(final LFUCacheEntry entry) {
+ private void unlink(final LRUCacheEntry entry) {


WDYT about testing this explicitly under the following conditions:

entry.list has one entry, which we unlink()

entry.list has one 2 entries, and we unlink() the first

entry.list has one 2 entries, and we unlink() the last

entry.list has one 3 entries, and we unlink() the middle

Maybe this is already done via the other tests in a transitive manner, as I didn't read through all the tests in detail. However, I noticed that neither this function, nor any of the methods that call this function, are under test. (I'm bit paranoid, but trust me I've earned it...)

Replied below.

JVerwolf · 2024-04-11T18:57:36Z

...ugin/blob-cache/src/main/java/org/elasticsearch/blobcache/shared/SharedBlobCacheService.java

+ assert invariant(entry, true);
+ }
+
+ private void pushEntryToFront(final LRUCacheEntry entry) {


Along with pushEntryToMiddle/Back(), this seems like the kind of thing that would be good to explicitly test. (Maybe this is already satisfactorily tested through transitive method calls, I confess I didn't read the entirety of the test files.)

I was thinking something along the lines of starting with an empty list, adding up to two entries, and for each added entry asserting that the pointers are correct.

We'd also want a test that sets the size to a low value, and checks that the entries are pushed to the middle list.

WDYT? Is that overkill? Is this behaviour already tested?

AFAIK this should be tested transitively. I think it's preferable to keep the LRUCache classes private if possible. They also happen to be non-static so are tied to an instance of SharedBlobCacheService.

jdconrad · 2024-04-17T18:06:56Z

@original-brownbear I added back the LFU cache. It's possible to switch between LFU and LRU with a new additional setting. LRU is the default. I also separated out the tests for LFU and LRU into separate files which accounts for the dramatic change in line count.

original-brownbear

This looks quite good thanks Jack! I'm wondering if we shouldn't go a little further here though and move to a more lock-free implementation if possible.
If any promotion to the head of the queue requires acquiring a global lock, that's not ideal. Maybe we could just make the moving of an item to the head of the queue lock-free for now, since that's what really matters for read performance, and leave the rest as is for maybe looking into later? WDYT?

original-brownbear · 2024-04-22T14:53:38Z

...ugin/blob-cache/src/main/java/org/elasticsearch/blobcache/shared/SharedBlobCacheService.java

+ assert invariant(entry, true);
+ }
+
+ private void pushEntryToFront(final LRUCacheEntry entry) {


I think there should be a way for us to not need this kind of lock any longer. Can't we insert
Because it's basically: point new node at current head, set head to new node, but only if current head is still what we just pointed at (CAS)? No need for locking is there?
And then basically precede that whole operation by evicting the last element in the cache in case we insert a new page instead of just cycling a page to the front?

Let's chat about this again and if it's still relevant once we start talking about using time.

Thinking back at past discussions, it seems lock-freedom was a key motivation for going to an LRU so would be good to proof out that this works as part of this (POC?).

jdconrad · 2024-06-24T23:20:13Z

@original-brownbear @JVerwolf I have removed the middle list from this PR.

elasticsearchmachine · 2024-07-17T19:21:12Z

Pinging @elastic/es-search-foundations (Team:Search Foundations)

jdconrad added 7 commits April 3, 2024 13:47

convert to lru cache

6555408

fix tests

9cdea52

spotless

b228a16

fix push to middle bug

b3626c2

Merge branch 'main' into lru

64fc8a3

update tests

19fa7e1

use numRegions to determine the size of the cache lists

931f505

jdconrad added :Search/Search Search-related issues that do not fall into other categories >refactoring v8.14.0 labels Apr 4, 2024

jdconrad requested a review from original-brownbear April 4, 2024 22:31

elasticsearchmachine added the Team:Search Meta label for search team label Apr 4, 2024

Merge branch 'main' into lru

f472530

JVerwolf reviewed Apr 11, 2024

View reviewed changes

jdconrad added 2 commits April 11, 2024 15:07

Merge branch 'main' into lru

020ae6e

Merge branch 'main' into lru

d4c5c11

elasticsearchmachine added v8.15.0 and removed v8.14.0 labels Apr 17, 2024

add option to switch between lfu and lru cache

ebaa61d

original-brownbear reviewed Apr 22, 2024

View reviewed changes

jdconrad added 3 commits June 24, 2024 15:34

Merge branch 'main' into lru

8d6f055

Remove the middle list.

61dab6a

spotless

dd9d7fd

fix test setting to lfu

a05ea7c

elasticsearchmachine added v8.16.0 and removed v8.15.0 labels Jul 4, 2024

javanna added :Search Foundations/Search Catch all for Search Foundations and removed :Search/Search Search-related issues that do not fall into other categories labels Jul 17, 2024

elasticsearchmachine added Team:Search Foundations Meta label for the Search Foundations team in Elasticsearch and removed Team:Search Meta label for search team labels Jul 17, 2024

mark-vieira added v9.0.0 and removed v8.16.0 labels Sep 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convert LFU to LRU cache for blob storage #107130

Convert LFU to LRU cache for blob storage #107130

jdconrad commented Apr 4, 2024

elasticsearchmachine commented Apr 4, 2024

jdconrad commented Apr 4, 2024

JVerwolf left a comment •

edited

Loading

JVerwolf Apr 11, 2024

jdconrad Jun 24, 2024

JVerwolf Apr 11, 2024

jdconrad Jun 24, 2024

jdconrad commented Apr 17, 2024

original-brownbear left a comment

original-brownbear Apr 22, 2024

jdconrad Jun 24, 2024

henningandersen Jun 25, 2024

jdconrad commented Jun 24, 2024

elasticsearchmachine commented Jul 17, 2024

Convert LFU to LRU cache for blob storage #107130

Are you sure you want to change the base?

Convert LFU to LRU cache for blob storage #107130

Conversation

jdconrad commented Apr 4, 2024

elasticsearchmachine commented Apr 4, 2024

jdconrad commented Apr 4, 2024

JVerwolf left a comment • edited Loading

Choose a reason for hiding this comment

JVerwolf Apr 11, 2024

Choose a reason for hiding this comment

jdconrad Jun 24, 2024

Choose a reason for hiding this comment

JVerwolf Apr 11, 2024

Choose a reason for hiding this comment

jdconrad Jun 24, 2024

Choose a reason for hiding this comment

jdconrad commented Apr 17, 2024

original-brownbear left a comment

Choose a reason for hiding this comment

original-brownbear Apr 22, 2024

Choose a reason for hiding this comment

jdconrad Jun 24, 2024

Choose a reason for hiding this comment

henningandersen Jun 25, 2024

Choose a reason for hiding this comment

jdconrad commented Jun 24, 2024

elasticsearchmachine commented Jul 17, 2024

JVerwolf left a comment •

edited

Loading