Use our own logic for index management #5088

rnewson · 2024-06-12T21:34:41Z

Overview

Use our own logic for index management

Caffeine complaines when eviction is slow (because we might commit the index).
Build our own logic for opening and closing indexes which allows opening and
closing to be very slow, while ensuring we don't close an index that's in use.

Testing recommendations

covered by tests

Related Issues or Pull Requests

N/A

Checklist

Code is written and works correctly
Changes are covered by tests
Any new configurable parameters are documented in rel/overlay/etc/default.ini
Documentation changes were made in the src/docs folder
Documentation changes were backported (separated PR) to affected branches

nickva · 2024-06-12T22:54:43Z

nouveau/src/main/java/org/apache/couchdb/nouveau/core/IndexManager.java

+ holder.state = HolderState.LOADED;
+ }
+ } finally {
+ holder.lock.readLock().lock();


If we hold the write lock we're allowed to take a read lock and now we hold both?

I wonder if holder.lock.readLock().lock(); should be before the } finally { instead of after, in other words finally should only handle unlocking the write lock we took above in line 141 holder.lock.writeLock()?

Yes,

Lock downgrading

Reentrancy also allows downgrading from the write lock to a read lock, by acquiring the write lock, then the read lock and then releasing the write lock. However, upgrading from a read lock to the write lock is not possible.

https://docs.oracle.com/javase/8/docs/api/java/util/concurrent/locks/ReentrantReadWriteLock.html

the purpose here is to ensure we have a read lock or write lock at all times.

I was thinking if readLock().lock() could fail (does it throw or get interrupted?) then the next statement holder.lock.writeLock().unlock(); won't be run and it would be stuck with a write lock taken.

ah, I see what you mean. will ponder. the proper spell for this in is those javadocs, will double check

updated. agree that the read lock() should be before the finally for the reason stated. pattern now matches the javadoc example more closely.

nickva · 2024-06-13T16:31:39Z

nouveau/src/main/java/org/apache/couchdb/nouveau/core/IndexManager.java

+ if (forceDelete) {
+ IOUtils.rm(indexRootPath(name));
+ }


Are there any assertions we can make about the presence in cache in these NOT_LOADED/UNLOADED state? For instance, would it make sense to assert we're not in the cache at this point?

hm, perhaps. I am also pondering if we need three states. It is important not to block reading the cache while we load or unload (as these can take many seconds), hence I let you retrieve from the cache under just the cache lock, though it might return a Holder in NOT_LOADED or UNLOADED states. the first thread to acquire the write lock gets the chore of opening the index. Perhaps I really don't need a distinct UNLOADED state? It could go back to UN_LOADED, and then we'd try to load it again, but only if someone has acquired the Holder before it is removed from the cache map itself... Hm, but no, because then we'd have to reinsert into the map. I think a boring lifecycle is better, NOT_LOADED -> LOADED -> UN_LOADED (and removed from the cache), and repeat. What do you think?

ben-manes · 2024-06-14T02:46:05Z

did you intend to have IndexEvictionListener run atomically within the map operation that removes the entry? In your new logic you defer that work to be asynchronous, so I believe you could use Caffeine.removalListener with cause.wasEvicted() or dispatch it yourself for the same effect. The naming is a little unobvious, unfortunately, and a characteristic of feature evolution and keeping familiarity with Guava Cache for straightforward migrations.

Edit:
oops, I missed that removeEldestEntry always returns false to evict the entry asynchronously.

I suppose another alternative would be to atomically evict into a victim cache which the AsyncIndexLoader. The AsyncIndexLoader could load from that first if the entry was found, either removing it or optionally having the IndexWeigher give the entry the weight of zero while unloading. When the IndexEvictionListener evicts it to the victim cache it could do your unloading procedure to discard it fully. I don't know whether that dance is any better than your own code that you can more easily understand and control, so just providing options if a helpful thought exercise.

rnewson · 2024-06-14T08:29:06Z

@ben-manes Hi, and thanks for jumping in (and for developing Caffeine). Yes, I need the eviction to happen atomically (or, at least, no client should try to use the item while it is unloading), and it can be very slow (potentially minutes). I concluded that I was using Caffeine inappropriately; caching stateful objects that have a mutex external to the JVM, a file-based lock, where loading and evicting can be very slow.

I need to ensure that any given cache entry is loaded and not unloading at the time of any interaction. I also can't load the same item twice (Lucene's write.lock is preventing that for very good reasons), which made the race between an eviction and a concurrent get() somewhat fun.

As you've seen I'm just using LinkedHashMap as a way to trigger awareness that we're at capacity, and I perform the unloading later. Happy to just accept that we might go a little over capacity, the trade-off in code simplicity is worth it to me.

Your suggestions are very helpful though, and I might be pulled back to Caffeine for this as I get more results from real world performance (I am not too thrilled about the synchronized(cache) bits, especially when reading the cache). What I like about this PR though is it pulls all of the concurrency logic into IndexManager directly, and there's not too much of it. Time will tell if it holds up, of course.

ben-manes · 2024-06-14T15:52:34Z

Since the mapping is discarded asynchronously, I suppose removeEldestEntry could try to evict the same one multiple times during a slow unloading? In those cases then once that completes the cache would still exceed its capacity and be stuck there, since the call only happens once on insertion. I think you would need to prune the cache yourself after unloading to avoid a memory leak.

If you need a quick-and-dirty concurrent cache, the simple and classic and earliest approaches are to use a Clock (aka SecondChance) or a random sampling based strategy. These are popular, easy to write and maintain, can have good hit rates, are lock-free on reads, and the longer lock hold time on a write is typically okay for small caches. The exact workloads where they are good and bad varies, so it can require algorithm tuning to maximize hit rates, whereas Caffeine does that automatically.

It sounds like the key insight of your approach is to evict the entry from the eviction policy while asynchronously removing it from the data map. Caffeine works the other way by keeping the data map linearizable (as user-visible) and the policy eventually consistent. If you decided to keep using Caffeine then you could have it act as the policy but store the data in a another hash table and use the loader + listener for coordination. You can think of that as an L1 / L2 inclusive cache where L1 pulls from L2 and L2 invalidations are sent to L1. That's not uncommon for local + remote cache layers, but here both would be on-heap.

However you go about it, it sounds like a fun but tricky little problem.

rnewson · 2024-06-14T16:04:37Z

hmm, maybe. concurrent attempts to cache.get() while we're unloading should all get the IndexHolder while it is either write-locked (currently unloading) or not locked but in the UN_LOADED state (in which event we retry, as the entry should be removed from the cache and the first thread that gets the lock on cache will create a new IndexHolder in NOT_LOADED state). However, it's exactly all these kinds of cases that I wanted to delegate to something else, like Caffeine, so I will be holding off merging this PR while I ponder it some more.

You've given me a lot to think about, and, yes, it is a surprisingly fun little problem to solve. You'd think on my third "full text for couchdb" codebase that I'd have a solid solution to it by now. alas...

Caffeine complaines when eviction is slow (because we might commit the index). Build our own logic for opening and closing indexes which allows opening and closing to be very slow, while ensuring we don't close an index that's in use.

nickva

+1

The general idea seems sound. I lost most of my Java knowledge a while back, so may not have caught all the subtle details.

rnewson · 2024-06-18T16:23:21Z

thanks Nick. I expect we'll find a corner case but at least we have all the logic in one place, and it's quite small. Time will tell, and we clearly have some other options to try from what Ben has contributed.

ben-manes · 2024-06-18T17:39:22Z

@rnewson did you review if there is a memory leak as I suspected? The scenario is,

Insert N items
Insert key N+1 to trigger an eviction for key K0 (removeEldestEntry => false)
Unload starts
Insert K entries (removeEldestEntry => false, still selects key=0)
Unload finished
Insert key X => what is evicted?

I believe since removeEldestEntry returns false it will be stuck growing to higher thresholds based on the number of observed concurrent evictions. Since unloading is slow, over time the cache could greatly exceed your threshold. That excess won't be unloaded since only the eldest in-flight unload is observable.

rnewson · 2024-06-18T19:56:35Z

ah, oops. will check, and fix in subsequent PR. thanks.

nickva reviewed Jun 12, 2024

View reviewed changes

rnewson force-pushed the decaffeinated branch from 7d284d4 to 0a994ec Compare June 13, 2024 14:58

nickva reviewed Jun 13, 2024

View reviewed changes

Use our own logic for index management

930c6c4

Caffeine complaines when eviction is slow (because we might commit the index). Build our own logic for opening and closing indexes which allows opening and closing to be very slow, while ensuring we don't close an index that's in use.

rnewson force-pushed the decaffeinated branch from 0a994ec to 930c6c4 Compare June 17, 2024 18:09

nickva approved these changes Jun 18, 2024

View reviewed changes

rnewson merged commit 3dee4e7 into main Jun 18, 2024
17 checks passed

rnewson mentioned this pull request Jun 19, 2024

close oldest indexes before opening new if over capacity #5095

Merged

5 tasks

rnewson deleted the decaffeinated branch June 20, 2024 13:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use our own logic for index management #5088

Use our own logic for index management #5088

rnewson commented Jun 12, 2024

nickva Jun 12, 2024

rnewson Jun 13, 2024 •

edited

Loading

rnewson Jun 13, 2024

nickva Jun 13, 2024

rnewson Jun 13, 2024

rnewson Jun 13, 2024

nickva Jun 13, 2024

rnewson Jun 13, 2024

ben-manes commented Jun 14, 2024 •

edited

Loading

rnewson commented Jun 14, 2024

ben-manes commented Jun 14, 2024

rnewson commented Jun 14, 2024

nickva left a comment

rnewson commented Jun 18, 2024

ben-manes commented Jun 18, 2024

rnewson commented Jun 18, 2024

Use our own logic for index management #5088

Use our own logic for index management #5088

Conversation

rnewson commented Jun 12, 2024

Overview

Testing recommendations

Related Issues or Pull Requests

Checklist

nickva Jun 12, 2024

Choose a reason for hiding this comment

rnewson Jun 13, 2024 • edited Loading

Choose a reason for hiding this comment

rnewson Jun 13, 2024

Choose a reason for hiding this comment

nickva Jun 13, 2024

Choose a reason for hiding this comment

rnewson Jun 13, 2024

Choose a reason for hiding this comment

rnewson Jun 13, 2024

Choose a reason for hiding this comment

nickva Jun 13, 2024

Choose a reason for hiding this comment

rnewson Jun 13, 2024

Choose a reason for hiding this comment

ben-manes commented Jun 14, 2024 • edited Loading

rnewson commented Jun 14, 2024

ben-manes commented Jun 14, 2024

rnewson commented Jun 14, 2024

nickva left a comment

Choose a reason for hiding this comment

rnewson commented Jun 18, 2024

ben-manes commented Jun 18, 2024

rnewson commented Jun 18, 2024

rnewson Jun 13, 2024 •

edited

Loading

ben-manes commented Jun 14, 2024 •

edited

Loading