perf(core.js): introduce promise ring #9979

AaronO · 2021-04-03T12:37:52Z

This is another optimization to help improve the baseline overhead of async ops. It shaves off ~55ns/op or ~7% of the current total async op overhead.

Though it's only 7% of the total async-op overhead, by my estimates it reduces the overhead of promise storing/lookup by somewhere between 33-50%.

It achieves these gains by taking advantage of the sequential nature of promise IDs and optimistically stores them sequentially in a pre-allocated circular buffer and fallbacks to the promise Map for slow to resolve promises.

The promise ring will add some constant memory overhead, ~4B per slot so ~16kb for 4k slots, etc...

Benches

Before (main):
test bench_op_async   ... bench:     774,428 ns/iter (+/- 46,444)

After:
test bench_op_async   ... bench:     719,575 ns/iter (+/- 17,933)

Todo

Implement basic promise ring
Double check ring arithmetic
Test idea of breaching sectors to batch "old promise" checks out of setPromise

Since breaching sectors in batches doesn't seem useful performance wise

AaronO · 2021-04-03T13:09:45Z

Breaching ring sectors in batch doesn't seem to yield any performance gains ultimately, so I've dropped references to ring sectors and this should otherwise be good to merge, if it's correct.

…g bounds Given that nextPromiseId is an increment ahead of the cursor last written to

ry · 2021-04-03T14:32:08Z

I see about a 3% real-world improvement with this optimization in the HTTP throughput benchmark. However to my eye, this optimization seems rather complicated for such minimal benefit. If you don't mind, I'd rather leave this one queued up for the future and search for lower hanging fruit for now.

leoc11 · 2021-04-05T15:35:58Z

Though it's not directly related to this pr, it seem that when promiseId touch Number.MAX_SAFE_INTEGER, next asyncOp might failed.

AaronO · 2021-04-05T15:47:09Z

Though it's not directly related to this pr, it seem that when promiseId touch Number.MAX_SAFE_INTEGER, next asyncOp might failed.

Yeah, I thought of that and it would be relatively easy to handle, but it needs to be coordinated on the rust side. We could easily loop around or fallback to BigInts up to 2^64.

In practice, it should be hard to hit that limit. Number.MAX_SAFE_INTEGER is 2^53 (number of significant bits in a f64 that aren't used for the radix point).

A deno program doing 10M opcalls/s non-stop, would take ~28.5 years to reach that limit:

2**53 / (1e7 * 86400 * 365)
> 28.561641472415626

Also, given that in 1.8.2 and prior, the cheapest op-baseline was ~1000ns/op, you couldn't do more than 1M opcalls/s, so you would be good for ~285 years ...

AaronO · 2021-04-06T13:22:34Z

I see about a 3% real-world improvement with this optimization in the HTTP throughput benchmark. However to my eye, this optimization seems rather complicated for such minimal benefit. If you don't mind, I'd rather leave this one queued up for the future and search for lower hanging fruit for now.

I'm working on other async op improvements, but this improvement is pretty significant, it's a 10% reduction in async-op baseline at current values and it substantially helps improve tail latencies and close the tail latency gap with node on our tcp latency benches. These improvements come at a relatively small cost: ~20 lines of moderately complex JS. Ring buffers are a relatively common and well understood data-structure, nothing too exotic.

A promise ring is naturally less straightforward that a plain Map or Object, but it significantly reduces the GC-pressure of promises whilst being faster than both. Unless we have a better alternative for holding promises, I would be in favor of landing this.

Implementation	Op Baseline	Tail Latencies
Ring	👍	👍
Map	👍	👎
Object	👎	👍

Promise Map has a better op baseline than Object but has higher tail latencies due to the GC pressure, Ring is better than both.

bartlomieju

Performance improvements from this PR for throughput increate and tail latency decrease are too good to let it wait. LGTM!

piscisaureus · 2021-04-07T10:59:47Z

core/core.js

- const promiseTable = new Map();
+ const promiseMap = new Map();
+ const RING_SIZE = 4 * 1024;
+ const NO_PROMISE = null; // Alias to null is faster than plain nulls


What, really??!

Yeah, it appears to save ~10ns/opcall in the benches I ran.

I haven't double-checked the v8 generated bytecode, but I assume it could be faster due to checking equality via referential equality instead of value equality.

I will definitely need to double check this and narrow it down, to see if there is a true performance difference or if it was just a statistical fluke in my local benches.

But it's easy to change either way and doesn't negatively impact performance or correctness in its current form.

@piscisaureus I've dumped the v8 bytecode and diffed the bytecode implementations of getPromise() and setPromise()

setPromise()

getPromise()

core.js: introduce promise ring

e77b4a3

AaronO changed the title ~~core.js: introduce promise ring~~ perf(core.js): introduce promise ring Apr 3, 2021

AaronO added 3 commits April 3, 2021 14:41

fmt

b2facad

Minor cleanup of commented calls

b796558

core.js: remove references to ring sectors

8566564

Since breaching sectors in batches doesn't seem useful performance wise

bartlomieju requested a review from ry April 3, 2021 14:06

core.js: fix off-by-one error in checking if promise id is out of rin…

02f1713

…g bounds Given that nextPromiseId is an increment ahead of the cursor last written to

AaronO mentioned this pull request Apr 5, 2021

deno_tcp latency is worse than node_tcp #2700

Closed

Merge branch 'main' into ops/promise-ring

2346075

bartlomieju approved these changes Apr 7, 2021

View reviewed changes

piscisaureus reviewed Apr 7, 2021

View reviewed changes

piscisaureus approved these changes Apr 7, 2021

View reviewed changes

bartlomieju merged commit 2865f39 into denoland:main Apr 7, 2021

AaronO deleted the ops/promise-ring branch August 16, 2021 21:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(core.js): introduce promise ring #9979

perf(core.js): introduce promise ring #9979

AaronO commented Apr 3, 2021 •

edited

Loading

AaronO commented Apr 3, 2021

ry commented Apr 3, 2021 •

edited

Loading

leoc11 commented Apr 5, 2021

AaronO commented Apr 5, 2021 •

edited

Loading

AaronO commented Apr 6, 2021 •

edited

Loading

bartlomieju left a comment

piscisaureus Apr 7, 2021

AaronO Apr 7, 2021

AaronO Apr 7, 2021 •

edited

Loading

AaronO Apr 7, 2021

perf(core.js): introduce promise ring #9979

perf(core.js): introduce promise ring #9979

Conversation

AaronO commented Apr 3, 2021 • edited Loading

Benches

Todo

AaronO commented Apr 3, 2021

ry commented Apr 3, 2021 • edited Loading

leoc11 commented Apr 5, 2021

AaronO commented Apr 5, 2021 • edited Loading

AaronO commented Apr 6, 2021 • edited Loading

bartlomieju left a comment

Choose a reason for hiding this comment

piscisaureus Apr 7, 2021

Choose a reason for hiding this comment

AaronO Apr 7, 2021

Choose a reason for hiding this comment

AaronO Apr 7, 2021 • edited Loading

Choose a reason for hiding this comment

AaronO Apr 7, 2021

Choose a reason for hiding this comment

setPromise()

getPromise()

AaronO commented Apr 3, 2021 •

edited

Loading

ry commented Apr 3, 2021 •

edited

Loading

AaronO commented Apr 5, 2021 •

edited

Loading

AaronO commented Apr 6, 2021 •

edited

Loading

AaronO Apr 7, 2021 •

edited

Loading