Bump up MAX_HP_FOR_GC from 1GB to 3GB #2848

ulan · 2021-10-21T20:03:21Z

I think the plan was to get initial version with 1GB, then
bump it up to 3G, and then implement more advanced heuristic.

The motivation: there is a spike of messages that dirty between
256MB and 512MB of memory. One theory is that these messages
are from Motoko canisters that crossed the 1GB threshold.

I think the plan was to get initial version with 1GB, then bump it up to 3G, and then implement more advanced heuristic. The motivation: there is a spike of messages that dirty between 512MB and 1GB of memory. One theory is that these messages are from Motoko canisters that crossed the 1GB threshold.

crusso · 2021-10-21T22:19:33Z

I'll let @osa1 approve this one - unless you are looking for a rubber stamp.

osa1 · 2021-10-22T04:32:34Z

Canisters that allocate at a slow rate but allocate large amounts every once in a while rely on this parameter to have enough allocation are for the calls with large allocations. A larger MAX_HP_FOR_GC will make it more likely for those canisters to get stuck. I have no objections if we're OK with this.

Perhaps in the long run it would make sense to allow configuring these parameters. We could even do it in runtime, maybe with a compiler-generated upgrade method (which would also help with getting a canister unstuck), or provide a prim and let the user to define the endpoint if they feel the need.

Since a larger MAX_HP_FOR_GC is more risky (may cause some canisters to get stuck), I wonder if we should think about possible recovery strategies before merging?

The motivation: there is a spike of messages that dirty between 256MB and 512MB of memory. One theory is that these messages are from Motoko canisters that crossed the 1GB threshold.

Is the spike for just one message? If a canister passes 1 GiB threshold and dirties 256-512 MiB of memory, that means half of the space is reclaimed, and for the next few messages it shouldn't do GC (assuming it allocates at the same rate as before).

osa1 · 2021-10-22T04:44:59Z

(Restarted failing CI job)

ulan · 2021-10-22T08:03:05Z

Canisters that allocate at a slow rate but allocate large amounts every once in a while rely on this parameter to have enough allocation are for the calls with large allocations. A larger MAX_HP_FOR_GC will make it more likely for those canisters to get stuck. I have no objections if we're OK with this.

I didn't get why would the canister get stuck depending the value of this parameter? Don't we force GC when allocation fails anyway independent of the GC schedule?

ulan · 2021-10-22T08:10:19Z

Is the spike for just one message? If a canister passes 1 GiB threshold and dirties 256-512 MiB of memory, that means half of the space is reclaimed, and for the next few messages it shouldn't do GC (assuming it allocates at the same rate as before).

I am not sure. If all objects survive, would we dirty 0 pages or all pages? I wonder if most of the objects at the beginning are surviving and only the objects at the end are moved.

osa1 · 2021-10-22T08:18:58Z

I didn't get why would the canister get stuck depending the value of this parameter? Don't we force GC when allocation fails anyway independent of the GC schedule?

We only do GC after an update message.

osa1 · 2021-10-22T08:21:14Z

If all objects survive, would we dirty 0 pages or all pages?

With copying GC: all pages
With mark-compact: onle a few pages for the mark stack and bitmap

ulan · 2021-10-22T09:59:41Z

I didn't get why would the canister get stuck depending the value of this parameter? Don't we force GC when allocation fails anyway independent of the GC schedule?

We only do GC after an upgrade message.

I see, thanks! Is it due to missing implementation or was it intentionally removed? In general it seems very important to do GC if the allocation fails because GC schedule cannot be perfect.

osa1 · 2021-10-22T10:10:24Z

Is it due to missing implementation or was it intentionally removed?

It was never implemented. See also #2033.

ulan · 2021-10-22T10:25:22Z

Is it due to missing implementation or was it intentionally removed?

It was never implemented. See also #2033.

Thanks. That's what I feared. Adding stack support is a large project.

Perhaps in the long run it would make sense to allow configuring these parameters. We could even do it in runtime, maybe with a compiler-generated upgrade method (which would also help with getting a canister unstuck), or provide a prim and let the user to define the endpoint if they feel the need.

That sounds reasonable as a mitigation. Another idea: can we detect that we are running in the context of an pre/post upgrade hooks in Motoko? If so we could define one hard limit for messages (e.g. 3GB) after which we trap with OOM. But when running in upgrade hooks we could increase the limit to 4GB. So that the developer can get canister unstuck.

nomeata · 2021-10-22T10:57:52Z

can we detect that we are running in the context of an pre/post upgrade hooks in Motoko?

Yes, see Lifecycle module in compile.ml.

crusso · 2021-10-25T10:17:52Z

I didn't get why would the canister get stuck depending the value of this parameter? Don't we force GC when allocation fails anyway independent of the GC schedule?

We only do GC after an upgrade message.

You mean update (not upgrade) message, right? Just in case someone got confused...

osa1 · 2021-10-25T11:08:01Z

You mean update (not upgrade) message, right? Just in case someone got confused...

Yes, sorry. Edited my original message.

osa1 · 2021-10-29T08:47:17Z

Created #2864 for the CI issue.

osa1 · 2021-11-01T10:00:33Z

I wasn't aware that this is labeled as "automerge-squash". I think we should revert this since this will make it more likely for canisters to get stuck, and we have no way of recovering such canisters.

Any thoughts on this? @crusso @ggreif @ulan

osa1 · 2021-11-01T13:33:40Z

We discussed this with @ulan. Here's my summary of the problem and what we want to do:

MAX_HEAP_FOR_GC = 1 GiB is too low. It is common for canisters to have live data more than 1 GiB, and in those cases doing GC after every update message causes performance problems. (mainly on the replica, but maybe uses too many cycles too)
This PR increases MAX_HEAP_FOR_GC to 3 GiB. This makes it more likely for canisters to get stuck. With this change, when a canisters reaches 3 GiB heap it will have 1 GiB allocation area for the next query message. Less than that for the next update message because it will want to do GC after the update message.
We are OK with giving canisters 1 GiB allocation space when their heap size reaches 3 GiB. (not sure what the reasoning here is, perhaps @ulan can say more)
However, the current default GC (copying) copies all live data to a new space and then back. So with this change, a canister with 3 GiB heap can have at most 1 GiB of live data (less depending on how much it allocates in the last message). This case is probably not too uncommon, so we want to revert this PR for now.

One possible way forward here is making the compacting GC the default. With compacting GC, we need to allocate a bitmap (one bit per heap word), and a growable mark stack. This makes it less likely for canisters with 3 GiB heap size to get stuck.

Compacting GC uses more cycles (~30% last time I benchmarked), but halves dirtied pages, and large dirtied pages is what causes problems on the replica side. So we're OK with trading cycles for less dirtied pages.

Before making compacting GC the default we want to test it more. One idea is to backport random heap generation in #2706.

For now we revert this. We should merge this with the PR that makes compacting GC the default.

This reverts commit c9d4d08. See #2848 (comment)

osa1 · 2021-11-02T08:03:11Z

In addition to my comment above, we discussed about the actual costs of GC algorithms for the platform. Ulan says one dirtied page costs the same as between 10,000 and 100,000 cycles to the replica. If we take that into account in GC benchmarks, I think compacting GC may be significantly faster than the copying GC.

I don't have the raw data for the latest version of the compacting GC and copying GC so I will have to repeat the benchmarks for this.

rossberg · 2021-11-02T08:50:36Z

I agree that we should make compacting GC the default as soon as we think it's safe to do so.

In the meantime, would it make sense to bump max HP to 2GiB (or something slightly less)? Would that avoid the risk of getting stuck?

nomeata · 2021-11-02T09:13:07Z

Compacting might be better than copying, but even that is not really a viable solution; didn't we conclude that only a non-moving incremental GC actually works on the IC? Has work in that direction started? Or at least, do we know which design to work towards?

rossberg · 2021-11-02T09:30:16Z

Yes, we definitely want an incremental generational collector, and we (Ömer) is busy working towards that. I don't know that we concluded non-moving is required (or desired), but Ömer is currently implementing a page-based heap as a prerequisite for the next steps. With that, GC would mostly operate on one page at a time in the end.

nomeata · 2021-11-02T09:36:03Z

Ok, fair enough, incremental might be good enough even if moving.

ulan requested review from osa1 and crusso October 21, 2021 20:03

osa1 approved these changes Oct 22, 2021

View reviewed changes

ggreif added the automerge-squash When ready, merge (using squash) label Oct 22, 2021

Merge branch 'master' into ulan/max-heap-limit

01ab3fe

Merge branch 'dfinity:master' into ulan/max-heap-limit

3b4d0ed

Merge branch 'master' into ulan/max-heap-limit

6afa1d8

mergify bot merged commit c9d4d08 into dfinity:master Nov 1, 2021

mergify bot removed the automerge-squash When ready, merge (using squash) label Nov 1, 2021

osa1 added a commit that referenced this pull request Nov 1, 2021

Revert "Bump up MAX_HP_FOR_GC from 1GB to 3GB (#2848)"

4cac1cb

This reverts commit c9d4d08. See #2848 (comment)

osa1 mentioned this pull request Nov 1, 2021

Revert "Bump up MAX_HP_FOR_GC from 1GB to 3GB (#2848)" #2878

Merged

mergify bot pushed a commit that referenced this pull request Nov 1, 2021

Revert "Bump up MAX_HP_FOR_GC from 1GB to 3GB (#2848)" (#2878)

c7d0ccc

This reverts commit c9d4d08. See #2848 (comment)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bump up MAX_HP_FOR_GC from 1GB to 3GB #2848

Bump up MAX_HP_FOR_GC from 1GB to 3GB #2848

ulan commented Oct 21, 2021 •

edited

Loading

crusso commented Oct 21, 2021

osa1 commented Oct 22, 2021

osa1 commented Oct 22, 2021

ulan commented Oct 22, 2021

ulan commented Oct 22, 2021

osa1 commented Oct 22, 2021 •

edited

Loading

osa1 commented Oct 22, 2021

ulan commented Oct 22, 2021

osa1 commented Oct 22, 2021

ulan commented Oct 22, 2021

nomeata commented Oct 22, 2021

crusso commented Oct 25, 2021

osa1 commented Oct 25, 2021

osa1 commented Oct 29, 2021

osa1 commented Nov 1, 2021

osa1 commented Nov 1, 2021

osa1 commented Nov 2, 2021

rossberg commented Nov 2, 2021

nomeata commented Nov 2, 2021

rossberg commented Nov 2, 2021

nomeata commented Nov 2, 2021

Bump up MAX_HP_FOR_GC from 1GB to 3GB #2848

Bump up MAX_HP_FOR_GC from 1GB to 3GB #2848

Conversation

ulan commented Oct 21, 2021 • edited Loading

crusso commented Oct 21, 2021

osa1 commented Oct 22, 2021

osa1 commented Oct 22, 2021

ulan commented Oct 22, 2021

ulan commented Oct 22, 2021

osa1 commented Oct 22, 2021 • edited Loading

osa1 commented Oct 22, 2021

ulan commented Oct 22, 2021

osa1 commented Oct 22, 2021

ulan commented Oct 22, 2021

nomeata commented Oct 22, 2021

crusso commented Oct 25, 2021

osa1 commented Oct 25, 2021

osa1 commented Oct 29, 2021

osa1 commented Nov 1, 2021

osa1 commented Nov 1, 2021

osa1 commented Nov 2, 2021

rossberg commented Nov 2, 2021

nomeata commented Nov 2, 2021

rossberg commented Nov 2, 2021

nomeata commented Nov 2, 2021

ulan commented Oct 21, 2021 •

edited

Loading

osa1 commented Oct 22, 2021 •

edited

Loading