Improve hydration by reordering optimally #6395

hbirler · 2021-06-09T21:58:32Z

(Hopefully) Closes #4308.

I am new to working with Svelte so I apologize for any errors beforehand.

Explanation

I tried to improve #6204 based on the thread #4308. I personally had problems with elements with CSS animations flickering due to hydration detaching and reattaching nodes (implicitly by re-inserting them) as described in the thread. The page that I was having problems with was almost entirely static, so seeing all the static nodes refresh went against my intuition as to how hydrate would work. So I looked into how this situation could be improved.

Old algorithm

Edit: The algorithm has been improved significantly. See #6395 (comment) for the new algorithm

During hydration, this new implementation tries to pick a sequence of nodes that we are able to not detach: the first common subsequence of the original Nodes and the claim_* requests.

For example, if the original nodes are:
B C D E F D G

and the claim_element function calls are as follows:
A B C D D E F G H

the first common subsequence would look like
B C D D G

You can find this subsequence by iterating through the claim_elements and keeping a pointer on the original nodes that you are only able to move forward. For every claim_element, if you are able to move the pointer to a matching node on the right, do it. Otherwise, do not move the pointer.

The algorithm chooses to preserve the nodes within this subsequence (i.e. not detach them). The rest of the nodes that are requested by claim_* are handled as before: they are either found among existing nodes to be detached or created anew.

Normally, after the claims are done, append and insert are called with every single child, since the caller assumes that all the children are detached. Since this is not the case anymore (the shared subsequence was not detached), we need to transparently make sure that append does what is expected, which in this case might be to insert a node in the middle of existing non-detached children. During append, the algorithm tracks the last appended node, i.e., what the caller thinks is actually the last child of the target. Thus, we can deduce where the caller actually wants to append the node. The insert function works similarly.

Possible optimizations

Note that the first common subsequence is not the longest common subsequence:
B C D E F G

We would prefer to not detach as many nodes as possible, so the longest common subsequence would theoretically be preferable. However, since the claim_* functions (if I understand them correctly) are expected to return Nodes directly (and greedily), I am not sure a change can be implemented without changing the compiler side.

Even though finding the optimal (longest) subsequence to not detach is problematic, it should be possible to further improve the greedy approach based on heuristics. In the example above, the second D node that is claimed results in the two nodes E and F getting detached. If the D node is just Text and E/F are large Elements, that would be quite a sad occurrence. This could be optimized in the future handling Text Nodes differently. I did not implement such an optimization since I was not sure my assumptions are correct and would love to receive feedback.

One might also look into what virtual DOM front-end frameworks are doing to find minimal differences https://github.com/localvoid/ivi/blob/2c81ead934b9128e092cc2a5ef2d3cabc73cb5dd/packages/ivi/src/vdom/implementation.ts#L1367 . But I am not sure how applicable those will be to our situation.

There may be many more possible optimizations! I am relatively inexperienced with front-end development so I might be missing many optimization opportunities.

Possible issues

Within the code, I added an additional property actual_end_child to Node objects during append and insert calls to make tracking last appended nodes easier. If this is a problem, we can switch to using a Map<Node, Node>.

Tests

The code passed all the tests but I did not add any new tests since I wasn't sure how to test for whether nodes are being detached. Maybe a total number of nodes detached counter could be used to ensure that the algorithm is working and further optimizations do not cause regressions. Any feedback on this is appreciated.

hbirler · 2021-06-10T00:18:17Z

From what I understand #6392 is similar in that it implements a similar insert/append logic, but it does not change the claim_* functions. The original claim_* functions always assign the first possible matching node to a claimed element, if such a node exists.

In the example that I gave with the original nodes:
B C D E F D G

and the claim_element function calls:
A B C D D E F G H

that pull request seems to do the following mutations (step-by-step states):

B C D E F D G    p:0 (create A)
A B C D E F D G    p:1 (nothing B)
A B C D E F D G    p:2 (nothing C)
A B C D E F D G    p:3 (nothing D)
A B C D D E F G    p:4 (move D)
A B C D D E F G    p:5 (nothing E)
A B C D D E F G    p:6 (nothing F)
A B C D D E F G    p:7 (nothing G)
A B C D D E F G H    p:8 (create H)

So B C D E F G is not detached which is better than my implementation in this example.
But on the actual page that I was testing on that pull request causes a refresh of most elements while this pull request does not.

A worst case for this pull request is when an early element is matched very late (since the first common subsequence will be very short).

A worst case for #6392 is when a late element is matched very early (since all the other elements have to move in front of that element).

hbirler · 2021-06-10T00:51:15Z

From what I know & understand, given arbitrary claim_* functions, the minimum amount of reorderings that assign & insert have to do can be computed via the longest increasing subsequence.

So, in a sense, we could potentially implement the claim_* functions such that they make the longest increasing subsequence as long as possible (using heuristics).

Then, we can theoretically implement insert & append in a way that they do the minimum number of moves given the claims.

hbirler · 2021-06-10T11:31:07Z

I have implemented optimal reordering and changed the claim_* logic somewhat. Here is the new algorithm which is also influenced by #6392:

New algorithm

The new algorithm computes the optimal amount of reorderings for a given set of claims. The claims are made in such a way that the optimal amount of reorderings is likely small.

There are two steps in hydration that are relevant to us. The first step is the claim step where existing nodes are picked to be reused one by one (or created when no corresponding node already exists). The second step is the modification step where claimed nodes are reordered and newly created nodes are inserted.

Let's say that our existing nodes are:
B C D X F G E H

And we claim:
A B C D E F G H I

During the modification step, there are three operations that we do modify our existing nodes to match the claims.

Detach: We remove nodes we have not claimed (X):
B C D F G E H
Reorder: We reorder the nodes to match the claim order (reorder E to the left). This step is executed at the first append/insert:
B C D E F G H
Insert: We insert newly created nodes into their corresponding spots:
A B C D E F G H I

What we can optimize in this approach is to minimize the total number of reorderings. There are two factors that effect the total number of reorderings:

The first factor is the reordering operations that we actually choose to do.
For example, the sequence B C D A can be sorted in multiple ways.
Either we move A to the left and do only one operation.
Or we move B, C, and D one-by-one to the right and do three operations.

Luckily the optimal amount of reorderings can be computed algorithmically in O(n log n) time, which I have implemented in the latest commit. The optimal amount of reorderings is given by (number_of_nodes - longest_ordered_subsequence_of_claimed_nodes). So if we maximize the longest ordered subsequence of claimed nodes, we minimize the amount of reordering, since only the nodes that are not in the longest ordered subsequence are reordered.

The second factor is, of course, the claiming order. If there are multiple elements with the same tag we are trying to claim, we can choose any one of those elements. This choice will effect the total number of reordering. What I do is, I remember the position of the last claim, and try to claim a node to its right side. If I cannot do it, I claim a node to the left (or create a new node when no matching node exists). I am hoping that this heuristic will maximize the longest ordered subsequence. (Additionally, if I am only claiming a text node, I do not update the position of the last claim, since I do not want text nodes to break a sequence of real elements. My notes under possible optimizations in the original post contain an example for this #6395 (comment))

New possible issues

More data is now stored in nodes. We can choose to use a Map<Node, Data> instead.
I am not sure about the following check in the code https://github.com/sveltejs/svelte/pull/6395/files#diff-7ad300a4efc821b2576200a7bfc99205d558792b81c037fbc20f9236f8b6f1bbR114 .
The ((target.actual_end_child !== null) && (target.actual_end_child.parentElement !== target)) is to satisfy an error I received in head-detached-in-dynamic-component where a title node was stored as the actual_end_child but was later missing.

hbirler · 2021-06-10T11:34:17Z

I updated the title to better reflect what the algorithm is doing now.

ehrencrona · 2021-06-11T13:09:09Z

This is very clever. I ran some benchmarks on my site and this seems significantly faster than my PR #6392; maybe 15% faster or so. So I'd support this PR over mine. I'll try to add some detailed comments on the code later.

benmccann · 2021-06-19T12:57:25Z

@hbirler we just checked in some new tests for hydration. Can you rebase this PR against master so that we can make sure the tests still pass?

Conduitry · 2021-06-19T13:16:08Z

I just tried this locally, and all of the new tests pass with these changes.

During hydration, greedily pick nodes that exist in the original HTML that should not be detached. Detach the rest.

During hydration we track the order in which children are claimed. Afterwards, rather than reordering them greedily one-by-one, we reorder all claimed children during the first append optimally. The optimal reordering first finds the longest subsequence of children that have been claimed in order. These children will not be moved. The rest of the children are reordered to where they have to go. This algorithm is guaranteed to be optimal in the number of reorderings. The hydration/head-meta-hydrate-duplicate test sample has been modified slightly. The order in which the <title> tag is being generated changed, which does not affect correctness.

Not sorting children before executing the `insertBefore` calls in `init_hydrate` potentially caused extra `insertBefore` calls in `append`

hbirler · 2021-06-19T14:02:44Z

I rebased the branch. The tests pass locally for me as well.

nickreese · 2021-06-22T19:10:06Z

@hbirler Thank you soo much for your effort on this.

benmccann · 2021-06-24T22:35:00Z

@hbirler thanks for this. Hydration performance has been a long standing issue, so it's great to see improvements!

I just updated a SvelteKit site of mine to use it. I see in the Chrome performance tab screenshots that the logo on my site (https://c3.ventures/) momentarily disappears and then is rerendered. I set hydration: false and this flash no longer occurs. I'm curious why the flicker is occurring given this change. The content on my site is all static and the page is prerendered, so there should be nothing like a timestamp that might differ between server and client. I'd thus expect the common longest subsequence to be the entire DOM. Do you have any idea why this optimization might not be working for me? If you'd be interested or willing to take a look, I can add you to the GitHub repo for my SvelteKit project so that you can take a peek.

hbirler · 2021-06-25T09:12:51Z

@benmccann I think I might have found out what the issue is.
To update an image's src attribute, svelte generates the following code:
if (img.src !== (img_src_value = srcVariable)) attr(img, "src", img_src_value);

The idea is, if the source is already set correctly, do not update it.
In your website, I have found a matching line in compiled index.svelte:
he.src !== (me = L) && m(he, "src", me)

However, if I look at the actual values:
he.src === https://c3.ventures/_app/assets/c3-logo-alpha-7c844443.webp
me === /_app/assets/c3-logo-alpha-7c844443.webp

So even though the values are theoretically equivalent, we still end up changing the src attribute to something else. I believe this might be the issue.

The img node pre-hydration looks like:
<img src="/_app/assets/c3-logo-alpha-7c844443.webp" width="262" height="68" alt="C3 Ventures">

So the issue is not that the generated ssr html is wrong. Reading the src attribute client side somehow returns the full url. Maybe someone more experienced with web development than me can comment on this? Maybe a more proper equivalence check can be done rather than just ===?

hbirler · 2021-06-25T09:14:19Z

Whatever the solution, the srcset attribute may need a similar correction.

hbirler · 2021-06-25T09:31:37Z

(new URL(relativePath, location)).href seems to be a good way to normalize a URL.
https://stackoverflow.com/a/67509097/4415649

hbirler · 2021-06-25T09:49:48Z

@benmccann Does https://github.com/hbirler/svelte/tree/better-hydration-src fix your issue?

ankitstarski · 2021-06-25T10:33:13Z

In my case, I have a stylesheet linked in head. That link tag is getting removed and added back again, giving me a Flash of unstyled content. Also, increasing my LCP by a lot.

ankitstarski · 2021-06-25T10:33:39Z

I'll check with your branch if that fixes it.

ankitstarski · 2021-06-25T10:40:11Z

Tried your branch, does not work for me.

hbirler · 2021-06-25T10:40:55Z

I only tried to fix the src issue in the branch :( .

hbirler · 2021-06-25T11:12:11Z

@ankitstarski It might be that the header tags generated by the SSR somehow do not match the order the client side expects them to be in, but I do not know how this might happen.

Edit: claiming elements from the header seems to work differently than normal nodes in Svelte. I am not sure what kinds of changes would be required to properly handle nodes under <head>

hbirler · 2021-06-25T12:24:35Z

@ankitstarski Does the branch solve your problem now? I have committed another change.

ankitstarski · 2021-06-25T12:57:47Z

It could be that I'm using sapper with some tags in head inside template.html.

I'll check with your branch and update here.

benmccann · 2021-06-25T15:23:22Z

Thanks @hbirler. Unfortunately I get the same behavior on that branch

hbirler · 2021-06-25T15:37:11Z

The branch does improve a local test of mine for a similar situation. It's weird that it did not work for you, maybe your website hits another corner case.

ankitstarski · 2021-06-26T14:04:46Z

@hbirler It does fix the issue for me. But high LCP and high Time to Interactive are still there compared to a custom build of svelte that I'm using based on Avi's patch .

Also, thanks for your contribution! 🥇

hbirler · 2021-06-27T06:03:01Z

@ankitstarski You can try the branch again to see if there are any improvements. The claim_text change improved a local test of mine.

ankitstarski · 2021-06-29T00:17:23Z

No noticeable change.

non25 · 2021-07-08T09:24:52Z

I believe this PR is causing #6444
@hbirler can you test by wrapping each in div with and w/o your PR?
Thanks.

hbirler · 2021-07-08T09:48:22Z

@non25 I believe #6445 fixes that issue

hbirler force-pushed the better-hydration branch 2 times, most recently from 37b552f to 84ec1b4 Compare June 10, 2021 11:04

hbirler changed the title ~~Improve hydration by detaching less~~ Improve hydration by reordering optimally Jun 10, 2021

benmccann mentioned this pull request Jun 11, 2021

Improved hydration performance by less reordering of nodes #6392

Closed

4 tasks

hbirler force-pushed the better-hydration branch 2 times, most recently from d10b9ec to 4f161fb Compare June 12, 2021 15:10

hbirler added 4 commits June 19, 2021 15:53

Implement new hydration optimization

8e1183b

During hydration, greedily pick nodes that exist in the original HTML that should not be detached. Detach the rest.

Fix issue potentially causing extra reorders

1c80db0

Not sorting children before executing the `insertBefore` calls in `init_hydrate` potentially caused extra `insertBefore` calls in `append`

Simplifyinit_hydrate sorting logic

8baf905

hbirler force-pushed the better-hydration branch from 79e5e1d to 8baf905 Compare June 19, 2021 14:01

Conduitry merged commit 04bc37d into sveltejs:master Jun 22, 2021

hbirler mentioned this pull request Jun 25, 2021

[fix] hydration improvements #6449

Merged

This was referenced Jul 27, 2021

Starting with v3.39.0 sorting a keyed each block messes up the DOM #6561

Closed

[idea] Minimal modifications in update_keyed_each #6603

Open

hbirler mentioned this pull request Aug 18, 2021

Better hydration #4308

Closed

deklanw mentioned this pull request Jan 28, 2022

Partial hydration sveltejs/kit#1390

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve hydration by reordering optimally #6395

Improve hydration by reordering optimally #6395

hbirler commented Jun 9, 2021 •

edited

Loading

hbirler commented Jun 10, 2021

hbirler commented Jun 10, 2021

hbirler commented Jun 10, 2021 •

edited

Loading

hbirler commented Jun 10, 2021

ehrencrona commented Jun 11, 2021 •

edited

Loading

benmccann commented Jun 19, 2021

Conduitry commented Jun 19, 2021

hbirler commented Jun 19, 2021

nickreese commented Jun 22, 2021

benmccann commented Jun 24, 2021

hbirler commented Jun 25, 2021 •

edited

Loading

hbirler commented Jun 25, 2021

hbirler commented Jun 25, 2021

hbirler commented Jun 25, 2021

ankitstarski commented Jun 25, 2021

ankitstarski commented Jun 25, 2021

ankitstarski commented Jun 25, 2021

hbirler commented Jun 25, 2021 •

edited

Loading

hbirler commented Jun 25, 2021 •

edited

Loading

hbirler commented Jun 25, 2021

ankitstarski commented Jun 25, 2021

benmccann commented Jun 25, 2021

hbirler commented Jun 25, 2021

ankitstarski commented Jun 26, 2021

hbirler commented Jun 27, 2021

ankitstarski commented Jun 29, 2021

non25 commented Jul 8, 2021

hbirler commented Jul 8, 2021 •

edited

Loading

Improve hydration by reordering optimally #6395

Improve hydration by reordering optimally #6395

Conversation

hbirler commented Jun 9, 2021 • edited Loading

Explanation

Old algorithm

Possible optimizations

Possible issues

Tests

hbirler commented Jun 10, 2021

hbirler commented Jun 10, 2021

hbirler commented Jun 10, 2021 • edited Loading

New algorithm

New possible issues

hbirler commented Jun 10, 2021

ehrencrona commented Jun 11, 2021 • edited Loading

benmccann commented Jun 19, 2021

Conduitry commented Jun 19, 2021

hbirler commented Jun 19, 2021

nickreese commented Jun 22, 2021

benmccann commented Jun 24, 2021

hbirler commented Jun 25, 2021 • edited Loading

hbirler commented Jun 25, 2021

hbirler commented Jun 25, 2021

hbirler commented Jun 25, 2021

ankitstarski commented Jun 25, 2021

ankitstarski commented Jun 25, 2021

ankitstarski commented Jun 25, 2021

hbirler commented Jun 25, 2021 • edited Loading

hbirler commented Jun 25, 2021 • edited Loading

hbirler commented Jun 25, 2021

ankitstarski commented Jun 25, 2021

benmccann commented Jun 25, 2021

hbirler commented Jun 25, 2021

ankitstarski commented Jun 26, 2021

hbirler commented Jun 27, 2021

ankitstarski commented Jun 29, 2021

non25 commented Jul 8, 2021

hbirler commented Jul 8, 2021 • edited Loading

hbirler commented Jun 9, 2021 •

edited

Loading

hbirler commented Jun 10, 2021 •

edited

Loading

ehrencrona commented Jun 11, 2021 •

edited

Loading

hbirler commented Jun 25, 2021 •

edited

Loading

hbirler commented Jun 25, 2021 •

edited

Loading

hbirler commented Jun 25, 2021 •

edited

Loading

hbirler commented Jul 8, 2021 •

edited

Loading