feat(core): initialize SQLite off-main-thread #18401

mmastrac · 2023-03-23T21:10:02Z

This gets SQLite off the flamegraph and reduces initialization time by somewhere between 0.2ms and 0.5ms. In addition, I took the opportunity to move all the cache management code to a single place and reduce duplication. While the PR has a net gain of lines, much of that is just being a bit more deliberate with how we're recovering from errors.

The existing caches had various policies for dealing with cache corruption, so I've unified them and tried to isolate the decisions we make for recovery in a single place (see open_connection in CacheDB). The policy I chose was:

Retry twice to open on-disk caches
If that fails, try to delete the file and recreate it on-disk
If we fail to delete the file or re-create a new cache, use a fallback strategy that can be chosen per-cache: InMemory (temporary cache for the process run), BlackHole (ignore writes, return empty reads), or Error (fail on every operation).

The caches all use the same general code now, and share the cache failure recovery policy.

In addition, it cleans up a TODO in the NodeAnalysisCache.

dsherret

Looks really good! I just have a few small comments.

cli/cache/cache_db.rs

cli/cache/caches.rs

dsherret · 2023-03-24T19:58:29Z

cli/cache/cache_db.rs

Very nice 👍

cli/cache/cache_db.rs

cli/proc_state.rs

bartlomieju · 2023-03-27T15:30:14Z

@mmastrac could you post latest benchmark results against v1.32.1?

mmastrac · 2023-03-27T16:50:02Z

I ran some benchmarks w/different journal modes, as I believe we're going to have to move from journal_mode=OFF back to either journal_mode=TRUNCATE or journal_mode=WAL. I think we'll want to use TRUNCATE based on these numbers.

There's another optimization that wins us a few milliseconds - just dropping the SQLite connection without running cleanup. I want to hold that one back until I can confirm that we're not going to risk corruption with it, but there's no reason it should be any different than crashing, which is already safe w/SQLite.

Summary:

Scenario                   Time       Commit
Base                       12.5ms     9eb
SQLite off-thread (SOT)    12.4ms     514
Base + WAL                 12.7ms     6c0
Base + TRUNCATE            12.6ms     ec4
SOT + WAL                  12.5ms     ef9
SOT + TRUNCATE             12.5ms     d8c

Raw results:

Benchmark 1: ./deno-9ebce6e725dd0b33aea20025995fb1e790b92df5 run empty.js
  Time (mean ± σ):      12.5 ms ±   0.4 ms    [User: 8.8 ms, System: 2.5 ms]
  Range (min … max):    11.9 ms …  14.3 ms    100 runs

Benchmark 2: ./deno-514313cc5d925015f3fd9e8916c04e9e5524782f run empty.js
  Time (mean ± σ):      12.4 ms ±   0.4 ms    [User: 9.0 ms, System: 3.0 ms]
  Range (min … max):    11.9 ms …  15.0 ms    100 runs

Benchmark 3: ./deno-6c0b5239a run empty.js
  Time (mean ± σ):      12.7 ms ±   0.3 ms    [User: 8.8 ms, System: 2.6 ms]
  Range (min … max):    12.0 ms …  14.6 ms    100 runs

Benchmark 4: ./deno-ec48252fc run empty.js
  Time (mean ± σ):      12.6 ms ±   0.7 ms    [User: 8.8 ms, System: 2.6 ms]
  Range (min … max):    11.8 ms …  14.6 ms    100 runs

Benchmark 5: ./deno-d8c2f5bfe run empty.js
  Time (mean ± σ):      12.5 ms ±   0.5 ms    [User: 9.0 ms, System: 3.1 ms]
  Range (min … max):    11.7 ms …  15.1 ms    100 runs

Benchmark 6: ./deno-ef9a5c1c9 run empty.js
  Time (mean ± σ):      12.5 ms ±   0.3 ms    [User: 8.9 ms, System: 3.4 ms]
  Range (min … max):    12.0 ms …  13.9 ms    100 runs```

dsherret

Looks great!

cli/cache/cache_db.rs

Co-authored-by: David Sherret <[email protected]>

… messages

…ons, so try a second time silently

bartlomieju · 2023-03-28T08:24:51Z

A couple observations from the benchmark page:

Thread count jumped from 8 to 10 - kind of unexpected, I thought we're gonna use 1 more thread:
Memory usage increased by about 4Mb:

I'm not suggesting we revert this change, just wanted to put it on people's radar

…t in CacheDB (#18469) Fast-follow on #18401 -- the reason that some tests were panicking in the `CacheDB` `impl Drop` was that the cache itself was being dropped during panic and the runtime may or may not still exist at that point. We can reduce the actual tokio runtime testing to where it's needed. In addition, we return the journal mode to `TRUNCATE` to avoid the risk of data corruption.

This gets SQLite off the flamegraph and reduces initialization time by somewhere between 0.2ms and 0.5ms. In addition, I took the opportunity to move all the cache management code to a single place and reduce duplication. While the PR has a net gain of lines, much of that is just being a bit more deliberate with how we're recovering from errors. The existing caches had various policies for dealing with cache corruption, so I've unified them and tried to isolate the decisions we make for recovery in a single place (see `open_connection` in `CacheDB`). The policy I chose was: 1. Retry twice to open on-disk caches 2. If that fails, try to delete the file and recreate it on-disk 3. If we fail to delete the file or re-create a new cache, use a fallback strategy that can be chosen per-cache: InMemory (temporary cache for the process run), BlackHole (ignore writes, return empty reads), or Error (fail on every operation). The caches all use the same general code now, and share the cache failure recovery policy. In addition, it cleans up a TODO in the `NodeAnalysisCache`.

…t in CacheDB (#18469) Fast-follow on #18401 -- the reason that some tests were panicking in the `CacheDB` `impl Drop` was that the cache itself was being dropped during panic and the runtime may or may not still exist at that point. We can reduce the actual tokio runtime testing to where it's needed. In addition, we return the journal mode to `TRUNCATE` to avoid the risk of data corruption.

mmastrac force-pushed the sqlite_off_thread branch 8 times, most recently from 6dc1a14 to c0619c4 Compare March 24, 2023 16:23

mmastrac marked this pull request as ready for review March 24, 2023 17:08

mmastrac force-pushed the sqlite_off_thread branch from 9797b6f to 2538c9c Compare March 24, 2023 17:14

mmastrac changed the title ~~[WIP] Move SQLite cache initialization off main thread~~ feat(core): initialize SQLite off-main-thread Mar 24, 2023

mmastrac force-pushed the sqlite_off_thread branch from 2538c9c to f4d3d39 Compare March 24, 2023 17:18

dsherret reviewed Mar 24, 2023

View reviewed changes

mmastrac force-pushed the sqlite_off_thread branch 2 times, most recently from 1855b71 to 574b1ce Compare March 27, 2023 13:50

dsherret mentioned this pull request Mar 27, 2023

fix(cli): Set a maximum time to wait for spawn_blocking tasks #18436

Open

mmastrac force-pushed the sqlite_off_thread branch from 14e12f5 to 7ce8b92 Compare March 27, 2023 14:39

mmastrac enabled auto-merge (squash) March 27, 2023 14:54

mmastrac force-pushed the sqlite_off_thread branch from d3874c8 to 514313c Compare March 27, 2023 15:10

dsherret disabled auto-merge March 27, 2023 17:48

dsherret approved these changes Mar 27, 2023

View reviewed changes

cli/cache/cache_db.rs Show resolved Hide resolved

mmastrac enabled auto-merge (squash) March 27, 2023 18:01

mmastrac force-pushed the sqlite_off_thread branch from 46bfb0d to 9c12488 Compare March 27, 2023 19:40

mmastrac and others added 5 commits March 27, 2023 13:42

feat(core): initialize SQLite off-main-thread

332ec29

Address code review comments

ae34c2e

Co-authored-by: David Sherret <[email protected]>

debug! -> trace! because too many tests are sensitive to trailing log…

ee5acbc

… messages

These caches may fail to initialize the first time in testing situati…

422132f

…ons, so try a second time silently

This PR exposed an issue where we aggressively deleted DENO_DIR

01575ab

mmastrac force-pushed the sqlite_off_thread branch from 9c12488 to 01575ab Compare March 27, 2023 19:43

mmastrac added 2 commits March 27, 2023 14:57

More elaborate fix for this

fa1151c

Gracefully handle lack of tokio runtime

f717a9f

mmastrac merged commit 86c3c4f into denoland:main Mar 27, 2023

mmastrac mentioned this pull request Mar 28, 2023

fix(core): restore cache journal mode to TRUNCATE and tweak tokio test in CacheDB #18469

Merged

mmastrac mentioned this pull request May 23, 2024

perf(startup): use WAL journal for sqlite databases in DENO_DIR #23955

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(core): initialize SQLite off-main-thread #18401

feat(core): initialize SQLite off-main-thread #18401

mmastrac commented Mar 23, 2023 •

edited

Loading

dsherret left a comment

dsherret Mar 24, 2023

bartlomieju commented Mar 27, 2023

mmastrac commented Mar 27, 2023

dsherret left a comment

bartlomieju commented Mar 28, 2023

feat(core): initialize SQLite off-main-thread #18401

feat(core): initialize SQLite off-main-thread #18401

Conversation

mmastrac commented Mar 23, 2023 • edited Loading

dsherret left a comment

Choose a reason for hiding this comment

dsherret Mar 24, 2023

Choose a reason for hiding this comment

bartlomieju commented Mar 27, 2023

mmastrac commented Mar 27, 2023

dsherret left a comment

Choose a reason for hiding this comment

bartlomieju commented Mar 28, 2023

mmastrac commented Mar 23, 2023 •

edited

Loading