Validation is extremely slow for large queries with fragment spreads due to lack of `implementers_map` caching #862

xuorig · 2024-06-01T01:48:24Z

I've been investigating slow parsing time in our application. It appears to be due to validate_fragment_spread_type recomputing a large map of implementers everytime it is called. For large schemas, this creates a very large hashmap everytime. Combined with larger queries this can have a very large effect and we're seeing parse times in the seconds.

Instead, the implementers_map can be computed once either through a SchemaWithCache structure, which is used elsewhere already, or by passing along a reference to the pre-computed map during validation. It would be great if this map could be computed only once per schema rather than once per query.

I'm happy to contribute a fix if you are opened to it!

The text was updated successfully, but these errors were encountered:

goto-bus-stop · 2024-06-03T07:44:54Z

That's a not great oversight. thanks for the report!

One approach we could consider here is to add a cache to Schema itself and provide access to the cache through Valid<Schema> only. Inside schema validation we can manually pass a reference even if the cache is not populated yet (if we need it). operation validation requires a valid schema so we can update the function signatures and reuse the cache between different queries.

goto-bus-stop · 2024-06-03T10:29:25Z

I think using the SchemaWithCache solution is also acceptable, though it would still require recomputing the map once for every query. If you'd like to contribute that then I'd be totally happy 😄

As a more concrete idea for caching on the schema itself so that it's reusable between different queries, we might have something like the below. If we cache something inside Schema, we have to guarantee that it is only cached when the schema is immutable through Valid<>, and we have to clear the cache when going back to a mutable schema through Valid::into_inner. that could be achieved with a trait.

struct Schema {
    // .. all the current stuff ..
    /// Implementers cache: this must only be accessed through Valid<Schema>.
    implementers_map: OnceLock<HashMap<Name, Implementers>>,
}

impl Valid<Schema> {
    // This could have the API that SchemaWithCache has today, that populates the cache on demand.
    fn implementers_of(&self, name: Name) {}
}

// & then the stuff to support invalidating the cache:
struct Valid<T: Invalidate>(T);
impl<T> Valid<T> {
    fn into_inner(self) -> T {
        self.0.invalidate()
    }
}
trait Invalidate {
    // default implementation for types that can be used with Valid<> but do not require invalidation
    fn invalidate(self) -> Self { self }
}

impl Invalidate for Schema {
    fn invalidate(mut self) -> Self {
        self.implementers_map.take();
        self
    }
}
// this can use the default implementation.
impl Invalidate for ExecutableDocument {}

xuorig · 2024-06-03T15:09:00Z

@goto-bus-stop I was thinking about this over the weekend, another option is to not cache it but instead have it as a living thing in the schema, that gets built along the schema.

Of course it's a bit more complex, again because of mutable schemas and updating this mapping as it is built / modified.

But it's a very common structure to access in GraphQL schemas in general, so it's tempting to just make it a "first class" kind of thing. Thoughts?

goto-bus-stop · 2024-06-03T15:19:30Z

I'm not sure if that would work with the current design, which encourages directly mutating the schema structs. Wouldn't users have to manually update the implementers map, or apollo-compiler to provide specific methods to modify the schema?

xuorig · 2024-06-03T15:20:26Z

which encourages directly mutating the schema structs

Gah good point, I thought this was happening through add_type and add_document kind of APIs only. nevermind!

goto-bus-stop · 2024-06-04T11:00:24Z

I'll work on this Invalidate idea today so we can reuse the cache between validation runs.

xuorig · 2024-06-04T12:14:42Z

Thank you @goto-bus-stop, sounds great 🙇

Fixes #862. I had proposed a different approach that would let us cache the implementers map for as long as the schema is immutable, so it could be reused for different query validations. That approach had an issue that made `Valid::assume_valid_ref` very, very subtle to use, so we are not doing that right now. This is a less ideal version but it does solve the immediate problem. We already pass around an `OperationValidationConfig` structure and this seems like a nice, non-invasive place to put the implementers cache.

xuorig added bug Something isn't working triage labels Jun 1, 2024

xuorig mentioned this issue Jun 1, 2024

Extremely slow parse times can cause the router to stop serving requests apollographql/router#5313

Closed

goto-bus-stop added apollo-compiler issues/PRs pertaining to semantic analysis & validation and removed triage labels Jun 3, 2024

goto-bus-stop mentioned this issue Jun 5, 2024

fix(compiler): cache the implementers map per executable document #863

Merged

goto-bus-stop added a commit that referenced this issue Jun 5, 2024

fix(compiler): add a benchmark for #862

5b9875b

goto-bus-stop closed this as completed in 09eb68c Jun 5, 2024

goto-bus-stop closed this as completed in #863 Jun 5, 2024

goto-bus-stop mentioned this issue Jun 5, 2024

[email protected] #864

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Validation is extremely slow for large queries with fragment spreads due to lack of `implementers_map` caching #862

Validation is extremely slow for large queries with fragment spreads due to lack of `implementers_map` caching #862

xuorig commented Jun 1, 2024 •

edited

Loading

goto-bus-stop commented Jun 3, 2024 •

edited

Loading

goto-bus-stop commented Jun 3, 2024 •

edited

Loading

xuorig commented Jun 3, 2024 •

edited

Loading

goto-bus-stop commented Jun 3, 2024

xuorig commented Jun 3, 2024

goto-bus-stop commented Jun 4, 2024

xuorig commented Jun 4, 2024

Validation is extremely slow for large queries with fragment spreads due to lack of implementers_map caching #862

Validation is extremely slow for large queries with fragment spreads due to lack of implementers_map caching #862

Comments

xuorig commented Jun 1, 2024 • edited Loading

goto-bus-stop commented Jun 3, 2024 • edited Loading

goto-bus-stop commented Jun 3, 2024 • edited Loading

xuorig commented Jun 3, 2024 • edited Loading

goto-bus-stop commented Jun 3, 2024

xuorig commented Jun 3, 2024

goto-bus-stop commented Jun 4, 2024

xuorig commented Jun 4, 2024

Validation is extremely slow for large queries with fragment spreads due to lack of `implementers_map` caching #862

Validation is extremely slow for large queries with fragment spreads due to lack of `implementers_map` caching #862

xuorig commented Jun 1, 2024 •

edited

Loading

goto-bus-stop commented Jun 3, 2024 •

edited

Loading

goto-bus-stop commented Jun 3, 2024 •

edited

Loading

xuorig commented Jun 3, 2024 •

edited

Loading