rfc(decision): Replay Issue Creation #92

cmanallen · 2023-05-15T17:34:46Z

This RFC proposes two possible paths for creating Replay issues.

ryan953 · 2023-05-15T20:55:35Z

text/0092-replay-issue-creation.md

+2. Provide actionable feedback to developers that could not otherwise be captured in a performance span or error event.
+3. Increase product awareness in free-tier customers and encourage adoption.
+
+# Options Considered


I'm trying to think of a way to make sure we're looking at an exhaustive list of options, even if some are bad.

here's where i'm starting from:

SDK Network data Relay & Ingest Result

@sentry/javascript error payload (call captureException() in browser) normal processing error quota risk

@sentry/javascript click detection (same payload as in experiment) server-side detector needed -

@sentry/replay existing breadcrumbs + rrweb events normal relay processing server-side detector needed

@sentry/replay click detection breadcrumb (same payload as in experiment) server-side detector needed replay must be sampled

@ryan953 @cmanallen Could we add a column here that is "Product Experience" or "Product Implications" so we can visualize what this means to our users?

bmckerry · 2023-05-16T15:14:42Z

text/0092-replay-issue-creation.md

+**Cons:**
+
+1. Poor rollout could impact service availability during testing period.
+2. Requires coordination between the SDK and Ingest to create new issue types.


Do we know how perf/profiling teams have been handling their issue types?
IIRC perf issues have been rolled out by a getsentry option, I haven't heard of anything going wrong on their end and I imagine with DS they're also at risk of sampling from the SDK side

Profiling does it in the back end. Performance I'm not sure exactly. I know they have a lot of back end detectors so at least some portion is handled there.

text/0092-replay-issue-creation.md

jas-kas · 2023-05-16T22:54:27Z

text/0092-replay-issue-creation.md

+
+**Cons:**
+
+1. Uses quota.


I assume you mean errors quota here. It would be good to state that specifically for the revised RFC content.

As an aside, is it possible to do Option 1 and exclude it from consuming errors quota?

^ If the answer is 'yes' to the above question, we'll have to modify the pros/cons list

There's an experimental REST API (that would not use error quota) but its not stable as of this RFC. We would need to coordinate with the issues team. I don't believe there is another alternative that would not use error quota.

text/0092-replay-issue-creation.md

wedamija · 2023-05-17T00:00:41Z

text/0092-replay-issue-creation.md

+
+# Summary
+
+We want to detect certain categories of issues only available through the Session Replay product. These issues can only be detected on the SDK. The Replay back-end will never have enough data to find these issues. For that reason this is primarily an SDK driven workload. The question is: what role should the Replay back-end have in Replay issue creation? Should the Replay SDK use the Replay back-end to generate new issues or should the SDK generate those issues through a generic, non-replay-specific interface?


When we first worked on performance issues the initial spike was doing them via sdk. But there was a lot of pushback to move things to the backend, since implementing this logic across N sdks isn't really a scalable solution.

Maybe this isn't such a big deal if this only applies to the JS world though? Either way it might be worth checking that upper management isn't going to veto this approach later on.

Maybe this isn't such a big deal if this only applies to the JS world though?

Yeah I'm hoping thats the case.

Unfortunately this work has to be done on the SDK due to the large sizes of replays and the large time differences between segments (segments can refer back to any previous segment). This is probably something that should be documented in the RFC :P

Co-authored-by: Jasmin <[email protected]>

bruno-garcia · 2023-05-29T15:38:46Z

The team decided to reopen this RFC in order to expand on the options further. Some of the goals:

Should an HTTP interface for creating issue events be created this RFC can be re-addressed.

Give more details on what this interface can look like.
Talk about a simple SDK-only approach with captureEvent

rfc(decision): Replay Issue Creation

185ca61

cmanallen force-pushed the rfc/replay-issue-creation branch from 784b61c to 185ca61 Compare May 15, 2023 17:34

cmanallen added 2 commits May 15, 2023 14:29

Initial commit

2aa8584

Add detecting

f94e6f5

ryan953 reviewed May 15, 2023

View reviewed changes

Add note about SDK, Ingest coordination

3663aac

bmckerry reviewed May 16, 2023

View reviewed changes

cmanallen added 3 commits May 16, 2023 12:39

Add new summary

706752f

Remove extraneous data

931a2ec

Clean up language

513c3ee