proposal: runtime/debug: add SetCrashHeader function #64590

EtiennePerot · 2023-12-06T22:48:26Z

Proposal Details

Add a function like SetCrashHeader(header []byte) to runtime/debug. When called, this copies the first K bytes of header into a region of memory owned by the Go runtime. If the program later panics, the contents of this buffer are printed out before any of the usual panic information.

This buffer would have a tight maximum byte size smaller or equal to the size of a page of memory. This ensures the crashing procedure remains fast and that this feature doesn't introduce more unpredictability when a process is already crashing. This header is global, not per-goroutine. The SetCrashHeader function may be called with nil as argument in order to clear out the runtime-owned buffer.

The idea is to capture useful information about a program that's crashing if crashing does happen, and to print it in a convenient place where an out-of-program system can easily consume it. This information could include basic program information (start time, version number, build options, architecture, etc.) or structured debugging information that isn't always easily obtainable from means outside of the Go program itself.

To disambiguate between this header and the rest of the panic message, the panic output also prints an extra \n--------------- backtrace ---------------\n after the header. This is not printed when the header isn't set (or has been reset by calling SetCrashHeader(nil)).

This pairs well with proposal #42888 (runtime/debug: add SetCrashOutput(file *os.File)). With these two proposals, this allows an out-of-program crash logging system consuming the output of panic logs (which are separate from other logs when using SetCrashOutput(file *os.File)) to obtain the information in the header from the same file handle as the one it already has to receive panic data.

Example

package main
import "runtime/debug"

func init() {
	debug.SetCrashHeader([]byte("MyCrashyProgram version 0.1\nfoo bar"))
}

func main() {
	panic("oops I crashed")
}

... would output to stderr:

MyCrashyProgram version 0.1
foo bar
--------------- backtrace ---------------
panic: oops I crashed

goroutine 1 [running]:
main.main()
	/path/to/prog.go:8 +0x...

The text was updated successfully, but these errors were encountered:

apparentlymart · 2023-12-07T01:12:10Z

I would love to replace some hacky stuff here with something like what's proposed:

https://github.com/hashicorp/terraform/blob/58a51bffd8565b1fe8b9e7a19227ae8eeb85c62d/internal/logging/panic.go#L36-L64
https://github.com/hashicorp/terraform/blob/58a51bffd8565b1fe8b9e7a19227ae8eeb85c62d/main.go#L68
https://github.com/hashicorp/terraform/blob/58a51bffd8565b1fe8b9e7a19227ae8eeb85c62d/internal/terraform/graph.go#L52

Currently our extra messaging for panics appears only if we've remembered to write defer logging.PanicHandler() near the start of each new goroutine, which is impossible to do if a goroutine is started by a library that isn't aware of our application.

I'm assuming that the intention of the proposal is for debug.SetCrashHeader to change the panic reporting behavior for all goroutines in the current program, rather than just one which called debug.SetCrashHeader, in which case we could avoid this special extra work for each goroutine.

Our current approach also allows us to change which exit code gets reported when the application panics, which would be nice to preserve since the runtime's default (2) accidentally collides with a different meaning of that exit code in this program, which we chose before knowing that the runtime was using it. However, I expect we'd be willing to live with the inability to change the exit code if it meant having this more robust way to add additional messaging to guide a user toward reporting the panic as a bug.

adonovan · 2023-12-07T21:09:39Z

The output should properly frame the two parts (header, backtrace) so they can be unambiguously parsed by the consumer.

EtiennePerot · 2023-12-08T00:45:25Z

I'm assuming that the intention of the proposal is for debug.SetCrashHeader to change the panic reporting behavior for all goroutines in the current program, rather than just one which called debug.SetCrashHeader

Correct. Clarified in proposal.

The output should properly frame the two parts (header, backtrace) so they can be unambiguously parsed by the consumer.

OK, updated proposal, though I'm not sure about how to best do that. Maybe just a --------------- backtrace --------------- between the header and the panic? Feel free to suggest alternatives.

apparentlymart · 2023-12-08T01:15:53Z

I would personally prefer that this not introduce any more involuntary content or delimiters into the output. In my case, the additional content is for the benefit of a human reader, not for machine consumption.

If I wanted to delimit the leading messages from the backtrace, I could presumably include a delimiter at the start and end of the buffer passed to debug.SetCrashHeader. Perhaps there's justification for an additional debug.SetCrashFooter to allow also delimiting the end of the backtrace; I have no use for that myself, so I'm only suggesting it to see if it would help satisfy the framing requirement without imposing any mandatory additional content on all callers.

adonovan · 2023-12-08T15:20:57Z

A comment such as "--- backtrace ---" does not constitute reliable framing because this string can potentially appear within the string form of the panic value (and, in theory, within the name of a file in the stack trace, though that's very unlikely).

This could potentially be a security vulnerability: for privacy reasons, the Go telemetry system reports the stack but not the panic value, as the stack consists only of strings from the executable, whereas the panic value may contain arbitrary user data. (We disregard the potential for encoding information in the sequence of stack frames as a side channel, but it is real.) An attacker that is able to influence the panic value could prepend the string --- backtrace ---, causing the crash reporting system that discards the portion before that string to leak the rest of it.

For me, the primary motivation to add the SetCrashOutput function is to enable automated crash reporting, for which it is desirable to record the panic value and the stack dump with as much structure as possible. Indeed, I wonder whether we should add an option to record stacks in some form (e.g. JSON) that makes them easier to parse.

adonovan · 2024-01-25T15:41:40Z

It dawned on me this morning that, so long as all the information we get out of the stack trace in the crash report is corroborated with the executable's symbol table, there's no way for the panic value to inject arbitrary strings into the telemetry counter name.

rsc · 2024-02-07T18:55:47Z

An alternative to make the crash dumps unambiguous would be to make the panic value printer insert a tab after every newline, like t.Log does. Then a "goroutine stack" inside a panic value won't look like an ordinary goroutine stack, because it will be indented.

seankhliao · 2024-02-07T19:22:14Z

I've wanted a fixed header/footer for go panic / crash(fault?) messages.
It would make writing log parsers less ambiguous since most give us the option for matching multiline output against start/end sequences.

rsc · 2024-02-07T19:31:26Z

What if something prints the header in another context?

seankhliao · 2024-02-07T19:40:12Z

Usually this is in the context of server applications with (semi) structured logging as the only expected output, so it's easy to match against a fixed string from the start of a line.
But if something else prints it i would also expect it to be captured as a panic, e.g. net/http automatically recovers panics and prints a stacktrace, i also want that to be captured.

rsc · 2024-02-08T23:58:14Z

The prints from net/http are separate from anything done by debug.SetCrashHeader, which would only be about crashes. So I think we still can just make the final crash messages unambiguous and not introduce what amounts to MIME framing around Go crash messages.

rsc · 2024-02-09T00:00:12Z

This proposal has been added to the active column of the proposals project
and will now be reviewed at the weekly proposal review meetings.
— rsc for the proposal review group

aclements · 2024-02-28T18:31:45Z

Given SetCrashOutput(file *os.File), why can't programs that want to provide extra metadata to a crash handler write to the crash output os.File ahead of time to provide such context?

rsc · 2024-03-01T17:44:33Z

Based on the discussion above, this proposal seems like a likely decline.
— rsc for the proposal review group

rsc · 2024-03-08T04:00:55Z

No change in consensus, so declined.
— rsc for the proposal review group

gopherbot · 2024-04-23T17:05:10Z

Change https://go.dev/cl/581215 mentions this issue: runtime: properly frame panic values in tracebacks

This CL causes the printing of panic values to ensure that all newlines in the output are immediately followed by a tab, so that there is no way for a maliciously crafted panic value to fool a program attempting to parse the traceback into thinking that the panic value is in fact a goroutine stack. See #64590 (comment) + release note Updates #64590 Updates #63455 Change-Id: I5142acb777383c0c122779d984e73879567dc627 Reviewed-on: https://go-review.googlesource.com/c/go/+/581215 Auto-Submit: Alan Donovan <[email protected]> LUCI-TryBot-Result: Go LUCI <[email protected]> Reviewed-by: Michael Pratt <[email protected]>

EtiennePerot added the Proposal label Dec 6, 2023

gopherbot added this to the Proposal milestone Dec 6, 2023

ianlancetaylor added this to Proposals Dec 14, 2023

ianlancetaylor moved this to Incoming in Proposals Dec 14, 2023

rsc changed the title ~~proposal: runtime/debug: add SetCrashHeader(header []byte)~~ proposal: runtime/debug: add SetCrashHeader function Feb 8, 2024

rsc moved this from Incoming to Active in Proposals Feb 9, 2024

rsc moved this from Active to Likely Decline in Proposals Mar 1, 2024

rsc added the Proposal-FinalCommentPeriod label Mar 1, 2024

rsc moved this from Likely Decline to Declined in Proposals Mar 8, 2024

rsc closed this as completed Mar 8, 2024

rsc removed the Proposal-FinalCommentPeriod label Mar 8, 2024

thanm mentioned this issue Mar 20, 2024

Go compiler and runtime meeting notes #43930

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

proposal: runtime/debug: add SetCrashHeader function #64590

proposal: runtime/debug: add SetCrashHeader function #64590

EtiennePerot commented Dec 6, 2023 •

edited

Loading

apparentlymart commented Dec 7, 2023

adonovan commented Dec 7, 2023

EtiennePerot commented Dec 8, 2023 •

edited

Loading

apparentlymart commented Dec 8, 2023

adonovan commented Dec 8, 2023 •

edited

Loading

adonovan commented Jan 25, 2024

rsc commented Feb 7, 2024

seankhliao commented Feb 7, 2024

rsc commented Feb 7, 2024

seankhliao commented Feb 7, 2024

rsc commented Feb 8, 2024

rsc commented Feb 9, 2024

aclements commented Feb 28, 2024

rsc commented Mar 1, 2024

rsc commented Mar 8, 2024

gopherbot commented Apr 23, 2024

proposal: runtime/debug: add SetCrashHeader function #64590

proposal: runtime/debug: add SetCrashHeader function #64590

Comments

EtiennePerot commented Dec 6, 2023 • edited Loading

Proposal Details

Example

apparentlymart commented Dec 7, 2023

adonovan commented Dec 7, 2023

EtiennePerot commented Dec 8, 2023 • edited Loading

apparentlymart commented Dec 8, 2023

adonovan commented Dec 8, 2023 • edited Loading

adonovan commented Jan 25, 2024

rsc commented Feb 7, 2024

seankhliao commented Feb 7, 2024

rsc commented Feb 7, 2024

seankhliao commented Feb 7, 2024

rsc commented Feb 8, 2024

rsc commented Feb 9, 2024

aclements commented Feb 28, 2024

rsc commented Mar 1, 2024

rsc commented Mar 8, 2024

gopherbot commented Apr 23, 2024

EtiennePerot commented Dec 6, 2023 •

edited

Loading

EtiennePerot commented Dec 8, 2023 •

edited

Loading

adonovan commented Dec 8, 2023 •

edited

Loading