Proposal: Add Column field to Line message #687

mar-kolya · 2022-03-03T20:37:18Z

It would be really useful to have a column field along with the line number field in the Line message. This is especially useful for runtimes that use source code compacting/obfuscation like Javascript.

In profile.proto this would look like:

index ee0391f..92f9fef 100644
--- a/proto/profile.proto
+++ b/proto/profile.proto
@@ -195,6 +195,8 @@ message Line {
   uint64 function_id = 1;
   // Line number in source code.
   int64 line = 2;
+  // Column number in the source code.
+  int64 column = 3;
 }

 message Function {

Additional changes to the pprof tool:

Output column information if it is present when -lines is used.
Where pprof currently outputs <path>:<line> make it output <path>:<line>:<column> when column is available.
- Columns are 1-indexed, so 0 column doesn’t exit.
merge.go needs updating to take into account column numbers, also (*Profile).Aggregate(), aggregate() in driver.go. We would need to check all the places where Line.line is used.

Please let me know what you think.

Thanks!

The text was updated successfully, but these errors were encountered:

felixge · 2022-09-27T15:45:54Z

@aalexand we're currently thinking about potential improvements to profile.proto at Datadog. Before submitting any bigger proposals, what do you think about this small one that my colleague @mar-kolya made a while ago? He just updated to title/text to make the proposal more clear. If accepted, we'd be happy to send a PR for the profile pkg to support encoding/decoding this new field.

aalexand · 2022-10-02T03:03:21Z

@felixge @mar-kolya No conceptual objections but it would be good to include in the description how this will integrate with pprof as a tool. In particular (not an exhaustive list):

How this will affect the granularity option. Today there is -lines choice, will there also be -linecolumns (similar to -filefunctions) or will we simply include the column when the lines granularity is requested?
How will each output format support the new field?
What places beyond encoding / decoding will be updated. For example, merge.go will need to be updated. What else?

I basically want to make sure that if we add this field then we do our best to identify what full support for the new field looks like in pprof and have this support added as part of the PR that adds the field, including tests.

mar-kolya · 2022-10-06T14:43:41Z

@aalexand I've updated the proposal. Please let me know what do you think.
Thanks!

aalexand · 2022-10-11T07:31:29Z

I wonder whether it's possible to avoid adding -linescolumns granularity and simply output the column information if it's present. Basically, treat it as "fractional" part of the line number.

Similarly I would prefer to avoid adding has_column_numbers. These has_* fields are typically populated based on the detail level of debug info and "has column number" doesn't quite fit that.

felixge · 2022-10-11T10:35:19Z

@aalexand I'll let @mar-kolya confirm, but AFAIK we're totally happy with following your suggestions. Our current workaround for minified JS has been to put both column and line numbers into the line field (using 32 bit for each). This works for our use case, but since we're invested in pprof we'd love to contribute back and make the format better for everybody as well.

mar-kolya · 2022-10-11T11:09:33Z

Sounds good to me, updated original proposal.
Thanks!

aalexand · 2022-10-17T20:54:28Z

One question I have on this: rendering performance data on a long source line is not very usable so I would expect that at some point of the profile data processing the data needs to be re-mapped to the original source lines via source maps?

mar-kolya · 2022-10-17T21:22:24Z

Yes, that indeed happens. But this happens on the pprof data outside of the process that created the pprof file.

That being said: this was just original usecase that has inspired this ticket. I'm sure there are languages where knowing column number is useful because there are multiple statements on the same line - but those languages do not put everything onto one line.

aalexand · 2022-10-18T03:23:43Z

I think the Function message would also need to be updated - there is start_line field and if we introduce ability to specify column in the Line message then we should consistently update the "precision" in other places.

aalexand · 2023-11-13T20:16:20Z

FTR, #818 is in review for this feature.

This change adds column numbers to the line message in profile.proto. This allows users to distinguish between souce code locations on the same line. Only the llvm addr2line is capable of reading column number information from dwarf debug information. Other changes include: * Add "columns" option for output granularity. * Account for column numbers during profile merge. * Update the encoder and decoder. * Update golden test files for legacy profiles. Fixes google#687

This change adds column numbers to the line message in profile.proto. This allows users to distinguish between souce code locations on the same line. Only the llvm based addr2line is capable of reading column number information from dwarf and PE debug information. Other changes include: * Add "columns" option for output granularity. * Account for column numbers during profile merge. * Update the encoder and decoder. * Update golden test files for legacy profiles. Fixes google#687

* Add column numbers to profile.proto. This change adds column numbers to the line message in profile.proto. This allows users to distinguish between souce code locations on the same line. Only the llvm based addr2line is capable of reading column number information from dwarf and PE debug information. Other changes include: * Add "columns" option for output granularity. * Account for column numbers during profile merge. * Update the encoder and decoder. * Update golden test files for legacy profiles. Fixes #687 * Update the expecatations in the test to match testdata. * Update report generation and driver for column numbers. * Address comments --------- Co-authored-by: Alexey Alexandrov <[email protected]>

mar-kolya changed the title ~~Pprof format should support 'column' (in addition to the line number)~~ Proposal: Add Column field to Line message Sep 27, 2022

prattmic mentioned this issue Apr 13, 2023

Proposal: add Discriminator field to Line message #768

Closed

Louis-Ye added type: feat Buganizer type - Feature Request Priority: p3 Buganizer priority - P3 labels Jun 1, 2023

snehasish mentioned this issue Nov 13, 2023

Add column numbers to profile.proto. #818

Merged

aalexand closed this as completed in #818 Jan 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposal: Add Column field to Line message #687

Proposal: Add Column field to Line message #687

mar-kolya commented Mar 3, 2022 •

edited

Loading

felixge commented Sep 27, 2022 •

edited

Loading

aalexand commented Oct 2, 2022

mar-kolya commented Oct 6, 2022

aalexand commented Oct 11, 2022

felixge commented Oct 11, 2022 •

edited

Loading

mar-kolya commented Oct 11, 2022

aalexand commented Oct 17, 2022

mar-kolya commented Oct 17, 2022 •

edited

Loading

aalexand commented Oct 18, 2022

aalexand commented Nov 13, 2023

Proposal: Add Column field to Line message #687

Proposal: Add Column field to Line message #687

Comments

mar-kolya commented Mar 3, 2022 • edited Loading

felixge commented Sep 27, 2022 • edited Loading

aalexand commented Oct 2, 2022

mar-kolya commented Oct 6, 2022

aalexand commented Oct 11, 2022

felixge commented Oct 11, 2022 • edited Loading

mar-kolya commented Oct 11, 2022

aalexand commented Oct 17, 2022

mar-kolya commented Oct 17, 2022 • edited Loading

aalexand commented Oct 18, 2022

aalexand commented Nov 13, 2023

mar-kolya commented Mar 3, 2022 •

edited

Loading

felixge commented Sep 27, 2022 •

edited

Loading

felixge commented Oct 11, 2022 •

edited

Loading

mar-kolya commented Oct 17, 2022 •

edited

Loading