Fully underline parse nodes in diagnostics. #3442

domisterwoozy · 2023-11-30T20:59:04Z

Another incremental change to diagnostic formatting. I simply recurse over all the tokens in the subtree of a parse node and construct a DiagnosticLocation that covers all of the tokens.

I believe it's nicer for the user to be directed at the entire chunk of source where the error is occurring rather then just pointing at the bracketing/terminator tokens, but let me know if you all agree.

…nderline_parse

jonmeow

@zygoloid What do you think of this? I think I'm hesitant on this due to several cases:

For missing return statements, it tries to highlight the full function (including the signature).
- e.g., "ERROR: Missing return at end of function with declared return type."
For cases where an open paren is the target, it highlights everything up to and including the open paren.
- But not including the close paren.
- e.g., "ERROR: addr self method cannot be invoked on a value."
For something like &foo where & is invalid, it points at the entire expression instead of &.
- e.g., "ERROR: Cannot take the address of non-reference expression."
- Similar can be seen of addr, etc.
In "Cannot implicitly convert from i32 to f64, the + seems slightly more clear than highlighting all of 12 + 3.4 (this is also a case where more clear associations of what's what would be useful).

Some cases seem more likely improvements, such as the highlight when a returned expression is of the incorrect type, or "Cannot initialize tuple of N element(s) from tuple with M element(s)."

Other cases look mostly like we're just not associating the right places. e.g., saying something can't be indexed seems like it should highlight the [, not necessarily the full a[b] expression. Right now it gets the latter because of the association with ]; with this change, associating with [ instead would result in a[ being highlighted.

This also makes me think about more complex cases. e.g., if we had a.b.c.d.e[x] where e cannot be indexed, I believe this would minimally highlight a.b.c.d.e[.

I think we'd want to keep iterating on highlight ranges, so it's not clear to me we should take this step. But this is where maybe zygoloid will have different thoughts about the parse tree structure associations for diagnostics.

zygoloid

@zygoloid What do you think of this?

I think some of these changes are improvements, others are regressions. But I think we do want functionality along these lines.

I think when we pass a location to a diagnostic builder, we should distinguish between (at least) two cases:

Passing a Parse::NodeId and wanting that subtree to be highlighted
Passing a Parse::NodeId and wanting just the corresponding token to be highlighted.

For example, we could change NodeLocationTranslator to take a new class NodeLocation as its location type, where NodeLocation can either be constructed from a NodeId (to highlight the range) or from

auto TokenLoc(Parse::NodeId node_id) -> NodeLocation;

(to point the caret at the token location for the parse node).

Then we'd need to go through the diagnostic changes here and switch to using emitter.Emit(TokenLoc(parse_node), MyDiag); for the ones we don't like.

I think highlighting the whole parse subtree is doing the right thing most of the time here, though, so that's probably the better default.

This also makes me think about more complex cases. e.g., if we had a.b.c.d.e[x] where e cannot be indexed, I believe this would minimally highlight a.b.c.d.e[.

Yeah, a few of our diagnostics have bad locations due to the structure of the parse tree. I think in this case using the TokenLoc of the [ is probably good enough for now, but ideally I think I'd want

  a.b.c.d.e[x]
  ~~~~~~~~~^

... for which I think we want to pass a highlighted range to the diagnostic infrastructure separately from the diagnostic location.

I think we'd want to keep iterating on highlight ranges, so it's not clear to me we should take this step. But this is where maybe zygoloid will have different thoughts about the parse tree structure associations for diagnostics.

I think this is an iterative step forward, especially if we add control over whether to highlight the subtree or just point at the one node, and gives us some useful functionality for adding more general highlighting.

toolchain/parse/tree_node_location_translator.h

domisterwoozy · 2023-12-01T21:22:13Z

I think when we pass a location to a diagnostic builder, we should distinguish between (at least) two cases:

Passing a Parse::NodeId and wanting that subtree to be highlighted

Passing a Parse::NodeId and wanting just the corresponding token to be highlighted.

For example, we could change NodeLocationTranslator to take a new class NodeLocation as its location type, where NodeLocation can either be constructed from a NodeId (to highlight the range) or from
auto TokenLoc(Parse::NodeId node_id) -> NodeLocation;
(to point the caret at the token location for the parse node).

Then we'd need to go through the diagnostic changes here and switch to using emitter.Emit(TokenLoc(parse_node), MyDiag); for the ones we don't like.

Let me know if I implemented this correctly. My question now is should we go through and add TokenOnly where we deem fit now or do that in a follow up? If we want to do it now can we enumerate each case where we only want the token? I can start with each instance in @jonmeow's comment and we can go from there?

zygoloid · 2023-12-02T01:49:27Z

Let me know if I implemented this correctly.

Yes, looks great, thanks!

My question now is should we go through and add TokenOnly where we deem fit now or do that in a follow up? If we want to do it now can we enumerate each case where we only want the token? I can start with each instance in @jonmeow's comment and we can go from there?

Normally I'd be in favor of doing this incrementally, but I think it'll be a lot easier to see where we want to make this change if we do it now, because we can see the effect of the combined patch on the test suite. So let's start by fixing the things that @jonmeow pointed out (plus optionally anything else that stands out to you), then I can do another pass over the diagnostic changes and point out any other ones where a point diagnostic would be better.

…nderline_parse

zygoloid

Thanks! I think there's only one class of highlighting here that still looks a bit off to me: diagnostics for a function call are highlighting from the start of the callee to the (, inclusive. I'm not sure what's best here -- I'd be happy with highlighting until the ) or just pointing at the ( token, but switching that case to TokenOnly is probably simplest.

zygoloid · 2023-12-04T02:52:21Z

toolchain/check/testdata/class/fail_addr_self.carbon

 // CHECK:STDERR: c.F();
- // CHECK:STDERR:  ^
+ // CHECK:STDERR: ^~~~


This is highlighting the ( but not the ), which seems a bit surprising. I think any of these would be OK:

c.F(); ^~~ c.F(); ^~~~~ c.F(); ^

(Same for other calls in this file.)

…nderline_parse

zygoloid

I think this is a significant improvement. Can you resolve the merge conflicts?

…nderline_parse

domisterwoozy · 2023-12-13T05:09:09Z

Sorry about the broken tests, I believe it should be good now.

@domisterwoozy

Builds upon @domisterwoozy 's excellent #3442 . Removes the need to store the first node of a declaration in the declaration state stack.

domisterwoozy added 4 commits November 27, 2023 18:43

fully underline parse nodes in diagnostics

bc7c467

Merge branch 'trunk' of github.com:carbon-language/carbon-lang into u…

846c56a

…nderline_parse

update tests

dc6e46f

fix issue with ranges ending with multichar token

f7a0041

github-actions bot requested a review from jonmeow November 30, 2023 20:59

github-actions bot added the toolchain label Nov 30, 2023

jonmeow reviewed Nov 30, 2023

View reviewed changes

jonmeow requested a review from zygoloid November 30, 2023 23:53

zygoloid reviewed Dec 1, 2023

View reviewed changes

toolchain/parse/tree_node_location_translator.h Outdated Show resolved Hide resolved

simplify range construction and allow specifying token only

e0dfb12

domisterwoozy added 2 commits December 3, 2023 13:40

add to some diagnostics

b2da381

Merge branch 'trunk' of github.com:carbon-language/carbon-lang into u…

2d9f3a0

…nderline_parse

zygoloid reviewed Dec 4, 2023

View reviewed changes

domisterwoozy added 3 commits December 4, 2023 11:29

only highlight token for addr self method on a value

ce1258f

Merge branch 'trunk' of github.com:carbon-language/carbon-lang into u…

37c229a

…nderline_parse

Merge branch 'trunk' of github.com:carbon-language/carbon-lang into u…

0fee6c3

…nderline_parse

zygoloid approved these changes Dec 7, 2023

View reviewed changes

Merge branch 'trunk' of github.com:carbon-language/carbon-lang into u…

61c8f09

…nderline_parse

zygoloid added this pull request to the merge queue Dec 8, 2023

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Dec 8, 2023

domisterwoozy added 2 commits December 12, 2023 20:47

Merge branch 'trunk' of github.com:carbon-language/carbon-lang into u…

5852df0

…nderline_parse

update tests

4631831

zygoloid added this pull request to the merge queue Dec 13, 2023

Merged via the queue into carbon-language:trunk with commit 6419568 Dec 13, 2023
6 checks passed

josh11b mentioned this pull request Dec 14, 2023

Underline the complete declaration in diagnostics #3508

Merged

github-merge-queue bot pushed a commit that referenced this pull request Dec 14, 2023

Underline the complete declaration in diagnostics (#3508)

23c7d7d

Builds upon @domisterwoozy 's excellent #3442 . Removes the need to store the first node of a declaration in the declaration state stack.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fully underline parse nodes in diagnostics. #3442

Fully underline parse nodes in diagnostics. #3442

domisterwoozy commented Nov 30, 2023

jonmeow left a comment

zygoloid left a comment

domisterwoozy commented Dec 1, 2023

zygoloid commented Dec 2, 2023

zygoloid left a comment

zygoloid Dec 4, 2023

zygoloid left a comment

domisterwoozy commented Dec 13, 2023

Fully underline parse nodes in diagnostics. #3442

Fully underline parse nodes in diagnostics. #3442

Conversation

domisterwoozy commented Nov 30, 2023

jonmeow left a comment

Choose a reason for hiding this comment

zygoloid left a comment

Choose a reason for hiding this comment

domisterwoozy commented Dec 1, 2023

zygoloid commented Dec 2, 2023

zygoloid left a comment

Choose a reason for hiding this comment

zygoloid Dec 4, 2023

Choose a reason for hiding this comment

zygoloid left a comment

Choose a reason for hiding this comment

domisterwoozy commented Dec 13, 2023