Make the formatter keep comments #17

AucaCoyan · 2023-06-06T14:35:01Z

In the current status, nufmt strips comments, wherever they are from the code.
Because of parsing with .parse()
I would be cool if once the vec<u8> is loaded, split the comments and format "sections of code" inbetween comments, so we can then collect all the comments-code together afterwards.
Also, there is a test skipped with this very feature: ignore_comments

The text was updated successfully, but these errors were encountered:

fdncred · 2023-06-06T15:01:33Z

I wonder if you can infer where comments are by using offsets?

For instance, if you have this test.nu script

# function comment
def fun1 [text] {
    echo "fun1: $text"
}

# this is a 
# multi-line comment
# before a function
def fun2 [text]] {
    echo "fun2: $text"
}

Then this command reveals where the first part of code is, it starts at 19.

❯ nu --ide-ast test.nu | from json | table -e
╭────┬───────────────┬────────────────────┬─────────────────┬──────╮
│  # │    content    │       shape        │      span       │ type │
├────┼───────────────┼────────────────────┼─────────────────┼──────┤
│  0 │ def           │ shape_internalcall │ ╭───────┬────╮  │ ast  │
│    │               │                    │ │ end   │ 22 │  │      │
│    │               │                    │ │ start │ 19 │  │      │
│    │               │                    │ ╰───────┴────╯  │      │

So, that means 0 - 19 may be comments.

❯ open test.nu | str substring 0..19
# function comment

Then we can see this section

│  6 │               │ shape_closure      │ ╭───────┬────╮  │ ast  │
│    │ }             │                    │ │ end   │ 61 │  │      │
│    │               │                    │ │ start │ 59 │  │      │
│    │               │                    │ ╰───────┴────╯  │      │
│  7 │ def           │ shape_internalcall │ ╭───────┬─────╮ │ ast  │
│    │               │                    │ │ end   │ 120 │ │      │
│    │               │                    │ │ start │ 117 │ │      │
│    │               │                    │ ╰───────┴─────╯ │      │

Then we could do this

❯ open test.nu | str substring 61..117


# this is a
# multi-line comment
# before a function

Kind of a long heuristic way to go, but it may be work-able?

AucaCoyan · 2023-06-06T15:20:21Z

That is a different (and I believe better) algorithm.
I'll try it! thanks!

The algorithm I first thought is:
the CLI sends Vec<u8>
the file comments.rs use lex() to read every TokenContents and checks if there are these 2 tokens concatenated: TokenContents::Comment and TokenContents::Eol (end of line, the last character which usually is \n)

After it found them, split in between. So you will have:

code
# some comment
more code
# another comment

Split the sections in yes-code and no-code,
format them,
and collect them for the output

(a much larger process than your logic)

fdncred · 2023-06-06T16:33:56Z

I saw your code, if it works I'm fine with yours too. I was just suggesting an alternative method. I think it would be much better if we could keep track of all comments during parsing so we're not guessing.

This was referenced Jun 6, 2023

make nufmt keep the comments #18

Merged

nufmt roadmap #11

Open

amtoine linked a pull request Jun 6, 2023 that will close this issue

make nufmt keep the comments #18

Merged

fdncred closed this as completed in #18 Jun 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make the formatter keep comments #17

Make the formatter keep comments #17

AucaCoyan commented Jun 6, 2023

fdncred commented Jun 6, 2023

AucaCoyan commented Jun 6, 2023 •

edited

Loading

fdncred commented Jun 6, 2023

Make the formatter keep comments #17

Make the formatter keep comments #17

Comments

AucaCoyan commented Jun 6, 2023

fdncred commented Jun 6, 2023

AucaCoyan commented Jun 6, 2023 • edited Loading

fdncred commented Jun 6, 2023

AucaCoyan commented Jun 6, 2023 •

edited

Loading