add tests, deal with pipes, newlines, tabs for `to nuon` #6391

merelymyself · 2022-08-23T15:00:05Z

Description

Related: #6379

Tests

Make sure you've done the following:

Add tests that cover your changes, either in the command examples, the crate/tests folder, or in the /tests folder.
Try to think about corner cases and various ways how your changes could break. Cover them with tests.
If adding tests is not possible, please document in the PR body a minimal example with steps on how to reproduce so one can verify your change works.

Make sure you've run and fixed any issues with these commands:

cargo fmt --all -- --check to check standard code formatting (cargo fmt --all applies these changes)
cargo clippy --workspace --features=extra -- -D warnings -D clippy::unwrap_used -A clippy::needless_collect to check that you're using the standard code style
cargo test --workspace --features=extra to check that all the tests pass

sholderbach · 2022-08-23T15:15:17Z

crates/nu-command/src/formats/to/nuon.rs

+ if val.contains('\t') {
+ return Err(ShellError::UnsupportedInput(
+ "tab character cannot be converted".to_string(),
+ span,
+ ));
 }
- quoted
- }),
+ if val.contains('\n') {
+ return Err(ShellError::UnsupportedInput(
+ "newline character cannot be converted".to_string(),
+ span,
+ ));


Do we have code already to escape them to \t and \n respectively?

I agree, the allowed / disallowed characters are already handled by the parser. If we want to disallow some characters in column names, let's do it in the parser. Here, we should just check for characters that break the to nuon | from nuon roundtrip and wrap them in quotes.

kubouch · 2022-08-23T17:03:25Z

I'd propose the following methodology:

Disable quoting entirely (for now)
Loop through all ASCII characters and check for
a) [ [ $'foo($char)bar' ]; [ 'test' ] ] | to nuon | from nuon
b) { $'foo($char)bar': 'test' } | to nuon | from nuon
Collect the characters that cause 2a) or 2b) to fail
Enable quoting for the collected characters

The step 2. can be put into the test suite (excluding obvious working characters like 0-9, a-z or A-Z).

merelymyself · 2022-08-24T02:09:57Z

I used proptest, as suggested in the other PR. It's a dev-dependency, so that should be fine, right?

sholderbach · 2022-08-24T08:45:55Z

I used proptest, as suggested in the other PR. It's a dev-dependency, so that should be fine, right?

Does it execute growing and shrinking in the CI or do you trigger the expensive operation manually?
(It has been a while that I used it)

sholderbach · 2022-08-24T08:50:23Z

2. Loop through all ASCII characters and check for
   a) `[ [ $'foo($char)bar' ]; [ 'test' ] ] | to nuon | from nuon`
   b) `{ $'foo($char)bar': 'test' } | to nuon | from nuon`

I would add
a) [ [ $'($char)' ]; [ 'test' ] ] | to nuon | from nuon
b) { $'($char)': 'test' } | to nuon | from nuon

for the particularly pathological cases

Also with proptest doing strings should be fine and that will discover some other pathologies like

{"(1)": one} that will round trip to {1: one} if there are no quotes

kubouch · 2022-08-24T09:11:19Z

Yeah, we can also add the single-char cases @sholderbach suggested. I've never used proptest -- does it test all the characters? I don't see the logic from this PR's commits.

merelymyself · 2022-08-24T13:11:29Z

@kubouch - I don't think proptest tests every character, but it does cast a rather wide net over often-problematic characters and strings.

@sholderbach - I've added the cases, thanks for the suggestion! {"(1)": one} doesn't roundtrip because anything with brackets is quoted automatically. As for your first question, sorry, I don't quite understand what you mean.

merelymyself · 2022-08-24T14:45:14Z

I also found an unrelated parser issue, #6401

sholderbach · 2022-08-29T11:13:01Z

Looking good to me. Thanks @merelymyself

I was wondering if it will sample new tests during each call to CI or if it will only check the previous failures. I didn't notice a big impact on the CI times so that should probably not be an issue (as long as we don't discover random failuers over time in unrelated PR's)
Do we have the machinery to escape/quote the caught cases already or do we error out in those case for now?

Other than that would be happy to land that.

merelymyself · 2022-08-31T11:59:37Z

I believe it will sample new tests, but I've ran it as much as I could, so I can only hope that some new issue won't be suddenly discovered.

The caught cases are already quoted - for weird cases where " or the null byte is chosen as a string, it errors out, and the test has built-in catches for that.

sholderbach · 2022-08-31T14:33:31Z

OK yeah let's land it. We can probably deal with a bit of fuzzing happening in the background ^^
I would consider the null byte outside of the realm of things we should handle gracefully when most programs will terminate strings on it...
Escaping the double quotation mark should probably be on the list of things we deal with properly but if we for now fail safe with to nuon I wouldn't consider that a stopper for this PR

* remove unnecessary FlatShape * add proptest * remove files that belonged in another PR * more tests, more chars * add exception for parser error unrelated ot PR

rgwood · 2022-11-07T04:26:43Z

Proptest is very cool but I have noticed that these tests are much slower than others.

cargo nextest run --all runs about 2000 individual tests, and on a fast desktop it takes ~12s with the 2 proptests and ~6s without. 2 out of 2000 tests are responsible for consuming 50% of test time!

I imagine the slowdown is worse on slower machines like the test runners. Not sure what the best way forward is (but also it's not especially urgent).

sholderbach · 2022-11-07T05:25:31Z

Yeah, running the extra iterations on proptest or quickcheck in CI doesn't seem to catch anything new. Either we restrict the iterations (at least with proptest that's configurable), ignore the tests and move the reproducibles to regular tests, or do feature crimes... (with the second approach you would have to name the test to run sampling)
I had a few other things I would like to stress test. But yeah that shouldn't run on every PR.
We could think about a cron workflow with just the fuzzing and some longer iteration time.

(what is the state on switching nextest back on?)

rgwood · 2022-11-07T05:34:34Z

(what is the state on switching nextest back on?)

Not sure. IIRC there were 2 main issues with nextest in CI:

Didn't notice a huge speedup in CI with nextest (maybe because measuring CI is hard, maybe because our CI runners only have 2 cores?)
nextest doesn't support doctests so we still needed to run cargo test for a few tests

We could probably turn it on again, but the cost-benefit analysis is kinda ¯\_(ツ)_/¯

remove unnecessary FlatShape

7f9ee7d

sholderbach reviewed Aug 23, 2022

View reviewed changes

add proptest

52266e4

merelymyself force-pushed the patch-to-nuon branch from 66e0df5 to 52266e4 Compare August 24, 2022 02:06

remove files that belonged in another PR

502d904

more tests, more chars

b5fcb8e

add exception for parser error unrelated ot PR

1a51485

sholderbach approved these changes Sep 1, 2022

View reviewed changes

sholderbach merged commit 34e58bc into nushell:main Sep 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add tests, deal with pipes, newlines, tabs for `to nuon` #6391

add tests, deal with pipes, newlines, tabs for `to nuon` #6391

merelymyself commented Aug 23, 2022

sholderbach Aug 23, 2022

kubouch Aug 23, 2022

kubouch commented Aug 23, 2022

merelymyself commented Aug 24, 2022

sholderbach commented Aug 24, 2022

sholderbach commented Aug 24, 2022

kubouch commented Aug 24, 2022

merelymyself commented Aug 24, 2022

merelymyself commented Aug 24, 2022

sholderbach commented Aug 29, 2022

merelymyself commented Aug 31, 2022

sholderbach commented Aug 31, 2022

rgwood commented Nov 7, 2022 •

edited

Loading

sholderbach commented Nov 7, 2022

rgwood commented Nov 7, 2022

add tests, deal with pipes, newlines, tabs for to nuon #6391

add tests, deal with pipes, newlines, tabs for to nuon #6391

Conversation

merelymyself commented Aug 23, 2022

Description

Tests

sholderbach Aug 23, 2022

Choose a reason for hiding this comment

kubouch Aug 23, 2022

Choose a reason for hiding this comment

kubouch commented Aug 23, 2022

merelymyself commented Aug 24, 2022

sholderbach commented Aug 24, 2022

sholderbach commented Aug 24, 2022

kubouch commented Aug 24, 2022

merelymyself commented Aug 24, 2022

merelymyself commented Aug 24, 2022

sholderbach commented Aug 29, 2022

merelymyself commented Aug 31, 2022

sholderbach commented Aug 31, 2022

rgwood commented Nov 7, 2022 • edited Loading

sholderbach commented Nov 7, 2022

rgwood commented Nov 7, 2022

add tests, deal with pipes, newlines, tabs for `to nuon` #6391

add tests, deal with pipes, newlines, tabs for `to nuon` #6391

rgwood commented Nov 7, 2022 •

edited

Loading