conditionally disable bottleneck #5560

keewis · 2021-07-01T23:12:03Z

As this came up in #5424 (and because I can't seem to reliably reproduce expected values in #4972 if it is enabled) this adds a option to disable bottleneck, even if it is installed.

In #5424 it was suggested to also allow replacing bottleneck with numbagg. If that's something we want (and numbagg supports the operations we need) I can try looking into renaming the new option to something like xr.set_options(accelerate_with="bottleneck") (or something else, if someone has a great idea).

Tests are missing because I have no idea how to check that this works (except by mocking bottleneck).

Closes Bottleneck bug with unusual strides - causes segfault or wrong number #5424
Tests added
Passes pre-commit run --all-files
User visible changes (including notable bug fixes) are documented in whats-new.rst

github-actions · 2021-07-01T23:49:55Z

Unit Test Results

        6 files ±0         6 suites ±0 57m 2s ⏱️ ±0s
16 217 tests ±0 14 483 ✔️ ±0 1 734 💤 ±0 0 ❌ ±0
90 498 runs ±0 82 324 ✔️ ±0 8 174 💤 ±0 0 ❌ ±0

Results for commit 3956b73. ± Comparison against base commit 3956b73.

♻️ This comment has been updated with latest results.

dcherian · 2021-07-02T00:09:36Z

We use it in other places too (for e.g.):

xarray/xarray/core/nputils.py

Lines 139 to 147 in c472f8a

 if ( 

 _USE_BOTTLENECK 

 and isinstance(values, np.ndarray) 

 and bn_func is not None 

 and not isinstance(axis, tuple) 

 and values.dtype.kind in "uifc" 

 and values.dtype.isnative 

 and (dtype is None or np.dtype(dtype) == values.dtype) 

 ):

xr.set_options(accelerate_with="bottleneck")

I like this idea but we should wait for more input.

max-sixty · 2021-07-02T00:20:32Z

Nice!

pandas uses use_bottleneck — and same for numba / numexpr.

To what extent would bottleneck & numba be mutually exclusive? Until everything is implemented in numbagg, I guess they won't be, and we might want separate options. Having a single option would be nicer otherwise.

keewis · 2021-07-21T17:23:45Z

as discussed in the meeting, we will keep use_bottleneck for now. I still don't know how to test this, but otherwise this should be ready for reviews and merging.

Edit: we also don't have alternatives for functions like duck_array_ops.push (used in ffill etc.) so disabling bottleneck would fail the function. Should we just silently use bottleneck, or raise an error?

dcherian

LGTM. I guess you would have to mock bottleneck to properly test this.

dcherian · 2021-07-21T17:35:36Z

so disabling bottleneck would fail the function. Should we just silently use bottleneck, or raise an error?

Ah this is a good test!

with xr.set_options(use_bottleneck=False):
	with pytest.raises(...):
		dataarray.ffill()

IMO it should raise an error so that use_bottleneck is a "global" control on whether xarray uses bottleneck or not. The context manager gives the user some flexibility to opt-in to using bottleneck where they want to.

keewis · 2021-08-11T21:18:05Z

I'm not sure how to test rolling: monkeypatching bottleneck.move_sum does not work because Rolling only accesses that on import, i.e. before the test is executed.

Everything else is should be done, though.

dcherian

just a suggestion on the error message.

Can you open a new issue about testing for rolling? I think it is OK to merge without that.

xarray/core/dataset.py

dcherian · 2021-08-12T14:41:30Z

Thanks @keewis. We can extend the tests for rolling later.

* upstream/main: (34 commits) Use same bool validator as other inputs (pydata#5703) conditionally disable bottleneck (pydata#5560) Refactor index vs. coordinate variable(s) (pydata#5636) pre-commit: autoupdate hook versions (pydata#5685) Flexible Indexes: Avoid len(index) in map_blocks (pydata#5670) Speed up _mapping_repr (pydata#5661) update the link to `scipy`'s intersphinx file (pydata#5665) Bump styfle/cancel-workflow-action from 0.9.0 to 0.9.1 (pydata#5663) pre-commit: autoupdate hook versions (pydata#5660) fix the binder environment (pydata#5650) Update api.rst (pydata#5639) Kwargs to rasterio open (pydata#5609) Bump codecov/codecov-action from 1 to 2.0.2 (pydata#5633) new blank whats-new for v0.19.1 v0.19.0 release notes (pydata#5632) remove deprecations scheduled for 0.19 (pydata#5630) Make typing-extensions optional (pydata#5624) Plots get labels from pint arrays (pydata#5561) Add to_numpy() and as_numpy() methods (pydata#5568) pin fsspec (pydata#5627) ...

* upstream/main: (307 commits) Use same bool validator as other inputs (pydata#5703) conditionally disable bottleneck (pydata#5560) Refactor index vs. coordinate variable(s) (pydata#5636) pre-commit: autoupdate hook versions (pydata#5685) Flexible Indexes: Avoid len(index) in map_blocks (pydata#5670) Speed up _mapping_repr (pydata#5661) update the link to `scipy`'s intersphinx file (pydata#5665) Bump styfle/cancel-workflow-action from 0.9.0 to 0.9.1 (pydata#5663) pre-commit: autoupdate hook versions (pydata#5660) fix the binder environment (pydata#5650) Update api.rst (pydata#5639) Kwargs to rasterio open (pydata#5609) Bump codecov/codecov-action from 1 to 2.0.2 (pydata#5633) new blank whats-new for v0.19.1 v0.19.0 release notes (pydata#5632) remove deprecations scheduled for 0.19 (pydata#5630) Make typing-extensions optional (pydata#5624) Plots get labels from pint arrays (pydata#5561) Add to_numpy() and as_numpy() methods (pydata#5568) pin fsspec (pydata#5627) ...

keewis added 2 commits July 2, 2021 00:50

add the "use_bottleneck" option

5056b45

conditionally disable bottleneck where possible

f136208

fix the option name

3623f8d

keewis added 2 commits July 21, 2021 19:20

Merge branch 'main' into conditionally-disable-bottleneck

4910da6

add a entry to whats-new.rst

83ff0bb

dcherian reviewed Jul 21, 2021

View reviewed changes

keewis added 3 commits July 30, 2021 11:24

also check use_bottleneck in ffill and bfill

514db3a

check for use_bottleneck in rank

ad91a08

Merge branch 'main' into conditionally-disable-bottleneck

cfeffa4

keewis force-pushed the conditionally-disable-bottleneck branch from 4ce8a6a to cfeffa4 Compare July 30, 2021 11:45

keewis added 3 commits July 30, 2021 13:58

split out the dask tests

26b6015

make sure bottleneck is not used for reduce functions

3c04ee1

Merge branch 'main' into conditionally-disable-bottleneck

0c70fd5

dcherian approved these changes Aug 12, 2021

View reviewed changes

xarray/core/dataset.py Outdated Show resolved Hide resolved

explain how to enable bottleneck in the error messages

76a22f3

dcherian merged commit 3956b73 into pydata:main Aug 12, 2021

keewis deleted the conditionally-disable-bottleneck branch August 12, 2021 14:48

mathause mentioned this pull request Oct 20, 2021

Rolling() gives values different from pd.rolling() #5877

Open

andersy005 mentioned this pull request May 6, 2022

bottleneck : Wrong mean for float32 array #1346

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

conditionally disable bottleneck #5560

conditionally disable bottleneck #5560

keewis commented Jul 1, 2021 •

edited

Loading

github-actions bot commented Jul 1, 2021 •

edited

Loading

dcherian commented Jul 2, 2021 •

edited

Loading

max-sixty commented Jul 2, 2021

keewis commented Jul 21, 2021 •

edited

Loading

dcherian left a comment

dcherian commented Jul 21, 2021

keewis commented Aug 11, 2021 •

edited

Loading

dcherian left a comment

dcherian commented Aug 12, 2021

conditionally disable bottleneck #5560

conditionally disable bottleneck #5560

Conversation

keewis commented Jul 1, 2021 • edited Loading

github-actions bot commented Jul 1, 2021 • edited Loading

Unit Test Results

dcherian commented Jul 2, 2021 • edited Loading

max-sixty commented Jul 2, 2021

keewis commented Jul 21, 2021 • edited Loading

dcherian left a comment

Choose a reason for hiding this comment

dcherian commented Jul 21, 2021

keewis commented Aug 11, 2021 • edited Loading

dcherian left a comment

Choose a reason for hiding this comment

dcherian commented Aug 12, 2021

keewis commented Jul 1, 2021 •

edited

Loading

github-actions bot commented Jul 1, 2021 •

edited

Loading

dcherian commented Jul 2, 2021 •

edited

Loading

keewis commented Jul 21, 2021 •

edited

Loading

keewis commented Aug 11, 2021 •

edited

Loading