cumsum, accumulate should not have default dim=1 #19451

StefanKarpinski · 2016-11-29T18:40:56Z

This is inconsistent with how non-cumulative functions work in Julia, e.g.:

julia> A = rand(3,4)
3×4 Array{Float64,2}:
 0.239183  0.420538  0.760069  0.349576
 0.768802  0.924563  0.934626  0.446843
 0.767848  0.761547  0.518525  0.758667

julia> sum(A)
7.650786585136748

julia> cumsum(A)
3×4 Array{Float64,2}:
 0.239183  0.420538  0.760069  0.349576
 1.00798   1.3451    1.6947    0.796419
 1.77583   2.10665   2.21322   1.55509

Correspondingly, cumsum(A) should, rather than asymmetrically defaulting to the first dimension, either be an error, or return an accumulation of the elements of A such that cumsum(A)[end] == sum(A) and such that cumsum(v')' == cumsum(v) for any vector v. One possibility is performing a cumulative sum in column-major order. Another is that each output value is the sum of values to the values to the left or above that slot in the intput. I.e.:

julia> [sum(A[1:i,1:j]) for i=1:size(A,1), j=1:size(A,2)]
3×4 Array{Float64,2}:
 0.239183  0.65972  1.41979  1.76937
 1.00798   2.35308  4.04778  4.8442
 1.77583   3.88248  6.0957   7.65079

This answer has the above properties, can be computed efficiently, and is useful.

The text was updated successfully, but these errors were encountered:

andreasnoack · 2016-11-29T18:58:41Z

I agree that cumsum shouldn't default to dim=1 but I think it is better to throw an error if no dimension is specified and ndims>1. I don't like that storage order becomes significant for anything but performance and floating point errors. Short term, there would also be a deprecation issue.

StefanKarpinski · 2016-11-29T20:18:05Z

I agree that it's not desirable for storage order to become semantically visible. That's why I proposed the [sum(A[1:i,1:j]) for i=1:size(A,1), j=1:size(A,2)] definition as well. Wanting cumsum(v')' == cumsum(v) may tie into #4774 depending on how that pans out.

StefanKarpinski · 2017-09-01T17:05:28Z

See also #23542, #20041.

StefanKarpinski · 2017-09-05T17:27:37Z

The minimal plan of action here is:

Make not supplying a dimension an error unless all dimensions but the first are singleton.
Make accumulate on a row vector work as accumulate(f, r.').'.

In the future, this could be generalized to the n-dimensional prefix summation behavior I proposed above, which is generally useful and ties the desired behavior for vectors and row vectors together.

andreasnoack mentioned this issue Dec 1, 2016

No cov(::AbstractMatrix, ::Bool) method #19471

Closed

StefanKarpinski added this to the 1.0 milestone Sep 1, 2017

StefanKarpinski assigned andreasnoack Sep 1, 2017

This was referenced Sep 1, 2017

cumsum inconvenient for RowVector #20041

Open

Removed default descriptions from multidimensional.jl #23542

Merged

JeffBezanson added the domain:arrays [a, r, r, a, y, s] label Nov 20, 2017

andreasnoack mentioned this issue Nov 21, 2017

Deprecate cumsum, cumprod, cumsum_kbn, and accumulate when dim isn't specified #24684

Merged

andreasnoack closed this as completed in #24684 Nov 24, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cumsum, accumulate should not have default dim=1 #19451

cumsum, accumulate should not have default dim=1 #19451

StefanKarpinski commented Nov 29, 2016

andreasnoack commented Nov 29, 2016 •

edited

Loading

StefanKarpinski commented Nov 29, 2016

StefanKarpinski commented Sep 1, 2017

StefanKarpinski commented Sep 5, 2017

cumsum, accumulate should not have default dim=1 #19451

cumsum, accumulate should not have default dim=1 #19451

Comments

StefanKarpinski commented Nov 29, 2016

andreasnoack commented Nov 29, 2016 • edited Loading

StefanKarpinski commented Nov 29, 2016

StefanKarpinski commented Sep 1, 2017

StefanKarpinski commented Sep 5, 2017

andreasnoack commented Nov 29, 2016 •

edited

Loading