Change `for i=1:length(A)` to `for i in eachindex(A)` #10858

timholy · 2015-04-17T10:13:59Z

PSA: when writing algorithms with potentially-multidimensional AbstractArrays, please write for i in eachindex(A) rather than for i = 1:length(A) whenever you don't have some reason to insist that i be an integer.

This fixes performance problems for many operations (see #1168 (comment)).

There are a few outstanding issues, some of which arise when trying to use one index for two different arrays in circumstances in which we have not previously required the shapes to match:

map!,

julia/base/abstractarray.jl

Line 1300 in ef06211

for i = 1:length(A)

and

julia/base/abstractarray.jl

Line 1337 in ef06211

for i = 1:length(A)
logical indexing,

julia/base/array.jl

Line 332 in ef06211

for i = 1:length(I)

,

julia/base/array.jl

Line 408 in ef06211

for i = 1:length(I)

,

julia/base/array.jl

Line 422 in ef06211

for i = 1:length(I)
ccopy!,

julia/base/array.jl

Line 1421 in ef06211

for i = 1:length(A)

and others of which pertain to returning an index to the user (should we return CartesianIndexes?)

find,

julia/base/array.jl

Line 1150 in ef06211

for i = 1:length(A)

and

julia/base/array.jl

Line 1164 in ef06211

for i=1:length(A)
findin,

julia/base/array.jl

Line 1271 in ef06211

@inbounds for i = 1:length(a)

and finally others where we may want to change the algorithm:

cummin/min/cummax/max,

julia/base/array.jl

Line 1558 in ef06211

for i = 1:length(A)

pao · 2015-04-17T12:35:34Z

This sounds like it should be a mailing list post, and an addition to the MATLAB differences list (not sure what other languages this is relevant for)? Should use of eachindex() in user code targeting v0.4 be considered the default?

lindahua · 2015-04-17T12:51:54Z

+1

A benchmark that shows the speed improvement + documentation of underlying functions (e.g. linear indexing) would be great!

timholy · 2015-04-17T13:10:57Z

eachindex is old news, and you can find benchmarks in various issues (#8432, #8501, #9329, #10507). This PR was just tackling the overdue issue of making more algorithms use it.

@lindahua, to catch you up: eachindex does linear indexing when it is fast (if the trait linearindexing(A) == LinearFast()), and cartesian indexing when not. So you can get the best of both worlds that way.

Yes, I agree that more needs to be done to advertise this.

mbauman · 2015-04-17T13:54:51Z

Documentation in #10859.

pao · 2015-04-17T14:55:34Z

eachindex is old news

Yes, I agree that more needs to be done to advertise this.

The team as a whole spends more time being awesome than I have time to keep up with the awesomeness, and I suspect I'm trying harder than the average Julia user. Thanks for highlighting the relevant issues for me.

IainNZ · 2015-04-17T15:02:45Z

Documentation commit: 9b4f212

I also didn't know this was a function for Mortal Men to use in their code, assumed it was just for the Elvish-kings under the Base.sky and Dwarf-lords in their Base.halls of Base.stone.

milktrader · 2015-04-17T15:02:46Z

Thanks for the PSA!

pao · 2015-04-17T15:56:09Z

I also didn't know this was a function for Mortal Men to use in their code, assumed it was just for the Elvish-kings under the Base.sky and Dwarf-lords in their Base.halls of Base.stone.

You're from New Zealand. Doesn't that count? (Also, it's morning here, and you've added substantially to my enjoyment of it. Thank you.)

Would it make sense for something along the lines of "you probably want to use this everywhere you're currently using the range 1:length(A) for iteration over an array" to be made explicit in that documentation? Otherwise it's not obvious that you'd want to use it.

timholy · 2015-04-17T16:18:48Z

It seems we should probably add something to the manual chapter on arrays.

timholy · 2015-04-17T16:25:42Z

This already needs rebasing. While I'm at it, what do people think about some of the TODO items above?
For the three categories:

Haven't previously been insisting on matching sizes: we could try adding those constraints and see what breaks, unless there are objections
Returning an index to the user: I think the big decision is whether we should have a sub2ind that works on CartesianIndex (and hence return an integer), or if we should return an array of CartesianIndexes.
Changing the algorithm: that can be done next time someone complains about their performance, I don't see any reason to tackle that now 😄

mbauman · 2015-04-17T16:47:49Z

Man, those are all tough decisions. The choice to make eachindex return either an integer or a CartesianIndex is a step towards putting Cartesian indices into user's hands more often (whether they realize it or not). But in that case, it seems more obvious that the returned element is specifically for indexing. There are lots of use-cases for find… and I can only see that causing trouble. Especially since it's generally an anti-pattern to index directly from the results of find; you should just be using the logical index instead.

Logical indexing feels like we're still doing it wrong. But I'm not sure how to make it better. See also #10065 (comment).

But I agree on punting the algorithm changes to another day, perhaps with a comment in the code about how it could be written better.

timholy · 2015-04-17T16:54:27Z

I agree that changing find semantics seems to be asking for trouble. One option would be find{T}(::Type{T}, A) and have T default to Int; people who want the cartesian version could use find(CartesianIndex, A).

I can rebase this and merge it, and leave hard decisions for another day 😄.

kmsquire · 2015-04-17T17:48:36Z

One option would be find{T}(::Type{T}, A) and have T default to Int; people who want the cartesian version could use find(CartesianIndex, A).

+1 -- this is really nice... although I'd love to have a shorter name for CartesianIndex.

This fixes performance problems for many SubArray operations

Since `done` is called on each iteration but `start` is called only once, it makes more sense to put this logic in `start`.

timholy · 2015-04-19T18:38:43Z

I realized that my proposal for find(CartesianIndex, A) is basically what findn does, except the latter returns a tuple of Vector{Int} indexes. So we should probably have one or the other but not both.

Change `for i=1:length(A)` to `for i in eachindex(A)`

timholy · 2015-04-20T01:48:35Z

As requested by @pao, mailing-list post is here.

tkelman · 2015-04-20T07:31:05Z

That bitarray segfault on appveyor is worrying... https://ci.appveyor.com/project/StefanKarpinski/julia/build/1.0.3946/job/ius9kwerpcskfbi8

More recent builds have passed though, so maybe it's just a case of #9176?

timholy · 2015-04-20T10:32:18Z

The earlier version of this passed on all 3 platforms, and all I did was fix a tiny merge conflict that had nothing to do with BitArrays. So I wasn't worried about this being the cause.

…g#10858)

timholy mentioned this pull request Apr 17, 2015

laplace equation benchmark performance #1168

Closed

garrison mentioned this pull request Apr 17, 2015

eachindex JuliaLang/Compat.jl#64

Closed

timholy added 2 commits April 19, 2015 13:13

Change i=1:length(A) to i in eachindex(A)

4297135

This fixes performance problems for many SubArray operations

Fix iteration for empty CartesianRanges

4216ae5

Since `done` is called on each iteration but `start` is called only once, it makes more sense to put this logic in `start`.

timholy force-pushed the teh/eachindex branch from 4dec65e to 4216ae5 Compare April 19, 2015 18:13

timholy mentioned this pull request Apr 19, 2015

Add more documentation on array iteration #10902

Merged

timholy added a commit that referenced this pull request Apr 20, 2015

Merge pull request #10858 from JuliaLang/teh/eachindex

3dbc828

Change `for i=1:length(A)` to `for i in eachindex(A)`

timholy merged commit 3dbc828 into master Apr 20, 2015

timholy deleted the teh/eachindex branch April 20, 2015 00:16

timholy mentioned this pull request Apr 20, 2015

Multi-argument eachindex #10906

Closed

timholy referenced this pull request Apr 26, 2015

faster copy! between arrays of different types (ref #11004)

102e840

stevengj added a commit to stevengj/julia that referenced this pull request Aug 24, 2015

change for i in 1:length(a) to i in eachindex(a) (continuing JuliaLan…

5df7daa

…g#10858)

stevengj mentioned this pull request Aug 24, 2015

change for i in 1:length(a) to i in eachindex(a) (continuing #10858) #12788

Closed

stevengj added a commit to stevengj/julia that referenced this pull request Aug 24, 2015

change for i in 1:length(a) to i in eachindex(a) (continuing JuliaLan…

bc557e2

…g#10858)

azwphy mentioned this pull request May 19, 2024

fix for ... eachindex(...) QuanEstimation/QuanEstimation.jl#100

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change `for i=1:length(A)` to `for i in eachindex(A)` #10858

Change `for i=1:length(A)` to `for i in eachindex(A)` #10858

timholy commented Apr 17, 2015

pao commented Apr 17, 2015

lindahua commented Apr 17, 2015

timholy commented Apr 17, 2015

mbauman commented Apr 17, 2015

pao commented Apr 17, 2015

IainNZ commented Apr 17, 2015

milktrader commented Apr 17, 2015

pao commented Apr 17, 2015

timholy commented Apr 17, 2015

timholy commented Apr 17, 2015

mbauman commented Apr 17, 2015

timholy commented Apr 17, 2015

kmsquire commented Apr 17, 2015

timholy commented Apr 19, 2015

timholy commented Apr 20, 2015

tkelman commented Apr 20, 2015

timholy commented Apr 20, 2015

Change for i=1:length(A) to for i in eachindex(A) #10858

Change for i=1:length(A) to for i in eachindex(A) #10858

Conversation

timholy commented Apr 17, 2015

pao commented Apr 17, 2015

lindahua commented Apr 17, 2015

timholy commented Apr 17, 2015

mbauman commented Apr 17, 2015

pao commented Apr 17, 2015

IainNZ commented Apr 17, 2015

milktrader commented Apr 17, 2015

pao commented Apr 17, 2015

timholy commented Apr 17, 2015

timholy commented Apr 17, 2015

mbauman commented Apr 17, 2015

timholy commented Apr 17, 2015

kmsquire commented Apr 17, 2015

timholy commented Apr 19, 2015

timholy commented Apr 20, 2015

tkelman commented Apr 20, 2015

timholy commented Apr 20, 2015

Change `for i=1:length(A)` to `for i in eachindex(A)` #10858

Change `for i=1:length(A)` to `for i in eachindex(A)` #10858