Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NullException with procs(::Int) when using --machinefile #11230

Closed
jaak-s opened this issue May 11, 2015 · 3 comments · Fixed by #11616
Closed

NullException with procs(::Int) when using --machinefile #11230

jaak-s opened this issue May 11, 2015 · 3 comments · Fixed by #11616
Labels
domain:parallelism Parallel or distributed computation

Comments

@jaak-s
Copy link
Contributor

jaak-s commented May 11, 2015

When starting Julia with --machinefile the command procs gives NullException:

jaak@node-1-14:~$ julia --machinefile test.machinefile 
...
  | | |_| | | | (_| |  |  Version 0.4.0-dev+4603 (2015-05-02 18:25 UTC)
 _/ |\__'_|_|_|\__'_|  |  Commit 803193e* (8 days old master)
|__/                   |  x86_64-linux-gnu

julia> workers()
2-element Array{Int64,1}:
 2
 3

julia> procs(2)
ERROR: NullException()
 in == at nullable.jl:42
 in anonymous at multi.jl:261
 in filter at ./array.jl:1321
 in procs at multi.jl:261

julia> procs(3)
ERROR: NullException()
 in == at nullable.jl:42
 in anonymous at multi.jl:261
 in filter at ./array.jl:1321
 in procs at multi.jl:261

julia> procs(1)
1-element Array{Int64,1}:
 1

No problem with general parallel commands:

julia> fetch(@spawnat 3 sum(rand(10,10)))
46.096993964605204

In my setup julia is started on node1 and the machine file is

node1
node2

The problem does not happen if julia is started with julia -p 2.

@jakebolewski
Copy link
Member

Hi @jaak-s, could you post the machinefile you are trying to use? Should be a simple fix.

@jakebolewski jakebolewski added the domain:parallelism Parallel or distributed computation label May 11, 2015
@jaak-s
Copy link
Contributor Author

jaak-s commented May 11, 2015

Machine file looks like this:

jaak@node1:~$ cat test.machinefile 
node1
node2

However, at the moment it seems any machine file will cause the same Exception, for example, I've tried:

node1
node1

or

node2

@jaak-s
Copy link
Contributor Author

jaak-s commented May 13, 2015

Btw, this bug also affects SharedArray when julia is started with machinefile

julia> SharedArray(Float64, 10, pids=[2])
ERROR: NullException()
 in == at nullable.jl:42
 in anonymous at multi.jl:261
 in filter at ./array.jl:1346
 in procs at multi.jl:261
 in SharedArray at sharedarray.jl:43
 in SharedArray at sharedarray.jl:113

amitmurthy added a commit that referenced this issue Jun 8, 2015
tkelman pushed a commit to tkelman/julia that referenced this issue Jun 8, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
domain:parallelism Parallel or distributed computation
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants