-
Notifications
You must be signed in to change notification settings - Fork 1.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RuntimeError when loading data using data_silo #1927
Comments
4 tasks
tholor
pushed a commit
that referenced
this issue
Jan 4, 2022
…y open file descriptors from multiprocessing (#1928) * fix #1687 * fix RuntimeError: received 0 items of ancdata * Add an arg multiprocessing_strategy to DataSilo and DPR.train() * Add latest docstring and tutorial changes Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Describe the bug
Getting RuntimeError when loading data using data_silo, this error seems related to the multiprocessing sharing strategy, which opens many file descriptors. trying to increase the ulimit on my machine to 2048 does not help (cannot increase further).
A solution to the issue might be to further increase file descriptors limit (following this fastai/fastai#23).
Unfortunately the hard limit in my machine is set to 2048
See file descriptors (open files) limit on machine:
#>ulimit -n
Increase file descriptors:
#>ulimit -n 2048
See hard limits:
#>ulimit -H -a
Error message
Expected behavior
No Error
Additional context
Running on a machine with 88 CPU's
To Reproduce
FAQ Check
System:
The text was updated successfully, but these errors were encountered: