-
Notifications
You must be signed in to change notification settings - Fork 209
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
facing unknown errors while compiling exact similar code for parallelization on CPU #7
Comments
How big is your problem size? Parallelization requires setup time which is a constant overhead. SO what happens if you increase the amount of work by 10x? What is your code? |
Its a simple addition of N = 2^20 numbers. It is in the docs |
The bug tracker isn't a place for these questions, please open a Discourse thread with some more details (implementations of these functions, which GPU, etc) or drop by Slack if you have questions. |
Its resolved. |
Here, the time taken by parallel_add() is more than sequential_add(). I am not able to figure out why is this happening.
The text was updated successfully, but these errors were encountered: