Create random numbers in bigger chunks #72

denisalevi · 2017-03-22T18:20:43Z

Before we created them every clock cycle which creates significant
overhead. This implementation still needs to call one
cudaMemcpyToSymbol per clock cycle and codeobject using rand/randn,
which could be avoided.

This implementation very generically generates a max of 50MB of random number per codeobject and regenerates them after they are used up. This way for mall simulations we only generate once and for bigger simulations (where memory is a potential limit), we don't generate too many numbers.

This is a quick and dirty solution here which might fail if we run into memory limits. but for our current benchmarks its good enough and since we should probably use a cleaner buffer system at some point, I will leave it like this for now.

Before we created them every clock cycle which creates significant overhead. This implementation still needs to call one `cudaMemcpyToSymbol` per clock cycle and codeobject using rand/randn, which could be avoided.

Create random numbers in bigger chunks

0c7013d

Before we created them every clock cycle which creates significant overhead. This implementation still needs to call one `cudaMemcpyToSymbol` per clock cycle and codeobject using rand/randn, which could be avoided.

denisalevi merged commit 7dd0dec into master Mar 22, 2017

denisalevi deleted the remove_rng_overhead branch March 22, 2017 18:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create random numbers in bigger chunks #72

Create random numbers in bigger chunks #72

denisalevi commented Mar 22, 2017

Create random numbers in bigger chunks #72

Create random numbers in bigger chunks #72

Conversation

denisalevi commented Mar 22, 2017