Skip to content

Conversation

@atillack
Copy link
Member

This PR increases the maximum number of runs from 1000 to 8192. Besides simply changing the MAX_NUM_OF_RUNS define, in order for Cuda not to waste GPU memory it now, like OpenCL, allocates memory based on the actual settings of the number of runs and the population size instead of their max values.

This requires a little bit of additional testing on the Cuda side.

@atillack atillack requested a review from althea-hansel August 18, 2023 16:37
@diogomart
Copy link
Member

E50s for cuda version with 100 runs look good, approving.

79f13c7-ocl-128wi_vs_PR233-406d169-cuda-128wi-overlap

@atillack
Copy link
Member Author

@diogomart Thank you very much! Merging.

@atillack atillack merged commit 5f73e1b into ccsb-scripps:develop Nov 16, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants