Fix #631 Allocation Of Memory On Jetson Boards#633
Conversation
…Boards SoC-like boards share the same memory for the device and host. Since we still do double buffering, we should only reserve *half* of the available "device" memory for particles.
|
Great job! 👍 I will try to test this on the jetson cluster here in Jena. |
|
fantastic, I already validated the bare function on our local jetson cluster (but not this pull yet). |
|
Test of with TBG_gpu_x=1
TBG_gpu_y=1
TBG_gpu_z=1
TBG_gridSize="-g 128 256 128"
TBG_steps="-s 1024"48MB for e- is not that much... they have nearly 2GB available in total, we space 450MB for cuda weirdness (random number states, overheads and similar). |
|
ok, looks better with init time: ~60sec works with P.S.: I recommend to uninstall the X server to get more memory for your sims :) |
|
on 3 Jetsons in Jena: with MPI rank 1 has only very limited memory in this run! This might be problematic. Should I rerun with verbose memory output? |
|
Wow still these improvements are awesome! Thank you for all the hard work!
|
|
@PrometheusPi do not use that much cells for the small GPUs, they only have 2GB to fit in the whole OS, processes and PIConGPU.
that is how it should look (for an empty cluster with minimal daemons running - well, we even have X11 on -.-). I used @sigkill thanks! :) |
|
@ax3l Yes I know that we have varying memory consumtion here.
Your example on ZIH/Jetson looks great. |
|
Yes, but the main thing is still: one GPU will use 2x more memory for the guards (the central one). with your setting that is quite relevant, too. Here are some performance tuning tips, especially for the network. P.S.: argh, now I want a PCI-E switch instead of an ethernet interconnect. Brrr. |
- uint8_t is well defined, char is not
Fix #631 Allocation Of Memory On Jetson Boards
SoC-like boards share the same memory for the device and host. Since we still do double buffering, we should only reserve half of the available "device" memory for particles.
Add
8forMEMORYtoPIC_VERBOSE, e.g.:Still needs run time testing:
Also:
MEMORYlogoutput about the selected choice