For a kernel_t, we can obtain the maximum active blocks per SM; let's add a config builder method which uses this value to set the grid dims.