To use more than 64 threads on Windows, you have to use Process Groups:
https://docs.microsoft.com/en-us/windows/win32/procthread/processor-groups?redirectedfrom=MSDN
Each spawned thread must be assigned to a process group. I don't know if it makes sense to handle this in rayon or libstd or what.