Implementing HERMES (fast ring packing) on GPU would enable LWE ciphertexts to be packed into a CKKS or BGV/BFV ciphertext to benefit from SIMD acceleration. It's currently quite slow on CPU (~10s), but would most probably benefit a lot from GPU acceleration.
Given the really competitive CKKS bit & gate bootstrapping results from this library, LWE->RLWE ring packing would enable batch bootstrapping of AP/GINX style ciphertexts.
Implementing HERMES (fast ring packing) on GPU would enable LWE ciphertexts to be packed into a CKKS or BGV/BFV ciphertext to benefit from SIMD acceleration. It's currently quite slow on CPU (~10s), but would most probably benefit a lot from GPU acceleration.
Given the really competitive CKKS bit & gate bootstrapping results from this library, LWE->RLWE ring packing would enable batch bootstrapping of AP/GINX style ciphertexts.