This repo seems to improve sdcpp performance by quite a bit. https://github.com/SealAILab/stable-diffusion-cpp