Followup to #173270, with suitable DemandedElts masks we can handle the scalar intrinsics: ``` x86_sse_max_ss x86_sse_min_ss x86_sse2_max_sd x86_sse2_min_sd ``` However, these will require special expansion to 2*extractelement/maxnum/insertelement as only the lowest element is actually called and the upper elements are passed through from the first operand: https://www.intel.com/content/www/us/en/docs/intrinsics-guide/index.html#text=_mm_max_ss&ig_expand=4391,4391