[Git][ghc/ghc][master] x86 NCG: Better lowering for shuffleFloatX4# and shuffleDoubleX2#