The following should be done in x86 assembly (up to SSE4) language. Lets say I have a 128 bit XMM register:
xmm0: [d, c, b, a]
And I want another