I just started to use SS2 optimization of image processing, but for the 3 channel 24 bit color images have no idea. My pix data arranged by BGR BGR BGR ... ,unsigned ch
I have slides on de-interleaving of 24-bit RGB pixels, which explain how to do it with SSE2 and SSSE3.
Here is some answers to your question: