I\'m working with a user-defined quantity of bits (I\'m holding a three-dimensional array of bits, so the size increases cubically - assume no less then 512 bits), and need
An alternative could be BitMagic although I'm not sure it works on any other architecture than x86(it's heavily optimized using SIMD).