I am using the NEON memory copy with preload implementation from the ARM website with the Windows Embedded Compact 7 ARM assembler on a Cortex-A8 processor.
I notice that I get datatype misalignment exceptions when I provide that function with non word aligned values
For example:
; NEON memory copy with preload ALIGN LEAF_ENTRY NEONCopyPLD PLD [r1, #0xC0] VLDM r1!,{d0-d7} ;datatype misalignment VSTM r0!,{d0-d7} SUBS r2,r2,#0x40 MOV R0, #0 MOV PC, LR ENTRY_END size_t size = /* arbitrary */; size_t offset = 1; char* src = new char[ size + offset ]; char* dst = new char[ size ]; NEONCopyPLD( dst, src + offset, size ); memcpy( dst, src + offset, size ); /* works perfectly */ Is this expected for the VLDM command? The article doesn't mention that this implementation is limited to word-aligned values. Is it fixable? If so, how?