Hello there,
Is it worth to align scan lines in a image so each row begins on 16-aligned memory? That is, round up the stride to the next multiple of 16 bytes?
I assume this might help a bit when processing the entire image, but the real question is: does IPP cares?
If yes, along the same line, is it worth to 32-align scan lines on CPUs that have a 256 bit vector unit, or 64-align for AVX 512 chips?
Thanks,
Axel