I have read this article on the alignment: http://www.ibm.com/developerworks/library/pa-dalign/
The question what arises is how to find out memory access granularity for a specific processor?
For example Intel Xeon. Unfortunately google doesn't help me when I try "memory access granularity Intel Xeon" or "memory alignment Intel Xeon".