This API provides the following types of memory routines:
barriers
special instruction intrinsics
cache operations
memcpy()-type routines
Note
memcpy()-like routines will typically always check for proper alignment and size increments of parameters using assert(), so make sure to build a release build (-DNDEBUG) for maximal gainz, when not debugging.