鉴于最新两代 NVIDIA GPU 上的以下低级 (SASS) 指令(参考http://docs.nvidia.com/cuda/cuda-binary-utilities/index.html),有哪些(可能是推测的)差异在硬件/内存层次结构设计(和性能影响)?
表面记忆指令MAXWELL
SUATOM Surface Reduction
SULD Surface Load
SURED Atomic Reduction on surface memory
SUST Surface Store
表面内存指令KEPLER
SUCLAMP Surface Clamp
SUBFM Surface Bit Field Merge
SUEAU Surface Effective Address
SULDGA Surface Load Generic Address
SUSTGA Surface Store Generic Address