ea8f99702d
Device property MaxSharedMemoryPerMultiprocessor set equal to totalGlobalMem (HIP path). Reason: MaxSharedMemoryPerMultiprocessor should be as the same as group memory size. Group memory will not be paged out, so, the physical memory size = total shared memory size = group region size. NVCC path remains untouched: CUDA's device property MaxSharedMemoryPerMultiprocessor is reported. hipify is updated as well.