b192beea52bbc083980b7944692eb4803a4b4b5d
Set affinity to the closest node of the current GPU. This reduces the latency to fetch kernel args since device would query the CPU cache of core which did the dispatch. This behavior is controlled with AMD_CPU_AFFINITY env var(disabled by default) Change-Id: I65afba62cb818ea25a311b88d1c0dd5c51330292
Opis
No description provided
Languages
C++
67.5%
C
20.6%
Python
6.6%
CMake
3.4%
Shell
0.6%
Pozostałe
1.1%