Enable cache flush after every -F iteration. Default : 0 (No cache flush)

This commit is contained in:
saurabhAMD
2024-05-07 11:32:30 -05:00
والد 3c0728e8eb
کامیت 699478dadf
2فایلهای تغییر یافته به همراه1 افزوده شده و 1 حذف شده
+1
مشاهده پرونده
@@ -91,6 +91,7 @@ All tests support the same set of arguments :
* `-c,--check <check iteration count>` perform count iterations, checking correctness of results on each iteration. This can be quite slow on large numbers of GPUs. Default : 1.
* `-z,--blocking <0/1>` Make NCCL collective blocking, i.e. have CPUs wait and sync after each collective. Default : 0.
* `-G,--cudagraph <num graph launches>` Capture iterations as a CUDA graph and then replay specified number of times. Default : 0.
* `-F,--cache_flush <cache flush after every -F iteration>` Enable cache flush after every -F iteration. Default : 0 (No cache flush).
## Unit tests
-1
مشاهده پرونده
@@ -123,7 +123,6 @@ static int enable_cache_flush = 0;
extern "C" __global__ void flush_icache()
{
printf("flush_icache called \n");
asm __volatile__("s_icache_inv \n\t"
"s_nop 0 \n\t"
"s_nop 0 \n\t"