rocm-systems

Autor(a)	SHA1	Mensagem	Data
Oak Zeng	7593b41575	Unmap memory from GPU before free Change-Id: Ic33b17cbaee5de7908d37527254f4f146e6b71e3 Signed-off-by: Oak Zeng <Oak.Zeng@amd.com>	2019-11-01 08:58:55 -04:00
Yong Zhao	da9a404ac3	kfdtest: Add a function to wait indefinitely until user input This will facilitate the need of halting the execution during debugging. Change-Id: I058c81bbc9f655bbc477b2b542d6b43aa423324c Signed-off-by: Yong Zhao <Yong.Zhao@amd.com>	2019-10-23 11:07:43 -04:00
Yong Zhao	ac5c433420	kfdtest: Add a Nop packet submission test for CP and SDMA queue The tests are useful to triage the fundamental queue submission functionality by excluding the packet format variable from the equation. Change-Id: I2c7fcda811f93bdefc1b62396233559416be44e7 Signed-off-by: Yong Zhao <Yong.Zhao@amd.com>	2019-10-22 13:42:33 -04:00
Yong Zhao	75c654cfea	kfdtest: Add SDMANopPacket class The class is very useful for triaging complex SDMA issues. Change-Id: Ib5de729f7fc62f41e894ef98d3967e7e1745d454 Signed-off-by: Yong Zhao <Yong.Zhao@amd.com>	2019-10-22 13:41:44 -04:00
Eric Huang	0174377351	kfdtest: add xgmi path for p2p tests When large bar is not available, we can use xgmi to do p2p tests. Change-Id: Ib7b59fb8a4d41f605739a0428973f6b2f1a3450f Signed-off-by: Eric Huang <JinhuiEric.Huang@amd.com>	2019-10-17 10:21:10 -04:00
Philip Cox	efe3769835	kfdtest: Check kfd debugger version in tests Need to check the kfd debugger version of the kernel before calling kfd debugger tests in kfdtest. If they are out of sync, the tests may fail. Change-Id: I1df5e89fb1199304e6fbe8973c60b76062514c03 Signed-off-by: Philip Cox <Philip.Cox@amd.com>	2019-10-10 14:32:54 -04:00
Philip Cox	b48f7d6ea3	kfdtest: Disable kfd debugger tests on gfx10 The KFD debugger is only supported on gfx9 platforms, so we need to restrict it from running on gfx10 platforms until it is supported. Change-Id: I500f0e20fda71021f2cce70a67fc8d9d042209fe Signed-off-by: Philip Cox <Philip.Cox@amd.com>	2019-10-10 12:50:12 -04:00
Oak Zeng	da789a2584	Fix memory map issue in KFDMemoryTest.CacheInvalidationOnRemoteWrite The memory need to be mapped for both local and remote GPU access Change-Id: I4aeaffc0851b6107fc91e9eaa6150764b06f5ca9 Signed-off-by: Oak Zeng <Oak.Zeng@amd.com>	2019-10-08 16:33:51 -05:00
Oak Zeng	d7c53bb1fa	Test new RW mtype for gfx908 Change-Id: Ia859c8f2e3c486f119772231a2d887f6783caf36 Signed-off-by: Oak Zeng <Oak.Zeng@amd.com>	2019-10-04 10:49:15 -04:00
Oak Zeng	51388973a1	Add gfx908 to asic family Change-Id: I838aa4be45ddfb34c5d36c519e28b4218fc32ba4 Signed-off-by: Oak Zeng <Oak.Zeng@amd.com>	2019-10-04 10:45:02 -04:00
Oak Zeng	63eb3948be	More parameter check in HsaMemoryBuffer constructor if parameter "zero" is set, check buffer host access. Change-Id: I9893062726fc240777405167a638cbea18fdf559 Signed-off-by: Oak Zeng <Oak.Zeng@amd.com>	2019-10-01 14:44:29 -04:00
Philip Yang	69d8f2d734	kfdtest: use flag NoNUMABind to allocate system memory Allocate system memory from node id 0 will fail on NUMA system which has no memory on node 0. Change to use new flag NoNUMABind to allocate system memory from NUMA nodes which have free memory. Change-Id: I8ef9ca28fc2ab5dd31d07a2d3eaf1d5886e798a0 Signed-off-by: Philip Yang <Philip.Yang@amd.com>	2019-09-16 12:25:01 -04:00
Yong Zhao	09d192802a	kfdtest: Remove useless test cases Those tests are actually did not function up to its expectation because some underlying functions such as suspend/resume and disable/enable KFD were not implemented. Those interfaces would never be implemented, so delete them. Change-Id: Ib5872ba2f35e307221e43791cda1782c6b6bb4d1 Signed-off-by: Yong Zhao <Yong.Zhao@amd.com>	2019-09-06 11:12:17 -04:00
Yong Zhao	9048fb5a1a	kfdtest: Implement SuspendAndWakeUp() Change-Id: I16384cd0ca35f15841344ded4d4237a0a862a934 Signed-off-by: Yong Zhao <Yong.Zhao@amd.com>	2019-09-06 11:11:49 -04:00
Felix Kuehling	01957d4da3	kfdtest: Use CoarseGrain for VRAM buffers Change-Id: I320099e61fc8f1012184fc68c248da4e224bc3ca Signed-off-by: Felix Kuehling <Felix.Kuehling@amd.com>	2019-09-04 14:45:01 -04:00
Felix Kuehling	8f91d6a222	kfdtest: Enable more tests on gfx802 A number of tests are no longer broken on gfx802. Change-Id: If70c77423f8f14de59490ab8ca156b0c4e7b5cf1 Signed-off-by: Felix Kuehling <Felix.Kuehling@amd.com>	2019-08-30 19:06:24 -04:00
Jonathan Kim	1ff5cb33b2	add new queue bit test on clear event enable thunk query api to report if queue is newly created test new queue bit test on clear events. also fixup cleanup to disable debug trap. Change-Id: I3ebe2d85da66f28b8c82f0e68461ee7d32ec0b0d Signed-off-by: Jonathan Kim <Jonathan.Kim@amd.com> Reviewed-by: Philip Cox <Philip.Cox@amd.com> Reviewed-by: Felix Kuehling <Felix.Kuehling@amd.com>	2019-08-23 11:25:32 -04:00
Jonathan Kim	c04ebb56b9	kfdtest: add query debug pending events test cases adding test cases to query debugger pending events Change-Id: I089754c508e476ce7b19e1cbd84235e4474b30c4 Signed-off-by: Jonathan Kim <Jonathan.Kim@amd.com> Reviewed-by: Kent Russell <Kent.Russell@amd.com>	2019-08-13 10:21:26 -04:00
Eric Huang	cdc10991a9	kfdtest: avoid TTM eviction in KFDMemoryTest.BigBufferStressTest Reserve half of dma32 zone for non-NUMA system. Change-Id: Id7aea7b6ff6cc1cc7983ecd95f8078b7f1be630c Signed-off-by: Eric Huang <JinhuiEric.Huang@amd.com>	2019-08-07 16:35:29 -04:00
Yong Zhao	3e4c42ef13	kfdtest: Improve the printing message for CuMasking tests Decimal is better than hex in this case. Change-Id: Ic15a9373e99160880b98d3dcd6827d551c87b77a Signed-off-by: Yong Zhao <Yong.Zhao@amd.com>	2019-08-07 13:57:36 -04:00
shaoyunl	e65685df19	KFDTest: simply P2PBandwidthTest case for specified source/destination Only run the P2P test over the specified source and destination nodes if user already specify them Change-Id: Ia3c0195cead7f46e3e28507f3255d8c59a287ab8 Signed-off-by: shaoyunl <shaoyun.liu@amd.com>	2019-08-06 18:57:33 -04:00
shaoyunl	65ab296840	KFDTest: Set CS_W32_EN bit for wave32 workloads for GFX10 On GFX10, the wave size is determined by the COMPUTE_DISPATCH_INITIATOR value passed to DISPATCH_DIRECT.CS_W32_EN, default 0 value was giving 64 lane waves Change-Id: Ie8c407a24bd2825757ec481be62247b35047e5ca Signed-off-by: shaoyunl <shaoyun.liu@amd.com>	2019-08-06 18:57:14 -04:00
Yong Zhao	5ae5854302	kfdtest: Improve FlatScratchAccess by not hardcoding the value We should use the SE number reflected by NumShaderBanks of the node rather than hardcoding it. Change-Id: I945fb001f81ce506249cf485a7ce25aee8219bc7 Signed-off-by: Yong Zhao <Yong.Zhao@amd.com>	2019-08-01 23:10:28 -04:00
shaoyunl	78e754ca5b	KFDTest: Make shader compatiable for gfx9 and gfx10 Remove the CHIP name from the shader ISA and add wave_size(32) to make the same shader can be used for both GFX9 and GFX10 Change-Id: I16ea72f87980c3d9c11298e20c06a0a073fe9a28 Signed-off-by: shaoyunl <shaoyun.liu@amd.com>	2019-07-30 10:56:19 -04:00
shaoyunl	c0663be7e8	KFDTest: Enable KFDEvictTest.QueueTest for gfx1010 v_add_u32 was removed from gfx10, use carry-out explicit instruction v_add_co_u32 instead on both gfx9 and gfx10 Change-Id: I1fcd5956844457a676757ad13bdce7f5304bb34b Signed-off-by: shaoyunl <shaoyun.liu@amd.com>	2019-07-25 10:56:17 -04:00
shaoyunl	157f5b945f	KFDTest: Add CWSR test for gfx1010 Change-Id: I708b2fd5f06e0c7591ebe04c40576845cf2a891a Signed-off-by: shaoyunl <shaoyun.liu@amd.com>	2019-07-25 10:55:01 -04:00
shaoyunl	33bd5c520b	KFDTest: Test 8 SDMA queues per engine for gfx1010 Change-Id: I709112ab10b1f5684e6411f44682bdef14745b50 Signed-off-by: shaoyunl <shaoyun.liu@amd.com>	2019-07-25 10:54:44 -04:00
shaoyunl	85a9821519	KFDTest: Shader modification for gfx10 Modified shaders in KFD memory test to support gfx10. There is no gprs register for flat_scratch on gfx10. Use s_setreg_b32 instruction to set flat scratch base address register Change-Id: I505156a046056b61ce2d873343feb50ce635274a Signed-off-by: shaoyunl <shaoyun.liu@amd.com> Signed-off-by: Oak Zeng <Oak.Zeng@amd.com>	2019-07-25 10:54:30 -04:00
Kent Russell	1ca1825b84	Remove KFDDBGTest.BasicAddressWatch This functionality doesn't work on GFX9+, and was disabled for gfx802. Remove the test altogether for now, especially since some kernel changes broke it on gfx803, and the functionality is deprecated now anyways. Leave the code for reference, but "#if 0" it to prevent it from compiling or being in the kfdtest binary Change-Id: I848b4f23201f18612cbdc122a5b46e4010c4af2a	2019-07-24 07:36:19 -04:00
shaoyunl	5b44be1907	KFDTest: Added gfx1010 SDMA fence packet support Change-Id: I33d824353d77317363b73ddc52cd182f86b8bc66 Signed-off-by: shaoyunl <shaoyun.liu@amd.com> Signed-off-by: Oak Zeng <Oak.Zeng@amd.com>	2019-07-22 16:37:02 -04:00
shaoyunl	395750264d	KFDTest : Add family ID when building SDMA packet Some SDMA packet format might be different among asic versions Change-Id: Ic7eda7554c23e3972e168480874ca67a92677346 Signed-off-by: shaoyunl <shaoyun.liu@amd.com>	2019-07-22 16:36:49 -04:00
Yong Zhao	4baeef356f	kfdtest: Sumbit to SDMA ring when using libdrm command submission Because not all ASICs (like gfx908) have GFX rings, we should use SDMA rings instead of GFX rings. Change-Id: Ibcc9f9e555302ba4ce25ac76c2ca73b8c3962a58 Signed-off-by: Yong Zhao <Yong.Zhao@amd.com>	2019-07-19 16:58:22 -04:00
shaoyunl	b4e834ab61	KFDTest: remove the usage global g_TestGPUFamilyId Adjust the KFDTest for multi-gou support Change-Id: Ib3491e3f645d35fdba6ab702d65fcc86f48d3958 Signed-off-by: shaoyunl <shaoyun.liu@amd.com>	2019-07-19 13:26:49 -04:00
shaoyunl	e9882daf11	KFDTest : Add gfx1xxx release_mem and acquire_mem packet support use family ID as parameter when construct the packets Change-Id: I6c1706954ab7b8cbb8bef2aab16edf21f5e1abf0 Signed-off-by: shaoyunl <shaoyun.liu@amd.com>	2019-07-18 10:43:48 -04:00
Felix Kuehling	4e9ff4393d	kfdtest: MMBench: Test a more useful range of buffer sizes Currently the test only covers relatively small buffers sizes. It's useful to test buffer sizes up to 1GB to see the impact of features that target the efficiency of large buffer allocations and mappings. Change-Id: I2e8d5afd482894dbe2166f32d38091199b9c15e6 Signed-off-by: Felix Kuehling <Felix.Kuehling@amd.com>	2019-07-11 17:15:39 -04:00
Felix Kuehling	5475e618e5	kfdtest: Add multi-process oversubscription test This test is designed to reproduce soft-hangs cause by HWS running with oversubscription. Change-Id: I49861522b3ff5ba50df5ddc968545c35ccb25353 Signed-off-by: Felix Kuehling <Felix.Kuehling@amd.com>	2019-07-10 17:05:57 -04:00
Felix Kuehling	6704b051d2	kfdtest: Factor out multi-process test into a base class Create KFDMultiProcessTest base class for tests forking multiple child processes. Derive KFDEvictTest from that class. Change-Id: Ie5f3362c45be2b807bf7a83839ab3820352a67f9 Signed-off-by: Felix Kuehling <Felix.Kuehling@amd.com>	2019-07-10 17:05:57 -04:00
shaoyunl	c6ed539b44	Add gfx IsaGenerator Change-Id: I93ccb889b4bb7f0f5921a90cebbc0550d1eb3f7d Signed-off-by: Oak Zeng <Oak.Zeng@amd.com> Signed-off-by: shaoyunl <shaoyun.liu@amd.com>	2019-07-09 11:39:38 -04:00
shaoyunl	6cad92de6f	Added family ID for gfx1010 Change-Id: I1b9a2b5270e70d12f066906f4e6cfea2cbfc2110 Signed-off-by: Oak Zeng <Oak.Zeng@amd.com> Signed-off-by: shaoyunl <shaoyun.liu@amd.com>	2019-07-09 11:38:57 -04:00
Oak Zeng	3b014adccc	Device HDP flush test Change-Id: I1c19e44caeee4a6e59200dceb718896fcff9bf82 Signed-off-by: Oak Zeng <Oak.Zeng@amd.com>	2019-07-07 21:59:37 -04:00
Yong Zhao	b507911ccd	kfdtest: Use SDMA engine information directly from the node Change-Id: Icd391c8e821fb0ff5a1094f21b880a97e6d417a3 Signed-off-by: Yong Zhao <Yong.Zhao@amd.com>	2019-06-28 00:47:15 -04:00
Oak Zeng	5d163cd821	Fix HostHdpFlush shader 1. Use s_mov_b32 to move 0xcafe to s18. s_movk_i32 is a sign extention move instruction. Oxcafe will be extended to 0xffffcafe which is not desired 2. Add wait to s_load_dword instruction to make sure memory read finish before the next store instruction. Change-Id: I665d1d471019edfaba5693e07cdc567d4103573f Signed-off-by: Oak Zeng <Oak.Zeng@amd.com>	2019-06-21 11:51:51 -04:00
Philip Yang	4066dcd542	kfdtest: increase BigBufStressTest timeout and avoid VM fault If TTM eviction and restore happens, it may takes very long time if retry, the longest time is 5 minutes during my test. There is chance packet is submited to queue while eviction, we have to increase the Wait4PacketConsumption timeout. The queue will continue to execute after eviction and restore. If we upmap the memory from GPU while queue is evicted, this will cause VM fault. Change to unmap memory after queue is destroyed. Change-Id: I1b44e2274ea7b83398b2e3293578dad6947cb5af Signed-off-by: Philip Yang <Philip.Yang@amd.com>	2019-06-18 09:28:43 -04:00
Philip Yang	36776e9917	kfdtest: avoid BigBufStressTest run on NUMA node 0 Because dma32 zone is on node 0, use all system memory on node 0 will cause TTM eviction to free dma32 zone for other devices which only work with 32bit physical address. The TTM eviction and restore may take too long and cause queue timeout. Running on other NUMA nodes, the NUMA default memory policy is MPOL_PREFERRED, means TTM will get pages from local node first, and then get remaining pages from other nodes. Check /proc/buddyinfo can confirm this. Reset NUMA bind to all after the test. Change-Id: I39b373c07a2d5aa396f5c7602bffabab0481930f Signed-off-by: Philip Yang <Philip.Yang@amd.com>	2019-06-18 09:28:20 -04:00
Felix Kuehling	396a85e97b	kfdtest: Allocate PM4 queue and dispatch earlier KFDEvictTest.QueueTest Allocating these before the big memory allocations minimizes the chances of spurious out of memory errors. Change-Id: I94aff9ec7ea34d4dc98ae08ac4cf9dc335b3df7f Signed-off-by: Felix Kuehling <Felix.Kuehling@amd.com>	2019-06-07 16:54:28 -04:00
Felix Kuehling	f474cf21cd	kfdtest: Reduce libdrm VRAM usage in eviction tests This reduces thrashing due to graphics submissions only and significantly speeds up the BasicTest when keeping idle compute processes evicted. In the BasicTest compute is always idle, so only one compute eviction and no restore is triggered. Then graphics submissions complete quickly without thrashing each other. Change-Id: Iae6da98903b20424a5097f235e1d09cf13e4b41b Signed-off-by: Felix Kuehling <Felix.Kuehling@amd.com>	2019-06-07 16:54:28 -04:00
Felix Kuehling	6984f3e3b4	kfdtest: Add KFDEvictionTest.BurstyTest Change-Id: I748603b0b204ffc3ea33399ecbc022233a7447d3 Signed-off-by: Felix Kuehling <Felix.Kuehling@amd.com>	2019-06-07 16:54:28 -04:00
Felix Kuehling	6f5379d315	kfdtest: Pass timeout parameter to BaseQueue::Wait4PacketConsumption Change-Id: I0e88db5ca8e6712e9efc419a10eb4c49cedb6f62 Signed-off-by: Felix Kuehling <Felix.Kuehling@amd.com>	2019-06-07 16:54:28 -04:00
Eric Huang	47d1c17592	kfdtest: fix error injection failure in RAS test 1. umc error injection only accepts parameter "0 0". 2. flush output to file in order to make writing happen immediately. Change-Id: I8d3bde287caee6b90b6eec56c760f5a228be7595 Signed-off-by: Eric Huang <JinhuiEric.Huang@amd.com>	2019-05-30 16:38:15 -04:00
Eric Huang	d278b2579e	kfdtest: fix debugfs path bug in RAS test The path was wrong based on assumption that GPU dri render node starts from 0, because if there is a VGA device on board, node 0 will be VGA and node 1 will be GPU. So the fix will look at the name of GPU minor node and find the correct primary node on which RAS debugfs entry exists. Change-Id: Icc5e63ce48698d5d29105c0417e3bec8afa0a7c8 Signed-off-by: Eric Huang <JinhuiEric.Huang@amd.com>	2019-05-29 11:14:22 -04:00

1 2

100 Cometimentos