rocm-systems

Autor(a)	SHA1	Mensagem	Data
Oak Zeng	1923d2e335	Revert "Create SDMA queue on specific engine" This reverts commit `acb80d7583`. Change-Id: Ia3e9db5fcba1fef80745c72c78b7c568b5c7315e Signed-off-by: Oak Zeng <Oak.Zeng@amd.com>	2019-01-21 10:37:32 -06:00
Oak Zeng	742fa5d871	Revert "Add test to allocate SDMA queue on specific engine" This reverts commit `af5b320c47`. Change-Id: I262d91afc60ba2618bf4a857f162ea5236d54131 Signed-off-by: Oak Zeng <Oak.Zeng@amd.com>	2019-01-21 10:36:54 -06:00
Philip Yang	b2e026fce3	kfdtest: increase KFDPerformanceTest.P2PBandWidthTest timeout value KFDPerformanceTest.P2PBandWidthTest[push, push] takes about 3 seconds on 4 gfx906, the default g_TestTimeout 2 seconds is not enough to wait for sDMA queue rptr is consumed. Use kfdtest command line option --timeout=6000, the test is finished and result is reasonable twice as P2PBandWidthTest[push, none]. Change P2PBandWidthTest wait timeout to 6 seconds. Add timeout argument to function WaitOnValue, BaseQueue.Wait4PacketConsumption SDMAQueue.Wait4PacketConsumption, PM4Queue.Wait4PacketConsumption with default value is g_TestTimeOut. Change-Id: I0aa04d644339feaeea695e41647ae66568beab9e Signed-off-by: Philip Yang <Philip.Yang@amd.com>	2019-01-04 12:53:55 -05:00
Yong Zhao	81b8815e1a	Add -fPIC flag when building sp3 library This will support the sp3 library built on one gcc version to be compatible with another gcc version. Change-Id: If67714bd63376dc781c56ed025be335fe54b2ba5 Signed-off-by: Yong Zhao <Yong.Zhao@amd.com>	2018-12-13 18:32:23 -05:00
Kent Russell	bcc348e3b9	kfdtest: Add gfx900/gfx906 IDs to run_kfdtest.sh Change-Id: Ib6ee418a432d1de79e2306b54d702132de3d06c5	2018-12-12 08:38:01 -05:00
Kent Russell	54807526b9	Add more SDMA-related tests to SDMA_BLACKLIST These tests all make use of an SDMAQueue in one way or another, so add them to the SDMA_BLACKLIST to be 100% certain Change-Id: Ic29e073c2f46249f3e5918145b13d276aec7bb33	2018-12-06 14:07:50 -05:00
Kent Russell	aa7c13264a	Add ZeroInitializationVram test to SDMA blacklist This test uses SDMA, so add it to the SDMA list Change-Id: I2dc2b0c4328e38e593d455de2103ebe1ef0adbc2	2018-12-06 11:14:26 -05:00
Kent Russell	3a2ec0111e	Temporarily remove SDMA tests from gfx906 SDMA is being flaky, so remove SDMA tests from it for now Change-Id: Ia3612566813f925804ab90d6235520da7cc65926	2018-12-05 08:41:16 -05:00
Kent Russell	381dba3932	Remove SDMAConcurrentCopies from gfx906 execution This is intermittently causing VM faults and excessive evictions, which causes the rest of the tests to fail. Take it out for now until someone can investigate Change-Id: I9c43890bc9f03a4a31efbc18df0df5e40a232c58	2018-11-28 10:01:35 -05:00
changzhu	c15cf2e9c3	kfdtest: fix SDMACopyParams build error on redhat 7.2 in KFDTestUtilQueue.cpp In file included from /usr/include/c++/4.8.2/algorithm:62:0, from /home/jenkins/libhsakmt/tests/kfdtest/src/KFDTestUtilQueue.cpp:24: /usr/include/c++/4.8.2/bits/stl_algo.h: In instantiation of ‘_RandomAccessIterator std::__unguarded_partition(_RandomAccessIterator, _RandomAccessIterator, const _Tp&, _Compare) [with _RandomAccessIterator = __gnu_cxx::__normal_iterator<SDMACopyParams, std::vector<SDMACopyParams> >; _Tp = SDMACopyParams; _Compare = bool ()(SDMACopyParams&, SDMACopyParams&)]’: /usr/include/c++/4.8.2/bits/stl_algo.h:2296:78: required from ‘_RandomAccessIterator std::__unguarded_partition_pivot(_RandomAccessIterator, _RandomAccessIterator, _Compare) [with _RandomAccessIterator = __gnu_cxx::__normal_iterator<SDMACopyParams, std::vector<SDMACopyParams> >; _Compare = bool ()(SDMACopyParams&, SDMACopyParams&)]’ /usr/include/c++/4.8.2/bits/stl_algo.h:2337:62: required from ‘void std::__introsort_loop(_RandomAccessIterator, _RandomAccessIterator, _Size, _Compare) [with _RandomAccessIterator = __gnu_cxx::__normal_iterator<SDMACopyParams, std::vector<SDMACopyParams> >; _Size = long int; _Compare = bool ()(SDMACopyParams&, SDMACopyParams&)]’ /usr/include/c++/4.8.2/bits/stl_algo.h:5499:44: required from ‘void std::sort(_RAIter, _RAIter, _Compare) [with _RAIter = __gnu_cxx::__normal_iterator<SDMACopyParams, std::vector<SDMACopyParams> >; _Compare = bool ()(SDMACopyParams&, SDMACopyParams&)]’ /home/jenkins/libhsakmt/tests/kfdtest/src/KFDTestUtilQueue.cpp:351:66: required from here /usr/include/c++/4.8.2/bits/stl_algo.h:2263:35: error: invalid initialization of reference of type ‘SDMACopyParams&’ from expression of type ‘const SDMACopyParams’ while (__comp(__first, __pivot)) ^ /usr/include/c++/4.8.2/bits/stl_algo.h:2266:34: error: invalid initialization of reference of type ‘SDMACopyParams&’ from expression of type ‘const SDMACopyParams’ while (__comp(__pivot, __last)) ^ Change-Id: I0fce0c7e6d0a0ce93b1e6522ee8f216615765568 Signed-off-by: changzhu <Changfeng.Zhu@amd.com>	2018-11-21 17:23:03 +08:00
Oak Zeng	af5b320c47	Add test to allocate SDMA queue on specific engine Change-Id: I5b5140e4119fc01db250d63cca7389cf80ec0d16 Signed-off-by: Oak Zeng <ozeng@amd.com>	2018-11-20 11:17:43 -05:00
shaoyunl	d8009b4fd3	KFDTest: fix failure when run KFDTest on multi-GPU small bar system On small bar multi-gpu system, hsaKmtMemoryMapToGPU will fail due to latest kernel P2P sanity check. Swith to use hsaKmtMemoryMapToGPUNodes to fix the failure Change-Id: Id8b6329d1243df0e908cc9a171b5c7f9156f4a8b Signed-off-by: shaoyunl <shaoyun.liu@amd.com>	2018-11-19 16:09:31 -05:00
Oak Zeng	acb80d7583	Create SDMA queue on specific engine Change-Id: Iece03795510d66b03324174203faa0ac9eb4fb7d Signed-off-by: Oak Zeng <ozeng@amd.com>	2018-11-13 14:52:57 -05:00
Oak Zeng	8d65e72045	Move m_Type to a local variable BaseQueue class has a member function GetQueueType so m_Type is duplicated. m_Type is only used in one function. Move it to a local variable. Change-Id: Ice144cf723178dd628cb49261c23d10605f9ee7d Signed-off-by: Oak Zeng <Oak.Zeng@amd.com>	2018-11-13 14:52:17 -05:00
Mike Li	3afce42b57	Changed scripts to include running kfdtest in docker container Change-Id: I822ff4869610df6abad846542d7c290b7a5aae79	2018-11-07 16:09:12 -05:00
xinhui pan	7a13bb4d66	kfdtest: blacklist KFDQMTest.SdmaEventInterrupt On gfx900+, the test sometimes timeout due to cp fw bug. Blacklist it until we address the root cause and have a fix. Change-Id: Iff600a6f6dbd86c56e034f530484205520bced32 Signed-off-by: xinhui pan <xinhui.pan@amd.com>	2018-10-19 15:29:54 -04:00
xinhui pan	ab4610cff7	kfdtest: Add more debug information of sdma event interrupt test We observe this test fails on gfx900+. Looks like the sdma packets are not executed at all after we submit sometimes. Run it with timeout 2s on gfx900. [ RUN ] KFDQMTest.SdmaEventInterrupt [----------] SDMACopyData FAIL! 1485262707170 VS 1485262747814 [----------] Event On Queue 1:0 Timeout, try to resubmit packets! [----------] The timeout event is signaled! [ ] Time Consumption (ns) [ ] 1: 1859427148 [ ] 2: 680148 [ ] 3: 6370 [ ] 4: 5481 /home/pp/code/compute/libhsakmt/tests/kfdtest/src/KFDQMTest.cpp:1670: Failure Value of: (ret) Actual: 31 Expected: HSAKMT_STATUS_SUCCESS Which is: 0 [----------] SDMACopyData FAIL! 1485367669958 VS 1485367750022 [----------] Event On Queue 2:1 Timeout, try to resubmit packets! [----------] The timeout event is signaled! [ ] Time Consumption (ns) [ ] 1: 1881615148 [ ] 2: 673629 [ ] 3: 6074 [ ] 4: 5481 /home/pp/code/compute/libhsakmt/tests/kfdtest/src/KFDQMTest.cpp:1670: Failure Value of: (ret) Actual: 31 Expected: HSAKMT_STATUS_SUCCESS Which is: 0 [----------] SDMACopyData FAIL! 1485427671250 VS 1485427751238 [----------] Event On Queue 2:1 Timeout, try to resubmit packets! [----------] The timeout event is signaled! [ ] Time Consumption (ns) [ ] 1: 1881508777 [ ] 2: 741629 [ ] 3: 6074 [ ] 4: 5481 /home/pp/code/compute/libhsakmt/tests/kfdtest/src/KFDQMTest.cpp:1670: Failure Value of: (ret) Actual: 31 Expected: HSAKMT_STATUS_SUCCESS Which is: 0 [ FAILED ] KFDQMTest.SdmaEventInterrupt (23675 ms) Change-Id: I7c1b752537d89782570df20838bf976578614f75 Signed-off-by: xinhui pan <xinhui.pan@amd.com>	2018-10-19 15:29:54 -04:00
Yong Zhao	d7e6d4706c	kfdtest: Clean up the indentations in PM4ReleaseMemoryPacket::InitPacket() Change-Id: I7f6b08697f6a68bf8c4a388c9f1cf3c3c8e6c81f Signed-off-by: Yong Zhao <Yong.Zhao@amd.com>	2018-10-17 14:28:15 -04:00
Yong Zhao	77bab8596f	kfdtest: Improve the SignalEvent test Create an extra event so that the event id to test is non zero. That way we can be sure the context id received in kernel ISR is non zero, which is different from the default value 0 when context id is not set at all. Change-Id: I7e261d1bbb783d5afd15558c7ac00493b1218cef Signed-off-by: Yong Zhao <Yong.Zhao@amd.com>	2018-10-17 14:27:54 -04:00
Gang Ba	52ec7f805e	drm/amdkfd: Added gfx904 and gfx803 for KFD. Change-Id: I4406dc70c776926feaecca3f2146d65259a80517 Signed-off-by: Gang Ba <gaba@amd.com>	2018-09-25 08:17:44 -04:00
Mike Li	c3b47c0959	kfdtest: Handle GPU resource management Currently the FindDRMRenderNode function will access the sysfs directly to find the render node. It doesn't work with the GPU management changes. Have changed code to call hsaKmtGetNodeProperties instead. Change-Id: I3bb537a323bc1e8c49f38d8aabc60c13e268aecd Signed-off-by: Mike Li <Tianxinmike.Li@amd.com>	2018-09-24 11:38:11 -04:00
xinhui pan	918a45a430	kfdtest: add P2POverheadTest This is to measure the laterncy + overhead of sdma packet consumption on p2p. It is Similar with QueueLatency test. What's more, the queue's overhead with different workload show more details. test result on two gfx900. [ RUN ] KFDPerformanceTest.P2POverheadTest [ ] Test (avg. ns) \| Size 4 8 16 64 256 1024 [ ] ----------------------------------------------------------------------- [ ] [push] [1 -> 0] 333 148 185 111 148 148 [ ] [push] [1 -> 1] 370 222 333 74 148 111 [ ] [push] [1 -> 2] 333 148 148 148 148 148 [ ] [push] [2 -> 0] 111 333 259 148 148 148 [ ] [push] [2 -> 1] 222 148 185 148 148 148 [ ] [push] [2 -> 2] 222 111 370 111 74 148 [ ] [pull] [1 -> 0] 370 296 296 148 185 148 [ ] [pull] [1 -> 1] 185 333 222 148 222 148 [ ] [pull] [1 -> 2] 222 444 259 148 185 111 [ ] [pull] [2 -> 0] 148 148 148 148 148 148 [ ] [pull] [2 -> 1] 148 148 148 148 148 148 [ ] [pull] [2 -> 2] 185 148 148 74 222 296 [ ] [push\|pull][1 -> 0] 1259 1222 1259 1074 1037 962 [ ] [push\|pull][1 -> 1] 1037 1037 1037 740 740 1000 [ ] [push\|pull][1 -> 2] 1259 1259 1296 1037 1000 1074 [ ] [push\|pull][2 -> 0] 1037 1037 1037 1074 1037 1148 [ ] [push\|pull][2 -> 1] 1037 1037 1037 1037 925 1074 [ ] [push\|pull][2 -> 2] 666 666 740 740 703 925 [ OK ] KFDPerformanceTest.P2POverheadTest (459 ms) Change-Id: I422263cb52f7ce184f6f1ff4466d04c239fbe9c9 Signed-off-by: xinhui pan <xinhui.pan@amd.com>	2018-09-24 09:28:00 -04:00
xinhui pan	e5a541eaf2	kfdtest: Add P2P bandwidth test The test measures the bandwidth between GPUs. Currently we do not care numa topology as some products really support across PCI-e root complex p2p. test result on two gfx900 system. [ RUN ] KFDPerformanceTest.P2PBandWidthTest [ ] Copy from node to node by [push, NONE] [ ] [1 -> 0] 6.13477 - 6.12695 GB/s [ ] [1 -> 2] 3.77734 - 3.76855 GB/s [ ] [2 -> 0] 6.67676 - 6.6543 GB/s [ ] [2 -> 1] 6.14453 - 6.12793 GB/s [ ] Copy from node to node by [pull, NONE] [ ] [1 -> 0] 6.10547 - 6.08105 GB/s [ ] [1 -> 2] 9.65527 - 9.65039 GB/s [ ] [2 -> 0] 6.49805 - 6.4873 GB/s [ ] [2 -> 1] 8.95508 - 8.85254 GB/s [ ] Full duplex copy from node to node by [push\|pull, NONE] [ ] [1 -> 0] 11.0986 - 11.0986 GB/s [ ] [1 -> 2] 7.54297 - 7.54297 GB/s [ ] [2 -> 0] 12.0264 - 11.9639 GB/s [ ] [2 -> 1] 12.0469 - 12.0371 GB/s [ ] Full duplex copy from node to node by [push, push] [ ] [1 <-> 2] 11.7324 - 11.4541 GB/s [ ] Full duplex copy from node to node by [pull, pull] [ ] [1 <-> 2] 11.4824 - 11.0508 GB/s [ ] Copy from node to multiple nodes by [push, NONE] [ ] [1 -> [0...2]] 5.625 - 5.73633 GB/s [ ] [2 -> [0...2]] 6.45801 - 6.4707 GB/s [ ] Copy from multiple nodes to node by [push, NONE] [ ] [[1...2] -> 0] 12.8379 - 12.2578 GB/s Now we can get more timestamp info like below. Copy from node to node by [push, NONE] [1 -> 0] [1 : 0] #-#-#-#-#-#-#-#-##-#-#-#-#-#-#-#-#-##-#-#-#-#-#-#-#-##-#-#-#-#-#-#-#-#-##-#-#-#-#-#-#-#-##-#-#-#-#-############################### [1 : 1] #################################################################################################### [1 -> 2] [1 : 0] #--#-#-#-#-#--#-#-#-#-#--#-#-#-#-#--#-#-#-#-#-#--#-#-#-#-#--#-#-#-#-#--#-#-#-#-#-#--#-#-#-#-#--#-#-###################################### [1 : 1] ##################################################################################################-# [2 -> 0] [2 : 0] ##-###-##-###-###-##-###-##-###-###-##-###-###-##-###-###-##-###-##-###-###-##-###-###-##-###-###-################# [2 : 1] ###############################################################################-#############-###-## [2 -> 1] [2 : 0] ##-##-##-##-##-###-##-##-##-##-##-###-##-##-##-##-###-##-##-##-##-##-###-##-##-##-##-###-##-##-##-#################### [2 : 1] ################################################################################-###-############-## [snip] Full duplex copy from node to node by [push, push] [1 <-> 2] [1 : 0] #-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-##-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-##-#-#-#-#-#-#-#-#-#-#-#-#################################### [1 : 1] ################-###################################################-############-####-############# [2 : 2] #-##-##-##-#-##-##-##-##-#-##-##-##-##-#-##-##-##-##-#-##-##-##-##-#-##-##-##-##-##-#-##-##-##-##-#-##-##-##-##-##-#-################## [2 : 3] #####-######-#####-######-#####-######-#####-######-#####-######-#####-######-#####-######-#####-######-#####-#####-## Full duplex copy from node to node by [pull, pull] [1 <-> 2] [1 : 0] ######################################################################-##-#-###############-####-### [1 : 1] #-#-#-##-#-#-##-#-#-##-#-#-##-#-#-##-#-#-##-#-#-##-#-#-#-##-#-#-##-#-#-##-#-#-##-#-#-##-#-#-##-#-#-############################ [2 : 2] ##-##-##-##-###-##-##-##-##-###-##-##-##-###-##-##-##-##-###-##-##-##-###-##-##-##-##-###-##-##-##-##-###-##-##-##-###-##-##-############ [2 : 3] #-#-#-#-#-#-#-#-#-#-#-#-#-##-#-#-#-#-#-#-#-#-#-#-#-#-#-##-#-#-#-#-#-#-#-#-#-#-#-#-#-##-#-#-#-#-#-#-#-#-#-#-#-#-##-#-#-#-#-#-#-#########-############# Copy from node to multiple nodes by [push, NONE] [1 -> [0...2]] [1 : 0] #-#--#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-#-############################### [1 : 1] ########################################################################################-###-###-### [2 -> [0...2]] [2 : 0] ##-##-##-###-##-###-##-##-###-##-###-##-##-###-##-###-##-###-##-##-###-##-###-##-##-###-##-###-##-################## [2 : 1] -################################################################################################-## Copy from multiple nodes to node by [push, NONE] [[1...2] -> 0] [1 : 0] #-#-#-#-#-#-#-#-##-#-#-#-#-#-#-#-#-##-#-#-#-#-#-#-#-##-#-#-#-#-#-#-#-#-##-#-#-#-#-#-#-#-#-##-#-#-#-############################### [1 : 1] ################################################################################################-#-# [2 : 2] ##-##-##-###-##-##-###-##-##-##-###-##-##-###-##-##-###-##-##-###-##-##-##-###-##-##-###-##-##-###-##-################## [2 : 3] #########################-#########################-#########################-######################### [ OK ] KFDPerformanceTest.P2PBandWidthTest (15982 ms) Change-Id: Ia90044191d51650ccb220476d31fb317aa3ad6ce Signed-off-by: xinhui pan <xinhui.pan@amd.com>	2018-09-19 12:03:05 +08:00
xinhui pan	f618b3f075	kfdtest: add KFDTestUtilQueue Some infrastructures below, Implement SdmaTimePacket which records the global GPU timestamp. Introduce class AsyncMPSQ and AsyncMPMQ. AsyncMPSQ is aka async multiple packet single queue. It takes a set of packet when create and submits them to a GPU to run. While AsyncMPMQ is aka async multiple packet multiple queue. It manages a set of AsyncMPSQ, and use a forloop to do operations of AsyncMPSQ. Implement sdma_multicopy helper functions. Change-Id: I47e1d2ca9630113b2a1d85a0055f3f8ee629fb5f Signed-off-by: xinhui pan <xinhui.pan@amd.com>	2018-09-19 12:03:05 +08:00
Xiaojie Yuan	247fa9f1e0	Use 'RecordProperty' to record performance scores For following test cases: - KFDQMTest.QueueLatency - KFDQMTest.BasicCuMaskingLinear - KFDQMTest.BasicCuMaskingEven - KFDMemoryTest.MMBandWidth - KFDMemoryTest.MMapLarge - KFDMemoryTest.MMBench v2: xml element cannot start with a number, so change the key name of MMBandWidth and MMBench accordingly xml element cannot contain whitespaces, so trim whitespaces in "VRAM " v3: introduce KFDLog-like way to use KFDRecord Change-Id: Ifc3ed5657621252a7b39dccf1ef4f50a92593f77 Signed-off-by: Xiaojie Yuan <xiaojie.yuan@amd.com>	2018-09-18 17:41:14 +08:00
xinhui pan	a6287ba919	kfdtest: Do not set GTEST_FLAG throw_on_failure This change is from commit 62f7dc2a("kfdtest: Do not set GTEST_FLAG throw_on_failure"). But it is unexpected to reverted by commit 414042ab("kfdtest: Clean up comments"). So add this change back. Fix: `414042ab` Change-Id: Ia9e99c9ca17b99aab62b4db55017018ddae43dfb Signed-off-by: xinhui pan <xinhui.pan@amd.com>	2018-09-11 10:25:56 +08:00
xinhui pan	07bd97a864	kfdtest: Fix queuelatency fail issue The timestamp written by releaseMemory packet might still not be visible when we fetch it. To fix this bug, use event-based wait. Change-Id: If2324eb3b3a632c711ee4dff4d03a93d5306c289 Signed-off-by: xinhui pan <xinhui.pan@amd.com>	2018-09-10 21:17:29 -04:00
Harish Kasiviswanathan	1fda429726	kfdtest: GetNodeIoLinkProperties: Display NodeFrom Use the NodeFrom returned by hsaKmtGetNodeIoLinkProperties() to check its correctness. Change-Id: I6ce436dc7c5d5b192bee21156292bd3eff77f916 Signed-off-by: Harish Kasiviswanathan <Harish.Kasiviswanathan@amd.com>	2018-09-10 09:44:24 -04:00
xinhui pan	9c7cfc0df2	kfdtest: Add event-based synchronization mechanism to queues Wait4PacketConsumption now can accept an event to wait all packets subbmitted to be processed. Change-Id: I1497b7704e892b04d05811b8d3e4742237c1be57 Signed-off-by: xinhui pan <xinhui.pan@amd.com>	2018-09-04 21:21:19 -04:00
Felix Kuehling	608dddbe9d	kfdtest: Fix gfx902 blacklist Removed some tests from the blacklist that are now passing. Added two new tests that hang the GPU. Change-Id: I09e729590e5181311375058be492d387342ba2fe Signed-off-by: Felix Kuehling <Felix.Kuehling@amd.com>	2018-08-31 15:04:50 -04:00
xinhui pan	a040a24243	kfdtest: Let BigBufferStressTest detect memory leak As it will alloc as much as small system memory to reach the allocation limit. We can try to alloc memory several times to see if any allocation in the previous step cause memory leak. Also we test if GPU can access these memory correctly or not. Change-Id: I309f9821b6bc99c212a6bfbc21fe3086ab589fd3 Signed-off-by: xinhui pan <xinhui.pan@amd.com>	2018-08-28 22:50:42 -04:00
xinhui pan	3e527bc7e8	kfdtest: add PM4EventInterrupt test Similar with SdmaEventInterrupt, verify event interrupt on pm4 queue. Change-Id: I0e43f26fd0d965126985820704215d2ef5e52c1a Signed-off-by: xinhui pan <xinhui.pan@amd.com>	2018-08-24 13:21:01 +08:00
xinhui pan	bdb1f8a066	kfdtest: Let SdmaEventInterrupt test more meaningful Simulate some workload there to verify the sDMA event interrupt. Change-Id: Ib5ad0c238cc66898f7835e765df50427ef106b04 Signed-off-by: xinhui pan <xinhui.pan@amd.com>	2018-08-24 11:27:34 +08:00
xinhui pan	1076075a1c	kfdtest: Add some asserts in BigBufferStressTest It should have PASS/FAIL report for the vram allocated size. Change-Id: I546c02c2ed02f1cfb5278e0dfd7b18ade39faafb Signed-off-by: xinhui pan <xinhui.pan@amd.com>	2018-08-23 23:01:20 -04:00
Kent Russell	fe33461622	kfdtest: Consolidate logic for ASSERT vs EXPECT ASSERT failures result in immediate termination of the test. EXPECT returns a failure but continues execution. Reserve ASSERT for required functionality (node initialization, queue creation, etc) where the rest of the test cannot run if that call fails. Use EXPECT everywhere else Change-Id: I1c11326fc3ae22b50fa83b07b3b49af1e1f4e69e	2018-08-23 06:20:18 -04:00
Kent Russell	414042abf7	kfdtest: Clean up comments Consolidate style (use /* */ for multi-line), fix typos, use dword instad of DWORD/DWord Change-Id: I620e45c1687550db41127e45641b7d79d28223a1	2018-08-23 06:20:17 -04:00
xinhui pan	163fa2f3aa	kfdtest: use HSAuint64 instead of unsigned HSAint64 This should fix gtest compile errors. code like below has trouble, typedef char char8; typedef unsigned char uchar8; ASSERT_NE((uchar8)1, 0); ASSERT_NE((unsigned char8)1, 0); // compile error here or ASSERT_NE((unsigned char8)1, 0); ASSERT_NE((uchar8)1, 0); // compile error here HSA[u]int64 are alias. So ASSERT_XX((unsigned HSAint64)..) with ASSERT_XX((HSAuint64)..) fail to compile. Change-Id: I4c24bc699a69bd4f37c4bc8aaaa9f1a92a24a33e Signed-off-by: xinhui pan <xinhui.pan@amd.com>	2018-08-16 16:03:52 +08:00
Yong Zhao	62f7dc2a48	kfdtest: Do not set GTEST_FLAG throw_on_failure The flag makes EXPECT_* to behave like ASSERT_*, which actually work against our favor, so disable the flag. Change-Id: I2ea1dfeaf916b396593a504d081148abdac0fc70 Signed-off-by: Yong Zhao <Yong.Zhao@amd.com>	2018-08-15 18:08:39 -04:00
Felix Kuehling	d3fdaaca3a	kfdtest: Enable more tests for gfx900 A lot of tests were disabled on gfx900 for historical reasons that are no longer valid. The only remaining one that won't work on gfx900 is BasicAddressWatch. Change-Id: I11507de0dfd31262713127d6cb15cc09c14b8b9f Signed-off-by: Felix Kuehling <Felix.Kuehling@amd.com>	2018-08-15 14:22:19 -04:00
Kent Russell	f2bd7e1d52	kfdtest: Consolidate log messages for skipped tests When skipping a test, the output should be: Skipping test: <reason>. This will allow for easier identification, automation and general readability Change-Id: I98bda1c068f9dbc83aeea74f642b6101121f234d	2018-08-14 10:11:50 -04:00
Kent Russell	cb019f00cd	kfdtest: Consolidate indentation of multi-line function calls Make indentation consistent, which is that subsequent lines are aligned with the variables declared above Change-Id: I590f7768d93565145b986ad1fb6ac8e82f9c0d58	2018-08-14 08:18:07 -04:00
Kent Russell	dffac0a97e	kfdtest: Style cleanup Clean up the KFDTest style via CPPLint. Some warnings remain regarding volatile variables being cast to void*. This is the command used: cpplint.py --linelength=120 --filter=-readability/multiline_string,-readability/todo,-build/include,-runtime/references multiline_string is due to using ISA code todo is to avoid errors that we don't have TODO(username) instead of TODO include is about including the folder in the header includes references is regarding non-const references '&' being const or using pointers. That can be addressed later Change-Id: I3c6622da0a13dd33ab29b2bfff48be25e763b750	2018-08-14 08:17:57 -04:00
xinhui pan	3f7b6356fd	kfdtest: fix a memory leak issue in MMapLarge test When mapMemoryToGpu fails, we need unregister it with user address as the gpu address is not available. Change-Id: I4418eeaa7aa37008f5bffa144e2c2171f0d238fd Signed-off-by: xinhui pan <xinhui.pan@amd.com>	2018-08-10 05:26:06 -04:00
xinhui pan	9d6d0911e4	kfdtest: make p2ptest go through all gpus Implement sDMA copy packet broadcast. Each time sDMA will copy its local vram to sysbuf and next GPU's vram. That will verify where the p2p link is broken. Currently we just test push of p2p. test result on 2 cpus, 4 gpus, numa enabled system. [ RUN ] KFDQMTest.P2PTest [ ] Test 2 -> 3 [ ] PASS 2 -> 3 [ ] Test 3 -> 4 [ ] PASS 3 -> 4 [ ] Test 4 -> 5 [ ] PASS 4 -> 5 [ ] Test 5 -> 0 [ ] PASS 5 -> 0 [ OK ] KFDQMTest.P2PTest (190 ms) Change-Id: Ie6fb2604109e39465b8a873b3bb42abc6259825a	2018-08-07 21:13:37 -04:00
Felix Kuehling	5c742f3e5e	kfdtest: Blacklist Fragmentation test on all chips This test has been intermittently failing for various reasons and was already disabled on all chips except Ellesmere. It stresses memory management in unusual ways by having lots of memory allocated but +# not mapped, which is not relevant to compute applications over ROCr. Change-Id: I6b791ca7e2e0fcfe93fc720063b4b56acfded751 Signed-off-by: Felix Kuehling <Felix.Kuehling@amd.com>	2018-08-03 20:14:46 -04:00
Eric Huang	3167e3b964	KFDEvictTest: change buffer size and add GFX vram allocation This is to coordinate kfd kernel vram limit change, and adding GFX vram allocation with submission of command nop is to trigger eviction. Change-Id: I18615cd13cfde034aae09c188ae3a82babde97b9 Signed-off-by: Eric Huang <JinHuiEric.Huang@amd.com>	2018-08-03 15:44:32 -04:00
Eric Huang	f8d19104aa	Kfdtest: Change and move drm device function into KFDBaseComponentTest It is for other test to reuse this function. Change-Id: Ib0dbc1a267a5bbcd8078ab3265677b53531f86f3 Signed-off-by: Eric Huang <JinHuiEric.Huang@amd.com>	2018-08-03 15:43:28 -04:00
Yong Zhao	f3e7870784	kfdtest: Evaluate whether a node is APU based on spec This will facilitate the user cases that some APU asics is used as dGPU. Change-Id: Ib3a79ae31a03e7a618c7785166f56282a7617127 Signed-off-by: Yong Zhao <yong.zhao@amd.com>	2018-08-02 11:36:40 -04:00
xinhui pan	86552aba4b	kfdtest: make the output of QueueLatency test more readable Change-Id: Ib33ac25509b23f2e5869bde126e3f11ef60f017e Signed-off-by: xinhui pan <xinhui.pan@amd.com>	2018-08-01 10:06:33 +08:00
Yong Zhao	1d43938ac7	kfdtest: Add run utility files for kfdtest A README.txt file is added to help the opensource community to use kfdtest effectively. After building, run_kfdtest.sh in the building output folder can be used to run the test. Change-Id: I9612d9d5a63bd4cdc3a328efd9961d3cc92a6ba5 Signed-off-by: Yong Zhao <yong.zhao@amd.com>	2018-07-31 00:02:04 -04:00

1 2

55 Cometimentos