Graf commitů

156 Commity

Autor SHA1 Zpráva Datum
Evgeny 1c11cd087b ocl samples disabling
Change-Id: I8012e226f96246ed70974048a87e67da9d01753c
2020-06-01 18:07:52 -05:00
Evgeny 9950b97567 disabling destruction
Change-Id: I2a7d05a8f597b3bc8bd07bffe7181f9dcace1cbf
2020-05-07 03:34:30 -05:00
Evgeny 3af87a7423 adding pid for kernel results to support multi-process profiling
Change-Id: I283228a4b4145599c5e637dd6faa771b9f4b6345
2020-05-05 05:35:32 -05:00
Evgeny 3ce98d33d4 get_time API: make public; extende with more time id: coarse and raw; added time error return value;
Change-Id: I1641eb2c38915222204617e07fc0bfb388bb8346
2020-04-30 02:38:18 -05:00
Evgeny fe70682184 toll destruction fix
Change-Id: If069c820526e21a0a4b80ac516f9669a81f34cab
2020-04-28 03:16:15 -05:00
Evgeny 1d638b6f79 test suite minor changes
Change-Id: I673055c74461c106b4f705c209e54d83b7d3fb17
2020-04-27 13:05:47 -05:00
Evgeny 3173448638 run script update
Change-Id: Iffa971ff68f31c4c751698ad962ddc2993455d13
2020-04-22 21:29:32 -05:00
Evgeny f819e1c463 elliminaring the need of AMD_INTERNAL_BUILD macro defined
Change-Id: Ie97aef943793b1e4f40b7c7397af313520b35beb
2020-04-09 23:41:51 -05:00
Evgeny fdb8f55e02 adding standalone intercept test; queeu_start/stop API fixed as public;
Change-Id: I5489a5ff69454985b955c9e4027f812168de1ecb
2020-04-08 04:31:52 -05:00
Evgeny aac37d4d1e fixing CPACK prefix to by just rocm root path
Change-Id: I8caf5832aa9536b1e891e07cb992dd5258b8755d
2020-03-29 01:09:11 -05:00
Evgeny 7be9a42ab3 fixing hsa intercept test
Change-Id: I2671dfc6a9bd3e01a0c926aa3ea367b8c7a0279e
2020-03-28 17:24:16 -05:00
Evgeny 9df9fddcfb PC sampling bringup
Change-Id: I0d041c4c8c3778f2c328cde38432bc72223706a3

pc sampling integration fix

Change-Id: Ia66ff876d2d99ec4d561daf8320b65d75f5cd2fe
2020-03-28 13:07:45 -05:00
Evgeny 299a08e4be update for gfx9 and 3.1
Change-Id: I0c218f8a2dcdad1c8deb44770d8a64ccd95a92fb
2020-03-24 19:07:11 -05:00
Evgeny ffa0a51408 adding mytest target
Change-Id: Ia82b971597a4876d99c85d7dbbf997faea83f0c9
2020-03-19 01:27:44 -05:00
Evgeny 7ecf9ef1e3 dynamic kernel lds size
Change-Id: I051b85550d12ca8662127615be7adfb12c5f9585
2020-02-27 17:44:22 -06:00
Evgeny 40730e34e4 adding AgentInfo::lds_block_size
Change-Id: I186893add96dc92570e710ae78b475897ebfe531
2020-02-18 14:00:19 -06:00
Evgeny d2324e780f hsa-interept mode enabling in head
Change-Id: I048a975deaae691868034ee361fea71ffb5bdd83
2020-02-14 00:22:36 -06:00
Evgeny 3d4c4d1cce device id/type for submit callback
Change-Id: I69b31bd28850198c8ca44e55bf8f2ff9540d92fc
2020-02-13 22:00:56 -06:00
Evgeny 4edc6c98cf adding agent to device callback data
Change-Id: I7d631e4faae685ce23704ac65308bf2aae0c157a
2020-02-13 22:00:56 -06:00
Evgeny 304568c1a3 kernel free on executable_destroy
Change-Id: I062e08fd5df94f4d21b443b74494dc6e95201f3d
2020-02-13 22:00:56 -06:00
Evgeny 086c29fff1 adding code object freeze handling, kernel code allocate/device callbacks
Change-Id: I6ec32a0c4e945260cd11f88c1d2a0a7f22d6e10c

documenting comments

Change-Id: I1b7af70d51042c48aef0474d0b6b54275b42c456
2020-02-13 22:00:56 -06:00
Evgeny f95ed067ba adding allocate/device/memcopy/submit callbacks
Change-Id: Ie83ae3cd57cdf5038e83db70891b103439c37d55
2020-02-13 22:00:08 -06:00
Evgeny 5615ade977 hsa_rsrc_factory sync
Change-Id: Idecbc0cdad6068eae5259cb043bdf5746b430aec
2020-01-30 19:29:59 -06:00
Evgeny 8565d41d7b adding C test
Change-Id: I006130d6711fd8e7bb222b985e4c64a801825f41
2019-12-29 05:31:25 -06:00
Evgeny 389cb50854 trace parameters fix
Change-Id: I5cd8c897b9982f9abc9e36c18a4a488bd65fbd30
2019-12-10 04:33:29 -06:00
Evgeny c8eb0641bf sgpr/vgpr reported sizes in terms of registers; lds size in bytes;
Change-Id: Id527687cc075a4e85ea4d0063ed8f336e34d2454
2019-11-27 15:19:14 -06:00
Evgeny 60043d198b code object tracking and v3 code object support
Change-Id: I081ada13f6364ea4401a97a485dedfa9bf8a45fe
2019-11-19 20:18:09 -06:00
Evgeny dc05345a78 setting of granulated lds, sgpr, vgpr reserved by HW for the kernel
Change-Id: Ib3e60c0b97787f56f769136480a46a8560e00cc8
2019-11-11 18:11:16 -05:00
Xianwei Zhang 51bdb2ce02 tests: replace 'bc' with 'awk' to be more portable
Calbulator package 'bc' is not always installed, occasionally
breaking the test scripts. As a fix, this patch gets rid of
'bc' and uses 'awk' instead.

Change-Id: I5f4dbf018d97581a626cf95d147c415755477771
2019-08-26 10:53:52 -04:00
Evgeny e5b938b5fb unordered metrics expressions config
Change-Id: I998423046abfd7af45f28071911276cce1c4f3da
2019-08-25 05:08:55 -05:00
Evgeny Shcherbakov 6fa4f1dc66 Merge "tests: improve scripts on rocm v2.6, and do more check" into amd-master 2019-08-23 15:02:06 -04:00
Xianwei Zhang 68bf049d49 tests: improve scripts on rocm v2.6, and do more check
'rocminfo' output format on v2.6 is slightly different, and thus
cache capacity and cache linesize can be incorrectly obtained.
Example: 'L1: 16KB' vs. 'L1: 16(0x10) KB'. This patch fixes the
formatting issue, and further does more check on profiled .csv
file.

Change-Id: I934ee3613a79fa8acc431a394e88a9e09833311b
2019-08-23 12:49:59 -04:00
Evgeny Shcherbakov 4f4e02de7a Merge "build: slightly change files to pass compilation" into amd-master 2019-08-23 11:44:44 -04:00
Evgeny aecc7abc76 Mem32Bwrites -> MemWrites32B renaming
Change-Id: I86f570b97178caa3d3d5fcbc5ecc716216dad504
2019-08-22 22:34:36 -05:00
Evgeny dff3256dd5 mi100 metrics fix
Change-Id: I862578a2b7827a88cc6926d636d5eb9a8320a244
2019-08-22 22:30:48 -05:00
Xianwei Zhang ee917344ff build: slightly change files to pass compilation
Compiled using Clang and encountered errors on ambiguous pointer
type, implicit conversion NULL to bool, and unused function, etc.
As a fix, this patch slightly updates the related codes.

Change-Id: I4a73d66293313dda36749036f5c73aaf2a445231
2019-08-22 23:29:16 -04:00
Evgeny 82ecf9e7a4 overwriting on inheritance
Change-Id: I6faf79874fc14e898c75972aa2f372e228105f7b
2019-08-08 01:54:29 -05:00
Evgeny 09eed4f35b gfx908 events update
Change-Id: Ieb7c72f66b7e5e5dfc29d97650a29722a6276a31
2019-08-07 19:44:31 -05:00
Evgeny 855a801ae7 adding gfx908 namespace
Change-Id: Ifd45385777be5066c8bfef23ee24bbabadf4dffa
2019-07-27 03:06:10 -05:00
Evgeny Shcherbakov 8092bd9b2b Merge "tests: make tests print msg and exit on errors" into amd-master 2019-07-24 14:20:25 -04:00
Evgeny Shcherbakov ed68e4c432 Merge "tests: remove TCC_EA_* counters to support gfx8" into amd-master 2019-07-24 14:20:14 -04:00
Xianwei Zhang 87e46dde4e tests: make tests print msg and exit on errors
Change-Id: Ie5af942551a6970a7414c30988a47ed57a9baf27
2019-07-23 17:18:54 -04:00
Evgeny Shcherbakov 710a9de411 Merge "integrating memory_validation bench in cmake build" into amd-master 2019-07-23 14:49:50 -04:00
Evgeny Shcherbakov 657241e848 Merge "xml: add extra events for gfx906 GPUs" into amd-master 2019-07-23 12:40:39 -04:00
Evgeny 6c5ca62575 integrating memory_validation bench in cmake build
Change-Id: I90ad85c7b1517c162804a00d5abb75007bf831fe
2019-07-23 11:14:44 -05:00
Xianwei Zhang 6152563612 tests: remove TCC_EA_* counters to support gfx8
TCC_EA_* counters are only available on gfx9 devices, and thus
the memory validation tests fail on gfx8. To fix the issue,
this patch removes TCC_EA_* metrics and keeps only the ones
supported by both gfx8 and gfx9.

Change-Id: Ib3889435b88f2e6970981304a72cee75d5343d4b
2019-07-23 05:00:28 -05:00
Evgeny Shcherbakov 1d9b6b4d4f Merge "tests: add top-level Makefile and run.sh" into amd-master 2019-07-22 16:09:47 -04:00
Xianwei Zhang a5a56425bb tests: add top-level Makefile and run.sh
Change-Id: Ieaef51422c11ec27d56ff19953f9a5dd1f563353
2019-07-18 05:21:20 -05:00
Evgeny 4ef7d7d156 sys trace adding
Change-Id: I8d0a3a34e14f897e635a42faa42eedc1a1f86ace
2019-07-18 00:41:49 -05:00
Xianwei Zhang ca9a714b77 xml: add extra events for gfx906 GPUs
Compard to gfx900, e.g., Vega 10, gfx906 adds extra counter events.
A typical difference is on TCC-EA that gfx906 (e.g., Vega 20) has 2
EAs per TCC, while only one single EA/TCC on gfx900. As such,
additional counters must be profiled to get correct results. This
patch adds one extra events to specifically handle gfx906.

Change-Id: Id6c9d37548a102c80bbfddcfa11e77d20f17431a
2019-07-16 12:56:45 -05:00