AMD
9e422660cd
Add support for gfx90a
...
Merge gfx90a support from the 'amd-npi' branch.
Change-Id: I9b51711ed4a1d2f1ed42ba9b83cb12136be228b8
[ROCm/rocprofiler commit: 4df3e0bd9a ]
2021-06-16 16:35:42 -07:00
Kent Russell
70680ba1dc
Cmake: Support static hsakmt
...
Add numa lib as this will be required with a static thunk
Look for static thunk of shared thunk cannot be found
Signed-off-by: Kent Russell <kent.russell@amd.com >
Change-Id: Idcaa0c785a0502c9f5fe42e2dfb9e0c1780f9d66
[ROCm/rocprofiler commit: 97c9efce38 ]
2021-04-27 12:18:02 -04:00
Laurent Morichetti
6624fe4628
Fix a compilation error with gcc-9.3.0
...
On Ubuntu 20.04, in Release mode, gcc fails with this error:
In file included from /usr/include/string.h:495,
from /opt/rocm/include/hsa/hsa_api_trace.h:57,
from ../rocprofiler/src/util/hsa_rsrc_factory.h:29,
from ../rocprofiler/src/util/hsa_rsrc_factory.cpp:25:
In function ‘char* strncpy(char*, const char*, size_t)’,
inlined from ‘const util::AgentInfo* util::HsaRsrcFactory::AddAgentInfo(hsa_agent_t)’ at ../rocprofiler/src/util/hsa_rsrc_factory.cpp:323:12:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: error: ‘char* __builtin___strncpy_chk(char*, const char*, long unsigned int, long unsigned int)’ specified bound depends on the length of the source argument [-Werror=stringop-overflow=]
106 | return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
| ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
../rocprofiler/src/util/hsa_rsrc_factory.cpp: In member function ‘const util::AgentInfo* util::HsaRsrcFactory::AddAgentInfo(hsa_agent_t)’:
../rocprofiler/src/util/hsa_rsrc_factory.cpp:322:39: note: length computed here
322 | const int gfxip_label_len = strlen(agent_info->name) - 2;
| ~~~~~~^~~~~~~~~~~~~~~~~~
The error is caused by the following 2 lines:
const int gfxip_label_len = strlen(agent_info->name) - 2;
strncpy(agent_info->gfxip, agent_info->name, gfxip_label_len);
The size argument to strncpy should not depend on the input string.
Since the terminating character is not considered (the copy is at
most len - 2 bytes), using memcpy is preferable. Also, make sure
the destination does not overflow by clamping the size.
Change-Id: I0c5cf7e0daf4cd6fcf7092efb1d9fd4c02a6c639
[ROCm/rocprofiler commit: 304d3366af ]
2021-04-22 11:12:53 -07:00
Evgeny
c701f9705c
cleanup after separating for staging and npi branches
...
Change-Id: Iadd624df21b85f1590e901a8125680743e3281a3
[ROCm/rocprofiler commit: 780dfa37d4 ]
2021-04-08 20:37:47 +00:00
Evgeny
8c3ce30c94
SWDEV-265287 : integration spmltgen.py script
...
Change-Id: Ief3e93225fb6660e72a04e4bd4b379262b73c914
[ROCm/rocprofiler commit: 82d7bb2145 ]
2021-04-08 10:04:39 -04:00
Evgeny
eddcdabf2a
fixing gfx10 gfxip name
...
Change-Id: Ie58768d64117a616b1896489b505790cfa993054
[ROCm/rocprofiler commit: 64bdcaddc7 ]
2021-03-24 00:48:21 -05:00
Evgeny
0282e30855
SWDEV-274821 SPM initialization fix
...
Change-Id: I5e27928a60083eff328bab3e79937ce11bce11bd
[ROCm/rocprofiler commit: e2c9d13e5b ]
2021-03-22 09:18:36 +00:00
Evgeny
2adb15caff
SWDEV-255662 : spm kfd mode support
...
Change-Id: I840c7e92d3d5a59d8e5402c4d8ef86bc123dd07c
[ROCm/rocprofiler commit: 7e60bf163e ]
2020-12-02 13:02:45 -06:00
Evgeny
b781ea8577
fixing sqtt trace for zero size case
...
Change-Id: I75712485f518725af46a3b419339a212d1e762a0
[ROCm/rocprofiler commit: f2c9980647 ]
2020-12-01 18:19:51 -05:00
Evgeny
66490fca38
fixing c_str() as strdup
...
Change-Id: Ib5cb68d16ce66fd2ae072168de4c16895f32b57f
[ROCm/rocprofiler commit: ccc6005c25 ]
2020-10-27 14:45:51 -05:00
Evgeny
e06ff588ce
porting of AQL packet submit to new atomic HSA queue API
...
Change-Id: I654448a7a8627978395d426118a5cb3ba2a92058
[ROCm/rocprofiler commit: 96ff7582ce ]
2020-10-12 09:26:27 -05:00
Evgeny
87e1a397a3
SWDEV-252747 : testing using v3 object
...
Change-Id: I427df765d1be55bd2851ce441238b3eaa46cca4f
[ROCm/rocprofiler commit: 169e36f379 ]
2020-10-09 06:38:46 -04:00
Evgeny
fc99b9a657
enable contexts wait
...
Change-Id: Ie2adf04662fddc8051fb5418904c9c659e264d78
[ROCm/rocprofiler commit: 0d164ba672 ]
2020-09-21 21:06:03 -04:00
Evgeny
2d42e93cdf
kernel objects dumping
...
Change-Id: I5a16e05b7df438efa903948701b65a9ced99e5f3
initial codeobj event implementation
Change-Id: Ia7fac3c2b9897a004cfe88c4de82ba8c18284196
update - codeobj event implementation
Change-Id: I2b91b6e689875af03f0086f5a0872a97a629fd83
update2 - codeobj event implementation
Change-Id: Icff75f14fd21963e40db95373fa74880957a9e32
fix - codeobj event implementation
Change-Id: I76c33c875cb429fb12a974bb408b217f187b4536
URI buffer fix - codeobj event implementation
Change-Id: I7ce1a758e021455da3fe5b8a6e4ae3ab46e9760e
HSA events exposing
Change-Id: I3664ab4e5111c4ccedaf068dcb19f48055f0ef9b
HSA events data struct normalizing
Change-Id: I365ef0db45e0a9314bd2a1a4d29dd4eb4e91297d
[ROCm/rocprofiler commit: 8850e46071 ]
2020-09-11 10:01:54 -05:00
Xianwei Zhang
9c62afb8ba
concurrent: enable/fix the related settings
...
Concurrent profiling relies on the aqlprofile read_api
and tracker. This patch set those options to enable
the concurrent profiling.
Change-Id: Ib97d4d8facfbc11f2684d83109397cd13f117d5e
[ROCm/rocprofiler commit: b445610cd1 ]
2020-08-26 16:04:57 -04:00
Evgeny
a331990ee4
optimization mechanism fix: correct tracker handler; kernel name query on completion;
...
Change-Id: I14da152b4ac3c7d8fd1af2f54e9d71f834071622
[ROCm/rocprofiler commit: 80747de208 ]
2020-08-03 23:34:49 -05:00
Evgeny
832ab03e46
kernel name filtering fix - handling [] brakets
...
Change-Id: I46a62d991a52045694640837393df229cf7a3133
[ROCm/rocprofiler commit: 7364edcc5b ]
2020-07-29 18:47:31 -05:00
Xianwei Zhang
f95b383dfa
pmc: add support of concurrent kernel profiling
...
The profiling was only enabled in serial mode, i.e., kernels
are serialized in execution, and counters are reset at each
kernel start and read at kernel completion. This patch adds
the concurrent mode, by issuing the process-level start
packet to reset counters, and then reading twice at kernel
start and end time to obtain the counter value difference.
The new concurrent profiling usage needs the integration
with the corresponding augment at aqlprofile side.
Change-Id: I94b4442eadc8c64b8fba51b1e4916fc8b895ad21
[ROCm/rocprofiler commit: 61c9df4631 ]
2020-07-16 14:39:46 -05:00
Evgeny
3d44ea1a05
counters dumping optimization
...
Change-Id: I8c694e5380e15179453148dd9ab3a3e51b6db861
[ROCm/rocprofiler commit: 2a7f77b290 ]
2020-07-15 09:57:41 -05:00
Evgeny
d22c364abe
cleanup
...
Change-Id: I518f741d3d3d0bb514936c4db8b727bdc79d9fdc
[ROCm/rocprofiler commit: f214238880 ]
2020-06-24 12:22:22 -05:00
Evgeny
c75faaaa37
concurrent sqtt support
...
Change-Id: I91391fafabc93aefa5d244d870ef82b96a59dc52
[ROCm/rocprofiler commit: 9f7e936d70 ]
2020-06-23 20:00:49 -04:00
Evgeny
4672ded98d
ocl samples disabling
...
Change-Id: I8012e226f96246ed70974048a87e67da9d01753c
[ROCm/rocprofiler commit: 1c11cd087b ]
2020-06-01 18:07:52 -05:00
Evgeny
7d2307424e
disabling destruction
...
Change-Id: I2a7d05a8f597b3bc8bd07bffe7181f9dcace1cbf
[ROCm/rocprofiler commit: 9950b97567 ]
2020-05-07 03:34:30 -05:00
Evgeny
212aefac7d
adding pid for kernel results to support multi-process profiling
...
Change-Id: I283228a4b4145599c5e637dd6faa771b9f4b6345
[ROCm/rocprofiler commit: 3af87a7423 ]
2020-05-05 05:35:32 -05:00
Evgeny
0bea31fe48
get_time API: make public; extende with more time id: coarse and raw; added time error return value;
...
Change-Id: I1641eb2c38915222204617e07fc0bfb388bb8346
[ROCm/rocprofiler commit: 3ce98d33d4 ]
2020-04-30 02:38:18 -05:00
Evgeny
e3514895c6
toll destruction fix
...
Change-Id: If069c820526e21a0a4b80ac516f9669a81f34cab
[ROCm/rocprofiler commit: fe70682184 ]
2020-04-28 03:16:15 -05:00
Evgeny
c1da03616e
test suite minor changes
...
Change-Id: I673055c74461c106b4f705c209e54d83b7d3fb17
[ROCm/rocprofiler commit: 1d638b6f79 ]
2020-04-27 13:05:47 -05:00
Evgeny
55dfe8ec4b
run script update
...
Change-Id: Iffa971ff68f31c4c751698ad962ddc2993455d13
[ROCm/rocprofiler commit: 3173448638 ]
2020-04-22 21:29:32 -05:00
Evgeny
31a7eac580
elliminaring the need of AMD_INTERNAL_BUILD macro defined
...
Change-Id: Ie97aef943793b1e4f40b7c7397af313520b35beb
[ROCm/rocprofiler commit: f819e1c463 ]
2020-04-09 23:41:51 -05:00
Evgeny
5b725bbe7a
adding standalone intercept test; queeu_start/stop API fixed as public;
...
Change-Id: I5489a5ff69454985b955c9e4027f812168de1ecb
[ROCm/rocprofiler commit: fdb8f55e02 ]
2020-04-08 04:31:52 -05:00
Evgeny
05c3b879d9
fixing CPACK prefix to by just rocm root path
...
Change-Id: I8caf5832aa9536b1e891e07cb992dd5258b8755d
[ROCm/rocprofiler commit: aac37d4d1e ]
2020-03-29 01:09:11 -05:00
Evgeny
943b92d4c7
fixing hsa intercept test
...
Change-Id: I2671dfc6a9bd3e01a0c926aa3ea367b8c7a0279e
[ROCm/rocprofiler commit: 7be9a42ab3 ]
2020-03-28 17:24:16 -05:00
Evgeny
67e1502ce5
PC sampling bringup
...
Change-Id: I0d041c4c8c3778f2c328cde38432bc72223706a3
pc sampling integration fix
Change-Id: Ia66ff876d2d99ec4d561daf8320b65d75f5cd2fe
[ROCm/rocprofiler commit: 9df9fddcfb ]
2020-03-28 13:07:45 -05:00
Evgeny
0e767e9864
update for gfx9 and 3.1
...
Change-Id: I0c218f8a2dcdad1c8deb44770d8a64ccd95a92fb
[ROCm/rocprofiler commit: 299a08e4be ]
2020-03-24 19:07:11 -05:00
Evgeny
c22f2a46dd
adding mytest target
...
Change-Id: Ia82b971597a4876d99c85d7dbbf997faea83f0c9
[ROCm/rocprofiler commit: ffa0a51408 ]
2020-03-19 01:27:44 -05:00
Evgeny
99d4641ee7
dynamic kernel lds size
...
Change-Id: I051b85550d12ca8662127615be7adfb12c5f9585
[ROCm/rocprofiler commit: 7ecf9ef1e3 ]
2020-02-27 17:44:22 -06:00
Evgeny
d657afa812
adding AgentInfo::lds_block_size
...
Change-Id: I186893add96dc92570e710ae78b475897ebfe531
[ROCm/rocprofiler commit: 40730e34e4 ]
2020-02-18 14:00:19 -06:00
Evgeny
a40ba7a793
hsa-interept mode enabling in head
...
Change-Id: I048a975deaae691868034ee361fea71ffb5bdd83
[ROCm/rocprofiler commit: d2324e780f ]
2020-02-14 00:22:36 -06:00
Evgeny
c1b7e4353a
device id/type for submit callback
...
Change-Id: I69b31bd28850198c8ca44e55bf8f2ff9540d92fc
[ROCm/rocprofiler commit: 3d4c4d1cce ]
2020-02-13 22:00:56 -06:00
Evgeny
988274443a
adding agent to device callback data
...
Change-Id: I7d631e4faae685ce23704ac65308bf2aae0c157a
[ROCm/rocprofiler commit: 4edc6c98cf ]
2020-02-13 22:00:56 -06:00
Evgeny
9d2081ceb4
kernel free on executable_destroy
...
Change-Id: I062e08fd5df94f4d21b443b74494dc6e95201f3d
[ROCm/rocprofiler commit: 304568c1a3 ]
2020-02-13 22:00:56 -06:00
Evgeny
59bc8da846
adding code object freeze handling, kernel code allocate/device callbacks
...
Change-Id: I6ec32a0c4e945260cd11f88c1d2a0a7f22d6e10c
documenting comments
Change-Id: I1b7af70d51042c48aef0474d0b6b54275b42c456
[ROCm/rocprofiler commit: 086c29fff1 ]
2020-02-13 22:00:56 -06:00
Evgeny
3b84973737
adding allocate/device/memcopy/submit callbacks
...
Change-Id: Ie83ae3cd57cdf5038e83db70891b103439c37d55
[ROCm/rocprofiler commit: f95ed067ba ]
2020-02-13 22:00:08 -06:00
Evgeny
ec6fd99b30
hsa_rsrc_factory sync
...
Change-Id: Idecbc0cdad6068eae5259cb043bdf5746b430aec
[ROCm/rocprofiler commit: 5615ade977 ]
2020-01-30 19:29:59 -06:00
Evgeny
7287ff939b
adding C test
...
Change-Id: I006130d6711fd8e7bb222b985e4c64a801825f41
[ROCm/rocprofiler commit: 8565d41d7b ]
2019-12-29 05:31:25 -06:00
Evgeny
67b1e9af0d
trace parameters fix
...
Change-Id: I5cd8c897b9982f9abc9e36c18a4a488bd65fbd30
[ROCm/rocprofiler commit: 389cb50854 ]
2019-12-10 04:33:29 -06:00
Evgeny
6dd6ea0046
sgpr/vgpr reported sizes in terms of registers; lds size in bytes;
...
Change-Id: Id527687cc075a4e85ea4d0063ed8f336e34d2454
[ROCm/rocprofiler commit: c8eb0641bf ]
2019-11-27 15:19:14 -06:00
Evgeny
9b183d909e
code object tracking and v3 code object support
...
Change-Id: I081ada13f6364ea4401a97a485dedfa9bf8a45fe
[ROCm/rocprofiler commit: 60043d198b ]
2019-11-19 20:18:09 -06:00
Evgeny
c34655f1ff
setting of granulated lds, sgpr, vgpr reserved by HW for the kernel
...
Change-Id: Ib3e60c0b97787f56f769136480a46a8560e00cc8
[ROCm/rocprofiler commit: dc05345a78 ]
2019-11-11 18:11:16 -05:00
Xianwei Zhang
a67147ca5b
tests: replace 'bc' with 'awk' to be more portable
...
Calbulator package 'bc' is not always installed, occasionally
breaking the test scripts. As a fix, this patch gets rid of
'bc' and uses 'awk' instead.
Change-Id: I5f4dbf018d97581a626cf95d147c415755477771
[ROCm/rocprofiler commit: 51bdb2ce02 ]
2019-08-26 10:53:52 -04:00