Gráfico de commits

177 Commits

Autor SHA1 Mensaje Fecha
AMD 9e422660cd Add support for gfx90a
Merge gfx90a support from the 'amd-npi' branch.

Change-Id: I9b51711ed4a1d2f1ed42ba9b83cb12136be228b8


[ROCm/rocprofiler commit: 4df3e0bd9a]
2021-06-16 16:35:42 -07:00
Kent Russell 70680ba1dc Cmake: Support static hsakmt
Add numa lib as this will be required with a static thunk
Look for static thunk of shared thunk cannot be found

Signed-off-by: Kent Russell <kent.russell@amd.com>
Change-Id: Idcaa0c785a0502c9f5fe42e2dfb9e0c1780f9d66


[ROCm/rocprofiler commit: 97c9efce38]
2021-04-27 12:18:02 -04:00
Laurent Morichetti 6624fe4628 Fix a compilation error with gcc-9.3.0
On Ubuntu 20.04, in Release mode, gcc fails with this error:

In file included from /usr/include/string.h:495,
                 from /opt/rocm/include/hsa/hsa_api_trace.h:57,
                 from ../rocprofiler/src/util/hsa_rsrc_factory.h:29,
                 from ../rocprofiler/src/util/hsa_rsrc_factory.cpp:25:
In function ‘char* strncpy(char*, const char*, size_t)’,
    inlined from ‘const util::AgentInfo* util::HsaRsrcFactory::AddAgentInfo(hsa_agent_t)’ at ../rocprofiler/src/util/hsa_rsrc_factory.cpp:323:12:
/usr/include/x86_64-linux-gnu/bits/string_fortified.h:106:34: error: ‘char* __builtin___strncpy_chk(char*, const char*, long unsigned int, long unsigned int)’ specified bound depends on the length of the source argument [-Werror=stringop-overflow=]
  106 |   return __builtin___strncpy_chk (__dest, __src, __len, __bos (__dest));
      |          ~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
../rocprofiler/src/util/hsa_rsrc_factory.cpp: In member function ‘const util::AgentInfo* util::HsaRsrcFactory::AddAgentInfo(hsa_agent_t)’:
../rocprofiler/src/util/hsa_rsrc_factory.cpp:322:39: note: length computed here
  322 |     const int gfxip_label_len = strlen(agent_info->name) - 2;
      |                                 ~~~~~~^~~~~~~~~~~~~~~~~~

The error is caused by the following 2 lines:

    const int gfxip_label_len = strlen(agent_info->name) - 2;
    strncpy(agent_info->gfxip, agent_info->name, gfxip_label_len);

The size argument to strncpy should not depend on the input string.

Since the terminating character is not considered (the copy is at
most len - 2 bytes), using memcpy is preferable. Also, make sure
the destination does not overflow by clamping the size.

Change-Id: I0c5cf7e0daf4cd6fcf7092efb1d9fd4c02a6c639


[ROCm/rocprofiler commit: 304d3366af]
2021-04-22 11:12:53 -07:00
Evgeny c701f9705c cleanup after separating for staging and npi branches
Change-Id: Iadd624df21b85f1590e901a8125680743e3281a3


[ROCm/rocprofiler commit: 780dfa37d4]
2021-04-08 20:37:47 +00:00
Evgeny 8c3ce30c94 SWDEV-265287 : integration spmltgen.py script
Change-Id: Ief3e93225fb6660e72a04e4bd4b379262b73c914


[ROCm/rocprofiler commit: 82d7bb2145]
2021-04-08 10:04:39 -04:00
Evgeny eddcdabf2a fixing gfx10 gfxip name
Change-Id: Ie58768d64117a616b1896489b505790cfa993054


[ROCm/rocprofiler commit: 64bdcaddc7]
2021-03-24 00:48:21 -05:00
Evgeny 0282e30855 SWDEV-274821 SPM initialization fix
Change-Id: I5e27928a60083eff328bab3e79937ce11bce11bd


[ROCm/rocprofiler commit: e2c9d13e5b]
2021-03-22 09:18:36 +00:00
Evgeny 2adb15caff SWDEV-255662 : spm kfd mode support
Change-Id: I840c7e92d3d5a59d8e5402c4d8ef86bc123dd07c


[ROCm/rocprofiler commit: 7e60bf163e]
2020-12-02 13:02:45 -06:00
Evgeny b781ea8577 fixing sqtt trace for zero size case
Change-Id: I75712485f518725af46a3b419339a212d1e762a0


[ROCm/rocprofiler commit: f2c9980647]
2020-12-01 18:19:51 -05:00
Evgeny 66490fca38 fixing c_str() as strdup
Change-Id: Ib5cb68d16ce66fd2ae072168de4c16895f32b57f


[ROCm/rocprofiler commit: ccc6005c25]
2020-10-27 14:45:51 -05:00
Evgeny e06ff588ce porting of AQL packet submit to new atomic HSA queue API
Change-Id: I654448a7a8627978395d426118a5cb3ba2a92058


[ROCm/rocprofiler commit: 96ff7582ce]
2020-10-12 09:26:27 -05:00
Evgeny 87e1a397a3 SWDEV-252747 : testing using v3 object
Change-Id: I427df765d1be55bd2851ce441238b3eaa46cca4f


[ROCm/rocprofiler commit: 169e36f379]
2020-10-09 06:38:46 -04:00
Evgeny fc99b9a657 enable contexts wait
Change-Id: Ie2adf04662fddc8051fb5418904c9c659e264d78


[ROCm/rocprofiler commit: 0d164ba672]
2020-09-21 21:06:03 -04:00
Evgeny 2d42e93cdf kernel objects dumping
Change-Id: I5a16e05b7df438efa903948701b65a9ced99e5f3

initial codeobj event implementation

Change-Id: Ia7fac3c2b9897a004cfe88c4de82ba8c18284196

update - codeobj event implementation

Change-Id: I2b91b6e689875af03f0086f5a0872a97a629fd83

update2 - codeobj event implementation

Change-Id: Icff75f14fd21963e40db95373fa74880957a9e32

fix - codeobj event implementation

Change-Id: I76c33c875cb429fb12a974bb408b217f187b4536

URI buffer fix - codeobj event implementation

Change-Id: I7ce1a758e021455da3fe5b8a6e4ae3ab46e9760e

HSA events exposing

Change-Id: I3664ab4e5111c4ccedaf068dcb19f48055f0ef9b

HSA events data struct normalizing

Change-Id: I365ef0db45e0a9314bd2a1a4d29dd4eb4e91297d


[ROCm/rocprofiler commit: 8850e46071]
2020-09-11 10:01:54 -05:00
Xianwei Zhang 9c62afb8ba concurrent: enable/fix the related settings
Concurrent profiling relies on the aqlprofile read_api
and tracker. This patch set those options to enable
the concurrent profiling.

Change-Id: Ib97d4d8facfbc11f2684d83109397cd13f117d5e


[ROCm/rocprofiler commit: b445610cd1]
2020-08-26 16:04:57 -04:00
Evgeny a331990ee4 optimization mechanism fix: correct tracker handler; kernel name query on completion;
Change-Id: I14da152b4ac3c7d8fd1af2f54e9d71f834071622


[ROCm/rocprofiler commit: 80747de208]
2020-08-03 23:34:49 -05:00
Evgeny 832ab03e46 kernel name filtering fix - handling [] brakets
Change-Id: I46a62d991a52045694640837393df229cf7a3133


[ROCm/rocprofiler commit: 7364edcc5b]
2020-07-29 18:47:31 -05:00
Xianwei Zhang f95b383dfa pmc: add support of concurrent kernel profiling
The profiling was only enabled in serial mode, i.e., kernels
are serialized in execution, and counters are reset at each
kernel start and read at kernel completion. This patch adds
the concurrent mode, by issuing the process-level start
packet to reset counters, and then reading twice at kernel
start and end time to obtain the counter value difference.
The new concurrent profiling usage needs the integration
with the corresponding augment at aqlprofile side.

Change-Id: I94b4442eadc8c64b8fba51b1e4916fc8b895ad21


[ROCm/rocprofiler commit: 61c9df4631]
2020-07-16 14:39:46 -05:00
Evgeny 3d44ea1a05 counters dumping optimization
Change-Id: I8c694e5380e15179453148dd9ab3a3e51b6db861


[ROCm/rocprofiler commit: 2a7f77b290]
2020-07-15 09:57:41 -05:00
Evgeny d22c364abe cleanup
Change-Id: I518f741d3d3d0bb514936c4db8b727bdc79d9fdc


[ROCm/rocprofiler commit: f214238880]
2020-06-24 12:22:22 -05:00
Evgeny c75faaaa37 concurrent sqtt support
Change-Id: I91391fafabc93aefa5d244d870ef82b96a59dc52


[ROCm/rocprofiler commit: 9f7e936d70]
2020-06-23 20:00:49 -04:00
Evgeny 4672ded98d ocl samples disabling
Change-Id: I8012e226f96246ed70974048a87e67da9d01753c


[ROCm/rocprofiler commit: 1c11cd087b]
2020-06-01 18:07:52 -05:00
Evgeny 7d2307424e disabling destruction
Change-Id: I2a7d05a8f597b3bc8bd07bffe7181f9dcace1cbf


[ROCm/rocprofiler commit: 9950b97567]
2020-05-07 03:34:30 -05:00
Evgeny 212aefac7d adding pid for kernel results to support multi-process profiling
Change-Id: I283228a4b4145599c5e637dd6faa771b9f4b6345


[ROCm/rocprofiler commit: 3af87a7423]
2020-05-05 05:35:32 -05:00
Evgeny 0bea31fe48 get_time API: make public; extende with more time id: coarse and raw; added time error return value;
Change-Id: I1641eb2c38915222204617e07fc0bfb388bb8346


[ROCm/rocprofiler commit: 3ce98d33d4]
2020-04-30 02:38:18 -05:00
Evgeny e3514895c6 toll destruction fix
Change-Id: If069c820526e21a0a4b80ac516f9669a81f34cab


[ROCm/rocprofiler commit: fe70682184]
2020-04-28 03:16:15 -05:00
Evgeny c1da03616e test suite minor changes
Change-Id: I673055c74461c106b4f705c209e54d83b7d3fb17


[ROCm/rocprofiler commit: 1d638b6f79]
2020-04-27 13:05:47 -05:00
Evgeny 55dfe8ec4b run script update
Change-Id: Iffa971ff68f31c4c751698ad962ddc2993455d13


[ROCm/rocprofiler commit: 3173448638]
2020-04-22 21:29:32 -05:00
Evgeny 31a7eac580 elliminaring the need of AMD_INTERNAL_BUILD macro defined
Change-Id: Ie97aef943793b1e4f40b7c7397af313520b35beb


[ROCm/rocprofiler commit: f819e1c463]
2020-04-09 23:41:51 -05:00
Evgeny 5b725bbe7a adding standalone intercept test; queeu_start/stop API fixed as public;
Change-Id: I5489a5ff69454985b955c9e4027f812168de1ecb


[ROCm/rocprofiler commit: fdb8f55e02]
2020-04-08 04:31:52 -05:00
Evgeny 05c3b879d9 fixing CPACK prefix to by just rocm root path
Change-Id: I8caf5832aa9536b1e891e07cb992dd5258b8755d


[ROCm/rocprofiler commit: aac37d4d1e]
2020-03-29 01:09:11 -05:00
Evgeny 943b92d4c7 fixing hsa intercept test
Change-Id: I2671dfc6a9bd3e01a0c926aa3ea367b8c7a0279e


[ROCm/rocprofiler commit: 7be9a42ab3]
2020-03-28 17:24:16 -05:00
Evgeny 67e1502ce5 PC sampling bringup
Change-Id: I0d041c4c8c3778f2c328cde38432bc72223706a3

pc sampling integration fix

Change-Id: Ia66ff876d2d99ec4d561daf8320b65d75f5cd2fe


[ROCm/rocprofiler commit: 9df9fddcfb]
2020-03-28 13:07:45 -05:00
Evgeny 0e767e9864 update for gfx9 and 3.1
Change-Id: I0c218f8a2dcdad1c8deb44770d8a64ccd95a92fb


[ROCm/rocprofiler commit: 299a08e4be]
2020-03-24 19:07:11 -05:00
Evgeny c22f2a46dd adding mytest target
Change-Id: Ia82b971597a4876d99c85d7dbbf997faea83f0c9


[ROCm/rocprofiler commit: ffa0a51408]
2020-03-19 01:27:44 -05:00
Evgeny 99d4641ee7 dynamic kernel lds size
Change-Id: I051b85550d12ca8662127615be7adfb12c5f9585


[ROCm/rocprofiler commit: 7ecf9ef1e3]
2020-02-27 17:44:22 -06:00
Evgeny d657afa812 adding AgentInfo::lds_block_size
Change-Id: I186893add96dc92570e710ae78b475897ebfe531


[ROCm/rocprofiler commit: 40730e34e4]
2020-02-18 14:00:19 -06:00
Evgeny a40ba7a793 hsa-interept mode enabling in head
Change-Id: I048a975deaae691868034ee361fea71ffb5bdd83


[ROCm/rocprofiler commit: d2324e780f]
2020-02-14 00:22:36 -06:00
Evgeny c1b7e4353a device id/type for submit callback
Change-Id: I69b31bd28850198c8ca44e55bf8f2ff9540d92fc


[ROCm/rocprofiler commit: 3d4c4d1cce]
2020-02-13 22:00:56 -06:00
Evgeny 988274443a adding agent to device callback data
Change-Id: I7d631e4faae685ce23704ac65308bf2aae0c157a


[ROCm/rocprofiler commit: 4edc6c98cf]
2020-02-13 22:00:56 -06:00
Evgeny 9d2081ceb4 kernel free on executable_destroy
Change-Id: I062e08fd5df94f4d21b443b74494dc6e95201f3d


[ROCm/rocprofiler commit: 304568c1a3]
2020-02-13 22:00:56 -06:00
Evgeny 59bc8da846 adding code object freeze handling, kernel code allocate/device callbacks
Change-Id: I6ec32a0c4e945260cd11f88c1d2a0a7f22d6e10c

documenting comments

Change-Id: I1b7af70d51042c48aef0474d0b6b54275b42c456


[ROCm/rocprofiler commit: 086c29fff1]
2020-02-13 22:00:56 -06:00
Evgeny 3b84973737 adding allocate/device/memcopy/submit callbacks
Change-Id: Ie83ae3cd57cdf5038e83db70891b103439c37d55


[ROCm/rocprofiler commit: f95ed067ba]
2020-02-13 22:00:08 -06:00
Evgeny ec6fd99b30 hsa_rsrc_factory sync
Change-Id: Idecbc0cdad6068eae5259cb043bdf5746b430aec


[ROCm/rocprofiler commit: 5615ade977]
2020-01-30 19:29:59 -06:00
Evgeny 7287ff939b adding C test
Change-Id: I006130d6711fd8e7bb222b985e4c64a801825f41


[ROCm/rocprofiler commit: 8565d41d7b]
2019-12-29 05:31:25 -06:00
Evgeny 67b1e9af0d trace parameters fix
Change-Id: I5cd8c897b9982f9abc9e36c18a4a488bd65fbd30


[ROCm/rocprofiler commit: 389cb50854]
2019-12-10 04:33:29 -06:00
Evgeny 6dd6ea0046 sgpr/vgpr reported sizes in terms of registers; lds size in bytes;
Change-Id: Id527687cc075a4e85ea4d0063ed8f336e34d2454


[ROCm/rocprofiler commit: c8eb0641bf]
2019-11-27 15:19:14 -06:00
Evgeny 9b183d909e code object tracking and v3 code object support
Change-Id: I081ada13f6364ea4401a97a485dedfa9bf8a45fe


[ROCm/rocprofiler commit: 60043d198b]
2019-11-19 20:18:09 -06:00
Evgeny c34655f1ff setting of granulated lds, sgpr, vgpr reserved by HW for the kernel
Change-Id: Ib3e60c0b97787f56f769136480a46a8560e00cc8


[ROCm/rocprofiler commit: dc05345a78]
2019-11-11 18:11:16 -05:00
Xianwei Zhang a67147ca5b tests: replace 'bc' with 'awk' to be more portable
Calbulator package 'bc' is not always installed, occasionally
breaking the test scripts. As a fix, this patch gets rid of
'bc' and uses 'awk' instead.

Change-Id: I5f4dbf018d97581a626cf95d147c415755477771


[ROCm/rocprofiler commit: 51bdb2ce02]
2019-08-26 10:53:52 -04:00