Граф коммитов

64 Коммитов

Автор SHA1 Сообщение Дата
Ammar ELWazir ef96663e82 SWDEV-374256: GPU Kernel Dispatch Trace Period Support
Change-Id: Idaabe82a30013e3aba4bcb65bd0a89ce2d14ad97


[ROCm/rocprofiler commit: 472624e3bd]
2023-06-21 12:46:33 -04:00
Giovanni LB a7e8182a21 SWDEV-405575: Added gfx941 and gfx942
Change-Id: I45a49cd64a76d3ae32c209497c70fe27b5be212b


[ROCm/rocprofiler commit: e1285e3fd4]
2023-06-19 11:11:37 -04:00
Ammar ELWazir 7647f6988f V1/V2 API Library Separation
V1 library will be supported as librocprofiler64.so and V2 will be supported as librocprofiler64v2.so and headers will be rocprofiler.h for V1 and v2/rocprofiler.h for v2

Change-Id: Ibe5bdbf2f79f0175342c648e917ae77918186604


[ROCm/rocprofiler commit: 9e62e066fe]
2023-05-02 22:44:43 -04:00
gobhardw c9a9e34844 SWDEV-374072 : rocprof gpu selector fix
Change-Id: I155e63a5dc1ecbacd76d80b0df76da99b645ed9f


[ROCm/rocprofiler commit: 14977e4dc1]
2023-03-29 15:55:06 +00:00
Ammar ELWazir de4abd0d0f Adding rocprofilerv2
Change-Id: Ic0cc280ba207d2b8f6ccae1cd4ac3184152fc1ad


[ROCm/rocprofiler commit: 8032adb64f]
2023-03-09 13:20:33 +00:00
Laurent Morichetti 6e1ea79067 Fix vgpr count calculation for gfx90a and gfx940
Read accum_offset from compute_pgm_rsrc3 to report both the arch vgprs
and the accum vgprs

Change-Id: I99e746d54a6a1671e343da5658cc6ce970f79939


[ROCm/rocprofiler commit: 5fd1c7e8e3]
2022-08-03 14:02:36 -07:00
Ranjith Ramakrishnan a5a941cbfc SWDEV-345870 - Correct include paths for new directory layout
Use hsa header files from /opt/rocm-ver/include rather than using wrapper files from /opt/rocm-ver/hsa/include/hsa

Change-Id: Id7a9bde19447cd2a0fd6e03b11c08471f09c2a46


[ROCm/rocprofiler commit: e7eb195924]
2022-07-14 16:08:41 -07:00
Chun Yang 62a76c8ebb SWDEV-296922 : Incorrect rounding due to integer division in rocprofiler metrics
Changed derived metrics to double from int64.
Fixed standalone test due to int64 to float change
Fixed intercept test due to int64 to float change.

Change-Id: I49631c187406ae9dd94a869b3bb13772012e8cdf


[ROCm/rocprofiler commit: f9017cbdc5]
2021-09-23 14:52:35 -07:00
Evgeny c701f9705c cleanup after separating for staging and npi branches
Change-Id: Iadd624df21b85f1590e901a8125680743e3281a3


[ROCm/rocprofiler commit: 780dfa37d4]
2021-04-08 20:37:47 +00:00
Evgeny 8c3ce30c94 SWDEV-265287 : integration spmltgen.py script
Change-Id: Ief3e93225fb6660e72a04e4bd4b379262b73c914


[ROCm/rocprofiler commit: 82d7bb2145]
2021-04-08 10:04:39 -04:00
Evgeny 0282e30855 SWDEV-274821 SPM initialization fix
Change-Id: I5e27928a60083eff328bab3e79937ce11bce11bd


[ROCm/rocprofiler commit: e2c9d13e5b]
2021-03-22 09:18:36 +00:00
Evgeny 2adb15caff SWDEV-255662 : spm kfd mode support
Change-Id: I840c7e92d3d5a59d8e5402c4d8ef86bc123dd07c


[ROCm/rocprofiler commit: 7e60bf163e]
2020-12-02 13:02:45 -06:00
Evgeny b781ea8577 fixing sqtt trace for zero size case
Change-Id: I75712485f518725af46a3b419339a212d1e762a0


[ROCm/rocprofiler commit: f2c9980647]
2020-12-01 18:19:51 -05:00
Evgeny 66490fca38 fixing c_str() as strdup
Change-Id: Ib5cb68d16ce66fd2ae072168de4c16895f32b57f


[ROCm/rocprofiler commit: ccc6005c25]
2020-10-27 14:45:51 -05:00
Evgeny fc99b9a657 enable contexts wait
Change-Id: Ie2adf04662fddc8051fb5418904c9c659e264d78


[ROCm/rocprofiler commit: 0d164ba672]
2020-09-21 21:06:03 -04:00
Evgeny 2d42e93cdf kernel objects dumping
Change-Id: I5a16e05b7df438efa903948701b65a9ced99e5f3

initial codeobj event implementation

Change-Id: Ia7fac3c2b9897a004cfe88c4de82ba8c18284196

update - codeobj event implementation

Change-Id: I2b91b6e689875af03f0086f5a0872a97a629fd83

update2 - codeobj event implementation

Change-Id: Icff75f14fd21963e40db95373fa74880957a9e32

fix - codeobj event implementation

Change-Id: I76c33c875cb429fb12a974bb408b217f187b4536

URI buffer fix - codeobj event implementation

Change-Id: I7ce1a758e021455da3fe5b8a6e4ae3ab46e9760e

HSA events exposing

Change-Id: I3664ab4e5111c4ccedaf068dcb19f48055f0ef9b

HSA events data struct normalizing

Change-Id: I365ef0db45e0a9314bd2a1a4d29dd4eb4e91297d


[ROCm/rocprofiler commit: 8850e46071]
2020-09-11 10:01:54 -05:00
Xianwei Zhang 9c62afb8ba concurrent: enable/fix the related settings
Concurrent profiling relies on the aqlprofile read_api
and tracker. This patch set those options to enable
the concurrent profiling.

Change-Id: Ib97d4d8facfbc11f2684d83109397cd13f117d5e


[ROCm/rocprofiler commit: b445610cd1]
2020-08-26 16:04:57 -04:00
Evgeny a331990ee4 optimization mechanism fix: correct tracker handler; kernel name query on completion;
Change-Id: I14da152b4ac3c7d8fd1af2f54e9d71f834071622


[ROCm/rocprofiler commit: 80747de208]
2020-08-03 23:34:49 -05:00
Evgeny 832ab03e46 kernel name filtering fix - handling [] brakets
Change-Id: I46a62d991a52045694640837393df229cf7a3133


[ROCm/rocprofiler commit: 7364edcc5b]
2020-07-29 18:47:31 -05:00
Xianwei Zhang f95b383dfa pmc: add support of concurrent kernel profiling
The profiling was only enabled in serial mode, i.e., kernels
are serialized in execution, and counters are reset at each
kernel start and read at kernel completion. This patch adds
the concurrent mode, by issuing the process-level start
packet to reset counters, and then reading twice at kernel
start and end time to obtain the counter value difference.
The new concurrent profiling usage needs the integration
with the corresponding augment at aqlprofile side.

Change-Id: I94b4442eadc8c64b8fba51b1e4916fc8b895ad21


[ROCm/rocprofiler commit: 61c9df4631]
2020-07-16 14:39:46 -05:00
Evgeny 3d44ea1a05 counters dumping optimization
Change-Id: I8c694e5380e15179453148dd9ab3a3e51b6db861


[ROCm/rocprofiler commit: 2a7f77b290]
2020-07-15 09:57:41 -05:00
Evgeny d22c364abe cleanup
Change-Id: I518f741d3d3d0bb514936c4db8b727bdc79d9fdc


[ROCm/rocprofiler commit: f214238880]
2020-06-24 12:22:22 -05:00
Evgeny c75faaaa37 concurrent sqtt support
Change-Id: I91391fafabc93aefa5d244d870ef82b96a59dc52


[ROCm/rocprofiler commit: 9f7e936d70]
2020-06-23 20:00:49 -04:00
Evgeny 7d2307424e disabling destruction
Change-Id: I2a7d05a8f597b3bc8bd07bffe7181f9dcace1cbf


[ROCm/rocprofiler commit: 9950b97567]
2020-05-07 03:34:30 -05:00
Evgeny 212aefac7d adding pid for kernel results to support multi-process profiling
Change-Id: I283228a4b4145599c5e637dd6faa771b9f4b6345


[ROCm/rocprofiler commit: 3af87a7423]
2020-05-05 05:35:32 -05:00
Evgeny e3514895c6 toll destruction fix
Change-Id: If069c820526e21a0a4b80ac516f9669a81f34cab


[ROCm/rocprofiler commit: fe70682184]
2020-04-28 03:16:15 -05:00
Evgeny 943b92d4c7 fixing hsa intercept test
Change-Id: I2671dfc6a9bd3e01a0c926aa3ea367b8c7a0279e


[ROCm/rocprofiler commit: 7be9a42ab3]
2020-03-28 17:24:16 -05:00
Evgeny 67e1502ce5 PC sampling bringup
Change-Id: I0d041c4c8c3778f2c328cde38432bc72223706a3

pc sampling integration fix

Change-Id: Ia66ff876d2d99ec4d561daf8320b65d75f5cd2fe


[ROCm/rocprofiler commit: 9df9fddcfb]
2020-03-28 13:07:45 -05:00
Evgeny 99d4641ee7 dynamic kernel lds size
Change-Id: I051b85550d12ca8662127615be7adfb12c5f9585


[ROCm/rocprofiler commit: 7ecf9ef1e3]
2020-02-27 17:44:22 -06:00
Evgeny d657afa812 adding AgentInfo::lds_block_size
Change-Id: I186893add96dc92570e710ae78b475897ebfe531


[ROCm/rocprofiler commit: 40730e34e4]
2020-02-18 14:00:19 -06:00
Evgeny a40ba7a793 hsa-interept mode enabling in head
Change-Id: I048a975deaae691868034ee361fea71ffb5bdd83


[ROCm/rocprofiler commit: d2324e780f]
2020-02-14 00:22:36 -06:00
Evgeny c1b7e4353a device id/type for submit callback
Change-Id: I69b31bd28850198c8ca44e55bf8f2ff9540d92fc


[ROCm/rocprofiler commit: 3d4c4d1cce]
2020-02-13 22:00:56 -06:00
Evgeny 988274443a adding agent to device callback data
Change-Id: I7d631e4faae685ce23704ac65308bf2aae0c157a


[ROCm/rocprofiler commit: 4edc6c98cf]
2020-02-13 22:00:56 -06:00
Evgeny 9d2081ceb4 kernel free on executable_destroy
Change-Id: I062e08fd5df94f4d21b443b74494dc6e95201f3d


[ROCm/rocprofiler commit: 304568c1a3]
2020-02-13 22:00:56 -06:00
Evgeny 59bc8da846 adding code object freeze handling, kernel code allocate/device callbacks
Change-Id: I6ec32a0c4e945260cd11f88c1d2a0a7f22d6e10c

documenting comments

Change-Id: I1b7af70d51042c48aef0474d0b6b54275b42c456


[ROCm/rocprofiler commit: 086c29fff1]
2020-02-13 22:00:56 -06:00
Evgeny 3b84973737 adding allocate/device/memcopy/submit callbacks
Change-Id: Ie83ae3cd57cdf5038e83db70891b103439c37d55


[ROCm/rocprofiler commit: f95ed067ba]
2020-02-13 22:00:08 -06:00
Evgeny 67b1e9af0d trace parameters fix
Change-Id: I5cd8c897b9982f9abc9e36c18a4a488bd65fbd30


[ROCm/rocprofiler commit: 389cb50854]
2019-12-10 04:33:29 -06:00
Evgeny 6dd6ea0046 sgpr/vgpr reported sizes in terms of registers; lds size in bytes;
Change-Id: Id527687cc075a4e85ea4d0063ed8f336e34d2454


[ROCm/rocprofiler commit: c8eb0641bf]
2019-11-27 15:19:14 -06:00
Evgeny 9b183d909e code object tracking and v3 code object support
Change-Id: I081ada13f6364ea4401a97a485dedfa9bf8a45fe


[ROCm/rocprofiler commit: 60043d198b]
2019-11-19 20:18:09 -06:00
Evgeny c34655f1ff setting of granulated lds, sgpr, vgpr reserved by HW for the kernel
Change-Id: Ib3e60c0b97787f56f769136480a46a8560e00cc8


[ROCm/rocprofiler commit: dc05345a78]
2019-11-11 18:11:16 -05:00
Xianwei Zhang 3dce66bd80 build: slightly change files to pass compilation
Compiled using Clang and encountered errors on ambiguous pointer
type, implicit conversion NULL to bool, and unused function, etc.
As a fix, this patch slightly updates the related codes.

Change-Id: I4a73d66293313dda36749036f5c73aaf2a445231


[ROCm/rocprofiler commit: ee917344ff]
2019-08-22 23:29:16 -04:00
Evgeny 2c091d01d0 adding SPM test
Change-Id: I55dde52e668206e238ea860dd11e0ed5501a582a


[ROCm/rocprofiler commit: 988283533f]
2019-06-04 15:34:21 -05:00
Evgeny 1fecf4774c adding SPM trace support
Change-Id: I07c0d3ec5c4ec51ea00fea539c930c6ed27b1163


[ROCm/rocprofiler commit: 6d6eb2b2d0]
2019-06-04 05:07:19 -05:00
Evgeny 65ea957d1f extending xontext pool API
Change-Id: I56809354a9ea991489479117160f3d4f451f82b3


[ROCm/rocprofiler commit: 8c15ff4bf8]
2019-03-29 11:12:40 -05:00
Evgeny 1ecd03ea5e improving rocprof tracing usage
Change-Id: I4db513c93406a7038b8b681d12a0dc4298975f01


[ROCm/rocprofiler commit: f834529669]
2019-03-12 18:19:17 -05:00
Evgeny 33e8160a54 fix of reported completion signal in th ecase when it is proxied by the tracker
Change-Id: Ic1100fde03bc34e0130405285d32d6b97a031967


[ROCm/rocprofiler commit: 996b64309a]
2019-02-20 19:21:35 -06:00
Evgeny 55664e9ae4 trace input parsing refactoring
Change-Id: Ib5828561249ab7c15c5ed58b932f03839b207295


[ROCm/rocprofiler commit: 20a1db9d6d]
2019-01-18 20:59:29 -06:00
Evgeny a33b1cc58d ctx-wait option
Change-Id: I8a7cf97458f27169774e3a38ee5007b2ca6bf450


[ROCm/rocprofiler commit: 4c0c64493c]
2019-01-17 02:25:08 -06:00
Evgeny 28b8257ef8 fix
Change-Id: I2c13b9610037baf9e5b0db675bac19426fe8239e


[ROCm/rocprofiler commit: 7dd087ebf0]
2019-01-16 19:08:00 -06:00
Evgeny 63150d03e1 hsa memcopy tracking
Change-Id: I61bc4ea01a62b587d5941f9d0e9e358a6ed71b5a


[ROCm/rocprofiler commit: 2b2a279ce9]
2019-01-07 01:53:27 -06:00