Ammar ELWazir
ef96663e82
SWDEV-374256: GPU Kernel Dispatch Trace Period Support
...
Change-Id: Idaabe82a30013e3aba4bcb65bd0a89ce2d14ad97
[ROCm/rocprofiler commit: 472624e3bd ]
2023-06-21 12:46:33 -04:00
Giovanni LB
a7e8182a21
SWDEV-405575: Added gfx941 and gfx942
...
Change-Id: I45a49cd64a76d3ae32c209497c70fe27b5be212b
[ROCm/rocprofiler commit: e1285e3fd4 ]
2023-06-19 11:11:37 -04:00
Ammar ELWazir
7647f6988f
V1/V2 API Library Separation
...
V1 library will be supported as librocprofiler64.so and V2 will be supported as librocprofiler64v2.so and headers will be rocprofiler.h for V1 and v2/rocprofiler.h for v2
Change-Id: Ibe5bdbf2f79f0175342c648e917ae77918186604
[ROCm/rocprofiler commit: 9e62e066fe ]
2023-05-02 22:44:43 -04:00
gobhardw
c9a9e34844
SWDEV-374072 : rocprof gpu selector fix
...
Change-Id: I155e63a5dc1ecbacd76d80b0df76da99b645ed9f
[ROCm/rocprofiler commit: 14977e4dc1 ]
2023-03-29 15:55:06 +00:00
Ammar ELWazir
de4abd0d0f
Adding rocprofilerv2
...
Change-Id: Ic0cc280ba207d2b8f6ccae1cd4ac3184152fc1ad
[ROCm/rocprofiler commit: 8032adb64f ]
2023-03-09 13:20:33 +00:00
Laurent Morichetti
6e1ea79067
Fix vgpr count calculation for gfx90a and gfx940
...
Read accum_offset from compute_pgm_rsrc3 to report both the arch vgprs
and the accum vgprs
Change-Id: I99e746d54a6a1671e343da5658cc6ce970f79939
[ROCm/rocprofiler commit: 5fd1c7e8e3 ]
2022-08-03 14:02:36 -07:00
Ranjith Ramakrishnan
a5a941cbfc
SWDEV-345870 - Correct include paths for new directory layout
...
Use hsa header files from /opt/rocm-ver/include rather than using wrapper files from /opt/rocm-ver/hsa/include/hsa
Change-Id: Id7a9bde19447cd2a0fd6e03b11c08471f09c2a46
[ROCm/rocprofiler commit: e7eb195924 ]
2022-07-14 16:08:41 -07:00
Chun Yang
62a76c8ebb
SWDEV-296922 : Incorrect rounding due to integer division in rocprofiler metrics
...
Changed derived metrics to double from int64.
Fixed standalone test due to int64 to float change
Fixed intercept test due to int64 to float change.
Change-Id: I49631c187406ae9dd94a869b3bb13772012e8cdf
[ROCm/rocprofiler commit: f9017cbdc5 ]
2021-09-23 14:52:35 -07:00
Evgeny
c701f9705c
cleanup after separating for staging and npi branches
...
Change-Id: Iadd624df21b85f1590e901a8125680743e3281a3
[ROCm/rocprofiler commit: 780dfa37d4 ]
2021-04-08 20:37:47 +00:00
Evgeny
8c3ce30c94
SWDEV-265287 : integration spmltgen.py script
...
Change-Id: Ief3e93225fb6660e72a04e4bd4b379262b73c914
[ROCm/rocprofiler commit: 82d7bb2145 ]
2021-04-08 10:04:39 -04:00
Evgeny
0282e30855
SWDEV-274821 SPM initialization fix
...
Change-Id: I5e27928a60083eff328bab3e79937ce11bce11bd
[ROCm/rocprofiler commit: e2c9d13e5b ]
2021-03-22 09:18:36 +00:00
Evgeny
2adb15caff
SWDEV-255662 : spm kfd mode support
...
Change-Id: I840c7e92d3d5a59d8e5402c4d8ef86bc123dd07c
[ROCm/rocprofiler commit: 7e60bf163e ]
2020-12-02 13:02:45 -06:00
Evgeny
b781ea8577
fixing sqtt trace for zero size case
...
Change-Id: I75712485f518725af46a3b419339a212d1e762a0
[ROCm/rocprofiler commit: f2c9980647 ]
2020-12-01 18:19:51 -05:00
Evgeny
66490fca38
fixing c_str() as strdup
...
Change-Id: Ib5cb68d16ce66fd2ae072168de4c16895f32b57f
[ROCm/rocprofiler commit: ccc6005c25 ]
2020-10-27 14:45:51 -05:00
Evgeny
fc99b9a657
enable contexts wait
...
Change-Id: Ie2adf04662fddc8051fb5418904c9c659e264d78
[ROCm/rocprofiler commit: 0d164ba672 ]
2020-09-21 21:06:03 -04:00
Evgeny
2d42e93cdf
kernel objects dumping
...
Change-Id: I5a16e05b7df438efa903948701b65a9ced99e5f3
initial codeobj event implementation
Change-Id: Ia7fac3c2b9897a004cfe88c4de82ba8c18284196
update - codeobj event implementation
Change-Id: I2b91b6e689875af03f0086f5a0872a97a629fd83
update2 - codeobj event implementation
Change-Id: Icff75f14fd21963e40db95373fa74880957a9e32
fix - codeobj event implementation
Change-Id: I76c33c875cb429fb12a974bb408b217f187b4536
URI buffer fix - codeobj event implementation
Change-Id: I7ce1a758e021455da3fe5b8a6e4ae3ab46e9760e
HSA events exposing
Change-Id: I3664ab4e5111c4ccedaf068dcb19f48055f0ef9b
HSA events data struct normalizing
Change-Id: I365ef0db45e0a9314bd2a1a4d29dd4eb4e91297d
[ROCm/rocprofiler commit: 8850e46071 ]
2020-09-11 10:01:54 -05:00
Xianwei Zhang
9c62afb8ba
concurrent: enable/fix the related settings
...
Concurrent profiling relies on the aqlprofile read_api
and tracker. This patch set those options to enable
the concurrent profiling.
Change-Id: Ib97d4d8facfbc11f2684d83109397cd13f117d5e
[ROCm/rocprofiler commit: b445610cd1 ]
2020-08-26 16:04:57 -04:00
Evgeny
a331990ee4
optimization mechanism fix: correct tracker handler; kernel name query on completion;
...
Change-Id: I14da152b4ac3c7d8fd1af2f54e9d71f834071622
[ROCm/rocprofiler commit: 80747de208 ]
2020-08-03 23:34:49 -05:00
Evgeny
832ab03e46
kernel name filtering fix - handling [] brakets
...
Change-Id: I46a62d991a52045694640837393df229cf7a3133
[ROCm/rocprofiler commit: 7364edcc5b ]
2020-07-29 18:47:31 -05:00
Xianwei Zhang
f95b383dfa
pmc: add support of concurrent kernel profiling
...
The profiling was only enabled in serial mode, i.e., kernels
are serialized in execution, and counters are reset at each
kernel start and read at kernel completion. This patch adds
the concurrent mode, by issuing the process-level start
packet to reset counters, and then reading twice at kernel
start and end time to obtain the counter value difference.
The new concurrent profiling usage needs the integration
with the corresponding augment at aqlprofile side.
Change-Id: I94b4442eadc8c64b8fba51b1e4916fc8b895ad21
[ROCm/rocprofiler commit: 61c9df4631 ]
2020-07-16 14:39:46 -05:00
Evgeny
3d44ea1a05
counters dumping optimization
...
Change-Id: I8c694e5380e15179453148dd9ab3a3e51b6db861
[ROCm/rocprofiler commit: 2a7f77b290 ]
2020-07-15 09:57:41 -05:00
Evgeny
d22c364abe
cleanup
...
Change-Id: I518f741d3d3d0bb514936c4db8b727bdc79d9fdc
[ROCm/rocprofiler commit: f214238880 ]
2020-06-24 12:22:22 -05:00
Evgeny
c75faaaa37
concurrent sqtt support
...
Change-Id: I91391fafabc93aefa5d244d870ef82b96a59dc52
[ROCm/rocprofiler commit: 9f7e936d70 ]
2020-06-23 20:00:49 -04:00
Evgeny
7d2307424e
disabling destruction
...
Change-Id: I2a7d05a8f597b3bc8bd07bffe7181f9dcace1cbf
[ROCm/rocprofiler commit: 9950b97567 ]
2020-05-07 03:34:30 -05:00
Evgeny
212aefac7d
adding pid for kernel results to support multi-process profiling
...
Change-Id: I283228a4b4145599c5e637dd6faa771b9f4b6345
[ROCm/rocprofiler commit: 3af87a7423 ]
2020-05-05 05:35:32 -05:00
Evgeny
e3514895c6
toll destruction fix
...
Change-Id: If069c820526e21a0a4b80ac516f9669a81f34cab
[ROCm/rocprofiler commit: fe70682184 ]
2020-04-28 03:16:15 -05:00
Evgeny
943b92d4c7
fixing hsa intercept test
...
Change-Id: I2671dfc6a9bd3e01a0c926aa3ea367b8c7a0279e
[ROCm/rocprofiler commit: 7be9a42ab3 ]
2020-03-28 17:24:16 -05:00
Evgeny
67e1502ce5
PC sampling bringup
...
Change-Id: I0d041c4c8c3778f2c328cde38432bc72223706a3
pc sampling integration fix
Change-Id: Ia66ff876d2d99ec4d561daf8320b65d75f5cd2fe
[ROCm/rocprofiler commit: 9df9fddcfb ]
2020-03-28 13:07:45 -05:00
Evgeny
99d4641ee7
dynamic kernel lds size
...
Change-Id: I051b85550d12ca8662127615be7adfb12c5f9585
[ROCm/rocprofiler commit: 7ecf9ef1e3 ]
2020-02-27 17:44:22 -06:00
Evgeny
d657afa812
adding AgentInfo::lds_block_size
...
Change-Id: I186893add96dc92570e710ae78b475897ebfe531
[ROCm/rocprofiler commit: 40730e34e4 ]
2020-02-18 14:00:19 -06:00
Evgeny
a40ba7a793
hsa-interept mode enabling in head
...
Change-Id: I048a975deaae691868034ee361fea71ffb5bdd83
[ROCm/rocprofiler commit: d2324e780f ]
2020-02-14 00:22:36 -06:00
Evgeny
c1b7e4353a
device id/type for submit callback
...
Change-Id: I69b31bd28850198c8ca44e55bf8f2ff9540d92fc
[ROCm/rocprofiler commit: 3d4c4d1cce ]
2020-02-13 22:00:56 -06:00
Evgeny
988274443a
adding agent to device callback data
...
Change-Id: I7d631e4faae685ce23704ac65308bf2aae0c157a
[ROCm/rocprofiler commit: 4edc6c98cf ]
2020-02-13 22:00:56 -06:00
Evgeny
9d2081ceb4
kernel free on executable_destroy
...
Change-Id: I062e08fd5df94f4d21b443b74494dc6e95201f3d
[ROCm/rocprofiler commit: 304568c1a3 ]
2020-02-13 22:00:56 -06:00
Evgeny
59bc8da846
adding code object freeze handling, kernel code allocate/device callbacks
...
Change-Id: I6ec32a0c4e945260cd11f88c1d2a0a7f22d6e10c
documenting comments
Change-Id: I1b7af70d51042c48aef0474d0b6b54275b42c456
[ROCm/rocprofiler commit: 086c29fff1 ]
2020-02-13 22:00:56 -06:00
Evgeny
3b84973737
adding allocate/device/memcopy/submit callbacks
...
Change-Id: Ie83ae3cd57cdf5038e83db70891b103439c37d55
[ROCm/rocprofiler commit: f95ed067ba ]
2020-02-13 22:00:08 -06:00
Evgeny
67b1e9af0d
trace parameters fix
...
Change-Id: I5cd8c897b9982f9abc9e36c18a4a488bd65fbd30
[ROCm/rocprofiler commit: 389cb50854 ]
2019-12-10 04:33:29 -06:00
Evgeny
6dd6ea0046
sgpr/vgpr reported sizes in terms of registers; lds size in bytes;
...
Change-Id: Id527687cc075a4e85ea4d0063ed8f336e34d2454
[ROCm/rocprofiler commit: c8eb0641bf ]
2019-11-27 15:19:14 -06:00
Evgeny
9b183d909e
code object tracking and v3 code object support
...
Change-Id: I081ada13f6364ea4401a97a485dedfa9bf8a45fe
[ROCm/rocprofiler commit: 60043d198b ]
2019-11-19 20:18:09 -06:00
Evgeny
c34655f1ff
setting of granulated lds, sgpr, vgpr reserved by HW for the kernel
...
Change-Id: Ib3e60c0b97787f56f769136480a46a8560e00cc8
[ROCm/rocprofiler commit: dc05345a78 ]
2019-11-11 18:11:16 -05:00
Xianwei Zhang
3dce66bd80
build: slightly change files to pass compilation
...
Compiled using Clang and encountered errors on ambiguous pointer
type, implicit conversion NULL to bool, and unused function, etc.
As a fix, this patch slightly updates the related codes.
Change-Id: I4a73d66293313dda36749036f5c73aaf2a445231
[ROCm/rocprofiler commit: ee917344ff ]
2019-08-22 23:29:16 -04:00
Evgeny
2c091d01d0
adding SPM test
...
Change-Id: I55dde52e668206e238ea860dd11e0ed5501a582a
[ROCm/rocprofiler commit: 988283533f ]
2019-06-04 15:34:21 -05:00
Evgeny
1fecf4774c
adding SPM trace support
...
Change-Id: I07c0d3ec5c4ec51ea00fea539c930c6ed27b1163
[ROCm/rocprofiler commit: 6d6eb2b2d0 ]
2019-06-04 05:07:19 -05:00
Evgeny
65ea957d1f
extending xontext pool API
...
Change-Id: I56809354a9ea991489479117160f3d4f451f82b3
[ROCm/rocprofiler commit: 8c15ff4bf8 ]
2019-03-29 11:12:40 -05:00
Evgeny
1ecd03ea5e
improving rocprof tracing usage
...
Change-Id: I4db513c93406a7038b8b681d12a0dc4298975f01
[ROCm/rocprofiler commit: f834529669 ]
2019-03-12 18:19:17 -05:00
Evgeny
33e8160a54
fix of reported completion signal in th ecase when it is proxied by the tracker
...
Change-Id: Ic1100fde03bc34e0130405285d32d6b97a031967
[ROCm/rocprofiler commit: 996b64309a ]
2019-02-20 19:21:35 -06:00
Evgeny
55664e9ae4
trace input parsing refactoring
...
Change-Id: Ib5828561249ab7c15c5ed58b932f03839b207295
[ROCm/rocprofiler commit: 20a1db9d6d ]
2019-01-18 20:59:29 -06:00
Evgeny
a33b1cc58d
ctx-wait option
...
Change-Id: I8a7cf97458f27169774e3a38ee5007b2ca6bf450
[ROCm/rocprofiler commit: 4c0c64493c ]
2019-01-17 02:25:08 -06:00
Evgeny
28b8257ef8
fix
...
Change-Id: I2c13b9610037baf9e5b0db675bac19426fe8239e
[ROCm/rocprofiler commit: 7dd087ebf0 ]
2019-01-16 19:08:00 -06:00
Evgeny
63150d03e1
hsa memcopy tracking
...
Change-Id: I61bc4ea01a62b587d5941f9d0e9e358a6ed71b5a
[ROCm/rocprofiler commit: 2b2a279ce9 ]
2019-01-07 01:53:27 -06:00