Venkateshwar Reddy Kandula
997b36f5bc
[rocprofiler][navi4] Remove navi4x support on rocprofv2. ( #307 )
...
* Remove navi4x support on rocprofv2.
* remove gfx12 from build scripts.
* bug fix.
* address comments.
* update changelog
* Update CHANGELOG.md
* Update CHANGELOG.md
* Update CHANGELOG.md
* address comments
Co-authored-by: Swati Rawat <120587655+SwRaw@users.noreply.github.com >
---------
Co-authored-by: Venkateshwar Reddy Kandula <venkateshwar.kandula1306@gmail.com >
Co-authored-by: Swati Rawat <120587655+SwRaw@users.noreply.github.com >
2025-09-22 03:17:29 -05:00
Giovanni Baraldi
d63f98bd00
SWDEV-508485: Adding MFMA F8 metric
...
Change-Id: I947d2645e8dc1544d198a4e0a02500feee10d89d
[ROCm/rocprofiler commit: f19bed4c8c ]
2025-01-09 12:18:02 -06:00
Giovanni Baraldi
4ab71520ac
SWDEV-490031: Adding ops 16,32,64 metrics for rdc
...
Change-Id: Ia694a79425aeda15ecbb3fef993d880220651ef7
[ROCm/rocprofiler commit: edc9228dab ]
2024-12-31 04:17:12 -06:00
Giovanni Baraldi
2c7baf6c89
SWDEV-490031: Fixing activity metrics
...
Change-Id: Id4e74b0f3ff35d892de05e044faf399d98199354
[ROCm/rocprofiler commit: 59e9e3a0c3 ]
2024-12-17 10:32:01 -06:00
Giovanni baraldi
2baa5bd4ab
SWDEV-495749: Adding SIMD_UTILIZATION metric
...
Change-Id: I38afd5db02de7a416d11274823bc4a0c326f1fbe
[ROCm/rocprofiler commit: bcc709c174 ]
2024-12-16 10:22:57 -06:00
Giovanni Baraldi
dc25330772
Revert "SWDEV-310289: Adding SPI pipe selection"
...
This reverts commit 896404efb3 .
Reason for revert: Requires priv_cp_queues=1
Change-Id: Ia6c78ac25b88d7ef4703654075d54e672a6e320c
[ROCm/rocprofiler commit: a6328a1481 ]
2024-10-25 02:57:52 -04:00
Manjunath-Jakaraddi
17fac40bae
SWDEV-481162: Updating MfmaUtil metric RDC
...
Change-Id: I60efa183edc14b6f870f7b6a82f223ea2c9789e5
[ROCm/rocprofiler commit: f84ecfe99b ]
2024-10-14 16:48:56 -05:00
Giovanni LB
896404efb3
SWDEV-310289: Adding SPI pipe selection
...
Change-Id: I4856d284df3dccaa100a2341211ae09e11c63ecd
[ROCm/rocprofiler commit: e5e2c6041d ]
2024-10-12 01:14:49 -04:00
Benjamin Welton
4dd298f312
Added FP64_ACTIVE and ENGINE_ACTIVE
...
Should replicate DCGM_FI_PROF_EVAL_FLOPS_64 and
DCGM_FI_PROF_GR_ENGINE_ACTIVE respectively. See
https://ontrack-internal.amd.com/browse/SWDEV-490046
and
https://ontrack-internal.amd.com/browse/SWDEV-490031
Change-Id: Ia79f6a1601beac48a350493f2e83ce322c1d8d33
[ROCm/rocprofiler commit: 6d80088c84 ]
2024-10-11 15:51:09 -07:00
Giovanni LB
8b22cf86a7
SWDEV-487621: Fixing BW measurement in MI300
...
Change-Id: Ib513009616214a1f3f3568571e58d79259692cfc
[ROCm/rocprofiler commit: bddd5b51dd ]
2024-10-07 16:29:09 -03:00
Giovanni LB
d3e5a88536
SWDEV-479522: trace-start off to also disable kernel tracing
...
Change-Id: I027be24f93a201b82752327830820a24540b24d9
[ROCm/rocprofiler commit: 2a3c24565a ]
2024-08-20 23:29:50 -04:00
Lang Yu
2230af4b1d
SWDEV-467545 - Add rocprofiler support for gfx1150/gfx1151
...
Change-Id: I2cddc36981f6d815c865d180a1daf1b8a7e0633f
Signed-off-by: Lang Yu <lang.yu@amd.com >
[ROCm/rocprofiler commit: 7313e52f35 ]
2024-07-09 22:40:10 -04:00
jatang
c8d58d1986
SWDEV-458392 - Add gfx12 support.
...
Change-Id: I91bb6a3329bf77f26005a345c18b63b86922028a
[ROCm/rocprofiler commit: e7b96b1e71 ]
2024-06-17 13:24:48 -04:00
Saurabh Verma
437d39de9e
RDC metrics in v1
...
Change-Id: Iaa8cd0a37da37729df76362f10a0bb63c317a498
[ROCm/rocprofiler commit: a63b6fcbd2 ]
2024-06-11 17:00:39 -05:00
Saurabh Verma
25eda9f856
Fixing occupancy metrics for MI300
...
Adding changes for v1 xml which was missed in change 6cf9df4ff0
Change-Id: I338f2736ee61e316522f1ce42cee74abec201499
[ROCm/rocprofiler commit: 2047bf4b8b ]
2024-06-11 11:47:57 -05:00
Mythreya
6a4321c4e3
Add MI200/MI300 counters
...
Revision - Addition [Impact SoC: MI200, MI300]
Note: this set of counters are important help understand the
bottleneck.
1. TCC_TAG_STALL
a. Metric: TCC_TAG_STALL/TCC_CYCLE: percentage of time TCC
tag lookup pipeline is stalled
2. TCP_TCR_TCP_STALL_CYCLES
a. Metric: TCP_TCR_TCP_STALL_CYCLES/TCP_GATE_EN1: percentage
of time TCP is stalled by TCR
Revision - Addition [Impact SoC: MI300]
3. TCC_BUBBLE:
a. Definition: Number of 128-byte read requests sent to EA
b. Revised Metric #1 , TCC-EA Read BW:
ReadBW = 128 * TCC_BUBBLE
+ 64 * (TCC_EA0_RDREQ - TCC_BUBBLE - TCC_EA0_RDREQ_32B)
+ 32 * TCC_EA0_RDREQ_32B
c. Revised Metric #2 : TCC_EA Read Latency
ReadLatency = TCC_EA0_RDREQ_LEVEL / (TCC_BUBBLE + TCC_EA0_RDREQ)
/* [Fineprint] More detailed arithmetic:
* ReadLatency = TCC_EA0_RDREQ_LEVEL / (#32B_req + #64B_req + #128B_req * 2)
*/
Change-Id: I0a2dfc1b64ca97023b1e8ba0f9830330b3034946
[ROCm/rocprofiler commit: 46e02a9866 ]
2023-10-30 15:38:46 -04:00
Mythreya
1471dc8a76
Remove non-functional counters for MI200 and MI300
...
Counters removed for MI300 (gfx940)
TCP_TCC_WRITE_REQ_HOLE_LATENCY
TCP_TCC_WRITE_REQ_LATENCY
TCP_TCC_READ_REQ_LATENCY
TCP_TCP_LATENCY
Counters removed for MI200 and MI300 (gfx90a and gfx940 respectively)
TA_BUFFER_COALESCABLE_WAVEFRONT
TA_FLAT_COALESCABLE_WAVEFRONT
TCC_EA0_WRREQ_IO_CREDIT_STALL
TCC_EA0_WRREQ_GMI_CREDIT_STALL
TCC_EA0_WRREQ_DRAM_CREDIT_STALL
TCC_EA0_RDREQ_IO_CREDIT_STALL
TCC_EA0_RDREQ_GMI_CREDIT_STALL
TCC_EA0_RDREQ_DRAM_CREDIT_STALL
Change-Id: Ic3d1e7bf35495f35b1239f03ca6420e949421386
[ROCm/rocprofiler commit: 1fae494b12 ]
2023-10-26 12:50:57 -04:00
gobhardw
ea5ecec246
SWDEV-427554 Fixing mainline ASAN build
...
Change-Id: I63cd047ceb75dea5f8ed6f84946e1ec209c7d812
[ROCm/rocprofiler commit: 5d390717b5 ]
2023-10-18 21:42:47 +05:30
Ammar ELWazir
5e77e5008c
SWDEV-302415: Fixing Kernel Dispatchs with trace-start off option
...
Change-Id: I225b88cb769d994f1007e7bc66f176e7fa40db05
[ROCm/rocprofiler commit: d816f133d1 ]
2023-10-16 09:39:42 -04:00
Giovanni LB
3badb4ba81
SWDEV-423659: Disabling HIP_ACTIVITY when HSA_ACTIVITY is enabled.
...
Change-Id: If64fabdcd0d8a718dd0017c2bc821a94c999e87e
[ROCm/rocprofiler commit: 7418c52cc8 ]
2023-09-26 01:13:21 -04:00
Giovanni LB
552814f227
SWDEV-419944: Added metrics for gfx1102
...
Change-Id: I5c69ff716f530d130710c0687f20e5bc990a60eb
[ROCm/rocprofiler commit: 43e259e5da ]
2023-09-11 13:59:33 -04:00
Ammar ELWazir
6eb06cf201
Pull from Github
...
Squashed commit of the following:
commit f029195705a15700380c6f832ba5d15d46fd6de7
Author: Jonathan R. Madsen <jrmadsen@users.noreply.github.com >
Date: Thu Jul 13 14:38:56 2023 -0500
Formatting workflows for source (clang-format) and cmake (cmake-format) (#4 )
* Add .cmake-format.yaml file
* Add formatting workflow
* provide base input for creating PR
* Update scheme for extracting branch name
- disable running formatting on push to amd-staging branch
* patch .cmake-format.yaml for find_package signature
- apparently cmake-format doesn't format the full signature of find_package
* run formatting (clang-format v11) (#7 )
Co-authored-by: jrmadsen <jrmadsen@users.noreply.github.com >
* run cmake formatting (cmake-format) (#6 )
Co-authored-by: jrmadsen <jrmadsen@users.noreply.github.com >
---------
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
commit bc4d135fdd8a1a9e51235f18a5d575fd2b3735e6
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Thu Jul 13 12:55:17 2023 -0500
Removing Build cache for potential issues with auto-generated header files (#5 )
Change-Id: I9e2319f4335e2f88585ffa6fac2bd88a1c952e6e
commit ce86dea6a311d44d880fa684eb78f3329295e2a4
Author: Jonathan R. Madsen <jrmadsen@users.noreply.github.com >
Date: Thu Jul 13 11:08:58 2023 -0500
Fix decltype(<hsa-function>) function pointer usage (#3 )
- the following is done in several places:
decltype(hsa_memory_allocate)* hsa_memory_allocate
- above can cause compiler errors
- replace decltype(<hsa-function>) with decltype(::<hsa-function>)
- this ensures that the type within the decltype is recognized as the global scope HSA function, not the variable
- in many places, the variable has a "_fn" suffix to prevent this issue but added '::' anyway for consistency
commit ac49fdd92a72e9c99394253a02da413a6c2e3b3a
Merge: a07946a 03a0855
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Wed Jul 12 11:36:24 2023 -0500
Merge pull request #2 from ROCm-Developer-Tools/gerrit-amd-staging
Pull from gerrit
commit 03a085588cffe863e8f466de67be1cfb205b675a
Merge: c26b32b a07946a
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Wed Jul 12 10:57:30 2023 -0500
Merge branch 'amd-staging' into gerrit-amd-staging
commit a07946a5cd4c670c83c27ad1a076a9d4567ce6d7
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Wed Jul 12 15:46:04 2023 +0000
Enabling Cached Builds
commit 525e494a7f13941077a8fd4ad6840904db4d27d4
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Wed Jul 12 04:53:54 2023 +0000
Updating missed GPU Targets
commit 42c75862f628c9bee7cfb7dc04dff2619430efbc
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Wed Jul 12 04:43:02 2023 +0000
Adding V1 Testing
commit 9d72fd4aee85e4b0c12e717060d2730fa5b73be1
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Wed Jul 12 03:34:31 2023 +0000
Fixing Artifacts directory path
commit f4000cc558b3b2e4676f7994f7ce8c8e6f94518e
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Wed Jul 12 03:27:26 2023 +0000
Fixing CMake for test build job
commit 2ce8115d4c33948c3c8f957f545a95a04e1d6cd2
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Wed Jul 12 03:16:18 2023 +0000
Fixing Ubuntu CMake for ubuntu test build
commit 6d0ed439191be900748d0c025157f9d689a73ec7
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Wed Jul 12 01:28:41 2023 +0000
Removing Navi21
commit e349a7642e5ae5eb03ab9fcd0a0f74f09f78cab5
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Wed Jul 12 01:14:14 2023 +0000
Removing Navi21
commit fefd02fe68d2a4bca7ec2e381960ad004ee9fc5b
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Wed Jul 12 00:42:48 2023 +0000
Fixing CMake Job
commit 2ea46abf7bf92643efa8c549fa70346ffbd79d65
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Wed Jul 12 00:35:13 2023 +0000
Fixing CMake Job
commit d99d681ed1999c5fcf291dc678b11a77205fb0f3
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Wed Jul 12 00:32:13 2023 +0000
Fixing Pull Latest Dockers and CMake Jobs
commit dfc4498072d13b4a1df3a63047d34c682c3d9a29
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Tue Jul 11 23:54:21 2023 +0000
Fixing CMake job
commit 919efe04de707f7c702031be15c3e2c5f8442cbb
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Tue Jul 11 23:52:13 2023 +0000
Adding Pull Last dockers job
commit be1b1256e8b0e05308e8f7e7e69bee3acca55281
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Tue Jul 11 18:25:40 2023 -0500
Update cmake.yml
commit 212299fa4355ae6ec18f9aaacbb79c51ea6c6f97
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Tue Jul 11 18:23:35 2023 -0500
Update cmake.yml
commit 7c2c1327086a61466cc6cac39f70865c051a8bc7
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Tue Jul 11 18:18:53 2023 -0500
Update cmake.yml
commit 191b5ce007e612e814c1d7a3afb4ad398f3852e1
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Tue Jul 11 16:03:22 2023 -0500
Update cmake.yml
commit 8824113d95f3e13c7ce4d0af8e0d9d8f522a6c4a
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Tue Jul 11 16:28:09 2023 +0000
Fixing Pull from Gerrit job name
Change-Id: I9e7ed9a27a13ca49d62c93bdadb30f0057e4d385
commit cc3d5e4b02ffb439e8cc2b3efa53527c376f9982
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Tue Jul 11 16:21:43 2023 +0000
Adding Staging sync job
Change-Id: I0551f43878b0678ce4b3e74e27d62357cf95ad95
commit b9be2eee71380a2e6dd34d520e92d0c4209277a0
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Tue Jul 11 15:57:11 2023 +0000
Fixing build.sh
Change-Id: Ia987b0244f0875370d5fe69907b3f5e9cea914de
commit 9eee33a95a1abd656a7ac5ca10a9f245e9825431
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Mon Jul 10 21:39:46 2023 -0500
Update cmake.yml
commit 7093b85a78497140e8b52632ca2a002bdaeacd62
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Mon Jul 10 21:33:29 2023 -0500
Update cmake.yml
commit f54697172c72a67740f9fdfa0c217b6ea6931576
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Mon Jul 10 21:01:26 2023 -0500
Update cmake.yml
commit 1b6620e16f8940386b0f4f04e69e2410d21c0e26
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Mon Jul 10 20:21:02 2023 -0500
Update cmake.yml
commit a94bec740c6b42c4b79c87bca20fa87b99bf060d
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Mon Jul 10 19:46:35 2023 -0500
Update cmake.yml
commit 85d6b29d4375a69d575c18ece8542c50f2ddfcc3
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Mon Jul 10 19:34:39 2023 -0500
Update cmake.yml
commit 8c004887cf1435f1a6214c3d2455299a8a27bd4c
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Mon Jul 10 19:31:17 2023 -0500
Update cmake.yml
commit a14a9168e17d9348a53c6e9c9a47ba1edb4c4509
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Mon Jul 10 19:25:46 2023 -0500
Update cmake.yml
commit 000f2f40b84e6a2f7d4becdbf5aed01436ca4c83
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Mon Jul 10 19:08:18 2023 -0500
Update cmake.yml
commit a28a53d56731cad848fa9133d1c4dbaa8fc7afa7
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Mon Jul 10 19:03:39 2023 -0500
Update cmake.yml
commit a6a2db01027f0b01fdfbb5997ddb772c7f51b649
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Mon Jul 10 18:21:53 2023 -0500
Update cmake.yml
commit 118ef2a88b2d44e3207c31c343da3e5e5ec6f176
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Mon Jul 10 17:55:57 2023 -0500
Update cmake.yml
commit 03c4c232396440cd0be6d2dd7baf4ceea1c2589d
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Mon Jul 10 17:48:49 2023 -0500
Create cmake.yml
Change-Id: I77992f15694e77cbae49c56f9ff02f4f9079235d
[ROCm/rocprofiler commit: d4a33cf33a ]
2023-07-13 20:54:30 -04:00
Saurabh Verma
7ea6d5692d
SWDEV-400688: Correction for block instance count referenced in xml for MI300 metrics
...
Change-Id: I8b84f5d018d64104ed3d1bedeff272fd5e7437ca
[ROCm/rocprofiler commit: b7d045c672 ]
2023-06-21 16:26:59 -04:00
Ammar ELWazir
ef96663e82
SWDEV-374256: GPU Kernel Dispatch Trace Period Support
...
Change-Id: Idaabe82a30013e3aba4bcb65bd0a89ce2d14ad97
[ROCm/rocprofiler commit: 472624e3bd ]
2023-06-21 12:46:33 -04:00
Giovanni LB
dda0379742
SWDEV-298742: Added occupancy metrics
...
Change-Id: I67e375ad06535bbb8cc864b78840ce3962bcc58e
[ROCm/rocprofiler commit: a1508035dc ]
2023-06-19 12:10:22 -04:00
Giovanni LB
a7e8182a21
SWDEV-405575: Added gfx941 and gfx942
...
Change-Id: I45a49cd64a76d3ae32c209497c70fe27b5be212b
[ROCm/rocprofiler commit: e1285e3fd4 ]
2023-06-19 11:11:37 -04:00
Saurabh Verma
dcd5f1a397
MI300 counters support for rocprof and rocprofv2(Accumulation from all xccs)
...
1. Xml files updated for gfx940 counters
2. File plugin changes to allow rocprofv2 backward compatibility for results.csv
3. Changes in rocprofv2 script to use tblextr.py, to generate results.csv just like rocprof
Change-Id: I7798f4411ce01f6fbfffb126de654ed806ca7045
(cherry picked from commit 86cbaf38c436be876f0426fa27803b1e64d90378)
[ROCm/rocprofiler commit: 8f82ff6a46 ]
2023-05-30 21:41:54 -05:00
Kiumars Sabeti
a260b63b96
SWDEV-380635: adding gfx11 architecture to rocprofiler which includes navi31 and navi32 for now
...
Change-Id: Ib2a93a34688471c82b5db0dc10e8da58452dba21
[ROCm/rocprofiler commit: 997c771723 ]
2023-05-05 15:39:18 -04:00
Ammar ELWazir
7647f6988f
V1/V2 API Library Separation
...
V1 library will be supported as librocprofiler64.so and V2 will be supported as librocprofiler64v2.so and headers will be rocprofiler.h for V1 and v2/rocprofiler.h for v2
Change-Id: Ibe5bdbf2f79f0175342c648e917ae77918186604
[ROCm/rocprofiler commit: 9e62e066fe ]
2023-05-02 22:44:43 -04:00
gobhardw
c9a9e34844
SWDEV-374072 : rocprof gpu selector fix
...
Change-Id: I155e63a5dc1ecbacd76d80b0df76da99b645ed9f
[ROCm/rocprofiler commit: 14977e4dc1 ]
2023-03-29 15:55:06 +00:00
Kiumars Sabeti
0b6e0186d3
SWDEV-387039: Modified gfx90a section to inherit from gfx9 base and removed derived counters that are defined in the gfx9 base from gfx90a section to avoid duplication
...
Change-Id: I653e116bc47fe11b57e663c2827d177149b00c5b
[ROCm/rocprofiler commit: a9f1237c53 ]
2023-03-29 15:55:06 +00:00
Ammar ELWazir
de4abd0d0f
Adding rocprofilerv2
...
Change-Id: Ic0cc280ba207d2b8f6ccae1cd4ac3184152fc1ad
[ROCm/rocprofiler commit: 8032adb64f ]
2023-03-09 13:20:33 +00:00
Saurabh Verma
6dc0459613
Adding missing MI200 metrics
...
Change-Id: I410f50e03d38bb03cf43e743318eb1242e7d6518
[ROCm/rocprofiler commit: 225bddf148 ]
2023-01-11 18:00:46 +00:00
Kiumars Sabeti
f893fecf87
SWDEV-369023: Added two new counters SQ_INSTS_TEX_LOAD and SQ_INSTS_TEX_STORE for gfx10.These two new counters are replacement for SQ_INSTS_VMEM_RD and SQ_INSTS_VMEM_WR which are not supported in gfx10 architecture
...
Change-Id: I4c4101eea27f9073492ae42c70a30a002f4d8834
[ROCm/rocprofiler commit: a9a82ee107 ]
2022-12-09 20:41:45 -05:00
Kiumars Sabeti
d5974aba78
SWDEV-302380: [ROCm QA][Mainline][Navi21] 6 tests are failing in rocprofiler-stg2
...
This is an attempt to support basic and derived counters for navi21. This code will not work correctly unless we add navi counters to metrics.xml and gfx_metrics.xml
Change-Id: Ied06a81345a6fbb02fa0fde1889d94bbe64e9a03
[ROCm/rocprofiler commit: b53fd84ade ]
2022-08-05 17:31:37 -04:00
Laurent Morichetti
6e1ea79067
Fix vgpr count calculation for gfx90a and gfx940
...
Read accum_offset from compute_pgm_rsrc3 to report both the arch vgprs
and the accum vgprs
Change-Id: I99e746d54a6a1671e343da5658cc6ce970f79939
[ROCm/rocprofiler commit: 5fd1c7e8e3 ]
2022-08-03 14:02:36 -07:00
Saurabh Verma
eaebfe0954
SWDEV-297195: Corrected units for some counters. Units changed to quad-cycles units where required.
...
Change-Id: Ia6b0387ac6ec4210bb9482d85ae5635fc7c3c9d0
[ROCm/rocprofiler commit: 18dedbaee8 ]
2022-07-21 17:22:17 -05:00
Ranjith Ramakrishnan
a5a941cbfc
SWDEV-345870 - Correct include paths for new directory layout
...
Use hsa header files from /opt/rocm-ver/include rather than using wrapper files from /opt/rocm-ver/hsa/include/hsa
Change-Id: Id7a9bde19447cd2a0fd6e03b11c08471f09c2a46
[ROCm/rocprofiler commit: e7eb195924 ]
2022-07-14 16:08:41 -07:00
Saurabh Verma
e27d5da8c0
SWDEV-298750:Approval to make internal profile counters public
...
Added approved HW counters for MI200. Also added derived metrics for the same
Change-Id: I1c6abfdfde4e4fd4ba8bd5eec0557ad08fd71c77
[ROCm/rocprofiler commit: 6d233c65d7 ]
2022-05-17 16:44:16 -05:00
Chun Yang
b9d8cc066a
SWDEV-324379 : Expose FP64 and FP32 performance counters on on AMD profilers for MI200
...
Change-Id: I2c38ccc297872dfc1896314ceadbed98dc761766
[ROCm/rocprofiler commit: 26c479c72a ]
2022-03-17 14:06:24 -07:00
Chun Yang
62a76c8ebb
SWDEV-296922 : Incorrect rounding due to integer division in rocprofiler metrics
...
Changed derived metrics to double from int64.
Fixed standalone test due to int64 to float change
Fixed intercept test due to int64 to float change.
Change-Id: I49631c187406ae9dd94a869b3bb13772012e8cdf
[ROCm/rocprofiler commit: f9017cbdc5 ]
2021-09-23 14:52:35 -07:00
AMD
9e422660cd
Add support for gfx90a
...
Merge gfx90a support from the 'amd-npi' branch.
Change-Id: I9b51711ed4a1d2f1ed42ba9b83cb12136be228b8
[ROCm/rocprofiler commit: 4df3e0bd9a ]
2021-06-16 16:35:42 -07:00
Evgeny
c701f9705c
cleanup after separating for staging and npi branches
...
Change-Id: Iadd624df21b85f1590e901a8125680743e3281a3
[ROCm/rocprofiler commit: 780dfa37d4 ]
2021-04-08 20:37:47 +00:00
Evgeny
8c3ce30c94
SWDEV-265287 : integration spmltgen.py script
...
Change-Id: Ief3e93225fb6660e72a04e4bd4b379262b73c914
[ROCm/rocprofiler commit: 82d7bb2145 ]
2021-04-08 10:04:39 -04:00
Evgeny
0282e30855
SWDEV-274821 SPM initialization fix
...
Change-Id: I5e27928a60083eff328bab3e79937ce11bce11bd
[ROCm/rocprofiler commit: e2c9d13e5b ]
2021-03-22 09:18:36 +00:00
Evgeny
2adb15caff
SWDEV-255662 : spm kfd mode support
...
Change-Id: I840c7e92d3d5a59d8e5402c4d8ef86bc123dd07c
[ROCm/rocprofiler commit: 7e60bf163e ]
2020-12-02 13:02:45 -06:00
Evgeny
b781ea8577
fixing sqtt trace for zero size case
...
Change-Id: I75712485f518725af46a3b419339a212d1e762a0
[ROCm/rocprofiler commit: f2c9980647 ]
2020-12-01 18:19:51 -05:00
Evgeny
66490fca38
fixing c_str() as strdup
...
Change-Id: Ib5cb68d16ce66fd2ae072168de4c16895f32b57f
[ROCm/rocprofiler commit: ccc6005c25 ]
2020-10-27 14:45:51 -05:00
Evgeny
fc99b9a657
enable contexts wait
...
Change-Id: Ie2adf04662fddc8051fb5418904c9c659e264d78
[ROCm/rocprofiler commit: 0d164ba672 ]
2020-09-21 21:06:03 -04:00
Evgeny
2d42e93cdf
kernel objects dumping
...
Change-Id: I5a16e05b7df438efa903948701b65a9ced99e5f3
initial codeobj event implementation
Change-Id: Ia7fac3c2b9897a004cfe88c4de82ba8c18284196
update - codeobj event implementation
Change-Id: I2b91b6e689875af03f0086f5a0872a97a629fd83
update2 - codeobj event implementation
Change-Id: Icff75f14fd21963e40db95373fa74880957a9e32
fix - codeobj event implementation
Change-Id: I76c33c875cb429fb12a974bb408b217f187b4536
URI buffer fix - codeobj event implementation
Change-Id: I7ce1a758e021455da3fe5b8a6e4ae3ab46e9760e
HSA events exposing
Change-Id: I3664ab4e5111c4ccedaf068dcb19f48055f0ef9b
HSA events data struct normalizing
Change-Id: I365ef0db45e0a9314bd2a1a4d29dd4eb4e91297d
[ROCm/rocprofiler commit: 8850e46071 ]
2020-09-11 10:01:54 -05:00