38 Commits

Autor SHA1 Mensagem Data
Venkateshwar Reddy Kandula 997b36f5bc [rocprofiler][navi4] Remove navi4x support on rocprofv2. (#307)
* Remove navi4x support on rocprofv2.

* remove gfx12 from build scripts.

* bug fix.

* address comments.

* update changelog

* Update CHANGELOG.md

* Update CHANGELOG.md

* Update CHANGELOG.md

* address comments

Co-authored-by: Swati Rawat <120587655+SwRaw@users.noreply.github.com>

---------

Co-authored-by: Venkateshwar Reddy Kandula <venkateshwar.kandula1306@gmail.com>
Co-authored-by: Swati Rawat <120587655+SwRaw@users.noreply.github.com>
2025-09-22 03:17:29 -05:00
Giovanni Baraldi 4ab71520ac SWDEV-490031: Adding ops 16,32,64 metrics for rdc
Change-Id: Ia694a79425aeda15ecbb3fef993d880220651ef7


[ROCm/rocprofiler commit: edc9228dab]
2024-12-31 04:17:12 -06:00
Giovanni Baraldi 2c7baf6c89 SWDEV-490031: Fixing activity metrics
Change-Id: Id4e74b0f3ff35d892de05e044faf399d98199354


[ROCm/rocprofiler commit: 59e9e3a0c3]
2024-12-17 10:32:01 -06:00
Giovanni baraldi 2baa5bd4ab SWDEV-495749: Adding SIMD_UTILIZATION metric
Change-Id: I38afd5db02de7a416d11274823bc4a0c326f1fbe


[ROCm/rocprofiler commit: bcc709c174]
2024-12-16 10:22:57 -06:00
Manjunath-Jakaraddi 17fac40bae SWDEV-481162: Updating MfmaUtil metric RDC
Change-Id: I60efa183edc14b6f870f7b6a82f223ea2c9789e5


[ROCm/rocprofiler commit: f84ecfe99b]
2024-10-14 16:48:56 -05:00
Benjamin Welton 4dd298f312 Added FP64_ACTIVE and ENGINE_ACTIVE
Should replicate DCGM_FI_PROF_EVAL_FLOPS_64 and
DCGM_FI_PROF_GR_ENGINE_ACTIVE respectively. See
https://ontrack-internal.amd.com/browse/SWDEV-490046
and
https://ontrack-internal.amd.com/browse/SWDEV-490031

Change-Id: Ia79f6a1601beac48a350493f2e83ce322c1d8d33


[ROCm/rocprofiler commit: 6d80088c84]
2024-10-11 15:51:09 -07:00
Giovanni LB 8b22cf86a7 SWDEV-487621: Fixing BW measurement in MI300
Change-Id: Ib513009616214a1f3f3568571e58d79259692cfc


[ROCm/rocprofiler commit: bddd5b51dd]
2024-10-07 16:29:09 -03:00
Lang Yu 2230af4b1d SWDEV-467545 - Add rocprofiler support for gfx1150/gfx1151
Change-Id: I2cddc36981f6d815c865d180a1daf1b8a7e0633f
Signed-off-by: Lang Yu <lang.yu@amd.com>


[ROCm/rocprofiler commit: 7313e52f35]
2024-07-09 22:40:10 -04:00
jatang c8d58d1986 SWDEV-458392 - Add gfx12 support.
Change-Id: I91bb6a3329bf77f26005a345c18b63b86922028a


[ROCm/rocprofiler commit: e7b96b1e71]
2024-06-17 13:24:48 -04:00
Saurabh Verma 437d39de9e RDC metrics in v1
Change-Id: Iaa8cd0a37da37729df76362f10a0bb63c317a498


[ROCm/rocprofiler commit: a63b6fcbd2]
2024-06-11 17:00:39 -05:00
Saurabh Verma 25eda9f856 Fixing occupancy metrics for MI300
Adding changes for v1 xml which was missed in change 6cf9df4ff0

Change-Id: I338f2736ee61e316522f1ce42cee74abec201499


[ROCm/rocprofiler commit: 2047bf4b8b]
2024-06-11 11:47:57 -05:00
Mythreya 6a4321c4e3 Add MI200/MI300 counters
Revision - Addition [Impact SoC: MI200, MI300]
    Note: this set of counters are important help understand the
          bottleneck.
    1.	TCC_TAG_STALL
        a.	Metric:  TCC_TAG_STALL/TCC_CYCLE: percentage of time TCC
            tag lookup pipeline is stalled
    2.	TCP_TCR_TCP_STALL_CYCLES
        a.	Metric: TCP_TCR_TCP_STALL_CYCLES/TCP_GATE_EN1: percentage
            of time TCP is stalled by TCR

Revision - Addition [Impact SoC: MI300]
    3.	TCC_BUBBLE:
        a.	Definition: Number of 128-byte read requests sent to EA
        b.	Revised Metric #1, TCC-EA Read BW:
        ReadBW =   128 * TCC_BUBBLE
                + 64 * (TCC_EA0_RDREQ - TCC_BUBBLE - TCC_EA0_RDREQ_32B)
                + 32 * TCC_EA0_RDREQ_32B
        c.	Revised Metric #2: TCC_EA Read Latency
        ReadLatency = TCC_EA0_RDREQ_LEVEL / (TCC_BUBBLE + TCC_EA0_RDREQ)

    /* [Fineprint] More detailed arithmetic:
    * ReadLatency = TCC_EA0_RDREQ_LEVEL / (#32B_req + #64B_req + #128B_req * 2)
    */

Change-Id: I0a2dfc1b64ca97023b1e8ba0f9830330b3034946


[ROCm/rocprofiler commit: 46e02a9866]
2023-10-30 15:38:46 -04:00
Mythreya 1471dc8a76 Remove non-functional counters for MI200 and MI300
Counters removed for MI300 (gfx940)
    TCP_TCC_WRITE_REQ_HOLE_LATENCY
    TCP_TCC_WRITE_REQ_LATENCY
    TCP_TCC_READ_REQ_LATENCY
    TCP_TCP_LATENCY

Counters removed for MI200 and MI300 (gfx90a and gfx940 respectively)
    TA_BUFFER_COALESCABLE_WAVEFRONT
    TA_FLAT_COALESCABLE_WAVEFRONT
    TCC_EA0_WRREQ_IO_CREDIT_STALL
    TCC_EA0_WRREQ_GMI_CREDIT_STALL
    TCC_EA0_WRREQ_DRAM_CREDIT_STALL
    TCC_EA0_RDREQ_IO_CREDIT_STALL
    TCC_EA0_RDREQ_GMI_CREDIT_STALL
    TCC_EA0_RDREQ_DRAM_CREDIT_STALL

Change-Id: Ic3d1e7bf35495f35b1239f03ca6420e949421386


[ROCm/rocprofiler commit: 1fae494b12]
2023-10-26 12:50:57 -04:00
Giovanni LB 552814f227 SWDEV-419944: Added metrics for gfx1102
Change-Id: I5c69ff716f530d130710c0687f20e5bc990a60eb


[ROCm/rocprofiler commit: 43e259e5da]
2023-09-11 13:59:33 -04:00
Saurabh Verma 7ea6d5692d SWDEV-400688: Correction for block instance count referenced in xml for MI300 metrics
Change-Id: I8b84f5d018d64104ed3d1bedeff272fd5e7437ca


[ROCm/rocprofiler commit: b7d045c672]
2023-06-21 16:26:59 -04:00
Giovanni LB dda0379742 SWDEV-298742: Added occupancy metrics
Change-Id: I67e375ad06535bbb8cc864b78840ce3962bcc58e


[ROCm/rocprofiler commit: a1508035dc]
2023-06-19 12:10:22 -04:00
Giovanni LB a7e8182a21 SWDEV-405575: Added gfx941 and gfx942
Change-Id: I45a49cd64a76d3ae32c209497c70fe27b5be212b


[ROCm/rocprofiler commit: e1285e3fd4]
2023-06-19 11:11:37 -04:00
Saurabh Verma dcd5f1a397 MI300 counters support for rocprof and rocprofv2(Accumulation from all xccs)
1. Xml files updated for gfx940 counters
2. File plugin changes to allow rocprofv2 backward compatibility for results.csv
3. Changes in rocprofv2 script to use tblextr.py, to generate results.csv just like rocprof

Change-Id: I7798f4411ce01f6fbfffb126de654ed806ca7045
(cherry picked from commit 86cbaf38c436be876f0426fa27803b1e64d90378)


[ROCm/rocprofiler commit: 8f82ff6a46]
2023-05-30 21:41:54 -05:00
Kiumars Sabeti a260b63b96 SWDEV-380635: adding gfx11 architecture to rocprofiler which includes navi31 and navi32 for now
Change-Id: Ib2a93a34688471c82b5db0dc10e8da58452dba21


[ROCm/rocprofiler commit: 997c771723]
2023-05-05 15:39:18 -04:00
Kiumars Sabeti 0b6e0186d3 SWDEV-387039: Modified gfx90a section to inherit from gfx9 base and removed derived counters that are defined in the gfx9 base from gfx90a section to avoid duplication
Change-Id: I653e116bc47fe11b57e663c2827d177149b00c5b


[ROCm/rocprofiler commit: a9f1237c53]
2023-03-29 15:55:06 +00:00
Saurabh Verma 6dc0459613 Adding missing MI200 metrics
Change-Id: I410f50e03d38bb03cf43e743318eb1242e7d6518


[ROCm/rocprofiler commit: 225bddf148]
2023-01-11 18:00:46 +00:00
Kiumars Sabeti d5974aba78 SWDEV-302380: [ROCm QA][Mainline][Navi21] 6 tests are failing in rocprofiler-stg2
This is an attempt to support basic and derived counters for navi21.  This code will not work correctly unless we add navi counters to metrics.xml and gfx_metrics.xml

Change-Id: Ied06a81345a6fbb02fa0fde1889d94bbe64e9a03


[ROCm/rocprofiler commit: b53fd84ade]
2022-08-05 17:31:37 -04:00
Saurabh Verma e27d5da8c0 SWDEV-298750:Approval to make internal profile counters public
Added approved HW counters for MI200. Also added derived metrics for the same

Change-Id: I1c6abfdfde4e4fd4ba8bd5eec0557ad08fd71c77


[ROCm/rocprofiler commit: 6d233c65d7]
2022-05-17 16:44:16 -05:00
AMD 9e422660cd Add support for gfx90a
Merge gfx90a support from the 'amd-npi' branch.

Change-Id: I9b51711ed4a1d2f1ed42ba9b83cb12136be228b8


[ROCm/rocprofiler commit: 4df3e0bd9a]
2021-06-16 16:35:42 -07:00
Evgeny 58968594c7 Mem32Bwrites -> MemWrites32B renaming
Change-Id: I86f570b97178caa3d3d5fcbc5ecc716216dad504


[ROCm/rocprofiler commit: aecc7abc76]
2019-08-22 22:34:36 -05:00
Evgeny e5169bf5b3 mi100 metrics fix
Change-Id: I862578a2b7827a88cc6926d636d5eb9a8320a244


[ROCm/rocprofiler commit: dff3256dd5]
2019-08-22 22:30:48 -05:00
Evgeny fc6e208b50 overwriting on inheritance
Change-Id: I6faf79874fc14e898c75972aa2f372e228105f7b


[ROCm/rocprofiler commit: 82ecf9e7a4]
2019-08-08 01:54:29 -05:00
Evgeny 55680ff541 gfx908 events update
Change-Id: Ieb7c72f66b7e5e5dfc29d97650a29722a6276a31


[ROCm/rocprofiler commit: 09eed4f35b]
2019-08-07 19:44:31 -05:00
Evgeny 05f8d9ebca adding gfx908 namespace
Change-Id: Ifd45385777be5066c8bfef23ee24bbabadf4dffa


[ROCm/rocprofiler commit: 855a801ae7]
2019-07-27 03:06:10 -05:00
Xianwei Zhang 62b33801c4 xml: add extra events for gfx906 GPUs
Compard to gfx900, e.g., Vega 10, gfx906 adds extra counter events.
A typical difference is on TCC-EA that gfx906 (e.g., Vega 20) has 2
EAs per TCC, while only one single EA/TCC on gfx900. As such,
additional counters must be profiled to get correct results. This
patch adds one extra events to specifically handle gfx906.

Change-Id: Id6c9d37548a102c80bbfddcfa11e77d20f17431a


[ROCm/rocprofiler commit: ca9a714b77]
2019-07-16 12:56:45 -05:00
Xianwei Zhang f5b54bff3d test: add tests to validate cache/mem blocks
To validate cache and memory blocks profiling, this patch prepares
tests to profile dedicated kernels using specified counters, to
compare the profiled results against expected ones, and further
show the test is a fail or pass.  Tests here are focusing on cache
hit/miss, memory fetch/write size.

Change-Id: Icbc8096a6e15256dec66297597a57c7665a533b8


[ROCm/rocprofiler commit: 8b445d2c00]
2019-07-16 12:39:38 -05:00
Evgeny 4c1a1ce831 adding xml inheritance
Change-Id: Iafbfb8a6fb1936cf1315c1f5215879e99187da77


[ROCm/rocprofiler commit: 46fd670818]
2019-07-16 09:20:53 -07:00
Evgeny c3c9815c49 minor changes
Change-Id: I16444de8c5d83a9f33392ac870d2fc188978ab54


[ROCm/rocprofiler commit: ea0a1110dc]
2018-12-18 09:51:47 -06:00
Evgeny cde69c4b14 rpl_run data extracting error message; comments for intermediate metrics
Change-Id: I4d9f97ed475d3ec328aafae40f1bc80b6bf69bcc


[ROCm/rocprofiler commit: 812c17ed77]
2018-11-01 13:38:01 -05:00
Evgeny cbdf1990eb - cleanup;
- metrics descriptions;

Change-Id: I9087f41a4ba3ed6c94039ac921407aa13529bf6c


[ROCm/rocprofiler commit: 3b7b958a5a]
2018-08-20 00:40:10 -05:00
Evgeny 368f9f2b5d license annotations
Change-Id: I96b8f625a03b707b45aee6b8e7dbfabe073294c6


[ROCm/rocprofiler commit: add56e26ae]
2018-06-25 19:52:28 -05:00
Evgeny ae9c4f2ea4 adding dispatch_packet field to rocprofiler_callback_data_t
Change-Id: I3cbe256130fa23dd24b094f27286d566ed339230


[ROCm/rocprofiler commit: b8fc3818de]
2018-05-22 13:17:03 -05:00
Evgeny 20d448818e test structure refactoring, unifying with aqlprofile
Change-Id: Icad642413c7edb27885effc9da3524cf3899f8de


[ROCm/rocprofiler commit: 71044909ce]
2018-05-03 16:47:18 -05:00