adding L2 model with updated legend and removal of large images (#718)

* adding L2 model with updated legend and removal of large images

* changed image name to perf_model

[ROCm/rocprofiler-compute commit: 783193c75f]
Этот коммит содержится в:
anisha-amd
2025-05-28 14:03:47 -04:00
коммит произвёл GitHub
родитель 689746e2cd
Коммит 27f5e06dd2
2 изменённых файлов: 4 добавлений и 24 удалений
+4 -24
Просмотреть файл
@@ -311,7 +311,7 @@ Request flow
The following is a diagram that illustrates how L2↔Fabric requests are reported
by ROCm Compute Profiler:
.. figure:: ../data/performance-model/fabric.png
.. figure:: ../data/performance-model/l2perf_model.png
:align: center
:alt: L2-Fabric transaction flow on AMD Instinct MI-series accelerators
:width: 800
@@ -350,38 +350,18 @@ accelerators local HBM, and an uncached write request. The request-flow
diagram marks *exclusive* classifications as a splitting of the flow,
while *non-exclusive* requests do not split the flow line. For example,
a request is either a 32B Write Request OR a 64B Write request, as the
flow splits at this point:
.. figure:: ../data/performance-model/split.*
:align: center
:alt: Splitting request flow
:width: 800
Splitting request flow
flow splits at this point.
However, continuing along, the same request might be an atomic request and an
uncached write request, as reflected by a non-split flow:
.. figure:: ../data/performance-model/nosplit.*
:align: center
:alt: Non-splitting request flow
:width: 800
Non-splitting request flow
uncached write request, as reflected by a non-split flow.
Finally, we note that :ref:`uncached <memory-type>` read requests (e.g., to
:ref:`fine-grained memory <memory-type>`) are handled specially on CDNA
accelerators, as indicated in the request flow diagram. These are
expected to be counted as a 64B Read Request, and *if* they are requests
to uncached memory (denoted by the dashed line), they will also be
counted as *two* uncached read requests (that is, the request is split):
counted as *two* uncached read requests (that is, the request is split).
.. figure:: ../data/performance-model/uncached.*
:align: center
:alt: Uncached read-request splitting
:width: 800
Uncached read-request splitting.
.. _l2-request-metrics:
Двоичный файл не отображается.

После

Ширина:  |  Высота:  |  Размер: 50 KiB