adding L2 model with updated legend and removal of large images (#718)

* adding L2 model with updated legend and removal of large images

* changed image name to perf_model
このコミットが含まれているのは:
anisha-amd
2025-05-28 14:03:47 -04:00
committed by GitHub
コミット 783193c75f
2個のファイルの変更4行の追加24行の削除
+4 -24
ファイルの表示
@@ -311,7 +311,7 @@ Request flow
The following is a diagram that illustrates how L2↔Fabric requests are reported
by ROCm Compute Profiler:
.. figure:: ../data/performance-model/fabric.png
.. figure:: ../data/performance-model/l2perf_model.png
:align: center
:alt: L2-Fabric transaction flow on AMD Instinct MI-series accelerators
:width: 800
@@ -350,38 +350,18 @@ accelerators local HBM, and an uncached write request. The request-flow
diagram marks *exclusive* classifications as a splitting of the flow,
while *non-exclusive* requests do not split the flow line. For example,
a request is either a 32B Write Request OR a 64B Write request, as the
flow splits at this point:
.. figure:: ../data/performance-model/split.*
:align: center
:alt: Splitting request flow
:width: 800
Splitting request flow
flow splits at this point.
However, continuing along, the same request might be an atomic request and an
uncached write request, as reflected by a non-split flow:
.. figure:: ../data/performance-model/nosplit.*
:align: center
:alt: Non-splitting request flow
:width: 800
Non-splitting request flow
uncached write request, as reflected by a non-split flow.
Finally, we note that :ref:`uncached <memory-type>` read requests (e.g., to
:ref:`fine-grained memory <memory-type>`) are handled specially on CDNA
accelerators, as indicated in the request flow diagram. These are
expected to be counted as a 64B Read Request, and *if* they are requests
to uncached memory (denoted by the dashed line), they will also be
counted as *two* uncached read requests (that is, the request is split):
counted as *two* uncached read requests (that is, the request is split).
.. figure:: ../data/performance-model/uncached.*
:align: center
:alt: Uncached read-request splitting
:width: 800
Uncached read-request splitting.
.. _l2-request-metrics:
バイナリファイルは表示されません。

変更後

幅:  |  高さ:  |  サイズ: 50 KiB