adding L2 model with updated legend and removal of large images (#718)
* adding L2 model with updated legend and removal of large images
* changed image name to perf_model
[ROCm/rocprofiler-compute commit: 783193c75f]
Этот коммит содержится в:
@@ -311,7 +311,7 @@ Request flow
|
||||
The following is a diagram that illustrates how L2↔Fabric requests are reported
|
||||
by ROCm Compute Profiler:
|
||||
|
||||
.. figure:: ../data/performance-model/fabric.png
|
||||
.. figure:: ../data/performance-model/l2perf_model.png
|
||||
:align: center
|
||||
:alt: L2-Fabric transaction flow on AMD Instinct MI-series accelerators
|
||||
:width: 800
|
||||
@@ -350,38 +350,18 @@ accelerator’s local HBM, and an uncached write request. The request-flow
|
||||
diagram marks *exclusive* classifications as a splitting of the flow,
|
||||
while *non-exclusive* requests do not split the flow line. For example,
|
||||
a request is either a 32B Write Request OR a 64B Write request, as the
|
||||
flow splits at this point:
|
||||
|
||||
.. figure:: ../data/performance-model/split.*
|
||||
:align: center
|
||||
:alt: Splitting request flow
|
||||
:width: 800
|
||||
|
||||
Splitting request flow
|
||||
flow splits at this point.
|
||||
|
||||
However, continuing along, the same request might be an atomic request and an
|
||||
uncached write request, as reflected by a non-split flow:
|
||||
|
||||
.. figure:: ../data/performance-model/nosplit.*
|
||||
:align: center
|
||||
:alt: Non-splitting request flow
|
||||
:width: 800
|
||||
|
||||
Non-splitting request flow
|
||||
uncached write request, as reflected by a non-split flow.
|
||||
|
||||
Finally, we note that :ref:`uncached <memory-type>` read requests (e.g., to
|
||||
:ref:`fine-grained memory <memory-type>`) are handled specially on CDNA
|
||||
accelerators, as indicated in the request flow diagram. These are
|
||||
expected to be counted as a 64B Read Request, and *if* they are requests
|
||||
to uncached memory (denoted by the dashed line), they will also be
|
||||
counted as *two* uncached read requests (that is, the request is split):
|
||||
counted as *two* uncached read requests (that is, the request is split).
|
||||
|
||||
.. figure:: ../data/performance-model/uncached.*
|
||||
:align: center
|
||||
:alt: Uncached read-request splitting
|
||||
:width: 800
|
||||
|
||||
Uncached read-request splitting.
|
||||
|
||||
.. _l2-request-metrics:
|
||||
|
||||
|
||||
Двоичные данные
Двоичный файл не отображается.
|
После Ширина: | Высота: | Размер: 50 KiB |
Ссылка в новой задаче
Block a user