In specific circumstances, the GPU can co-issue two VALU instructions in the same clock cycle. This may result in an observed VALU Utilization and FP64 VALU FLOP values above the theoretical peak. This is expected hardware behavior and not a measurement error.
This dual-issue capability can be further investigated via:
***ROCm Compute Viewer**: The Instructions view shows when two instructions are issued to the VALU in the same cycle.
***On MI350 and newer platforms**: Starting in ROCm 7.2.0, the ``Dual-issue VALU Utilization`` metric shows the % of time when VALU is executing dual-issued instructions.
When ROCm Compute Profiler detects values exceeding their theoretical peaks, it displays a warning message indicating this behavior.
When profiling, you may see the following warning:
..code-block::text
WARNING: Counter variance corrected: X value(s) adjusted (max Y% deviation from multi-pass collection).
This indicates that ROCm Compute Profiler detected and corrected negative values in derived metrics. This is expected behavior, not an error.
**Why does this happen?**
Hardware performance counters are collected across multiple profiling passes. When calculating derived metrics that involve subtraction (such as ``A - B``), small run-to-run variance can occasionally produce negative results. Since negative event counts are physically impossible, these values are automatically clamped to zero.
**When should I be concerned?**
***Deviation < 1%**: Normal hardware variance. No action needed.
***Deviation ≥ 1%**: The warning is displayed. Results are still valid, but variance was higher than typical.
This correction primarily affects L2 cache metrics where counter subtraction is used to derive values like remote read/write traffic, but run-to-run variations may impact the accuracy of a number of derived metrics in ROCm Compute Profiler.