Files
rocm-systems/tests/workloads/Axes4/mi100/prev_analysis/201.csv
T
colramos-amd 62d130b458 Initial commit
2022-11-04 14:49:36 -05:00

1.5 KiB

1MetricValueUnitPeakPoP
2VALU FLOPsGflops23070.72
3VALU IOPsGops23070.72
4MFMA FLOPs (BF16)Gflops92282.88
5MFMA FLOPs (F16)Gflops184565.76
6MFMA FLOPs (F32)Gflops46141.44
7MFMA FLOPs (F64)Gflops46141.44
8MFMA IOPs (Int8)Gops184565.76
9Active CUs95Cus12079.16666666666667
10SALU Util2.096732825456871Pct1002.096732825456871
11VALU Util59.23170182173713Pct10059.23170182173713
12MFMA UtilPct100
13VALU Active Threads/Wave63.967701311913Threads6499.94953329986406
14IPC - Issue0.8437250831887582Instr/cycle516.874501663775163
15LDS BW0.0Gb/sec23070.720.0
16LDS Bank ConflictConflicts/access32
17Instr Cache Hit Rate99.99344038252147Pct10099.99344038252147
18Instr Cache BW1409.8925710298083Gb/s4614.14430.55588579441405
19Scalar L1D Cache Hit Rate99.35620448523953Pct10099.35620448523953
20Scalar L1D Cache BW43.04568147791508Gb/s4614.1440.9329071974761749
21Vector L1D Cache Hit Rate20.508982035928145Pct10020.508982035928145
22Vector L1D Cache BW1218.566004817546Gb/s11535.3610.56374490971713
23L2 Cache Hit Rate0.3931390058893899Pct1000.3931390058893899
24L2-Fabric Read BW837.836025789315Gb/s1228.868.1832703279065
25L2-Fabric Write BW4.944144824583556Gb/s1228.80.40235553585478157
26L2-Fabric Read Latency451.5625182005534Cycles
27L2-Fabric Write Latency145.41346555482866Cycles
28Wave Occupancy3336.2725639727946Wavefronts480069.50567841609988
29Instr Fetch BW690.2595629641929Gb/s2307.07229.91929003360939
30Instr Fetch Latency18.669321297310752Cycles