Files
rocm-systems/tests/workloads/LDS/mi100/prev_analysis/201.csv
T
colramos-amd 62d130b458 Initial commit
2022-11-04 14:49:36 -05:00

1.5 KiB

1MetricValueUnitPeakPoP
2VALU FLOPsGflops23070.72
3VALU IOPsGops23070.72
4MFMA FLOPs (BF16)Gflops92282.88
5MFMA FLOPs (F16)Gflops184565.76
6MFMA FLOPs (F32)Gflops46141.44
7MFMA FLOPs (F64)Gflops46141.44
8MFMA IOPs (Int8)Gops184565.76
9Active CUs92Cus12076.66666666666667
10SALU Util2.104062127635719Pct1002.104062127635719
11VALU Util59.587355669324566Pct10059.587355669324566
12MFMA UtilPct100
13VALU Active Threads/Wave63.96669242997463Threads6499.94795692183536
14IPC - Issue0.8437169643268545Instr/cycle516.87433928653709
15LDS BW0.0Gb/sec23070.720.0
16LDS Bank ConflictConflicts/access32
17Instr Cache Hit Rate99.99346464516054Pct10099.99346464516054
18Instr Cache BW1409.5594562466663Gb/s4614.14430.54866636686385
19Scalar L1D Cache Hit Rate99.35620448529356Pct10099.35620448529356
20Scalar L1D Cache BW43.04660637176443Gb/s4614.1440.9329272422309411
21Vector L1D Cache Hit Rate20.508982035928145Pct10020.508982035928145
22Vector L1D Cache BW1219.7352677278018Gb/s11535.3610.573881246253276
23L2 Cache Hit Rate0.3929551729113038Pct1000.3929551729113038
24L2-Fabric Read BW837.7149327910072Gb/s1228.868.1734157544765
25L2-Fabric Write BW4.9808899694266895Gb/s1228.80.4053458633973543
26L2-Fabric Read Latency430.0370775789851Cycles
27L2-Fabric Write Latency146.25406284063632Cycles
28Wave Occupancy3317.098394422987Wavefronts480069.1062165504789
29Instr Fetch BW690.2144294664407Gb/s2307.07229.91733372285047
30Instr Fetch Latency18.52969728392234Cycles