27585a8a2b
* Add MI 350 hardware information
* Refactor MI GPU YAML file and corresponding interface
* Add SoC file for gfx950 architecture
* Add analysis report configs for MI 350 containing existing metrics
* Add placeholder None valued metrics for previous architectures to make
baseline comparison work
* Enable testing on MI 350
* Analysis config metric changes
- SPI changes
- Update metric formula for default SPI pipe counter
- Use efficiently collected pipe wise SPI counters
- Add SPI Wave Occupancy
- Add Scheduler-Pipe Wave Utilization
- Update formula for VGPR Writes
- Add Scheduler-Pipe FIFO Full Rate
- CPC changes
- Add CPC SYNC FIFO Full Rate
- Add CPC CANE Stall Rate
- Add CPC ADC Utilization
- SQ changes
- Add VALU co-issue efficiency
- Add F6F4 datatype metrics
- Update formula for total FLOPs by adding F6F4 counters
- Add LDS STORE / LOAD / ATOMIC metrics
- Add LDS STORE / LOAD / ATOMIC bandwidth
- Add LDS FIFO and TA ADDR / CMD / DATA FIFO full rates
* Collect TCP_TCP_LATENCY_sum only for gfx950 (MI 350)
* Do not inject SQ_ACCUM_PREV_HIRES unnecesarily
* Do not hardcode memory and shader clock speeds
* Write num_hbm_channels to sysinfo.csv instead of hbm_bw while profiling
* Move generate sysinfo.csv to pre processing step of profiling
* Add warnings to use --specs-correction for missing sysinfo.csv values during analysis phase
* Update CHANGELOG
* Analysis phase warning to use --specs-correction when needed
[ROCm/rocprofiler-compute commit: f9aa7be97c]
809 B
809 B
| 1 | workload_name | command | ip_blocks | timestamp | version | hostname | cpu_model | sbios | linux_distro | linux_kernel_version | amd_gpu_kernel_version | cpu_memory | gpu_memory | rocm_version | vbios | compute_partition | memory_partition | gpu_series | gpu_model | gpu_arch | gpu_chip_id | gpu_l1 | gpu_l2 | cu_per_gpu | simd_per_cu | se_per_gpu | wave_size | workgroup_max_size | max_waves_per_cu | max_sclk | max_mclk | cur_sclk | cur_mclk | total_l2_chan | lds_banks_per_cu | sqc_per_gpu | pipes_per_gpu | num_xcd | num_hbm_channels |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2 | vcopy | tests/vcopy -n 1048576 -b 256 -i 3 | SQ|LDS|SQC|TA|TD|TCP|TCC|SPI|CPC|CPF | Fri Mar 28 22:43:57 2025 (UTC) | 3 | f77021840818 | AMD Ryzen Threadripper PRO 7985WX 64-Cores | AMDVBS1052957N.FD | Ubuntu 22.04.5 LTS | 5.15.0-70-generic | 527456116 | 6.5.0-831 | 113-M3550101-100 | SPX | NPS1 | MI350 | MI350 | gfx950 | 30112 | 32 | 4096 | 128 | 4 | 16 | 64 | 1024 | 32 | 0 | 0 | 128 | 32 | 64 | 4 | 8 | 128 |