f9aa7be97c
* Add MI 350 hardware information
* Refactor MI GPU YAML file and corresponding interface
* Add SoC file for gfx950 architecture
* Add analysis report configs for MI 350 containing existing metrics
* Add placeholder None valued metrics for previous architectures to make
baseline comparison work
* Enable testing on MI 350
* Analysis config metric changes
- SPI changes
- Update metric formula for default SPI pipe counter
- Use efficiently collected pipe wise SPI counters
- Add SPI Wave Occupancy
- Add Scheduler-Pipe Wave Utilization
- Update formula for VGPR Writes
- Add Scheduler-Pipe FIFO Full Rate
- CPC changes
- Add CPC SYNC FIFO Full Rate
- Add CPC CANE Stall Rate
- Add CPC ADC Utilization
- SQ changes
- Add VALU co-issue efficiency
- Add F6F4 datatype metrics
- Update formula for total FLOPs by adding F6F4 counters
- Add LDS STORE / LOAD / ATOMIC metrics
- Add LDS STORE / LOAD / ATOMIC bandwidth
- Add LDS FIFO and TA ADDR / CMD / DATA FIFO full rates
* Collect TCP_TCP_LATENCY_sum only for gfx950 (MI 350)
* Do not inject SQ_ACCUM_PREV_HIRES unnecesarily
* Do not hardcode memory and shader clock speeds
* Write num_hbm_channels to sysinfo.csv instead of hbm_bw while profiling
* Move generate sysinfo.csv to pre processing step of profiling
* Add warnings to use --specs-correction for missing sysinfo.csv values during analysis phase
* Update CHANGELOG
* Analysis phase warning to use --specs-correction when needed
832 B
832 B
| 1 | workload_name | command | ip_blocks | timestamp | version | hostname | cpu_model | sbios | linux_distro | linux_kernel_version | amd_gpu_kernel_version | cpu_memory | gpu_memory | rocm_version | vbios | compute_partition | memory_partition | gpu_model | gpu_arch | gpu_l1 | gpu_l2 | cu_per_gpu | simd_per_cu | se_per_gpu | wave_size | workgroup_max_size | max_waves_per_cu | max_sclk | max_mclk | cur_sclk | cur_mclk | total_l2_chan | lds_banks_per_cu | sqc_per_gpu | pipes_per_gpu | hbm_bw | num_xcd | num_hbm_channels |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2 | path | ./tests/vcopy -n 1048576 -b 256 -i 3 | SQ|LDS|SQC|TA|TD|TCP|TCC|SPI|CPC|CPF|roofline | Thu 21 Mar 2024 04:16:46 PM (CDT) | 2 | t007-002.hpcfund | AMD EPYC 7V13 64-Core Processor | American Megatrends Inc.0602 | Rocky Linux 9.1 (Blue Onyx) | 5.14.0-162.18.1.el9_1.x86_64 | 527650760 | 6.0.2-115 | 113-D67301-059 | NA | NA | MI200 | gfx90a | 16 | 8192 | 104 | 4 | 8 | 64 | 1024 | 32 | 1700 | 1600 | 1700 | 1600 | 32 | 32 | 56 | 4 | 1638.4 | 1 | 32 |