Files
rocm-systems/projects/rocprofiler-compute/CHANGES
T
Cole Ramos 2d99a5b16f Update CHANGES for v1.0.10
Signed-off-by: Cole Ramos <colramos@amd.com>

[ROCm/rocprofiler-compute commit: 9770196763]
2023-08-22 12:45:36 -05:00

96 sor
4.1 KiB
Plaintext

Version 1.0.10 (22 Aug 2023)
* critical patch for detection of llvm in rocm installs on SLURM systems
Version 1.0.9 (17 Aug 2023)
* add units to L2 per-channel panel (#133)
* new quickstart guide for Grafana setup in docs (#135)
* more detail on kernel and dispatch filtering in docs (#136, #137)
* patch manual join utility for ROCm >5.2.x (#139)
* add % of peak values to low level speed-of-light panels (#140)
* patch critical bug in Grafana by removing a deprecated plugin (#141)
* enhancements to KernelName demangeler (#142)
* general metric updates and enhancements (#144, #155, #159)
* add min/max/avg breakdown to instruction mix panel (#154)
Version 1.0.8 (30 May 2023)
* add `--kernel-names` option to toggle kernelName overlay in standalone roofline plot (#93)
* remove unused python modules (#96)
* fix empirical roofline calculation for single dispatch workloads (#97)
* match color of arithmetic intensity points to corresponding bw lines
* ux improvements in standalone GUI (#101)
* enhanced readability for filtering dropdowns in standalone GUI (#102)
* new logfile to capture rocprofiler output (#106)
* roofline support for sles15 sp4 and future service packs (#109)
* adding dockerfiles for all supported Linux distros
* new examples for `--roof-only` and `--kernel` options added to documentation
* enable cli analysis in Windows (#110)
* optional random port number in standalone GUI (#111)
* limit length of visible kernelName in `--kernel-names` option (#115)
* adjust metric definitions (#117, #130)
* manually merge rocprof runs, overriding default rocprofiler implementation (#125)
* fixed compatibility issues with Python 3.11 (#131)
Version 1.0.8-PR2 (17 Apr 2023)
* ux improvements in standalone GUI (#101)
* enhanced readability for filtering dropdowns in standalone GUI (#102)
* new logfile to capture rocprofiler output (#106)
* roofline support for sles15 sp4 and future service packs (#109)
* adding dockerfiles for all supported Linux distros
* new examples for `--roof-only` and `--kernel` options added to documentation
Version 1.0.8-PR1 (13 Mar 2023)
* add `--kernel-names` option to toggle kernelName overlay in standalone roofline plot (#93)
* remove unused python modules (#96)
* fix empirical roofline calculation for single dispatch workloads (#97)
* match color of arithmetic intensity points to corresponding bw lines
Version 1.0.7 (21 Feb 2023)
* update documentation (#52, #64)
* improved detection of invalid command line arguments (#58, #76)
* enhancements to standalone roofline (#61)
* enable Omniperf on systems with X-server (#62)
* raise minimum version requirement for rocm (#64)
* enable baseline comparison in CLI analysis (#65)
* add multi-normalization to new metrics (#68, #81)
* support alternative profilers (#70)
* add MI100 configs to override rocprofiler's incomplete default (#75)
* improve error message when no GPU(s) detected (#85)
* separate CI tests by Linux distro and add status badges
Version 1.0.6 (21 Dec 2022)
* CI update: documentation now published via github action (#22)
* better error detection for incomplete ROCm installs (#56)
Version 1.0.5 (13 Dec 2022)
* store application command-line parameters in profiling output (#27)
* enable additional normalizations in CLI mode (#30)
* add missing ubuntu 20.04 roofline binary to packaging (#34)
* update L1 bandwidth metric calculations (#36)
* add L1 <-> L2 bandwidth calculation (#37)
* documentation updates (#38, #41)
* enhanced subprocess logging to identify critical errors in rocprofiler (#50)
* maintain git sha in production installs from tarball (#53)
Version 1.0.4 (11 Nov 2022)
* update python requirements.txt with minimum versions for numpy and pandas
* addition of progress bar indicator in web-based GUI (#8)
* reduced default content for web-based GUI to reduce load times (#9)
* minor packaging and CI updates
* variety of documentation updates
* added an optional argument to vcopy.cpp workload example to specify device id
Version 1.0.3 (07 Nov 2022)
* initial Omniperf release