2d99a5b16f
Signed-off-by: Cole Ramos <colramos@amd.com>
[ROCm/rocprofiler-compute commit: 9770196763]
96 sor
4.1 KiB
Plaintext
96 sor
4.1 KiB
Plaintext
Version 1.0.10 (22 Aug 2023)
|
|
|
|
* critical patch for detection of llvm in rocm installs on SLURM systems
|
|
|
|
Version 1.0.9 (17 Aug 2023)
|
|
|
|
* add units to L2 per-channel panel (#133)
|
|
* new quickstart guide for Grafana setup in docs (#135)
|
|
* more detail on kernel and dispatch filtering in docs (#136, #137)
|
|
* patch manual join utility for ROCm >5.2.x (#139)
|
|
* add % of peak values to low level speed-of-light panels (#140)
|
|
* patch critical bug in Grafana by removing a deprecated plugin (#141)
|
|
* enhancements to KernelName demangeler (#142)
|
|
* general metric updates and enhancements (#144, #155, #159)
|
|
* add min/max/avg breakdown to instruction mix panel (#154)
|
|
|
|
Version 1.0.8 (30 May 2023)
|
|
|
|
* add `--kernel-names` option to toggle kernelName overlay in standalone roofline plot (#93)
|
|
* remove unused python modules (#96)
|
|
* fix empirical roofline calculation for single dispatch workloads (#97)
|
|
* match color of arithmetic intensity points to corresponding bw lines
|
|
|
|
* ux improvements in standalone GUI (#101)
|
|
* enhanced readability for filtering dropdowns in standalone GUI (#102)
|
|
* new logfile to capture rocprofiler output (#106)
|
|
* roofline support for sles15 sp4 and future service packs (#109)
|
|
* adding dockerfiles for all supported Linux distros
|
|
* new examples for `--roof-only` and `--kernel` options added to documentation
|
|
|
|
* enable cli analysis in Windows (#110)
|
|
* optional random port number in standalone GUI (#111)
|
|
* limit length of visible kernelName in `--kernel-names` option (#115)
|
|
* adjust metric definitions (#117, #130)
|
|
* manually merge rocprof runs, overriding default rocprofiler implementation (#125)
|
|
* fixed compatibility issues with Python 3.11 (#131)
|
|
|
|
Version 1.0.8-PR2 (17 Apr 2023)
|
|
|
|
* ux improvements in standalone GUI (#101)
|
|
* enhanced readability for filtering dropdowns in standalone GUI (#102)
|
|
* new logfile to capture rocprofiler output (#106)
|
|
* roofline support for sles15 sp4 and future service packs (#109)
|
|
* adding dockerfiles for all supported Linux distros
|
|
* new examples for `--roof-only` and `--kernel` options added to documentation
|
|
|
|
Version 1.0.8-PR1 (13 Mar 2023)
|
|
|
|
* add `--kernel-names` option to toggle kernelName overlay in standalone roofline plot (#93)
|
|
* remove unused python modules (#96)
|
|
* fix empirical roofline calculation for single dispatch workloads (#97)
|
|
* match color of arithmetic intensity points to corresponding bw lines
|
|
|
|
Version 1.0.7 (21 Feb 2023)
|
|
|
|
* update documentation (#52, #64)
|
|
* improved detection of invalid command line arguments (#58, #76)
|
|
* enhancements to standalone roofline (#61)
|
|
* enable Omniperf on systems with X-server (#62)
|
|
* raise minimum version requirement for rocm (#64)
|
|
* enable baseline comparison in CLI analysis (#65)
|
|
* add multi-normalization to new metrics (#68, #81)
|
|
* support alternative profilers (#70)
|
|
* add MI100 configs to override rocprofiler's incomplete default (#75)
|
|
* improve error message when no GPU(s) detected (#85)
|
|
* separate CI tests by Linux distro and add status badges
|
|
|
|
Version 1.0.6 (21 Dec 2022)
|
|
|
|
* CI update: documentation now published via github action (#22)
|
|
* better error detection for incomplete ROCm installs (#56)
|
|
|
|
Version 1.0.5 (13 Dec 2022)
|
|
|
|
* store application command-line parameters in profiling output (#27)
|
|
* enable additional normalizations in CLI mode (#30)
|
|
* add missing ubuntu 20.04 roofline binary to packaging (#34)
|
|
* update L1 bandwidth metric calculations (#36)
|
|
* add L1 <-> L2 bandwidth calculation (#37)
|
|
* documentation updates (#38, #41)
|
|
* enhanced subprocess logging to identify critical errors in rocprofiler (#50)
|
|
* maintain git sha in production installs from tarball (#53)
|
|
|
|
Version 1.0.4 (11 Nov 2022)
|
|
|
|
* update python requirements.txt with minimum versions for numpy and pandas
|
|
* addition of progress bar indicator in web-based GUI (#8)
|
|
* reduced default content for web-based GUI to reduce load times (#9)
|
|
* minor packaging and CI updates
|
|
* variety of documentation updates
|
|
* added an optional argument to vcopy.cpp workload example to specify device id
|
|
|
|
Version 1.0.3 (07 Nov 2022)
|
|
|
|
* initial Omniperf release
|