نمودار کامیت

64218 کامیت‌ها

مولف SHA1 پیام تاریخ
Julia Jiang 17ffa13035 SWDEV-538999 - Make correction in porting guid for launch_bounds (#646) 2025-08-26 16:55:52 -04:00
Kian Cossettini 07a7b9b845 Use rocprofiler-SDK for OMPT tracing (#702)
Switch to using SDK for OMPT tracing and remove older OMPT code path
2025-08-26 16:54:01 -04:00
systems-assistant[bot] 5f4e0dc889 SWDEV-538789 - Add multi stream kernel dispatch perf test (#556)
Co-authored-by: Pengda Xie <pengda.xie@amd.com>
2025-08-26 13:42:11 -07:00
Milan Radosavljevic df7b9d559f Fix collecting of stream id's for rocpd (#751) 2025-08-26 16:17:42 -04:00
systems-assistant[bot] ded5b86e83 SWDEV-540609 - capture of MIOpen OCL kernels needs remainder globalWorkSize (#431)
Co-authored-by: Rakesh Roy <rakesh.roy@amd.com>
2025-08-26 16:11:31 -04:00
Jason Bonnell 296a4021f9 [rocprofiler-compute] Fix rocprofiler-compute workflows (#761)
* add working-directory to ver_check step in rocprofiler-compute-packaging.yml

* Remove compute mi-rhel9 workflow badge since workflow is no longer in develop

* Update actions to v5 in rocprofiler-compute-docs

* Add working directory to steps in rocprofiler-compute-docs.yml

* Revert back to v4 pages

* Remove rocprofiler-compute-docs.yml workflow

* Remove docs workflow badge from rocprofiler-compute in README.md

* Remove rocprofiler-compute-packaging.yml, update README.md badges
2025-08-26 14:29:15 -04:00
vedithal-amd 323d06c79c [rocprofiler-compute] Add database output format to analyze mode (#748)
Analysis data dump

* Add `--output-format` and `--output-name` option to analyze mode

* Remove `--output` and `-save-dfs` option to analyze mode

* Add documentation on `rocpd` output format and analysis database file

* Create sqlite3 database using object relation mapping (ORM) provided
  by sqlalchemy library

* Fix metrics config to remove metrics marked as `null`, fix `Unit` header, add
  missing `title`

* Add test cases to ensure analysis data dump work
2025-08-26 14:15:05 -04:00
Satyanvesh Dittakavi 09cfa97156 SWDEV-551218 - Fix hip on nvidia build failures (#642)
* Rebase and address merge conflicts

* SWDEV-551218 - Fix hip on nvidia build failures
2025-08-26 23:40:35 +05:30
Milan Radosavljevic 96a46962ad Change amd_smi and cpu_freq modules to use trace cache for rocpd (#690)
* Move amd-smi to use caching mechanism

* Add VCN and JPEG activity to rocpd

* Switch cpu_freq to use caching mechanism

* Different approach with xcp activity & applied suggestions from code review

* Applied suggestions from code review

* Fix shadowing

* Applied suggestions from code review
2025-08-26 14:00:04 -04:00
systems-assistant[bot] 7601798fa7 SWDEV-545953 - Add Implementation for hipStreamGetId (#434)
Authored-by: Satyanvesh Dittakavi <Satyanvesh.Dittakavi@amd.com>
2025-08-26 22:47:55 +05:30
systems-assistant[bot] 832af6d472 SWDEV-545953 - Add Nvidia mapping for hipStreamGetId (#456)
Co-authored-by: Satyanvesh Dittakavi <Satyanvesh.Dittakavi@amd.com>
2025-08-26 21:35:17 +05:30
systems-assistant[bot] 3e62d0d2e6 SWDEV-545953 - Add hipStreamGetId API header (#428)
Authored-by: Satyanvesh Dittakavi <Satyanvesh.Dittakavi@amd.com>
2025-08-26 21:33:26 +05:30
Jimbo c03048d68e Implement hipMemAllocationTypeUncached in hipMemCreate (#747)
* Revert "SWDEV-547589 - Add hipDeviceMallocUncached to hipMemCreate (#815)"

This reverts commit 5ce7103555.

* Revert "SWDEV-547589 - comment for flag hipDeviceMallocUncached in hipMemcreate (#339)"

This reverts commit 04dac5eae3.

* SWDEV-551942 - implement hipMemAllocationTypeUncached in hipMemCreate
2025-08-26 11:34:49 -04:00
Julia Jiang 202aa7ff8c SWDEV-525231 - Remove Memory Manager support in 7.0 (#741) 2025-08-26 11:13:36 -04:00
MachineTom f1ed57e54d SWDEV-550626 - Make atomics test pass with new compiler (#731)
Change pinned host memory to device memory so that
atomics Min/Max tests can pass with new compiler patch
in integer types.
2025-08-25 22:30:55 -04:00
xuchen-amd e8081bd91a Update mi350 output files for unit tests. (#744) 2025-08-25 21:27:10 -04:00
shwetakhatri-amd 79400a1f23 rocr: GFX12+ - Fix trap handler to process SW trap ID correctly (#736)
When stochastic sampling is not active, the trap handler is incorrectly
branching to .check_exceptions, bypassing the software trap ID checks
and inturn not advancing the PC. Fixed the issue to always check software
traps regardless of PC sampling state.

Co-authored-by: Shweta Khatri <shweta.khatri@amd.com>
2025-08-25 19:20:37 -04:00
SaleelK ddba20579d SWDEV-551080 - Fix hipMemcpyDeviceToDeviceNoCU path (#683)
* hipMemcpyDeviceToDeviceNoCU should always take SDMA path as per the
  flag usage
2025-08-25 15:13:02 -07:00
xuchen-amd 5c8b34ddf5 [rocprofiler-compute][TUI] Add interactive metric description (#718) 2025-08-25 15:53:55 -04:00
vedithal-amd 9a02dae75f [rocprofiler-compute] [Bugfix] Fix analysis not working with rocpd (#704)
* fix rocpd roofline

* Improve rocpd test by using dynamic workload folder

* bugfix

* fix ruff format
2025-08-25 11:46:55 -04:00
systems-assistant[bot] b645010655 Using semaphore to sync with all peer processes in finalization stage (#169)
* Using semaphore to sync with all peer processes in finalization stage

[rocprofv3] Implement synchronization using POSIX semaphore in finalization

* clang format code

* clang 11 format code

* Add process sync option for rocprofv3

* Default value of process sync is false

* Update source/lib/rocprofiler-sdk-tool/tool.cpp

Apply suggestion by Copilot

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

* update according to comments

* add new line to helper.hpp

---------

Co-authored-by: Huanran Wang <huanrwan@amd.com>
Co-authored-by: Huanran Wang <huanran.wang@amd.com>
Co-authored-by: Madsen, Jonathan <Jonathan.Madsen@amd.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
2025-08-25 08:57:41 -05:00
vedithal-amd 748c9b74d9 Update standalone binary to use python 3.9 (#725)
* Update standalone docker to python 3.9

* Add TUI files

* Fix docker files to work with monorepo

* Update standalone binary documentation
2025-08-25 07:57:08 -04:00
cfreeamd a013e141b7 Revert "rocr: river interface changes" (#724)
This commit reverts the following related commits which cause
test failures:

6d15779b3e rocr/driver: add PC sampling support to driver interface
56cb9390ff rocr/driver: add PC sampling support to driver interface
76bf829f09 rocr/driver: add ASAN header page management to Driver class
a47c060d6a rocr/driver: add ASAN header page management to Driver class
02d7eaf3b7 rocr: add memory sharing call to Driver interface
9312468655 rocr: add memory sharing call to Driver interface
2025-08-25 12:44:26 +05:30
Ashutosh Mishra f2f7f03d61 Fix buffer overrun (#655)
Assigning a null terminator at
the end of the string wrote
past the end of the allocated
buffer. This patch corrects that.

Signed-off-by: Ashutosh Mishra <ashutosh.mishra@amd.com>
2025-08-25 09:41:25 +05:30
ywang103-amd 2a216ecbc1 pc sampling unit tests (#194) 2025-08-23 10:13:22 -04:00
Taylor Ding b5c8c8bcb1 Eval metrics performance optimizations (#435)
Post-analysis eval metrics performance optimizations.
2025-08-22 16:35:48 -04:00
jamessiddeley-amd 5deeea71df [rocprof-compute] Update Formatting (#671)
* updated rocprof-compute formatting

* fixed ammolite peak variables in parser.py

* format parser.py

* update formatting rocprof_compute_base
2025-08-22 12:22:17 -04:00
Julia Jiang e9e98daf24 SWDEV-524348 - Update description for multiProcessorCount (#648) 2025-08-22 21:40:02 +05:30
Danylo Lytovchenko 2ff2316227 Adjust clang format to the new versions, revert broken macro layout (#714) 2025-08-22 17:23:22 +02:00
xuchen-amd ed877433f3 [rocprofiler-compute][TUI] Updates and refactor. (#703) 2025-08-22 11:02:32 -04:00
Danylo Lytovchenko 59a8c8011d Add validate PR description workflow (#689)
* Add validate PR description workflow

* Fix PR validation workflow name
2025-08-22 16:26:17 +02:00
David Yat Sin a1597a358a rocr: Expose flag to allocate uncached memory (#674)
Add new flag for clients to directly request uncached memory
2025-08-22 09:52:39 -04:00
Daniel Su 5eb4ff1ca7 [Ex CI] add rocprofiler status badge (#643) 2025-08-22 00:46:34 -04:00
Todd tiantuo Li bbf0d34b48 SWDEV-550231 - fix hipGetDriverEntryPoint_common (#651) 2025-08-21 18:57:16 -07:00
Venkateshwar Reddy Kandula a77f7d5770 [aqlprofile] set gpu agent name while registering agent (#687)
* set gpu agent name, which will be used for gfx minor for pm4factory

* Update pm4_factory.cpp

* EOF.

* addr comments

---------

Co-authored-by: Venkateshwar Reddy Kandula <venkateshwar.kandula1306@gmail.com>
2025-08-21 15:16:28 -05:00
Julia Jiang d741a88f39 SWDEV-525231 - Update changelog for 7.0 on develop (#647) 2025-08-21 15:59:54 -04:00
David Galiffi 847580dd9e Update minimum_cmake_required to match version used in CI (#679)
- Update minimum_cmake_required to match version used in CI
  - We should match the minimum version that we test against

- Ensure ".S" files are treated as assembly.
2025-08-21 15:56:47 -04:00
systems-assistant[bot] c7b9533836 [Docs] Update using-pc-sampling (#157) 2025-08-21 11:14:16 -04:00
David Yat Sin 87b348c51d rocr: Fix hsa_amd_pointer_info regression (#638)
Fix regression when hsa_amd_pointer_info is called on a pointer that was
allocated using non-VMM APIs. The helper function VMemoryPtrInfo should
return error when the address is not found so that PtrInfo does the
lookup via Thunk.
2025-08-21 10:25:50 -04:00
xuchen-amd 6bf5648c6d [rocprofiler-compute] Handle amd-smi rocm==6.4.2 json output issue. (#667) 2025-08-21 10:06:04 -04:00
Hadi Naeisseh b2857b5db9 SWDEV-543981 Part 2 This is a new branch to avoid the many errors in the previous PR due to migration (#672)
Co-authored-by: hnaeisse_amdeng <hadi.naeisseh@amd.com>
2025-08-21 09:06:57 -04:00
David Galiffi 31c9d3424c Update README.md (#640)
* Update README.md

* Update README.md
2025-08-20 17:39:49 -04:00
xuchen-amd 0bf66a519c [rocprofiler-compute][TUI] Restructure Performance Metrics (#232) 2025-08-20 17:00:54 -04:00
Joseph Macaranas f5ac5efd79 Language Runtime README Changes (#650)
- Renaming old `README.md` files to keep their information intact.
- Default `README.md` files will have the deprecation notice to be mirrored back into the individual repos.
- Change ROCR-Runtime mirroring to `develop` branch.
2025-08-20 15:48:55 -04:00
vedithal-amd ed53090c25 Join SQC_* csv files as well (#669) 2025-08-20 15:33:48 -04:00
David Galiffi d111e9a297 [rocprofiler-systems] Add Debian 12 workflows (#402)
* Create CI dockers for debian 12

* Create Debian workflow

Signed-off-by: David Galiffi <David.Galiffi@amd.com>

* Fixing typo

* Apply suggestions from code review

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

* Update default value for script's "VERSIONS" variable

* Fix Docker build warnings

LegacyKeyValueFormat: "ENV key=value" should be used instead of legacy "ENV key value" format

* Refactored the check for `pip install --break-system-packages`

---------

Signed-off-by: David Galiffi <David.Galiffi@amd.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
2025-08-20 14:58:49 -04:00
systems-assistant[bot] 4d5db26646 SWDEV-549707 - Fixed memleak in hipMemImportFromShareableHandle (#442)
Co-authored-by: Ioannis Assiouras <Ioannis.Assiouras@amd.com>
2025-08-20 19:28:59 +01:00
systems-assistant[bot] 621da5410a SWDEV-465041 - Avoid wait in device enqueue (#443)
If we have PCIE atomics then we can avoid workaround in the scheduler, which requires an explicit wait on CPU
2025-08-20 12:46:47 -04:00
cfallows-amd 8bc0063f2e Update rocprofiler-compute workflows and CODEOWNERS (#221)
Update super repo codeowners- add Pratik to any doc changes for rocprofiler-compute.
Remove rocprofiler-compute-mi-rhel9.yml from super repo workflow folder- unused and out of date, coverage is already in RHEL8 workflow.
Remove CODEOWNERS, dependabot.yml, and all workflows from the projects/rocprofiler-compute/ project- confirmed these do not do anything anymore after being copied into super repo's main ./github/ section.

Signed-off-by: Carrie Fallows <Carrie.Fallows@amd.com>

* Add a branch filter for push events

* Update .github/CODEOWNERS

Use reviewers team instead of using the individual profiles.

Co-authored-by: David Galiffi <David.Galiffi@amd.com>

Update owners section for rocprofiler-compute codeowners, fix spacing

Signed-off-by: Carrie Fallows <Carrie.Fallows@amd.com>

---------

Signed-off-by: Carrie Fallows <Carrie.Fallows@amd.com>
Co-authored-by: David Galiffi <David.Galiffi@amd.com>
2025-08-20 11:17:25 -04:00
Danylo Lytovchenko f7338717ae SWDEV-470698 - fix formatting, add format check workflow (#657) 2025-08-20 19:58:06 +05:30