Граф коммитов

64291 Коммитов

Автор SHA1 Сообщение Дата
GunaShekar, Ajay c14988b6ee SWDEV-546178 - hipModuleLoadFatBinary hip header (#292)
* SWDEV-546178 - hipModuleLoadFatBinary hip header

* SWDEV-546178 - rebase and fix minor comments

* SWDEV-546178 - clean up header function declaration

[ROCm/hip commit: 4a4959b49e]
2025-08-15 13:13:16 -07:00
GunaShekar, Ajay 4a4959b49e SWDEV-546178 - hipModuleLoadFatBinary hip header (#292)
* SWDEV-546178 - hipModuleLoadFatBinary hip header

* SWDEV-546178 - rebase and fix minor comments

* SWDEV-546178 - clean up header function declaration
2025-08-15 13:13:16 -07:00
Arandjelovic, Marko e91332340a SWDEV-542815 - Fix unused-parameter compiler warnings (#719)
* SWDEV-542815 - Fix unused-parameter compiler warnings

* SWDEV-542815 - Run clang-format

* SWDEV-542815 - Run clang-format

* SWDEV-542815 - Run clang-format

* SWDEV-542815 - Run clang-format

[ROCm/clr commit: 1385d159ba]
2025-08-15 22:10:23 +02:00
Arandjelovic, Marko 1385d159ba SWDEV-542815 - Fix unused-parameter compiler warnings (#719)
* SWDEV-542815 - Fix unused-parameter compiler warnings

* SWDEV-542815 - Run clang-format

* SWDEV-542815 - Run clang-format

* SWDEV-542815 - Run clang-format

* SWDEV-542815 - Run clang-format
2025-08-15 22:10:23 +02:00
Naeisseh, Hadi 1d9c8b7f6d SWDEV-546485 Port and clean up for all tests in catch/perftests/memory folder. (#558)
* SWDEV-546485 Port and clean up for hipPerfBufferCopyRectSpeed

* SWDEV-546485 Port and clean up for hipPerfDevMemReadSpeed

* SWDEV-546485 Port and clean up for hipPerfDevMemWriteSpeed

* SWDEV-546485 Port and clean up for hipPerfHostNumaAlloc

* SWDEV-546485 Port and clean up for hipPerfMemcpy

* SWDEV-546485 Port and clean up for hipPerfMemMallocCpyFree

* SWDEV-546485 Port and clean up for hipPerfMemset

* SWDEV-546485 Port and clean up for hipPerfSampleRate

* SWDEV-546485 Port and clean up for hipPerfSharedMemReadSpeed

* SWDEV-546485 Ported and fixed up segfault for hipPerfMemFill

* SWDEV-545485 Returning to unedited stage

[ROCm/hip-tests commit: 04469c0cde]
2025-08-15 13:09:19 -07:00
Naeisseh, Hadi 04469c0cde SWDEV-546485 Port and clean up for all tests in catch/perftests/memory folder. (#558)
* SWDEV-546485 Port and clean up for hipPerfBufferCopyRectSpeed

* SWDEV-546485 Port and clean up for hipPerfDevMemReadSpeed

* SWDEV-546485 Port and clean up for hipPerfDevMemWriteSpeed

* SWDEV-546485 Port and clean up for hipPerfHostNumaAlloc

* SWDEV-546485 Port and clean up for hipPerfMemcpy

* SWDEV-546485 Port and clean up for hipPerfMemMallocCpyFree

* SWDEV-546485 Port and clean up for hipPerfMemset

* SWDEV-546485 Port and clean up for hipPerfSampleRate

* SWDEV-546485 Port and clean up for hipPerfSharedMemReadSpeed

* SWDEV-546485 Ported and fixed up segfault for hipPerfMemFill

* SWDEV-545485 Returning to unedited stage
2025-08-15 13:09:19 -07:00
GunaShekar, Ajay 0d63a5abd3 SWDEV-546178 - hipModuleLoadFatBinary implementation (#785)
* SWDEV-546178 - hipModuleLoadFatBinary implementation

* SWDEV-546178 - hipModuleLoadFatBinary CHANGELOG update

* SWDEV-546178 - fix roctracer build issue due to hip_prof_str.h

[ROCm/clr commit: f6736d57a5]
2025-08-15 13:06:24 -07:00
GunaShekar, Ajay f6736d57a5 SWDEV-546178 - hipModuleLoadFatBinary implementation (#785)
* SWDEV-546178 - hipModuleLoadFatBinary implementation

* SWDEV-546178 - hipModuleLoadFatBinary CHANGELOG update

* SWDEV-546178 - fix roctracer build issue due to hip_prof_str.h
2025-08-15 13:06:24 -07:00
Manocha, Rahul b3ccf487da SWDEV-545952 - API definitions for hipStreamSet/GetAttribute (#831)
Co-authored-by: Rahul Manocha <rmanocha@amd.com>

[ROCm/clr commit: 0f49c4a97f]
2025-08-15 12:51:35 -07:00
Manocha, Rahul 0f49c4a97f SWDEV-545952 - API definitions for hipStreamSet/GetAttribute (#831)
Co-authored-by: Rahul Manocha <rmanocha@amd.com>
2025-08-15 12:51:35 -07:00
Arandjelovic, Marko a2409bebd0 SWDEV-542815 - Fix unused-parameter compiler warning (#263)
* SWDEV-542815 - Fix unused-parameter compiler warning

* SWDEV-542815 - Run clang-format

[ROCm/hip commit: f94cc3711e]
2025-08-15 12:41:35 -07:00
Arandjelovic, Marko f94cc3711e SWDEV-542815 - Fix unused-parameter compiler warning (#263)
* SWDEV-542815 - Fix unused-parameter compiler warning

* SWDEV-542815 - Run clang-format
2025-08-15 12:41:35 -07:00
Hernandez, Gerardo ef0bb78310 SWDEV-525231 - clarify that reduce sync operations are new intrinsics in the 7.0 CHANGELOG (#876)
SWDEV-525231 - clarify that reduce sync operations are new intrinsics in 7.0, not an existing one

[ROCm/clr commit: a5be0f5346]
2025-08-15 12:40:05 -07:00
Hernandez, Gerardo a5be0f5346 SWDEV-525231 - clarify that reduce sync operations are new intrinsics in the 7.0 CHANGELOG (#876)
SWDEV-525231 - clarify that reduce sync operations are new intrinsics in 7.0, not an existing one
2025-08-15 12:40:05 -07:00
Hernandez, Gerardo f9dba251b7 SWDEV-420237 - add documentation for reduce sync operations (#21)
* SWDEV-420237 - add documentation for reduce sync operations

* SWDEV-515087 - resolve some comments

* SWDEV-420237 - Rewrite sentence about the usage of HIP_ENABLE_WARP_SYNC_TYPES

Co-authored-by: Hartgrove, Randy <randy.hartgrove@amd.com>

* SWDEV-420237 - Update docs as HIP_ENABLE_WARP_SYNC_BUILTINS is not used anymore

* SWDEV-420237 - change wording 'would be' for 'are'

Co-authored-by: Sahasrabuddhe, Sameer <Sameer.Sahasrabuddhe@amd.com>

* SWDEV-420237 - fix redundant wording

Co-authored-by: Sahasrabuddhe, Sameer <Sameer.Sahasrabuddhe@amd.com>

* SWDEV-420237 - use 'and'/'or' consistently when talking about supported types

Co-authored-by: Sahasrabuddhe, Sameer <Sameer.Sahasrabuddhe@amd.com>

* SWDEV-420237 - Do not use future tense in reduce sync operation documentation

Co-authored-by: Sahasrabuddhe, Sameer <Sameer.Sahasrabuddhe@amd.com>

* SWDEV-420237 - make performance remark an 'informational note', regarding reduce sync operations performance

Co-authored-by: Sahasrabuddhe, Sameer <Sameer.Sahasrabuddhe@amd.com>

* SWDEV-420237 - do not use future tense

Co-authored-by: Sahasrabuddhe, Sameer <Sameer.Sahasrabuddhe@amd.com>

* SWDEV-543182 - changing more future tense that should be present tense

---------

Co-authored-by: Hartgrove, Randy <randy.hartgrove@amd.com>
Co-authored-by: Sahasrabuddhe, Sameer <Sameer.Sahasrabuddhe@amd.com>

[ROCm/hip commit: 07ddf90631]
2025-08-15 12:39:33 -07:00
Hernandez, Gerardo 07ddf90631 SWDEV-420237 - add documentation for reduce sync operations (#21)
* SWDEV-420237 - add documentation for reduce sync operations

* SWDEV-515087 - resolve some comments

* SWDEV-420237 - Rewrite sentence about the usage of HIP_ENABLE_WARP_SYNC_TYPES

Co-authored-by: Hartgrove, Randy <randy.hartgrove@amd.com>

* SWDEV-420237 - Update docs as HIP_ENABLE_WARP_SYNC_BUILTINS is not used anymore

* SWDEV-420237 - change wording 'would be' for 'are'

Co-authored-by: Sahasrabuddhe, Sameer <Sameer.Sahasrabuddhe@amd.com>

* SWDEV-420237 - fix redundant wording

Co-authored-by: Sahasrabuddhe, Sameer <Sameer.Sahasrabuddhe@amd.com>

* SWDEV-420237 - use 'and'/'or' consistently when talking about supported types

Co-authored-by: Sahasrabuddhe, Sameer <Sameer.Sahasrabuddhe@amd.com>

* SWDEV-420237 - Do not use future tense in reduce sync operation documentation

Co-authored-by: Sahasrabuddhe, Sameer <Sameer.Sahasrabuddhe@amd.com>

* SWDEV-420237 - make performance remark an 'informational note', regarding reduce sync operations performance

Co-authored-by: Sahasrabuddhe, Sameer <Sameer.Sahasrabuddhe@amd.com>

* SWDEV-420237 - do not use future tense

Co-authored-by: Sahasrabuddhe, Sameer <Sameer.Sahasrabuddhe@amd.com>

* SWDEV-543182 - changing more future tense that should be present tense

---------

Co-authored-by: Hartgrove, Randy <randy.hartgrove@amd.com>
Co-authored-by: Sahasrabuddhe, Sameer <Sameer.Sahasrabuddhe@amd.com>
2025-08-15 12:39:33 -07:00
Luo, Phoebe 83d3897df9 SWDEV-546217 Complete hip-test Port to Catch2 Framework [Stream and Compute Folder] (#559)
* SWDEV-546498 hipPerfDeviceConcurrency

* SWDEV-546500 hipPerfStreamConcurrency

* SWDEV-546502 hipPerfStreamCreateCopyDestroy.c

* SWDEV-546479 hipPerfDotProduct

* SWDEV-546482 hipPerfMandelbrot

[ROCm/hip-tests commit: 9fdc9a98b7]
2025-08-15 12:38:33 -07:00
Luo, Phoebe 9fdc9a98b7 SWDEV-546217 Complete hip-test Port to Catch2 Framework [Stream and Compute Folder] (#559)
* SWDEV-546498 hipPerfDeviceConcurrency

* SWDEV-546500 hipPerfStreamConcurrency

* SWDEV-546502 hipPerfStreamCreateCopyDestroy.c

* SWDEV-546479 hipPerfDotProduct

* SWDEV-546482 hipPerfMandelbrot
2025-08-15 12:38:33 -07:00
GunaShekar, Ajay ceaa8c8475 SWDEV-546178 - hipModuleLoadFatBinary nvidia impl (#26)
[ROCm/hipother commit: 5107c35166]
2025-08-15 12:37:44 -07:00
GunaShekar, Ajay 5107c35166 SWDEV-546178 - hipModuleLoadFatBinary nvidia impl (#26) 2025-08-15 12:37:44 -07:00
GunaShekar, Ajay daa2814cf3 SWDEV-541623 - cuda parity hipTexObjectCreate when Array is null (#734)
[ROCm/clr commit: 54e592beec]
2025-08-15 12:35:33 -07:00
GunaShekar, Ajay 54e592beec SWDEV-541623 - cuda parity hipTexObjectCreate when Array is null (#734) 2025-08-15 12:35:33 -07:00
Luo, Phoebe 12a1235939 SWDEV-543981 - Performance Test Improvement for Dispatch Speed and Kernel Latency (#527)
* SWDEV-543981 new kernel latency test with different timing modes and taking multiple iterations of same test

* SWDEV-543981 cleanup

* SWDEV-543981 removed outdated hit test

* SWDEV-543981 Updated timing kernel

[ROCm/hip-tests commit: d227a8110c]
2025-08-15 12:34:44 -07:00
Luo, Phoebe d227a8110c SWDEV-543981 - Performance Test Improvement for Dispatch Speed and Kernel Latency (#527)
* SWDEV-543981 new kernel latency test with different timing modes and taking multiple iterations of same test

* SWDEV-543981 cleanup

* SWDEV-543981 removed outdated hit test

* SWDEV-543981 Updated timing kernel
2025-08-15 12:34:44 -07:00
Manocha, Rahul 59536c1331 SWDEV-545952 - Header changes for hipStreamSet/GetAttribute (#311)
Co-authored-by: Rahul Manocha <rmanocha@amd.com>

[ROCm/hip commit: c9b9717f57]
2025-08-15 12:27:37 -07:00
Manocha, Rahul c9b9717f57 SWDEV-545952 - Header changes for hipStreamSet/GetAttribute (#311)
Co-authored-by: Rahul Manocha <rmanocha@amd.com>
2025-08-15 12:27:37 -07:00
Kian Cossettini 947adacb3a [rocprofiler-systems] Fixed build-docker script failure on sparse checkout (#364)
Created `containers.yml` file in docker folder.
Updated `build-docker.sh` script to read from this file to determine supported OS + ROCm combinations.
2025-08-15 13:50:43 -04:00
Ammar ELWazir 3dd35a51f4 ROCProfiler SDK CI Fix (#365) 2025-08-15 12:18:28 -05:00
Arandjelovic, Marko 4e4caf3ff8 SWDEV-512604 - Use the same context during Image creation (#800)
[ROCm/clr commit: be73d024e2]
2025-08-15 18:31:11 +02:00
Arandjelovic, Marko be73d024e2 SWDEV-512604 - Use the same context during Image creation (#800) 2025-08-15 18:31:11 +02:00
Daniel Su d3e29aaea9 [Ex CI] add rocprofiler pipeline ID (#355) 2025-08-15 10:20:02 -04:00
hkasivis 53ba025a2e libhsakmt: Don't use MADV_DONTFORK for paged memory (#356)
Also advice parameter of madvise() system call is not a bitmask. So fix
that also

v2: Use MAP_SHARED instead of MAP_PRIVATE. This avoids MMU notifiers and
    evictions.

Signed-off-by: Harish Kasiviswanathan <Harish.Kasiviswanathan@amd.com>
2025-08-15 09:22:20 -04:00
Dittakavi, Satyanvesh e81464f1ad SWDEV-547629 - Fix hipLaunchHostFunc as it is supported during stream capture (#823)
[ROCm/clr commit: ddd96c1e5d]
2025-08-15 15:19:59 +05:30
Dittakavi, Satyanvesh ddd96c1e5d SWDEV-547629 - Fix hipLaunchHostFunc as it is supported during stream capture (#823) 2025-08-15 15:19:59 +05:30
Joseph Macaranas 7e661bb310 Migration Date Changes (#350) 2025-08-15 02:29:03 -04:00
RahulC 1996dddb73 Update rocm_ci_caller.yml - disable the trigger for draft PRs (#358) 2025-08-14 16:48:44 -07:00
jokim-amd 700afd2d17 Re-Enable IPC DMA Bufs by default
Let ROCr use the new IPC-DMA bufs path.
2025-08-14 18:49:09 -04:00
Arandjelovic, Marko c33359ec62 SWDEV-532479 - Add tracking of hostcall memory allocations (#416)
* SWDEV-532479 - Add tracking of hostcall memory allocations

* SWDEV-532479 - Remove hostcall allocations if request is received

* SWDEV-532479 - Cleanup

* SWDEV-532479 - Naming fix

* SWDEV-532479 - Add new separator after each new function

[ROCm/clr commit: b58faa2f37]
2025-08-15 00:17:24 +02:00
Arandjelovic, Marko b58faa2f37 SWDEV-532479 - Add tracking of hostcall memory allocations (#416)
* SWDEV-532479 - Add tracking of hostcall memory allocations

* SWDEV-532479 - Remove hostcall allocations if request is received

* SWDEV-532479 - Cleanup

* SWDEV-532479 - Naming fix

* SWDEV-532479 - Add new separator after each new function
2025-08-15 00:17:24 +02:00
Manocha, Rahul 2b79ca9435 SWDEV-546328 - API definitions for hipMemsetD2D8/D16/D32 (#789)
Co-authored-by: Rahul Manocha <rmanocha@amd.com>

[ROCm/clr commit: 5f86622adc]
2025-08-14 14:47:28 -07:00
Manocha, Rahul 5f86622adc SWDEV-546328 - API definitions for hipMemsetD2D8/D16/D32 (#789)
Co-authored-by: Rahul Manocha <rmanocha@amd.com>
2025-08-14 14:47:28 -07:00
Manocha, Rahul 31cf3d8487 SWDEV-546328 - hipMemsetD2D API declarations (#294)
Co-authored-by: Rahul Manocha <rmanocha@amd.com>

[ROCm/hip commit: fb19412da3]
2025-08-14 14:47:21 -07:00
Manocha, Rahul fb19412da3 SWDEV-546328 - hipMemsetD2D API declarations (#294)
Co-authored-by: Rahul Manocha <rmanocha@amd.com>
2025-08-14 14:47:21 -07:00
systems-assistant[bot] a42c58ff25 Update Workflows for Ubuntu 24.04 (#111)
* Updated ubuntu noble workflow

* Actually use ROCm when supposed to; Added Python 3.13; Set ompt flag to ON

* Print compiler version outside of if logic

Co-authored-by: David Galiffi <David.Galiffi@amd.com>

* Move compiler version info

Co-authored-by: David Galiffi <David.Galiffi@amd.com>

* Updated `working-directory`

set to `projects/rocprofiler-systems`

* Update .github/workflows/rocprofiler-systems-ubuntu-noble.yml

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

* Simplifying some of the build options

---------

Co-authored-by: Kian Cossettini <Kian.Cossettini@amd.com>
Co-authored-by: David Galiffi <David.Galiffi@amd.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
2025-08-14 16:31:56 -04:00
jamessiddeley-amd 2936a468aa [rocprof-compute] updated formatting workflow and docs (#348)
* updated formatting workflow and docs

* restored requirements.txt
2025-08-14 14:12:24 -04:00
Arandjelovic, Marko 208d124f54 SWDEV-547453 Release the kernel command if the operation returns an error (#807)
* SWDEV-547453 Release the kernel command if the operation returns an error

* SWDEV-547453 - Initialize parameters_ to default value

* SWDEV-547453 - Run clang-format

[ROCm/clr commit: a15957fee9]
2025-08-14 20:08:53 +02:00
Arandjelovic, Marko a15957fee9 SWDEV-547453 Release the kernel command if the operation returns an error (#807)
* SWDEV-547453 Release the kernel command if the operation returns an error

* SWDEV-547453 - Initialize parameters_ to default value

* SWDEV-547453 - Run clang-format
2025-08-14 20:08:53 +02:00
systems-assistant[bot] 3fd8af5974 rocr: SvmPrefetch to a particular node (#294)
Previously regardless of hsa_agent passed the prefetch is always driven
to node 0, now the agent of interest may be properly prefetched.

Signed-off-by: Sunday Clement <Sunday.Clement@amd.com>
Co-authored-by: Sunday Clement <Sunday.Clement@amd.com>
Co-authored-by: systems-assistant[bot] <systems-assistant[bot]@users.noreply.github.com>
2025-08-14 09:52:45 -04:00
Giovanni Lenzi Baraldi 75eb06dc18 Add realtime clock agent info to ATT records (#319)
* Add agent info to ATT records

* Fix names

* Update projects/aqlprofile/src/pm4/sqtt_builder.h
2025-08-14 13:35:00 +02:00
Stojiljkovic, Vladana 5775e9202b SWDEV-545489, SWDEV-545082 - Free marker when ipc event is recorded (#784)
Co-authored-by: Rahul Manocha <rmanocha@amd.com>

[ROCm/clr commit: da068a2492]
2025-08-14 11:08:53 +02:00