Alysa Liu
5b75ec6a09
rocr: Fix error when internal signal is destroyed ( #1845 )
...
Fix error when we destroy internal signals during shutdown.
Fix init dependency on uninitialized value.
2025-11-26 16:22:57 -08:00
Ioannis Assiouras
a598f9138b
Fix flaky test Unit_hipStreamAddCallback_StrmSyncTiming ( #2022 )
2025-11-26 22:52:58 +00:00
Shadi Dashmiz
962b99f925
SWDEV-567514: Remove default stream wait ( #1977 )
...
- when virtual map command is called
- can create deadlock
Signed-off-by: sdashmiz <shadi.dashmiz@amd.com >
2025-11-26 15:11:52 -05:00
Kian Cossettini
76a23eab14
[rocprofiler-systems] Add support for ompt_callback_thread_begin ( #1681 )
...
* Add thread_begin callback
* Make OMPT callbacks that are instant have start_ts = end_ts
2025-11-26 13:38:04 -05:00
Rahul Manocha
bc6f29c04a
Fix and enable VMM tests on cuda ( #1855 )
...
* Fix and enable VMM tests on cuda
* Minor syntax fixes
---------
Co-authored-by: Rahul Manocha <rmanocha@amd.com >
2025-11-26 08:48:47 -08:00
AidanBeltonS
d849b88aef
SWDEV-558080 - Add recommended granularity ( #1176 )
...
* Add recommended granularity
* Improve granularity testing
* Update based on feedback
2025-11-26 16:10:58 +00:00
Yiltan
f9caef6908
Add rocshmem_int64_p ( #335 )
...
[ROCm/rocshmem commit: d9e2890222 ]
2025-11-26 10:31:23 -05:00
Yiltan
d9e2890222
Add rocshmem_int64_p ( #335 )
2025-11-26 10:31:23 -05:00
Adam Pryor
422253f871
Implement PTL support ( #1957 )
...
* Implement PTL support
Signed-off-by: adapryor <Adam.pryor@amd.com >
(cherry picked from commit 45bc31292e7940a3b8fca044ef7df22047b95733)
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
---------
Signed-off-by: adapryor <Adam.pryor@amd.com >
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
Co-authored-by: Maisam Arif <Maisam.Arif@amd.com >
2025-11-26 08:33:27 -06:00
Matt Arsenault
f089217e6a
SWDEV-548892 - Stop using ockl steadyctr function ( #1882 )
...
Directly use the builtin
2025-11-26 09:29:06 -05:00
amilanov-amd
da9bb4efae
SWDEV-503089 - Fix and enable disabled HIP tests from math group ( #1319 )
...
* SWDEV-503089 - Fix and enable disabled HIP tests from math group
* SWDEV-503089 - Move single precision reduced run to a common function
2025-11-26 10:34:05 +01:00
Todd tiantuo Li
ee48f6221d
SWDEV-562708 - change default maximum SVM size to 256GB ( #1731 )
2025-11-25 23:59:39 -08:00
Matt Arsenault
9fbb062505
SWDEV-548892 - Stop using ocml isinf wrapper ( #1854 )
2025-11-25 22:21:37 -05:00
Karthik Jayaprakash
740a06d567
SWDEV-559267 - Use CLPrint to DevLogPrintf with Log Level - detail debug. ( #1160 )
2025-11-25 19:25:32 -05:00
German Andryeyev
93682f2f75
SWDEV-567852 - Clean-up hip::init() ( #1948 )
2025-11-25 19:05:41 -05:00
cadolphe-amd
cce94f6ee0
SWDEV-557412 - Incorporate proper chunk offset when remapping virtual memory ( #1848 )
...
* SWDEV-557412 - Incorporate proper offset when remapping virtual memory
* Fix condition to check if VMHeap allocation address matches a chunk address
* Move offset calculation outside if/else block
---------
Co-authored-by: JeniferC99 <150404595+JeniferC99@users.noreply.github.com >
2025-11-25 18:05:25 -05:00
marantic-amd
daf8596ce9
[rocprof-sys] Process all information regarding agents and store them as extdata in rocpd database ( #1880 )
...
## Motivation
Resolved: SWDEV-566226
The current implementation of agents inside of rocprof-systems keeps just the minimal necessary set of information required for populating the `info_agent` table inside of rocpd database. There is a sufficient amount of data that is being left out from database, so this change should fix that and store the additional agent information as an `extdata` row inside of `info_agent` table.
## Technical Details
This PR introduces additional filed inside of `agent` structure inside which is representing the JSON formatted string of all the additional information we can acquire about particular agent. This data is processed and added during the initial fetching of agents, and afterwards pushed inside of the database.
---------
Co-authored-by: David Galiffi <David.Galiffi@amd.com >
2025-11-25 17:33:12 -05:00
itrowbri
304c2b82b0
Updated rocprofv3.py to ignore old attach duration msec value ( #1980 )
2025-11-25 16:30:54 -06:00
Victor Zhang
ede71ca3b0
SWDEV-567829 - populateFormatStringHashMap: relax printf hash collisi… ( #1944 )
...
* SWDEV-567829 - populateFormatStringHashMap: relax printf hash collision check for duplicate format strings
* function optimized by ai
2025-11-25 17:19:27 -05:00
isaki001
cf11e2f39f
add back missing proxy-counter updates ( #2052 )
...
[ROCm/rccl commit: da183596cd ]
2025-11-25 15:22:34 -06:00
isaki001
da183596cd
add back missing proxy-counter updates ( #2052 )
2025-11-25 15:22:34 -06:00
anujshuk-amd
85b5c03f36
[rocprof-sys] Fix test build failure on RHEL 10 ( #1955 )
...
## Motivation
To solve: SWDEV-566076
FFmpeg versions >= 58.134 no longer expose read_seek and read_seek2 function pointers in AVInputFormat,
requiring alternative seek detection methods. This pull request updates the `VideoDemuxer` class to improve compatibility with newer versions of FFmpeg. The main change is how the code determines whether the input file is seekable, addressing differences in FFmpeg API versions.
## Technical Details
In `video_demuxer.h`, added a conditional check for `USE_AVCODEC_GREATER_THAN_58_134` to set `is_seekable_` to `true` for newer FFmpeg versions, since `read_seek` and `read_seek2` are no longer exposed in `AVFormatContext`. For older versions, the previous method of checking these fields remains in place. The conditional compilation
now assumes seek capability is available for newer FFmpeg versions.
2025-11-25 15:25:05 -05:00
Bindhiya Kanangot Balakrishnan
e8c3b22734
[SWDEV-556483] Fix runtime PM suspend causing test failures ( #1931 )
...
Added runtime PM detection and DRM ioctl-based device wake
to handle GPUs in BACO state. Modified tests to wake
suspended devices before reading sysfs files.
---------
Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com >
2025-11-25 13:36:45 -06:00
Aryan Salmanpour
d4898975b3
Update optional GPU targets ( #675 )
...
[ROCm/rocdecode commit: 014e0d3e65 ]
2025-11-25 10:15:59 -08:00
Aryan Salmanpour
014e0d3e65
Update optional GPU targets ( #675 )
2025-11-25 10:15:59 -08:00
Aryan Salmanpour
650f1227d3
Update optional GPU targets ( #206 )
...
[ROCm/rocjpeg commit: 5fbcffb11f ]
2025-11-25 10:15:38 -08:00
Aryan Salmanpour
5fbcffb11f
Update optional GPU targets ( #206 )
2025-11-25 10:15:38 -08:00
usrihari123
47e53ec6f3
Update rocpd docs ( #1276 )
2025-11-25 22:33:12 +05:30
German Andryeyev
2c5754844f
SWDEV-465041 - Enable direct dispatch under Linux by default. ( #1934 )
2025-11-25 11:30:32 -05:00
Victor Zhang
92fcc928b6
SWDEV-526773 - Modify LaunchDelayKernel to set a hard coded WallClock… ( #1911 )
...
* SWDEV-526773 - Modify LaunchDelayKernel to set a hard coded WallClock value when it's not avaliable
* Change hardcode clockrate in unit of KHz.
2025-11-25 11:21:03 -05:00
Ethan Trinh
2042191e23
Suppress deprecated-declaration warnings ( #1817 )
...
Co-authored-by: JeniferC99 <150404595+JeniferC99@users.noreply.github.com >
2025-11-25 10:31:30 -05:00
Ethan Trinh
bef946de1c
SWDEV-555551 - Remove hip-test warnings in linux ( #1031 )
...
Co-authored-by: JeniferC99 <150404595+JeniferC99@users.noreply.github.com >
2025-11-25 10:31:15 -05:00
Jason Bonnell
e68873c170
Gersemi formatting for rocprofiler-compute ( #1997 )
...
* Run gersemi formatting on cmake files in compute
* Run gersemi again but on updated version
2025-11-25 09:49:16 -05:00
Gerardo Hernandez
8abfee9f26
SWDEV-541351 - fix use of uninitialized memory in Unit___hip_atomic_compare_exchange tests ( #1976 )
2025-11-25 11:02:14 +00:00
solaiys
3466ec5458
Added PCIE Atomic Operations enable check. ( #1746 )
...
* Added PCIE Atomic Operations enable check.
Tests if atomic operations are enabled for GPU devices.
Displays the Atomic routing capability via Link capability and status.
Signed-off-by: Saravanan Solaiyappan <saravanan.solaiyappan@amd.com >
2025-11-25 14:29:30 +05:30
Gerardo Hernandez
c87014a54c
SWDEV-534207 fix order of kernel launch parameters when calling notifiedKernel in some tests: kernel<<<gridDim, blockDim>>> instead of kernel<<<blockDim, gridDim>>>. This was causing out of bounds accesses ( #1860 )
2025-11-25 06:37:47 +00:00
Pengda Xie
6c31785eaf
SWDEV-562761 - Cleanup static fatbin on runtime teardown ( #1873 )
2025-11-24 21:57:46 -08:00
darren-amd
16e7ee32e6
[rocm-smi-lib] Add iomanip include to frequencies_read ( #1797 )
2025-11-24 16:38:21 -05:00
Young Hui - AMD
a4f533fa92
[rocpd] Fix rocpd convenience scripts to accept --automerge-limit parameter ( #1926 )
...
* remove double RocpdImportData calls from execute() in each module
* formatting fix
2025-11-24 14:50:27 -05:00
Maisam Arif
1f7fc8d8a7
Fixed wrapper to respect symlink pathing ( #1984 )
...
Signed-off-by: Maisam Arif <Maisam.Arif@amd.com >
2025-11-24 13:14:46 -06:00
systems-assistant[bot]
c404fbd851
[SWDEV-560235] Add gpu_board and base_board temperatures to monitor ( #1906 )
...
* Add helpers for gpu_board and base_board temperatures
* Added gpu_board and base_board temperatures arguments for non-default monitor subcommand
Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com >
Co-authored-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com >
2025-11-24 13:12:09 -06:00
Marius Brehler
2dc32d645b
Explicitly load versioned libamdhip64.so ( #1872 )
...
* Explicitly load versioned libamdhip64.so
* Fix syntax errors
* Fix when patching happens in Windows workflow
---------
Co-authored-by: Joseph Macaranas <145489236+jayhawk-commits@users.noreply.github.com >
Co-authored-by: ammallya <ameyakeshava.mallya@amd.com >
2025-11-24 10:05:05 -08:00
Kapil S. Pawar
566671910a
[RcclReplayer] JSON <-> BIN log format conversion tool ( #2056 )
...
* Add replay log format converter
* Add Log Sanitizer
* Add no timestamp option (nts) to sanitizer
[ROCm/rccl commit: 5fd86021a8 ]
2025-11-24 11:51:36 -06:00
Kapil S. Pawar
5fd86021a8
[RcclReplayer] JSON <-> BIN log format conversion tool ( #2056 )
...
* Add replay log format converter
* Add Log Sanitizer
* Add no timestamp option (nts) to sanitizer
2025-11-24 11:51:36 -06:00
sluzynsk-amd
2cf9faa93f
SWDEV-563777 - fix warnings related to inconsistent overrides ( #1625 )
...
This patch adds missing override keywords. Fixes this class of warnings.
Signed-off-by: Sebastian Luzynski <Sebastian.Luzynski@amd.com >
2025-11-24 18:50:07 +01:00
Nilesh M Negi
8c928e60f9
[AzureCI] Increase timeout of per PR and nightly pipeline to 240 mins ( #2074 )
...
[ROCm/rccl commit: db52690c2a ]
2025-11-24 10:55:36 -06:00
Nilesh M Negi
db52690c2a
[AzureCI] Increase timeout of per PR and nightly pipeline to 240 mins ( #2074 )
2025-11-24 10:55:36 -06:00
habajpai-amd
1a3564a51a
[rocprof-sys] Fix fork() handling for GPU profiling and AMD SMI ( #1930 )
...
- Fix fork() handling for GPU profiling and AMD SMI
- Add hipMallocConcurrency test for CI with GPU
2025-11-24 09:21:27 -05:00
marantic-amd
ebd55d2ce0
Track process_sampler state for CPU sampling ( #1993 )
2025-11-24 15:03:08 +01:00
Aleksandar Djordjevic
a5d554b85a
[rocprofiler-systems] Implement GTest/GMock integration for unit testing ( #1777 )
...
* googletest project set up
---------
Co-authored-by: Aleksandar Djordjevic <adjordje@amd.com >
Co-authored-by: Milan Radosavljevic <milan.radosavljevic@amd.com >
2025-11-24 11:49:30 +01:00