Vladana Stojiljkovic
30cb2d0e67
SWDEV-498061 - Add capture support for hipModuleLaunchCooperativeKernel
...
Change-Id: I5ed188e046c680c2785b3952391f59ed1d0c21b8
2025-01-16 10:54:30 -05:00
Marko Arandjelovic
db8527f655
SWDEV-489619 - Fix memcpy tests with capture stream enabled
...
- Added missing validation as graph node should not be created
if parameters are invalid
- Fix conversion of input params to graphNode params
Change-Id: I37ab04942b5fb2eb07386850cb7dbbf26f9ca967
2025-01-16 10:31:04 -05:00
Marko Arandjelovic
c46f843b99
SWDEV-504084 - Make hipModuleGetFunction use the device the module is loaded on
...
If a module is loaded on one device, hipModuleGetFunction and other similar APIs should be able to run successfully from another device.
Change-Id: I96084cbd6c6dcf2a81019779a6ab1842ef2f35d1
2025-01-16 10:16:42 -05:00
Ioannis Assiouras
5e3a29078d
SWDEV-505503 - Use internal device synchronize function in __hipUnregisterFatBinary
...
This is to avoid calling the HIP_INIT macro during the shutdown process.
Change-Id: I2e65f6e10491918a17445ee1e8ddd08286070358
2025-01-15 18:57:34 -05:00
Sourabh Betigeri
9d8d35ae40
SWDEV-507960 - Return with error code if stream of type hipStreamLegacy is being attempted to destroy
...
Change-Id: Iee7ada6a5a905b44360a7e4049fc8b1a45c80db0
2025-01-13 18:17:37 -05:00
Ioannis Assiouras
44b6b6813d
SWDEV-503760 - Only consider allocations that are less than X% larger in a mempool request
...
Change-Id: I94acbca606fd4c575e2e1a9e34959ce650571867
2025-01-13 16:57:26 -05:00
Rahul Manocha
2b32d9aada
SWDEV-497288 - Add validation checks to hipGraphExecNodeSetParams
...
Change-Id: If8cde47bc8e62414333768e01064298d8a3d80ee
2025-01-13 16:49:45 -05:00
Anusha GodavarthySurya
683a942364
SWDEV-480209 - Make internal callbacks non-blocking
...
Change-Id: Ic918d08f341abfd9a7c167d09f9c723cdc43157f
2025-01-10 02:16:11 -05:00
Saleel Kudchadker
a22c45d635
SWDEV-508004 - Improve hipEventRecord
...
- Resolve stream once for event record. We should avoid calling
getStream again in addMarker
Change-Id: I78448c4f151ae10a5c8e8c248b2f4078b84191cb
2025-01-09 16:47:46 -05:00
German Andryeyev
45a12208b6
SWDEV-507019 - Change the function lock to the module lock
...
Multiple functions can be located in the same module.
Change-Id: Ia4ca3db64fe5b0822584059d3770c91103665c63
2025-01-09 12:35:29 -05:00
Rahul Manocha
05baf9ff22
SWDEV-504215 - fix rocalution perf drop by disabling cpu wait
...
Change-Id: I878f3420073b05cc6241f524ac428e47c0ce823d
2025-01-07 17:02:24 -05:00
Marko Arandjelovic
7e152bb0f3
SWDEV-506234 - Refactor validation in hip_memory
...
Change-Id: I9d69695e4b6668e6de00f1f6b060862872358340
2024-12-31 00:35:25 +02:00
Jaydeep Patel
dff8197b1d
SWDEV-505276 - Parent graph of orig node and original graph of cloned node should be same.
...
Change-Id: I6ebc21cc42e41ad5d952a69fb3b3cb095f32cffb
2024-12-24 04:32:14 -05:00
taosang2
d82d6a78cf
SWDEV-479958 - Support different address mode
...
Support different address modes in X, Y, Z directions
Change-Id: If1db5a8af33c92dd14b48968c3e8eceb97daea6c
2024-12-23 16:39:54 -05:00
Ioannis Assiouras
158b6a29e0
SWDEV-505504 - Disable vectorization in GetHipDispatchTable
...
Change-Id: Id33144623555a5d25e029ca644f6274610dcd0ad
2024-12-20 17:47:07 -05:00
Jaydeep Patel
a05a02e527
SWDEV-505205 - Fix hipStreamLegacy segfault with hipStreamWaitEvent.
...
Change-Id: I17fdaf7ac323507f99a7c071066944296537489c
2024-12-20 04:18:21 -05:00
German Andryeyev
0640d36019
SWDEV-504658 - Reduce the lock scope for kernel look-up
...
The vector with all kernels is preallocated on the executable init.
Thus, reduce the scope of global lock to the binary creation only.
Change-Id: I73035013a6562175069137e895bba815f466ee35
2024-12-18 17:04:51 -05:00
Sourabh Betigeri
cd9db5a2fa
SWDEV-505277 - Adds hipStreamBatchMemOp in the enum of hcc_map
...
Change-Id: I6e58dfbe4ba13db8717edc36020fefabc9ddbe23
2024-12-18 05:38:58 -05:00
German Andryeyev
e3efce20be
SWDEV-504650 - Switch to shared_mutex for events
...
Use shared mutex for events validation
Change-Id: Iff291c758d9edd65717c506150f3b9d39e5306ba
2024-12-17 11:04:58 -05:00
Ioannis Assiouras
e8b2fdab96
SWDEV-483134 - Remove hipExtHostAlloc API
...
Change-Id: I60777ef5c56b60dd8100d0d794ca10fb3b96a555
2024-12-16 17:13:49 -05:00
Anusha GodavarthySurya
13e2e797c0
SWDEV-469422 - Derive GraphExec from Graph and ChildGraphNode from GraphExec
...
Change-Id: I54d67a1665355579bc249d8ff4f9806e9ee14588
2024-12-16 00:43:57 -05:00
Sourabh Betigeri
f1c05e9026
SWDEV-421020 - Adds hipGraphAddBatchMemOp, SetGetParams and execSetParams APIs
...
Change-Id: Ieccecfe6173cc68fd3c01f86c99f7cc09fe194a3
2024-12-13 06:23:39 +00:00
Sourabh Betigeri
03dbcd8ca7
SWDEV-440866 - [hip-roclr] Adds support to batch memory operations APIs
...
Change-Id: I5ac63a6626af8c2b4ac382c52dfe1aaf0b3716b8
2024-12-12 19:29:24 -05:00
Tao Sang
3ad8f1b811
SWDEV-496667 - Support gfx9-4-generic target
...
Support gfx9-4-generic target to cover mi3XX.
Support features sramecc and xnack in generic target.
Improve some code formats.
Add more log on compiler.
Change-Id: I6b3c6af55c60cffd43ce6f17b75998f751b75713
2024-12-12 14:43:39 -05:00
Saleel Kudchadker
537f2fffc9
SWDEV-301667 - Improve logging for hip_memory
...
Change-Id: Id624b2c91e6b701bc0ee561a0c193f2c66654890
2024-12-12 14:42:00 -05:00
Anusha GodavarthySurya
28cbf2bc4f
SWDEV-469422 - Refacor childgraph node
...
Remove static functions in graph
Change-Id: I4df94915f81f250acaea60398aea32ef0ed658e2
2024-12-12 12:38:24 -05:00
Jaydeep Patel
c8afd7109d
SWDEV-496544 - Sync with stream if it is different than srcMemory stream.
...
Change-Id: I9c0f94a9531555278a51202ec7203961e1344c2e
2024-12-12 06:17:39 -05:00
Branislav Brzak
89dfdc4dbe
SWDEV-490860 - Do signal_is_required detection post graph schedule
...
Change-Id: Iaf1067a811aeac3d16c08de954036e219b545e07
2024-12-09 03:57:44 -05:00
Jaydeep Patel
0d4823ff88
SWDEV-502532 - Exit graph launch in case of empty graph.
...
Change-Id: Ifb6ab14ca6810cbc1c9e38c59d1d9e7d367358d9
2024-12-07 12:27:53 -05:00
Ioannis Assiouras
e80442fdbf
SWDEV-497759 - Fix memObj offset computation for hipHostRegister on Windows mgpu
...
On Windows, hipHostRegister may add a single object in the MemObjMap
that maps to memory that is allocated on different devices.
This change ensures that the offset that is returned from
getMemoryObject() is computed relative to the memory that is allocated
on the current device.
Change-Id: I5fd3af200bf6f4926fdeaea12dcb9d0154d3a843
2024-12-05 16:18:10 -05:00
Marko Arandjelovic
78f62d3230
SWDEV-495609 - Change include path for rocclr/utils
...
- Header files inside rocclr/utils when included from hipamd or opencl should be included as #include "rocclr/utils/xxx.h" instead of "utils/xxx.h"
Change-Id: Ic0760c33b9d091f5620dec67e5482c9698d22093
2024-12-05 11:44:20 -05:00
Branislav Brzak
f2dba978f5
SWDEV-490860 - Include signal_is_required in dot file dumps
...
Change-Id: Iec4b433b11fbecb71a4ce68beb7d6f681d25b8e6
2024-12-05 04:51:59 -05:00
Shane Xiao
231b2410a0
SWDEV-492049 - Remove the handle of Phy Mem from Memobj
...
The hipGraph will use VMM by default when allocating memory.
However, the handle of Phy mem has been added to Memobj by default.
Since the Memobj will track the whole address range from handle to
handle + size, this needs the system to reserve the whole address
range. If the system range have not reserved by the system, then it
will have the potential issue that clr finds the Memobj incorrectly.
This patch removes the handle from the Memobj to fix this potential
issue.
Change-Id: I2da38e6b2d11d0d48e1afe66c46899500c290624
2024-12-04 19:39:52 -05:00
German Andryeyev
efd3ea4b30
SWDEV-501403 - Switch to std::shared_mutex for streamSetLock
...
Shared mutex allows to have access to the list of streams from
multiple threads at the same time.
Change-Id: Ibee64b846cde03321d5b17dbee2829c0bab7e7d6
2024-12-04 12:06:51 -05:00
Anusha GodavarthySurya
b89977d518
SWDEV-469422 - Always schedule graph nodes
...
Change-Id: Icc636527fa19e7bf3eb111bc4b1bb9a5f9acff73
2024-12-03 23:44:23 -05:00
Saleel Kudchadker
6611cc015d
SWDEV-494149 - Improve hipGet/Set Device
...
Change-Id: If8975687a3ba9caadafc48a0066f19a4ebaab9e2
2024-12-03 13:36:38 -05:00
Sourabh Betigeri
2ca644cf22
Revert "SWDEV-440866 - [hip-roclr] Adds support to batch memory operations APIs"
...
This reverts commit bd5d8e9baf .
Reason for revert: hipInfo fails on windows. Updating llvm amd-mainline-closed
Change-Id: I57e1fa1945188b0bc0a799c4f3d540f2b7713003
2024-12-02 16:46:12 -05:00
Marko Arandjelovic
08aee16573
SWDEV-499794 - Update AQL packet after updating GraphNode
...
Change-Id: I332d70bdf42a276894a548a02d636e370c2ca08c
2024-12-02 12:29:35 -05:00
Sourabh Betigeri
bd5d8e9baf
SWDEV-440866 - [hip-roclr] Adds support to batch memory operations APIs
...
Change-Id: I449ffca44bbb04d13348d112e896d603c70fd485
2024-11-30 17:54:32 -05:00
Anusha GodavarthySurya
c47f9dda58
SWDEV-469422 - Cleanup graph code remove parallellists and nodewaitlists
...
Change-Id: I00c7b2894333bd13d47b913d3fcdd6e1ffcb741f
2024-11-30 04:40:51 -05:00
taosang2
f1f4f40c5b
SWDEV-447973 - Support generic targets
...
Change-Id: I32db83843e45e0f013591493aafd7a532c881e16
2024-11-29 10:12:10 -05:00
Vladana Stojiljkovic
b75b0d9a53
SWDEV-494612- Add capture support for hipLaunchCooperativeKernel
...
Change-Id: I6b3c6af55c60cffd43ce6f47b75998f750b75703
2024-11-29 08:17:41 -05:00
Anusha GodavarthySurya
fb7ad8361c
SWDEV-489084 - Update max streams for graph
...
Change-Id: I6d0992b2e80ebf3184911593a4f3574327b2e9c3
2024-11-29 08:16:16 -05:00
Anusha GodavarthySurya
06e6561eb5
SWDEV-489084 - Avoid using queue colliding with the graph launch stream
...
Change-Id: I3ecaf8836c8e0883441275139041c702aba0937e
2024-11-29 08:15:58 -05:00
Sebastian Luzynski
019abdc3bd
SWDEV-465085 - replace asserts inside API calls
...
This change replaces some asserts, that were only available in debug
mode, with standard error handling.
Signed-off-by: Sebastian Luzynski <Sebastian.Luzynski@amd.com >
Change-Id: I112f9e56f921abd72daf0d11e4ecdcb7b1a9f9e6
2024-11-29 04:11:39 -05:00
Marko Arandjelovic
e94d9b1763
SWDEV-489617 - Make any host to any host memcpy synchronous
...
Change-Id: I2a29d1a433508f9b4b67b48c47bb4a4eebac0cb3
2024-11-29 03:48:28 -05:00
Satyanvesh Dittakavi
e3b8754448
SWDEV-477584 - Match hipGetLastError behavior with CUDA using env var
...
Change-Id: I4c5acff180ae904028f7c5fdf4e109ffd1f0c4ef
2024-11-28 01:33:52 -05:00
Anusha GodavarthySurya
17e7b7c2ef
SWDEV-472840 SWDEV-461980 - Fix null stream sync performance
...
=> If null stream is not created during sync skip nullstrm creation
=> Do cpu wait on blocking & null stream if it exists
Change-Id: I90d6ced6a2dd1782ba58f3fed4e3608fc0efa55a
2024-11-27 10:29:15 -05:00
Satyanvesh Dittakavi
70b20857e9
SWDEV-494808 - Do not allow hipMallocAsync/hipFreeAsync when another stream is capturing
...
hipMallocAsync/hipFreeAsync APIs should return error stating
operation is not supported, if a stream is actively capturing
and is different from the passed stream
Change-Id: I2a1b8260c5eb22d99a936ac529d6788a83f81a17
2024-11-26 12:12:56 -05:00
Rahul Manocha
1ac9286b5a
SWDEV-497288 - Fix Segfault when kernel param is nullptr
...
1) for case where kernelParam_.func = nullptr, the validation
fails in setParam call and memory is not alloced for kernelParams
2) destructor path segfaults trying to free the kernelParams memory.
3) copy of params is done after function validation is successful.
Change-Id: I6338e0c89f259632e4115f0508e2f240bc207fd9
2024-11-26 11:34:41 -05:00