Sameer Sahasrabuddhe
4e2fd192eb
SWDEV-491314 - enable _sync() functions with 64-bit mask argument
...
Change-Id: Ieb13a9e1b2fc49ff225a05a51056d1212d95ae57
2024-12-01 10:16:59 -05:00
Sourabh Betigeri
bd5d8e9baf
SWDEV-440866 - [hip-roclr] Adds support to batch memory operations APIs
...
Change-Id: I449ffca44bbb04d13348d112e896d603c70fd485
2024-11-30 17:54:32 -05:00
Anusha GodavarthySurya
c47f9dda58
SWDEV-469422 - Cleanup graph code remove parallellists and nodewaitlists
...
Change-Id: I00c7b2894333bd13d47b913d3fcdd6e1ffcb741f
2024-11-30 04:40:51 -05:00
taosang2
f1f4f40c5b
SWDEV-447973 - Support generic targets
...
Change-Id: I32db83843e45e0f013591493aafd7a532c881e16
2024-11-29 10:12:10 -05:00
Vladana Stojiljkovic
b75b0d9a53
SWDEV-494612- Add capture support for hipLaunchCooperativeKernel
...
Change-Id: I6b3c6af55c60cffd43ce6f47b75998f750b75703
2024-11-29 08:17:41 -05:00
Anusha GodavarthySurya
fb7ad8361c
SWDEV-489084 - Update max streams for graph
...
Change-Id: I6d0992b2e80ebf3184911593a4f3574327b2e9c3
2024-11-29 08:16:16 -05:00
Anusha GodavarthySurya
06e6561eb5
SWDEV-489084 - Avoid using queue colliding with the graph launch stream
...
Change-Id: I3ecaf8836c8e0883441275139041c702aba0937e
2024-11-29 08:15:58 -05:00
Sebastian Luzynski
019abdc3bd
SWDEV-465085 - replace asserts inside API calls
...
This change replaces some asserts, that were only available in debug
mode, with standard error handling.
Signed-off-by: Sebastian Luzynski <Sebastian.Luzynski@amd.com >
Change-Id: I112f9e56f921abd72daf0d11e4ecdcb7b1a9f9e6
2024-11-29 04:11:39 -05:00
Marko Arandjelovic
e94d9b1763
SWDEV-489617 - Make any host to any host memcpy synchronous
...
Change-Id: I2a29d1a433508f9b4b67b48c47bb4a4eebac0cb3
2024-11-29 03:48:28 -05:00
Aidan Belton-Schure
f42a87dc2f
SWDEV-485827 release initial_heap_buffer_
...
This PR adds the initialization and release of initial_heap_buffer_
to prevent memory leaks.
Change-Id: I4ab8721b439a1a3a6f6e53d63d870e572f7c984a
2024-11-28 10:31:26 -05:00
Satyanvesh Dittakavi
e3b8754448
SWDEV-477584 - Match hipGetLastError behavior with CUDA using env var
...
Change-Id: I4c5acff180ae904028f7c5fdf4e109ffd1f0c4ef
2024-11-28 01:33:52 -05:00
Anusha GodavarthySurya
17e7b7c2ef
SWDEV-472840 SWDEV-461980 - Fix null stream sync performance
...
=> If null stream is not created during sync skip nullstrm creation
=> Do cpu wait on blocking & null stream if it exists
Change-Id: I90d6ced6a2dd1782ba58f3fed4e3608fc0efa55a
2024-11-27 10:29:15 -05:00
Aidan Belton-Schure
9652d69575
SWDEV-436099 Use new amdgcn_ballot builtin
...
Change-Id: I024fabc6c5b3f39c66885eb7615953f4d0432e9a
2024-11-27 04:34:50 -05:00
Satyanvesh Dittakavi
70b20857e9
SWDEV-494808 - Do not allow hipMallocAsync/hipFreeAsync when another stream is capturing
...
hipMallocAsync/hipFreeAsync APIs should return error stating
operation is not supported, if a stream is actively capturing
and is different from the passed stream
Change-Id: I2a1b8260c5eb22d99a936ac529d6788a83f81a17
2024-11-26 12:12:56 -05:00
amd-jmacaran
31ee9f02ff
SWDEV-458516 - External CI: Support commits on both staging and mainline
...
Change-Id: I1970029bdd6469a7a79c28b63b451e8be462e142
2024-11-26 11:36:18 -05:00
Rahul Manocha
1ac9286b5a
SWDEV-497288 - Fix Segfault when kernel param is nullptr
...
1) for case where kernelParam_.func = nullptr, the validation
fails in setParam call and memory is not alloced for kernelParams
2) destructor path segfaults trying to free the kernelParams memory.
3) copy of params is done after function validation is successful.
Change-Id: I6338e0c89f259632e4115f0508e2f240bc207fd9
2024-11-26 11:34:41 -05:00
Rahul Manocha
e0c11624e5
SWDEV-497288 - Enable hipGraphExecSetParams for Ext SemWait and SemSignal Nodes
...
Change-Id: I7184a3a04ac17d3d841222ae1559db66d73a429c
2024-11-26 11:34:18 -05:00
Anusha GodavarthySurya
9820480cbd
SWDEV-491643 - AQL packets are captured for kernels disable sdma profiling
...
hsa_amd_profiling_async_copy_enable is taking 45us for the first call. Disable sdma profiling for enqueuing captured kernel packets and for accumulate command.
Change-Id: I80b51a58c46bccc9c1025e9331515f57c97b5a2a
2024-11-26 08:37:31 -05:00
kjayapra-amd
39b64b57d9
SWDEV-496106 - Use lock_ops to avoid race condition in safe_stream.
...
Change-Id: I5e98c34a41434e2f7d2e6406ea30ddcf01b1091c
2024-11-26 06:56:56 -05:00
Anusha GodavarthySurya
25a893658a
SWDEV-491643 - DEBUG_HIP_FORCE_GRAPH_QUEUES is 1 enable capture path
...
Change-Id: Ibddd50592232b090bf5eab8395fe78a36bb3a14a
2024-11-25 05:21:10 -05:00
Jaydeep Patel
3f3f3d0f1c
SWDEV-498816 - Detached thread might exit and pthread_t is invalid access to check if it is valid or not.
...
Hence, It is not required to check it if thread is already finished processing packets.
Change-Id: If1b43a169a06203f3e1ab0529cf592879496d7c4
2024-11-24 01:38:47 -05:00
Ioannis Assiouras
2cc9f59774
SWDEV-494409 - Fixed segfault caused by aligned_free called on unaligned memory
...
Change-Id: I8ddea5c309b568db25c92789bf71be341d19c3ba
2024-11-22 03:08:04 +00:00
Saleel Kudchadker
2273a1dbdc
SWDEV-497886 - Fix unaligned size copy for kernel args
...
Change-Id: If6675b98178aeb35f376d6994555cbf941b048c3
2024-11-21 14:30:04 -05:00
Maneesh Gupta
ec6f83b544
Revert "SWDEV-475482 - hipTexObjectCreate align with cuda"
...
This reverts commit efce2f77c4 .
Reason for revert: Even though this change is valid, this would break backward compatibility.
Change-Id: I9c7cab83198c8d5c8485b11194099162e3e7a874
2024-11-21 00:55:32 -05:00
German Andryeyev
9473f143c2
SWDEV-486602 - Disable sysmem pool
...
Currently amd::Monitor can work in FILO mode for the active waits
and cause a delay in wakeup of some threads. That may have a problem
with the current sysmem pool design.
Change-Id: I145081478d1e0b282d8838855c5718f09cf54b69
2024-11-20 11:35:28 -05:00
Rahul Manocha
efce2f77c4
SWDEV-475482 - hipTexObjectCreate align with cuda
...
Support zero width and height for hipTextObjectCreate to align
with cuda.
Change-Id: I5d4c48625faf5f060ed2a7e634ec65e4ecac9da5
2024-11-19 23:28:55 -05:00
Rahul Manocha
a2d8199402
SWDEV-494409 - Fix destructor path for Static and Dynamic CO
...
1) For Dynamic CO variables, free the device pointer in
DynCO destructor instead of DeviceVar destructor.
2) For Static CO Remove Fatbinary,
only call hipFree for valid device Vars instead of all devices.
Change-Id: I84291f5371b2c05d1d0bcdb4f9c6bd122e7c9b21
2024-11-19 13:04:44 -05:00
German Andryeyev
e2eeb20c00
SWDEV-494231 - Revert TS optimization
...
Runtime may use checkGpuTime() for the wait and not just for the GPU time queries. Hence, the call can't be skipped if profiling isn't enabled.
More changes are required for this optimization.
Change-Id: I79e8918312e755d75f0d26685f2fdc604a8ffb18
2024-11-19 10:17:38 -05:00
Jaydeep Patel
d997f78be4
SWDEV-498077 - Check topoOrder_ before accessing it.
...
Change-Id: I10e3c24ca8dc1009b8ac8ac27b3e9a6296f9a7ee
2024-11-19 04:50:47 -05:00
Ioannis Assiouras
1a710dabcf
SWDEV-496003 - Add support for dynamic functions in hipFuncSetAttribute
...
Modified hipFuncSetAttribute to handle pointers to dynamic functions
returned by hipModuleGetFunction.
Change-Id: I54b98f9d31a79630dd7edcd363fad81f1d89219b
2024-11-19 04:50:15 -05:00
Saleel Kudchadker
e4d29e228d
SWDEV-495987 - Use shared mutex for MemObj maps
...
- Remove binning logic, although useful it doesnt work in current
scenario as there is no upper limit on the size of allocation. If an
app or framework uses entire VRAM and then creates suballocs, binning
would result in failure.
Change-Id: Icc27c13e433bb4a1f03e82028d8718488b43bfa5
2024-11-19 04:28:20 -05:00
Ioannis Assiouras
0fa632c52c
SWDEV-483134 - Reverted the deprecation of hipHostMalloc/hipHostFree
...
Change-Id: I065031b8438330c5eb7a7ae510b8d6619b500d04
2024-11-18 04:30:31 -05:00
Julia Jiang
3bd32d7d4b
SWDEV-497618 - Define GL group of doxygen for HIP API
...
Change-Id: Ia43f25201df960b905e5fd45481f486e7bffabdb
2024-11-15 11:25:51 -05:00
Alex Xie
d48a4fec24
SWDEV-494876 - OCLtst P2P test fail
...
A variable is used without assignment.
Change-Id: Ie1dac8daaa06c5d348dbf1cfa0aca65f84b12816
2024-11-15 10:52:38 -05:00
Julia Jiang
a27c627731
SWDEV-497324 - Rename CLR as Compute Language Runtime
...
Change-Id: Ie2854eeac764fba770b296a63d76d547139d3cc0
2024-11-13 15:04:12 -05:00
Ioannis Assiouras
9f1cc590d8
SWDEV-493913 - Replaced calls to clGetExtensionFunctionAddress in ocltst
...
Replaced clGetExtensionFunctionAddress calls with
clGetExtensionFunctionAddressForPlatform to ensure
interoperability with distribution ICD loaders.
Change-Id: I560a62459f2ad222750e65e869b98d6b6ec56665
2024-11-12 15:56:17 +00:00
Jaydeep Patel
24c57cb984
SWDEV-496544 - Reset mem alloc node count for AutoFreeOnLaunch.
...
Change-Id: Ib32b04584548a46632606ecd85b58c6ce4a5894d
2024-11-11 11:03:32 +00:00
taosang2
cc25c5d646
SWDEV-487356 - Fix AMD LOG compiling warining
...
Change-Id: I757185f9c7c12f736e266219b67daf5836d2a125
2024-11-09 12:57:22 -05:00
Saleel Kudchadker
582dc7dd6d
SWDEV-446123 - Revert "Match hipGetLastError behavior with CUDA using env var"
...
This reverts commit 5f477900a3 .
Reason for revert: <INSERT REASONING HERE>
Change-Id: I11a456655393bcf4b82d749ce7259bc1b78d1424
2024-11-08 20:35:13 -05:00
Satyanvesh Dittakavi
5f477900a3
SWDEV-446123 - Match hipGetLastError behavior with CUDA using env var
...
Change-Id: Iaec697c1304d746376ecf2bfe2ad683b15ee189f
2024-11-07 12:02:34 -05:00
Alex Xie
b5aab7f6b4
SWDEV-491168 - expose one more API in glGetExtensionFuncionAddress
...
Change-Id: Ia78b6b9377d99298c68968ef0eddd87534e0f750
2024-11-07 10:15:05 -05:00
Branislav Brzak
6e55f40313
Revert "SWDEV-465201 - Match negative hipModuleLoad return with Cuda"
...
This reverts commit 68e5aeb93d .
Reason for revert: Breaking change that will be merged in at a later date
Change-Id: Idd300492cc08a57c50decc22df287ddcc5463c88
2024-11-07 09:51:50 -05:00
Vladana Stojiljkovic
62e1d50c2a
Revert "SWDEV-491452 - Allow hipMemAdvise capturing only inrelaxed mode"
...
This reverts commit 02bbe11e56 .
Reason for revert: It is considered a breaking change.
Change-Id: Ia1f7b27a6d3caa4a9fe12bee469f5f0a6b8327fd
2024-11-07 09:50:21 -05:00
Vladana Stojiljkovic
ed458615c4
Revert "SWDEV-492768 - Match hipStreamAddCallback capture behavior with nvidia"
...
This reverts commit f6c8bbf4dc .
Reason for revert: It is considered a breaking change.
Change-Id: I9b2dfc60f62eec7268c0374b0b33e45eca7b3bbb
2024-11-07 09:49:55 -05:00
Vladana Stojiljkovic
48df130d87
Revert "SWDEV-490474 - Allow hipMallocManaged capturing only in relaxed mode"
...
This reverts commit 6deecf1bfe .
Reason for revert: It is considered a breaking change.
Change-Id: I6bfd81dcc16d36304efa2f151969427249a4122f
2024-11-07 04:33:07 -05:00
Julia Jiang
11a4cb3c34
SWDEV-494501 - Fix CTS api negative sub-test failures
...
Change-Id: I0dd67467d977569fca45dcdad28a96c839bae33b
2024-11-05 14:58:01 -05:00
Ajay
aa4f27c29b
SWDEV-495842 - use find_program clang, llvm-mc instead of hip_package
...
Helps in needing minimum files to build hip runtime on windows
Change-Id: I1d27b72d3b0551b9261524606f08361d00463394
2024-11-04 23:09:11 -05:00
Branislav Brzak
68e5aeb93d
SWDEV-465201 - Match negative hipModuleLoad return with Cuda
...
Change-Id: Id6c682e7f9ec8d34e9c8d5d5933f31e003c67e27
2024-11-04 03:58:06 -05:00
Jaydeep Patel
c24bc05cac
SWDEV-491409 - Handle DynCO path to delete DeviceVars.
...
Change-Id: I757185e9c7c12f236e266219b67daf5836d2a122
2024-11-04 01:52:26 -05:00
Saleel Kudchadker
f1e98ab6e4
SWDEV-483586 - Do not take pinned path for read/write
...
- When GPU_FORCE_BLIT_COPY_SIZE is set do not take pinned path
Change-Id: Iaa065db63cc8fda61f82e6c9701e9fdaec5c54cb
2024-11-01 12:55:15 -04:00