Граф коммитов

11771 Коммитов

Автор SHA1 Сообщение Дата
German Andryeyev 2d492a201b SWDEV-423317 - Enable GPU wait for hip sync calls
hipStreamSynchronize and hipDeviceSynchronize won't longer wait
for CPU commands in DD mode

Change-Id: I079c8bbfc34ddc6d3e2d74c92a34665877e512a5


[ROCm/clr commit: fbea58ba11]
2023-09-22 13:04:27 -04:00
Jatin Chaudhary f62807d411 SWDEV-421819 - Add ptrdiff_t and clock_t.
Rework types inside new __HIP_NO_STD_DEFS__ guard

Change-Id: I2b67381c01ba497b323864afe0a8d1402e396748


[ROCm/clr commit: 49bb6a4072]
2023-09-22 05:07:52 -04:00
pghafari f088f06be6 SWDEV-369567 - [ABI Break] updating gridY and Z max dimension check
Change-Id: If515fda4938d3c5f455a02b4a65adaad76dc0aed


[ROCm/clr commit: 96f41fad07]
2023-09-22 04:52:27 -04:00
sdashmiz 0e64e42c60 SWDEV-330665 - [ABI Break] check for attribute
- Add support for DevAttrHostRegisterSupported

Signed-off-by: sdashmiz <shadi.dashmiz@amd.com>
Change-Id: Ica8d323eda97b65ef0088aa99c84dd36cfe4edf3


[ROCm/clr commit: 8705322f10]
2023-09-22 04:51:24 -04:00
Satyanvesh Dittakavi 9ee9f04cdd SWDEV-407358 - Address MIGraphx -Werror issue with unsafe buffer access
Change-Id: If4189dfbf12b670af60782f09ff8cf9b9867c299


[ROCm/clr commit: 16fb3ca923]
2023-09-22 04:50:32 -04:00
Ioannis Assiouras fb17aa5c41 SWDEV-401489 - [ABI Break] Removed hipModuleLaunchKernelExt
Change-Id: I0b31d9db755f8a89ead203dbd238a93c18b76855


[ROCm/clr commit: fe2d747bcc]
2023-09-21 23:50:16 -04:00
kjayapra-amd 142492d9eb SWDEV-422580 - Adding pcie.function to the PCI address string in hipGetDevicePCIBusId.
Change-Id: Iaa73b6028ac7b1d05cc310405d6fe622d764320f


[ROCm/clr commit: 369fdc4d87]
2023-09-21 12:22:05 -04:00
Sourabh Betigeri 5d160eb7ee SWDEV-326798 - [ABI Break] Fixes null stream sync behavior
Change-Id: I84ad21f61779145c198dc68bf0fe02a57d34bf64


[ROCm/clr commit: 3f88fe850b]
2023-09-21 11:57:58 -04:00
Jatin Chaudhary 6abb81112d SWDEV-422808 - Add E5M2 and E4M3 for nvidia headers
Change-Id: Ib40e75c1b2e18d75164607a46e95fc322363c08e


[ROCm/clr commit: 15ba5c0004]
2023-09-21 11:57:04 -04:00
Satyanvesh Dittakavi be8dbcf736 SWDEV-375607 - Fix hipGraphExecUpdate behavior
Addresses the below scenarios like parameters mismatch
in memcopy node, difference in the count of nodes, difference
in the dependencies of the nodes.

Change-Id: I31c6516fb27cc1007809f1b50306fdb0c2356ccc


[ROCm/clr commit: f16d336e32]
2023-09-21 11:17:49 -04:00
Satyanvesh Dittakavi 78a3dc739d SWDEV-395996 - Add HIPRTC support for missing hip headers
hip headers such as hip_math_constants.h, library_types.h,
hip_common.h, channel_descriptor.h, device_functions.h, hip_complex.h,
hip_texture_types.h, math_functions.h, surface_types.h
are added in HIPRTC

Change-Id: I4a4c198449ceb609c3ff55e00b43056c1f085431


[ROCm/clr commit: d7d0f11318]
2023-09-21 03:11:06 -04:00
German c3d48d80fa SWDEV-404889 - Debugger support, trap handler
Add trap handler code into runtime and compile/load during
device initialization. The current interface for trap handler in
PAL is obsolete and the new interface will be provided later.

Change-Id: I1fa702c5d1f2e6731f781369c980d546cf422328


[ROCm/clr commit: e1d34cb24f]
2023-09-19 16:05:02 -04:00
Ioannis Assiouras 7d69dd7598 SWDEV-416741 - Improve logging in find_package(hip)
Change-Id: Iaaaa595c668e5a8e977c9d2d31beca481748f637


[ROCm/clr commit: 5e51335ef6]
2023-09-18 19:10:38 -04:00
Ajay c9c3d951f2 SWDEV-421102 - make clr compatible with c++20
Change-Id: Ia692817232caa3ee97dc0b2139645a0474b582e1


[ROCm/clr commit: 2bae1b1387]
2023-09-18 11:28:29 -04:00
David Salinas f1325c1adf SWDEV-421615 - Correct required perl modules
Change-Id: I4e7e19e666042c0e8bfdb0ae0bfb2fc8d3ffa565


[ROCm/clr commit: bf88980e0a]
2023-09-15 11:46:48 -04:00
Jaydeep Patel e688bfee2c SWDEV-415914 - Use settings from device's setting object.
Change-Id: I4b0cbab361e9e2d5b8164066ec37de1fa212e783


[ROCm/clr commit: d18ae85833]
2023-09-14 19:16:50 +00:00
Julia Jiang 78ced05253 SWDEV-401314 - Fix failure on MI300 for progvar_prog_scope_misc basic test
Change-Id: I56701b050023199a3e56dc5a78553c0c330b8537


[ROCm/clr commit: 42799c9f21]
2023-09-14 11:32:25 -04:00
kjayapra-amd 22bc3d52a3 SWDEV-417244 - Create Unique File Desc Store and guarantee no duplicates fds are opened.
Change-Id: I7be7fee52b673f0b1b3facdfbd847b57f2a47cde


[ROCm/clr commit: 85b645731c]
2023-09-13 17:03:18 -04:00
Alex Xie c0bb71bd74 SWDEV-421710 - Remove unused GL function pointers
Change-Id: Ifd77314b51d4b1494428bf8b611051245dbc3feb


[ROCm/clr commit: 2804aa8e34]
2023-09-12 18:17:04 -04:00
Jaydeep Patel 5b450d5551 SWDEV-418038 - Drop GPU_ENABLE_WAVE32_MODE from RTC.
Change-Id: Iac11f26714e8b3bae14cbf69933ef7c8c8be47da


[ROCm/clr commit: a37c95d7ed]
2023-09-08 02:13:41 -04:00
Anusha GodavarthySurya f464a6ade6 SWDEV-301667 - Optimize performance when graph has single branch
Three for loops iterate over all graph nodes for UpdateStream, FillCommands and
EnqueueCommands has performance drop for large graphs.

Change-Id: I077accf3a4680d5d944b73200fd6498a7a48f25c


[ROCm/clr commit: 530dc6de2a]
2023-09-07 23:35:36 -04:00
Ioannis Assiouras a0ee7e2784 SWDEV-419996 - Allow both kernelParams and extra arguments to be set to null
When kernel function expects no parameters no error should be returned
if both kernelParams and extra arguments are set to null.

Change-Id: I5941bcc400b6fb380e623bdae0233ae3e4f73815


[ROCm/clr commit: 315082e554]
2023-09-07 12:57:46 -04:00
German 5d9912f48b SWDEV-407533 - [ABI Break]Remove Wavelimiter
Change-Id: I6a2f6fb5a0c3acea93fa0200a69679783e76f5bd


[ROCm/clr commit: 7be3a5e33e]
2023-09-07 09:58:41 -04:00
Ioannis Assiouras 97d7b15428 SWDEV-420328 - Initialize AMD_LOG_MASK with decimals instead of hex
Change-Id: Id25510863c51206bca2e50fc93d6e1e1c5cbbfea


[ROCm/clr commit: 1302d6f119]
2023-09-07 03:04:37 -04:00
Jacob Lambert 1ed5b0b9c5 Revert "SWDEV-410182: Link device-libs when compiling source to bitcode"
This reverts commit 58e62063f3.

Reason for revert: There are currently some outstanding issues with the COMPILE_SOURCE_WITH_DEVICE_LIBS Comgr action (https://ontrack-internal.amd.com/browse/SWDEV-386072). Once these LLVM issues have been resovled, we can safely re-apply this patch

Change-Id: I8501967af8496ea50d6e4a97399e45db51bbed1e


[ROCm/clr commit: 19526e46e6]
2023-09-06 21:34:03 -04:00
Saleel Kudchadker 586f4abcd7 SWDEV-301667 - Balance AQL packet scopes
- Acquire and Release scopes for AQL need to be balanced. These were
missing on the AMD_OPT_FLUSH=0 path.

Change-Id: Ibf4132eb96800f155d7b664359c790d68a353e60


[ROCm/clr commit: 42c9cd0320]
2023-09-06 13:26:39 -04:00
David Salinas 4963cb0b77 SWDEV-419396 - hipcc.bin.exe and hipconfig.bin.exe missing from build
Correct HIPAMD cmake file for finding/instaling hipcc.bin

Change-Id: I3aa56331615fa81a489464210a9be734e917aa47


[ROCm/clr commit: e1efdfbca1]
2023-09-06 10:55:54 -04:00
Jaydeep Patel f5175f145d SWDEV-417317 - Bypass agent init for HIP.
Change-Id: I3396cf7c1feced03d29f5e5ff3c77474f4f7e1ec


[ROCm/clr commit: 198b7b412d]
2023-09-06 00:40:54 -04:00
Ioannis Assiouras 60522dbdaf SWDEV-419664 - Set reserved0 field of hsa_kernel_dispatch_packet to zero
Change-Id: I2e9c503f1c393548aacad47d40da90a2126e522c


[ROCm/clr commit: 96ac99b4a0]
2023-09-05 19:10:33 -04:00
Ioannis Assiouras 0066c6cb6b SWDEV-420222 - Removed implementation of hipMemcpyArray functions
hipMemcpyArrayToArray, hipMemcpyFromArrayAsync, and hipMemcpyToArrayAsync
are deprecated in cuda and are missing the headers in hip_runtime_api.h.
Removed their implementation from hip_memory.cpp.

Change-Id: I9720aec6241515c56c66b7e90a37b2ed53347eb2


[ROCm/clr commit: 2bc6661737]
2023-09-04 13:35:08 +01:00
Rahul Garg 87bbe33acd SWDEV-418360 - Continue parsing modules even when some module don't have
right binaries built for current device

Change-Id: Icc6a4233fbcede9b2dc6feb78e6bfbd761684c7f


[ROCm/clr commit: c9cf3bdc16]
2023-08-31 14:54:59 -04:00
Ioannis Assiouras f28fa0277e SWDEV-415937 - Make _IMPORT_PREFIX in hip*targets.cmake point to a relative path
Change-Id: Ib0f4321f2954254d0486d741d057f182cb2481a7


[ROCm/clr commit: 209525d6e4]
2023-08-31 08:10:50 -04:00
Sourabh Betigeri 22f367a172 SWDEV-418855 - Limits the 'no GWS' approach to gfx940, gfx11and gfx12
Change-Id: Iab2d34d3142902517124cec7ef3461cf7aa4b98c


[ROCm/clr commit: 7dc78d234d]
2023-08-30 23:48:02 -04:00
German 476e7f7022 SWDEV-418038 - Remove opengl32.lib from linking in runtime
Change-Id: If9416742cd8dc6a35adaccce802d65888e808ef2


[ROCm/clr commit: ad56c20cd0]
2023-08-30 14:07:28 -04:00
Satyanvesh Dittakavi 92c16d7d04 SWDEV-419034 - Use MADV_HUGEPAGE for large host allocations
Change-Id: I80bb1839cdd47eb64a97467c8b01fcdf37195ad5


[ROCm/clr commit: 94e70bee26]
2023-08-30 14:03:26 -04:00
Juan Manuel Martinez 58e62063f3 SWDEV-410182: Link device-libs when compiling source to bitcode
This is related to SWDEV-410182, but it's not enough to fix it.

Functions from device-libs are precompiled into llvm-ir in a "target agnostic" way
(in reality, it's not 100% target agnostic, which brings us many headaches).

When linking builtins (like device-libs) from the command line, we use the flag
-mlink-builtin-bitcode. The difference between regular linking of bitcode and
this flag is that the later propagates target-specific attributes. If this
attributes are not propagated, we can end up with incosistent target attributes.

Comgr provides the action AMD_COMGR_ACTION_COMPILE_SOURCE_WITH_DEVICE_LIBS_TO_BC
for this exact reason. The old action is currently deprecated and this one should
be used.

Change-Id: I518415214debdf4fedf0b1d81456d6e9fb8a3d19


[ROCm/clr commit: f3dc04a50d]
2023-08-30 04:24:17 -04:00
kjayapra-amd 76cc68ff80 SWDEV-381625 - Parse compiler and linker options from environment variable.
Change-Id: Id5a012b678e5973c4b64dff84444a909aefae006


[ROCm/clr commit: 6a0f80a03d]
2023-08-29 20:24:27 -04:00
German db1e03f276 SWDEV-3 - Move PAL to version 818
Restore PAL platform destruction.
Update CmdAllocatorCreateInfo::AllocInfo for the new interface.

Change-Id: Iea418eed7ee26166039a4a9cc1999438856e9097


[ROCm/clr commit: bd00826446]
2023-08-29 12:46:28 -04:00
Jaydeep Patel 6258f8254f SWDEV-375596 - Device context should not be changed while graph exec update for graph nodes other than empty and host node.
Change-Id: I89d734b111aae5b042386175d2786ce9002771b9


[ROCm/clr commit: d2655d84fd]
2023-08-27 22:58:32 -04:00
Saleel Kudchadker 0a26b75238 SWDEV-301667 - Use large signal pool
Use large signal pool if profiler is connected or profiling forced
enabled. This is needed to mitigate signal creation overhead when
profiling as signals are attached to every packet and deeper batch may
show overhead of signal allocation.

Change-Id: I8034b8a20b55328b87d593bf044f59672f9653e8


[ROCm/clr commit: 1ec0ba3537]
2023-08-24 19:17:05 -04:00
German 3f4bbcfdba SWDEV-407533 - [ABI Break]Purge unused env vars
Change-Id: I627950e8ebb6299affc602754a20d442dbe42b14


[ROCm/clr commit: 077311153a]
2023-08-24 14:11:40 -04:00
Jatin Chaudhary 98b3f79d45 SWDEV-418230 - fix warning seen in HIP headers
This PR fixes warning seen on HIP headers due to -Wignored-attributes.
The warning is attributes for a variable must precede definition.
This also fixes a potential issue with C-style header where templates
were declared outside __cplusplus.

Change-Id: I78bfd391717c7891afb442ef79812ea630c0ff49


[ROCm/clr commit: 8fe5100d16]
2023-08-24 08:41:08 -04:00
Tao Sang 3fdd346cf2 SWDEV-417727 - Fix hipSignalExternalSemaphoresAsync()
This reverts commit cab71e6e00.

Implement the right way to make ExternalSemaphores be signalled
only after prior works on the stream have been finished.

Change-Id: I9d5974e05d5f229170b928db4566c14e40e3cbaa


[ROCm/clr commit: d433df4761]
2023-08-23 22:31:27 -04:00
Saleel Kudchadker d7a63b9d66 SWDEV-301667 - Port optimization to save extra packet to graphs
Change-Id: Ibaf64a4efe070c42620e6e153c1862a4a0b15664


[ROCm/clr commit: e1e5d071ba]
2023-08-23 16:58:21 -04:00
taosang2 cab71e6e00 SWDEV-417727 - Fix hipSignalExternalSemaphoresAsync()
Let ExternalSemaphores be signalled only after prior works on the
stream have been finished.

Change-Id: I856917db905f68f55fdf484f5267f7fe8ea3117f


[ROCm/clr commit: 44a3935cda]
2023-08-23 14:58:37 -04:00
German 85d075fa82 SWDEV-404889 - Inital change for debugger support
- Program unique AQL index for debugger. The logic manages AQL array of packets per HW queue.
- Provide debug state to PAL

Change-Id: I38fa1f5435fa711fd1d44dc391f2e61eb2a25efa


[ROCm/clr commit: d97cc0abbd]
2023-08-23 13:21:58 -04:00
kjayapra-amd b432ced424 SWDEV-408473 - Add wait time of 10 us if the waiting signal copy was < 24K.
Change-Id: I438ec9eb07e5034042a4a9a5e6e51d74daba2c83


[ROCm/clr commit: 6f5277c701]
2023-08-23 10:46:33 -04:00
Jatin Chaudhary 105212ef57 SWDEV-414425 - __half2's member variable should be __half instead of unsigned short
We currently have __half2 made up of unsigned short instead of __half.
This prevents users to do operation seamlessly when they want to operate on individual components.

Change-Id: I856917db905f68055fdf484f526707fe8ea3117d


[ROCm/clr commit: 19afdf719e]
2023-08-22 04:25:19 -04:00
Satyanvesh Dittakavi bee336d360 SWDEV-416443 - hipPointerGetAttributes should return correct type for managed memory
Change-Id: Ic197cd8c9d3f5405616ed6e4924ed5577e636b94


[ROCm/clr commit: 84eae1f0a9]
2023-08-21 07:47:32 -04:00
Juan Manuel MARTINEZ CAAMAÑO 0cf78a6dfa Revert "SWDEV-410182: Do not force wave64 for geekbench"
This patch did not consider the dicussions in SWDEV-270908

> "we found that in GeekBench5, forcing Wave64 instead of the default
> Wave32 compute policy yields big gains in every subtest except one"

This reverts commit 10e2958197.

Change-Id: Ice1728585b9d1b2c1b36a06cfa0b8c47cb2bfa49


[ROCm/clr commit: 06dcaae619]
2023-08-21 03:29:32 -04:00