Commit Graph

11608 Commits

Author SHA1 Message Date
ajay c4e5952687 SWDEV-404940 - __amd_rocclr_initHeap in pal.
Avoid Seg fault on Windows

Change-Id: I3901c003dd33222ec9e2600c519d8503f5e81c01


[ROCm/clr commit: 169d17f47f]
2023-06-16 11:48:42 -04:00
Jatin Chaudhary 0cd45b8cc7 SWDEV-399522 - Fix the issue #2 raised on github clr repo. Adds signed long long atomics
Change-Id: I09c386d45e219e472d2da1449f678b9d17fe3e2e


[ROCm/clr commit: be02e1e524]
2023-06-16 08:53:45 -04:00
Jaydeep Patel de6662f1b7 SWDEV-403768 - Check mem type is managed and result mem addresses are not null.
Change-Id: Ib40c468a981e0dd42b2da65aeb7543b11af4bc90


[ROCm/clr commit: a6e72de83d]
2023-06-16 08:49:21 -04:00
Jaydeep Patel 600ad0e9dc SWDEV-403770 - Add validations.
Change-Id: If36202ba9777d2a0d6f8ee92982dbff92bafe81a


[ROCm/clr commit: 0a20da39cd]
2023-06-16 02:37:25 -04:00
sdashmiz 2216908962 SWDEV-403638 - Fix warnings
- disable deprecated function use warning
- disalbe size_t to .type' warning
- disable conversion from 'type1' to 'type2' warning

Signed-off-by: sdashmiz <shadi.dashmiz@amd.com>
Change-Id: I64161fd37cf56de3d132102103267ae8da40193a


[ROCm/clr commit: 38a67df312]
2023-06-15 12:17:22 -04:00
sdashmiz 2e960fae8f SWDEV-403063 - Clean up hip graph
- Replace dynamic allocation with embedded struct in hip graph

Signed-off-by: sdashmiz <shadi.dashmiz@amd.com>
Change-Id: Idfe3f7393fa50d6e510d4c88f779408f96d3accb


[ROCm/clr commit: cc85289739]
2023-06-15 12:17:09 -04:00
German af5944dc71 SWDEV-311270 - Add IPC support for memory pools
Initial implementation for hipMemPoolExportToShareableHandle,
hipMemPoolImportFromShareableHandle,
hipMemPoolExportPointer and hipMemPoolImportPointer

Change-Id: I0ebdc48e9163b394ded560adca6c38bbc5aee7d1


[ROCm/clr commit: 1a0c3e4dc4]
2023-06-15 11:36:52 -04:00
Jaydeep Patel d6086f9d69 SWDEV-403766 - Release mem obj while device reset.
Change-Id: I8a93061c9c4ed692bf7ba1b872af370f79a13a5d


[ROCm/clr commit: ec75a3e60e]
2023-06-15 09:52:50 -04:00
Jatin Chaudhary 8913c4de4f SWDEV-400136 - change make_complex function name to make_hipComplex
Change-Id: Ia276610286df80ec35fe11f303eb9919eeda87dd


[ROCm/clr commit: f7f70a0880]
2023-06-15 09:02:38 -04:00
Saleel Kudchadker 87652b6c93 SWDEV-384557 - Use toggle for SDMA query
- Use HIP_USE_SDMA_QUERY env var toggle for new API use. Env var is 0 by
default

Change-Id: If725a0c41e15f78a1a6c3f47942954fe9240b4db


[ROCm/clr commit: 8d193c32bb]
2023-06-15 01:02:24 -04:00
Jacob Lambert 52eaffa1a4 Revert "SWDEV-371628 - Shift device lib linking into clang driver"
This reverts commit 2e664d2492.

Reason for revert: Performance regressions and failures observed. Need to investigate those and before re-applying patch

Change-Id: I42ba0605797f9bdcfb5d5102927dd01405cf05e3


[ROCm/clr commit: 8047d8e3e8]
2023-06-15 00:56:08 -04:00
Jason Tang 6edd795788 SWDEV-404928 - Remove rocm-opencl depends of rocm-ocl-icd
Will replace rocm-ocl-icd with packaging of khronos opencl-icd-loader

Change-Id: I26da58e80fd1426b9908d0b524f93b17ab27626d


[ROCm/clr commit: a4bba6d705]
2023-06-14 14:35:57 -04:00
Ioannis Assiouras af03b2c8d2 SWDEV-386069 - Added nvidia and AMD config.cmake files as part of the dev package
Change-Id: I3c9a65b287822d91407996ca59ac90051b749923


[ROCm/clr commit: d2e36bc047]
2023-06-07 04:55:52 -04:00
Jaydeep Patel 063b3d4e51 SWDEV-403995 - uniform_work_group_size is interger type is 1 for true and 0 for false. Lighting kernel's init sets uniform work group flag and hence not need to set it from options.
Change-Id: Ife45c1e3ccef03d16d7e07c3e0714fd8fd388fd3


[ROCm/clr commit: 2c9c58e7bb]
2023-06-06 23:05:25 -04:00
Rahul Garg cea26614db SWDEV-379678 - Take hipcc/hipconfig from hipcc
Change-Id: I53a6e5108e142860747186c0e7ed4fc2cb384a69


[ROCm/clr commit: 17c052ca9a]
2023-06-06 17:21:59 -04:00
jatang 34f68be605 SWDEV-373396 - OCLCreateBuffer: make the CPU checkResult() 4 times faster
Change-Id: If20cd6b509896a748f74ab5296cb85f2c4a9f04c


[ROCm/clr commit: a456182888]
2023-06-06 17:15:34 -04:00
Ioannis Assiouras 917dc7ee82 SWDEV-401852 - Do not define hipCreateChannelDesc for long types in LP64
Change-Id: I56881a229ea666269d575c6bdfde5081562ec009


[ROCm/clr commit: 4f5dfa77bd]
2023-06-06 14:28:15 -04:00
Jaydeep Patel 7be6ca6306 SWDEV-404118 - Change to uint.
Change-Id: Ib9ced171a4330e868a584bfb86d7ee5418e784ed


[ROCm/clr commit: 9daf8e3177]
2023-06-06 13:17:56 -04:00
Ajay 06213e3b4b SWDEV-400515 - populate __HIP_NO_IMAGE_SUPPORT flag
MI300 does not support image APIs.
Apps to use __HIP_NO_IMAGE_SUPPORT instead of arch

Change-Id: I91178bfd27ea7b7188e7a958a876c0264f4469aa


[ROCm/clr commit: 16c6b365c2]
2023-06-06 12:11:41 -04:00
German 17c6b36204 SWDEV-403871 - Add MS platform recognition
Change-Id: Ie01e876f9322e35f4430352d1e4328f613133ee5


[ROCm/clr commit: c0e9734c35]
2023-06-06 11:57:07 -04:00
Maneesh Gupta 0be27c38f8 SWDEV-397358 - Revert "Return hipErrorNotSupported for unsupported APIs"
This reverts commit 32a73102ea.

Change-Id: Idb18e1ce6bcfc1da144fabf1ba01329f4f6604a5
(cherry picked from commit 5c38999efe11243dcdd8cc5244172e6840c47492)


[ROCm/clr commit: 68238c99d2]
2023-06-05 04:55:33 -04:00
Satyanvesh Dittakavi d7b529ff6e SWDEV-395996 - Add double precision constants to math constants header
Fixes a typo and adds missing constants matching cuda

Change-Id: Iebf47d107af361cbd356c2a3b9cd6ef121d01363


[ROCm/clr commit: 287f607d2b]
2023-06-05 03:31:27 -04:00
Jaydeep Patel 911002f262 SWDEV-403762 - Fill pciBusId even if len <=12.
Change-Id: Iaf1b3314f4ae5597d42d54eb4c977667ecbb9cfb


[ROCm/clr commit: 0064fac887]
2023-06-02 10:47:21 +00:00
Ranjith Ramakrishnan fe2a510c08 SWDEV-366831 - Compile time flag to switch between #warning and #error message
Using backward compatibility paths will provide #warning message be default.
Compile time option added to enable/disable the #error message.
Updated the backward compatibility message

Change-Id: I3bab00df26145991b32176d2d76977c2e953bf5f


[ROCm/clr commit: f788150132]
2023-06-01 04:22:19 -04:00
Anusha GodavarthySurya 729779a3b3 SWDEV-399098 - Change hip_init lock to use std::call_once
Updating hip_init lock to use std::call_once fixed Performance
drops in TF benchmarks for FP16

Change-Id: Ib1125ac66806b232057ba183e296ea4d0642d08d


[ROCm/clr commit: 2f83719d12]
2023-06-01 02:04:29 -04:00
Jacob Lambert 7ee73c61b0 SWDEV-375055 - Re-enable Comgr unbundler
With recent upstream changes (D145770), we can now use the
Comgr unbundler without requiring an env field in the supplied
targetID. For users, this is consistent with previous legacy
unbundler behavior.

Change-Id: I5f085b0fa1ad352bbbb282b75367c206b75f279f


[ROCm/clr commit: 443f912c7f]
2023-05-31 16:14:08 -04:00
Jacob Lambert 911dbabdff SWDEV-380857 - Remove space in "Linker Program" used to set Comgr object name
Not a required change, but it does make dealing with temporary files generated
by Comgr easier.

Change-Id: I9c43138dd2a6c4fea965b57fbce7a087ab2bbd28


[ROCm/clr commit: 1171518b97]
2023-05-30 13:47:26 -04:00
Jaydeep Patel 6ab03957bb SWDEV-401847 - Update atomicMin/Max for float and double.
Change-Id: Ib5be459b8a24f0739e299ed12c9f877f8baa02b1


[ROCm/clr commit: d576356505]
2023-05-29 14:27:16 -04:00
sdashmiz 9a3d3aef8c SWDEV-360031 - Event record clean up
Signed-off-by: sdashmiz <shadi.dashmiz@amd.com>
Change-Id: Ic7b5edd501c5f61b2bce87252ac040cbc4083006


[ROCm/clr commit: 1e9dfdfe99]
2023-05-29 11:42:16 -04:00
Vikram 5ba0fb5802 SWDEV-354050 - Runtime support for HIP non hostcall printf
Relates to https://reviews.llvm.org/D150427,

Each printf call populates buffer with following data
1. Control DWord - contains info regarding stream, format string constness and size of data frame
   (see http://gerrit-git.amd.com/c/lightning/ec/device-libs/+/857722 for more info)
2. Hash of the format string (if constant) else the format string itself
3. Printf arguments (each aligned to 8 byte boundary)

Change-Id: I7e320deb343921b4b4cfaf08a2be2883e0bc1f65


[ROCm/clr commit: 7b6a8f1702]
2023-05-26 01:55:20 -04:00
Jeremy Newton b1266e557e SWDEV-361988 - Fix hip library install permissions
"FILES" installs files as 644, but we want libraries to be 755, which
we can do with "PROGRAMS".

Signed-off-by: Jeremy Newton <Jeremy.Newton@amd.com>
Change-Id: I155ed77482839ff6d71f90239a014d239e20f4b8


[ROCm/clr commit: 2cda949920]
2023-05-25 22:47:32 -04:00
Satyanvesh Dittakavi 094b2bfa9e SWDEV-401850 - Fix hipCreateChannelDescHalf APIs
- Address ChannelDescHalf1 and ChannelDescHalf2 APIs not returning
correct Channel Descriptors in amd headers
- Add missing hipCreateChannelDescHalf APIs in nvidia headers

Change-Id: I558847425c2459d3cde329ca2e926d882bb0a9dd


[ROCm/clr commit: 586731c2cc]
2023-05-25 04:36:24 -04:00
Ioannis Assiouras babff0ff73 SWDEV-401902 - Set default CMAKE_BUILD_TYPE in the top level CMakeLists.txt
Change-Id: I3a3e24da2dc7eec5feb1f748ea49af3df40d4cd9


[ROCm/clr commit: f945adf0bd]
2023-05-24 14:11:18 -04:00
Ioannis Assiouras 5f30552398 wSWDEV-386069 - Allow find_package(hip) to work on nvidia platform
Change-Id: Ifadf683a6db42d4adadce92c7ebf6bd3e8d2245a


[ROCm/clr commit: 7e871698e3]
2023-05-24 14:12:22 +01:00
Jacob Lambert 2d9db8a102 SWDEV-371628 - Remove now-redundant Comgr devlib call from hipRTC
Because hipRTC is now using the newer
AMD_COMGR_COMPILE_SOURCE_WITH_DEVICE_LIBS_TO_BC, and now that this
action has been fixed for HIP compilations in Comgr, hipRTC no
longer needs separate Comgr call to link in the device libs.

Change-Id: Ibf9024cbaaab825584566e8d0b5fce60d7063dd8


[ROCm/clr commit: 283dd8352d]
2023-05-23 18:34:59 -04:00
AravindanC 554b6efd98 SWDEV-383579 - Handling compiler-rt unrecognized error msg
Change-Id: Ia9070a575282bd263b0bb51eab1280552126c79e


[ROCm/clr commit: 82deb581cb]
2023-05-23 11:28:26 -04:00
Jaydeep Patel 0a6fe33ab7 SWDEV-401231 - getMemoryObject returns nullptr for host system memory as arena is disabled for XNACK.
Change-Id: I84348e0ba2debf69f65c0e85179dad7827d30806


[ROCm/clr commit: b07a44c253]
2023-05-23 09:51:08 -04:00
Ranjith Ramakrishnan 849bfd9196 SWDEV-310152 - Use the RUNPATH provided by build scripts
RUNPATH in libraries will be : $ORIGIN
RUNPATH in binaries will be : $ORIGIN/../lib

Change-Id: I87b6a7d1f58f20499c3a0913d03701ac687d910d


[ROCm/clr commit: 31d1420c54]
2023-05-23 08:00:34 -04:00
Saleel Kudchadker a5715294fb SWDEV-301667 - Add a flag for gpuvm kernargs
HIP_FORCE_DEV_KERNARG=1 will create a device allocation for kernel arg
segment. Flag is 0 by default.

Change-Id: Iaaf5a149f3be8596568878d5d272268baf067c60


[ROCm/clr commit: 5436d362b1]
2023-05-22 11:23:48 -04:00
Saleel Kudchadker 9394aa2671 SWDEV-384557 - Do not fall back to compute
- Use regular copy API if we exhaust free SDMA engines and not fall back
to compute copy. Falling to compute is affecting performance for
numerous apps that are GPU bound

Change-Id: I75c767eff0b9f5ada324301c5c327fe2c23a9806


[ROCm/clr commit: 60d9a4ebab]
2023-05-22 11:23:23 -04:00
Alex Voicu e266f8834b SWDEV-301667 - Kernelarg gpuvm
Add aligned, nontemporal `memcpy` for kernarg.

Change-Id: I5d8ac76904feaf793b45ec2ea5fbd1069be20068


[ROCm/clr commit: 06df9e2efd]
2023-05-22 11:21:14 -04:00
kjayapra-amd 62302ac4fa SWDEV-393910 - Adding CAS expanders under gfx941 tag.
Change-Id: I2c4bcf56be419d1f037b8555ad254a2dc49d0c5b


[ROCm/clr commit: feb22250f3]
2023-05-22 09:59:05 -04:00
Ioannis Assiouras fce1d56110 SWDEV-401495 - Removed hip_hcc.def.in
Change-Id: Ia84503f0b9582f32de819f4ff95630fb024cd613


[ROCm/clr commit: 9d06e8c2d3]
2023-05-19 11:47:07 +01:00
sdashmiz a88c64f86d SWDEV-367877 - Detect cycle in graph
- detect cycle when graph is instantiated

- remove level calculation from add/remove node

Signed-off-by: sdashmiz <shadi.dashmiz@amd.com>
Change-Id: I3f7432f91f70aec8e4fd866b2766256f8a9a0cfe

graph-cycle-corrections

Signed-off-by: sdashmiz <shadi.dashmiz@amd.com>
Change-Id: I8a3cec9a5a503aac6ea1e85ff3dd2b972790fb1d


[ROCm/clr commit: 8578da8a3d]
2023-05-18 09:44:39 -04:00
Jaydeep Patel f7c3760996 SWDEV-389199 - Add GL Memory object to memory map so that it is acknowledged by getMemoryObject.
Change-Id: Ia58bc30ba8be016c1f004f6af01dd767f45f7905


[ROCm/clr commit: 97dc73f581]
2023-05-18 01:10:33 -04:00
pghafari 9eda493469 SWDEV-399734, SWDEV-399733 - update maxGridDim for y and z
Change-Id: Id5b1dd2491798dd20a0672362f445ab4de164f6d


[ROCm/clr commit: fb82b292c2]
2023-05-17 19:03:04 -04:00
Jacob Lambert 2e664d2492 SWDEV-371628 - Shift device lib linking into clang driver
Previously, we used the following approach and Comgr actions
for device lib linking:

    AMD_COMGR_COMPILE_SOURCE_TO_BC (compile with clang driver)
    AMD_COMGR_ADD_DEVICE_LIBRARIES (link in device libs with
      llvm-link API)

However, the clang driver can link in device libraries as part
of compilation, assuming a --rocm-path is set. In this context,
this is accomplished by using the following Comgr action instead:

  AMD_COMGR_COMPILE_SOURCE_WITH_DEVICE_LIBS_TO_BC (compile and
    link in device libs with clang driver)

Change-Id: I661465865365afecc44aa15d4df91bfab361af8d


[ROCm/clr commit: a4c5c44008]
2023-05-17 17:40:56 -04:00
sdashmiz 915677e9a0 SWDEV-396533 - correct _shfl function to match cuda
Signed-off-by: sdashmiz <shadi.dashmiz@amd.com>
Change-Id: I311419fd25c055339f25fe0c7a132ec9ee225600


[ROCm/clr commit: 23e99dbb07]
2023-05-17 14:19:46 -04:00
AravindanC 27616948ac SWDEV-336531 - Fix for relative path issue in debuginfo
Change-Id: Ie36d1a93f40ddc3a42823eb83e041af8c55bcdee


[ROCm/clr commit: 3f4b70cafd]
2023-05-17 13:06:26 -04:00
Cordell Bloor f7a7997c66 SWDEV-346940 - Remove default AMDGPU_TARGETS
hipcc and clang++ both have logic to detect the installed hardware
and to automatically select the appropriate AMDGPU target when it is
left unspecified. When the AMDGPU_TARGETS property is initialized with
a set of default values, it results in the addition of an explicit set
of --offload-arch flags being passed. These explicit architecture flags
disable the architecture autodetection in the compiler.

The resulting behaviour from setting fixed defaults makes it unpleasant
to compile with CMake because they increase the build times for projects
unless they are overriden (as most users do not need to build for all
five default architectures). The fixed defaults are also troublesome for
users with hardware not included in the default set (e.g., gfx1011,
gfx1031, gfx1100).

A possible alternative might be to detect the architecture within
hip-config.cmake rather than running the detection logic on each
compiler invocation. However, this approach is simpler.

Change-Id: I9495d766b7eed03852eb4dc72b0aabe4100bc32c
Signed-off-by: Cordell Bloor <Cordell.Bloor@amd.com>


[ROCm/clr commit: e1bed6f354]
2023-05-16 16:35:09 -06:00