نمودار کامیت

11577 کامیت‌ها

مولف SHA1 پیام تاریخ
Satyanvesh Dittakavi 094b2bfa9e SWDEV-401850 - Fix hipCreateChannelDescHalf APIs
- Address ChannelDescHalf1 and ChannelDescHalf2 APIs not returning
correct Channel Descriptors in amd headers
- Add missing hipCreateChannelDescHalf APIs in nvidia headers

Change-Id: I558847425c2459d3cde329ca2e926d882bb0a9dd


[ROCm/clr commit: 586731c2cc]
2023-05-25 04:36:24 -04:00
Ioannis Assiouras babff0ff73 SWDEV-401902 - Set default CMAKE_BUILD_TYPE in the top level CMakeLists.txt
Change-Id: I3a3e24da2dc7eec5feb1f748ea49af3df40d4cd9


[ROCm/clr commit: f945adf0bd]
2023-05-24 14:11:18 -04:00
Ioannis Assiouras 5f30552398 wSWDEV-386069 - Allow find_package(hip) to work on nvidia platform
Change-Id: Ifadf683a6db42d4adadce92c7ebf6bd3e8d2245a


[ROCm/clr commit: 7e871698e3]
2023-05-24 14:12:22 +01:00
Jacob Lambert 2d9db8a102 SWDEV-371628 - Remove now-redundant Comgr devlib call from hipRTC
Because hipRTC is now using the newer
AMD_COMGR_COMPILE_SOURCE_WITH_DEVICE_LIBS_TO_BC, and now that this
action has been fixed for HIP compilations in Comgr, hipRTC no
longer needs separate Comgr call to link in the device libs.

Change-Id: Ibf9024cbaaab825584566e8d0b5fce60d7063dd8


[ROCm/clr commit: 283dd8352d]
2023-05-23 18:34:59 -04:00
AravindanC 554b6efd98 SWDEV-383579 - Handling compiler-rt unrecognized error msg
Change-Id: Ia9070a575282bd263b0bb51eab1280552126c79e


[ROCm/clr commit: 82deb581cb]
2023-05-23 11:28:26 -04:00
Jaydeep Patel 0a6fe33ab7 SWDEV-401231 - getMemoryObject returns nullptr for host system memory as arena is disabled for XNACK.
Change-Id: I84348e0ba2debf69f65c0e85179dad7827d30806


[ROCm/clr commit: b07a44c253]
2023-05-23 09:51:08 -04:00
Ranjith Ramakrishnan 849bfd9196 SWDEV-310152 - Use the RUNPATH provided by build scripts
RUNPATH in libraries will be : $ORIGIN
RUNPATH in binaries will be : $ORIGIN/../lib

Change-Id: I87b6a7d1f58f20499c3a0913d03701ac687d910d


[ROCm/clr commit: 31d1420c54]
2023-05-23 08:00:34 -04:00
Saleel Kudchadker a5715294fb SWDEV-301667 - Add a flag for gpuvm kernargs
HIP_FORCE_DEV_KERNARG=1 will create a device allocation for kernel arg
segment. Flag is 0 by default.

Change-Id: Iaaf5a149f3be8596568878d5d272268baf067c60


[ROCm/clr commit: 5436d362b1]
2023-05-22 11:23:48 -04:00
Saleel Kudchadker 9394aa2671 SWDEV-384557 - Do not fall back to compute
- Use regular copy API if we exhaust free SDMA engines and not fall back
to compute copy. Falling to compute is affecting performance for
numerous apps that are GPU bound

Change-Id: I75c767eff0b9f5ada324301c5c327fe2c23a9806


[ROCm/clr commit: 60d9a4ebab]
2023-05-22 11:23:23 -04:00
Alex Voicu e266f8834b SWDEV-301667 - Kernelarg gpuvm
Add aligned, nontemporal `memcpy` for kernarg.

Change-Id: I5d8ac76904feaf793b45ec2ea5fbd1069be20068


[ROCm/clr commit: 06df9e2efd]
2023-05-22 11:21:14 -04:00
kjayapra-amd 62302ac4fa SWDEV-393910 - Adding CAS expanders under gfx941 tag.
Change-Id: I2c4bcf56be419d1f037b8555ad254a2dc49d0c5b


[ROCm/clr commit: feb22250f3]
2023-05-22 09:59:05 -04:00
Ioannis Assiouras fce1d56110 SWDEV-401495 - Removed hip_hcc.def.in
Change-Id: Ia84503f0b9582f32de819f4ff95630fb024cd613


[ROCm/clr commit: 9d06e8c2d3]
2023-05-19 11:47:07 +01:00
sdashmiz a88c64f86d SWDEV-367877 - Detect cycle in graph
- detect cycle when graph is instantiated

- remove level calculation from add/remove node

Signed-off-by: sdashmiz <shadi.dashmiz@amd.com>
Change-Id: I3f7432f91f70aec8e4fd866b2766256f8a9a0cfe

graph-cycle-corrections

Signed-off-by: sdashmiz <shadi.dashmiz@amd.com>
Change-Id: I8a3cec9a5a503aac6ea1e85ff3dd2b972790fb1d


[ROCm/clr commit: 8578da8a3d]
2023-05-18 09:44:39 -04:00
Jaydeep Patel f7c3760996 SWDEV-389199 - Add GL Memory object to memory map so that it is acknowledged by getMemoryObject.
Change-Id: Ia58bc30ba8be016c1f004f6af01dd767f45f7905


[ROCm/clr commit: 97dc73f581]
2023-05-18 01:10:33 -04:00
pghafari 9eda493469 SWDEV-399734, SWDEV-399733 - update maxGridDim for y and z
Change-Id: Id5b1dd2491798dd20a0672362f445ab4de164f6d


[ROCm/clr commit: fb82b292c2]
2023-05-17 19:03:04 -04:00
Jacob Lambert 2e664d2492 SWDEV-371628 - Shift device lib linking into clang driver
Previously, we used the following approach and Comgr actions
for device lib linking:

    AMD_COMGR_COMPILE_SOURCE_TO_BC (compile with clang driver)
    AMD_COMGR_ADD_DEVICE_LIBRARIES (link in device libs with
      llvm-link API)

However, the clang driver can link in device libraries as part
of compilation, assuming a --rocm-path is set. In this context,
this is accomplished by using the following Comgr action instead:

  AMD_COMGR_COMPILE_SOURCE_WITH_DEVICE_LIBS_TO_BC (compile and
    link in device libs with clang driver)

Change-Id: I661465865365afecc44aa15d4df91bfab361af8d


[ROCm/clr commit: a4c5c44008]
2023-05-17 17:40:56 -04:00
sdashmiz 915677e9a0 SWDEV-396533 - correct _shfl function to match cuda
Signed-off-by: sdashmiz <shadi.dashmiz@amd.com>
Change-Id: I311419fd25c055339f25fe0c7a132ec9ee225600


[ROCm/clr commit: 23e99dbb07]
2023-05-17 14:19:46 -04:00
AravindanC 27616948ac SWDEV-336531 - Fix for relative path issue in debuginfo
Change-Id: Ie36d1a93f40ddc3a42823eb83e041af8c55bcdee


[ROCm/clr commit: 3f4b70cafd]
2023-05-17 13:06:26 -04:00
Cordell Bloor f7a7997c66 SWDEV-346940 - Remove default AMDGPU_TARGETS
hipcc and clang++ both have logic to detect the installed hardware
and to automatically select the appropriate AMDGPU target when it is
left unspecified. When the AMDGPU_TARGETS property is initialized with
a set of default values, it results in the addition of an explicit set
of --offload-arch flags being passed. These explicit architecture flags
disable the architecture autodetection in the compiler.

The resulting behaviour from setting fixed defaults makes it unpleasant
to compile with CMake because they increase the build times for projects
unless they are overriden (as most users do not need to build for all
five default architectures). The fixed defaults are also troublesome for
users with hardware not included in the default set (e.g., gfx1011,
gfx1031, gfx1100).

A possible alternative might be to detect the architecture within
hip-config.cmake rather than running the detection logic on each
compiler invocation. However, this approach is simpler.

Change-Id: I9495d766b7eed03852eb4dc72b0aabe4100bc32c
Signed-off-by: Cordell Bloor <Cordell.Bloor@amd.com>


[ROCm/clr commit: e1bed6f354]
2023-05-16 16:35:09 -06:00
Satyanvesh Dittakavi 82e044cb6a SWDEV-399851 - Add template overload for ToString in HIPRTC
HIPRTC_INIT_API can have nullptr in the arguments and ClPrint
can crash while printing

Change-Id: Iecade5c3867196509c8cc0647b9aa24be0960a02


[ROCm/clr commit: c98fad1edc]
2023-05-16 14:21:19 +00:00
Jatin Chaudhary 9e5b96d567 SWDEV-384013 - Print link options being passed to compiler with AMD_LOG_LEVEL
Change-Id: I556ed434587cff07a634796da51dcbb9ae78ef21


[ROCm/clr commit: 8753db5446]
2023-05-15 15:52:04 -04:00
Ioannis Assiouras ad30c947bf SWDEV-400119 - Return error from globalFreeMemory() if HSA available memory query fails
Change-Id: I33c069fa1e0d31891f64dc8167a24ef9d7ed1426


[ROCm/clr commit: a07e38be2f]
2023-05-15 11:54:25 +01:00
Jatin Chaudhary 8d0b25334b SWDEV-395773 - Fix unordered compare functions in fp16 header
Change-Id: I9158420e37cd3151159182223c6645ed84c4fddd


[ROCm/clr commit: 15bafc9a93]
2023-05-11 10:42:13 -04:00
Jatin Chaudhary 69d6e8873d SWDEV-367537 - Add __hip_bfloat16 and vector definitions to match __nv_bfloat16
Change-Id: I1c0f9f5f278c2c3b4e175d9f08831ba458ed856e


[ROCm/clr commit: 6c793ea206]
2023-05-11 04:45:13 -04:00
Saleel Kudchadker 0141e6809f SWDEV-398151 - Partly relax static engine allocation
Change-Id: I4903b51a34b597a2e84d771b52cf629f877dba05


[ROCm/clr commit: 0b475284e9]
2023-05-11 00:52:18 -04:00
Jaydeep Patel cb3642e26e SWDEV-398047 - Disable arena for XNACK.
Change-Id: If6c98490bd5b8146a357f45ed132f687def0877f


[ROCm/clr commit: ffa1deef27]
2023-05-10 23:28:05 -04:00
Satyanvesh Dittakavi 130f9c5117 SWDEV-399203 - Use clang pragma only with clang
Change-Id: I21d84ccde07e243b6bc06f1980c24b399e74d1c3


[ROCm/clr commit: a7041d15a7]
2023-05-10 04:19:51 -04:00
taosang2 8d054fa50e SWDEV-366528 – Fix image memory format updating issue
Add dstMemory format updating.
Separate format updating for srcMemory and dstMemory.

Change-Id: I1692b92d417bbd742d562679f218ebf8ca532e92


[ROCm/clr commit: 7624a48de9]
2023-05-08 21:43:42 -04:00
Jacob Lambert c22413efba SWDEV-380857 - Fix bug when reading bitcode file into vector<char> buffer
The previous implementation using std::copy() resulted in
differences between the in-memory and on-disk representations.
With the updated implementation, we get the same contents.

Change-Id: Iadfae3cd7f7ba99538da2ac4f11f30f5a78260d8


[ROCm/clr commit: b17056cb93]
2023-05-08 16:25:01 -04:00
Jaydeep Patel 34f9de0f7e SWDEV-397168 - Enable dynamic call stack size for PAL.
Change-Id: I8be51ffb48e6a742117491a4bf6f12f152e4a0b3


[ROCm/clr commit: 0eb96cbc59]
2023-05-07 23:26:28 -04:00
German 8d97827417 SWDEV-353281 - VM support in mempool for graphs
The change enables VM support in graphs on Windows. That allows
to avoid caching of all allocations at the cost of map/unmap
overhead during memory create/destroy.

Change-Id: I792be00fba099e5e5d3cd44a963e1dfd6976a86d


[ROCm/clr commit: 04b696abee]
2023-05-05 15:31:26 -04:00
kjayapra-amd 23584feba4 SWDEV-393910 - Port gfx94x changes to mainline.
Change-Id: Ibf727223bbe5230b132b47c39e0fc1d87cbd3b9c


[ROCm/clr commit: f14e8a2dba]
2023-05-05 15:16:23 -04:00
Ajay 2e9767d073 SWDEV-394488 - cudaStreamGetCaptureInfo_v2 is undefined in CUDA 12.0
hipStreamPerThrdCompilerOptn.cc test fails to build with cudaStreamGetCaptureInfo_v2
in CUDA 12.0.
fix was to change runtime API cudaStreamGetCaptureInfo_v2
to Driver cuStreamGetCaptureInfo_v2

Change-Id: I44a0110770d3246f5345092acae301c9a2f6d520


[ROCm/clr commit: 0aa70ee0e1]
2023-05-05 10:10:07 -04:00
Jaydeep Patel d43e7a2db1 SWDEV-396277 - Offset and bit size correction, Considaration of SA ID for Navi.
Correct sm id formula for GFX10 and GFX11.

Change-Id: I2883c139b8e684e0334ccc81f703a09e3ddec588


[ROCm/clr commit: a167abefe5]
2023-05-05 01:04:42 -04:00
Ioannis Assiouras 7adb0089c6 SWDEV-397356: Move HIP_API_PRINT after HIP_INIT so that the begin function call gets printed
Change-Id: I99b655596741c19bc483647794ae8b4f6e6f2843


[ROCm/clr commit: d88f0358b0]
2023-05-04 18:46:21 -04:00
Sourabh Betigeri d7275cb51c SWDEV-393362 - Move setData() from profilingEnd() to profilingBegin() to eliminate a possible data race
- Introduce a state variable to indicate if HwProfiling is enabled to
eliminate a possible data race of vector<> signals_.

Change-Id: Id504cc76d7fa9f7e6455587dd232b60ccbbb735b


[ROCm/clr commit: afa28cdf44]
2023-05-04 17:19:25 -04:00
Saleel Kudchadker 5487d0ed10 SWDEV-301667 - Better log
Change-Id: I7399de7fe7a8840568d02362d2d936173583030c


[ROCm/clr commit: 0a7708f982]
2023-05-04 16:29:02 -04:00
Ioannis Assiouras 145506e067 SWDEV-398511 - Removed hiprtc_internal.hpp
Change-Id: Ic0751dfeb4936b3e233b44e34d16e0a1605874d7


[ROCm/clr commit: 29e0d2f335]
2023-05-04 16:50:18 +01:00
Ioannis Assiouras 91f676904b SWDEV-393199 - Added new include file for opengl interop mappings for nvidia
Change-Id: I2e955a9dd06539939188a5f2e1dde4f173af1202


[ROCm/clr commit: 633e720da4]
2023-05-03 19:43:24 -04:00
kjayapra-amd 69406243e2 SWDEV-378006 - Adding a new guarantee macro to support printing args.
Change-Id: I2083ddaf8058759bdace7d3bb1e8e3670cbeddd2


[ROCm/clr commit: 7f41a96ffb]
2023-05-03 12:43:16 -04:00
Jaydeep Patel 23af1d59d3 SWDEV-392870 - Enable p2p support for image using PAL.
Change-Id: I3acbbc01ee11e488e93204814361b8c3431bca4f


[ROCm/clr commit: 82e4859e03]
2023-05-03 00:34:08 -04:00
Jatin Chaudhary 639656a09e SWDEV-376453 - Changes to fix crash seen with HIP and recent compiler change with -O0
Change-Id: I37de83b7449461ce2378741e97ff8fb8da736523


[ROCm/clr commit: c585aefdb7]
2023-05-02 17:46:38 -04:00
taosang2 ebc60d68a8 SWDEV-396574 – Refactor texture channel type checking
Create __hip_is_tex_surf_channel_type to replace
__hip_is_tex_channel_type
__hip_is_itex_channel_type
__hip_is_surf_channel_type .

Change-Id: I1692b92d417bad742d562679f218ebf8ca532e31


[ROCm/clr commit: 381e6520d0]
2023-05-02 10:01:13 -04:00
pghafari 505bce23be SWDEV-369567 - dimension check update
Change-Id: I31deb5070d4852db7dd6d37f9a514965f387698e


[ROCm/clr commit: 31b362bf6e]
2023-05-02 08:04:04 -04:00
Jatin Chaudhary 646af29dc5 SWDEV-1 - Fix incorrect SGPR usage in VGPR calculation. Pointed out by #58 issue on hipamd public repo
Change-Id: I8c27c0da254521714c92369990aa10f6fc09c246


[ROCm/clr commit: 54f73e430b]
2023-05-02 03:53:18 -04:00
Jaydeep Patel fa88970f6a SWDEV-395559 - Move unsigned short up.
Change-Id: I3373a7bc22e632dc10b94b93ff023444d456b2b7


[ROCm/clr commit: cbd91bdc19]
2023-05-02 00:43:36 -04:00
Saleel Kudchadker 44512e3228 SWDEV-301667 - Cleanup log
Change-Id: I9719ec8b19406a062727937471e7a845a522a0be


[ROCm/clr commit: 538d453b86]
2023-05-01 11:07:55 -07:00
Saleel Kudchadker a210535169 SWDEV-392427 - Invalidate Barrier Value AQL header
Change-Id: Id8e04ffe44da58641361468957d397af128443bb


[ROCm/clr commit: 9775c38725]
2023-05-01 14:06:05 -04:00
taosang2 3a37f33e4e SWDEV-396574 – Refactor device mapping functions
Refactor mapFrom(), mapTo(), mapElem(),
__hip_is_surf_channel_type() and
__hip_is_itex_channel_type()

Change-Id: I1692b92d407bad742d562678f218ebf8ca532e91


[ROCm/clr commit: 2f2d02649f]
2023-05-01 13:57:10 -04:00
sdashmiz ebcab6e910 SWDEV-360031 - Correct APIs behaviour
- correct error for hipStreamWaitEvent when event recorded before
  capture
- correct hipEventSync when event is synced during capture

Signed-off-by: sdashmiz <shadi.dashmiz@amd.com>
Change-Id: I7ecbed5621eaf323846d4ccb20ec112aaa8a5757


[ROCm/clr commit: 544318fffe]
2023-05-01 13:38:59 -04:00