Ioannis Assiouras
aaf1c19609
SWDEV-384362 - Added explicit cast from __fp16 to _Float16 in rcph
...
Change-Id: Ibbb11e928a80126ed7a8c7cd8bddebc74c38986d
2023-02-21 17:32:59 -05:00
Ajay
fc6a77a09f
SWDEV-384100 - HIP support for CUDA 12.0
...
Apps are failing to build due to undefined deprecated texture APIs
Change-Id: I1fb64adc4bc0ba6ee6ecaa65d54b34da0327e6a3
2023-02-21 12:23:48 -05:00
Rakesh Roy
287d07ccd4
SWDEV-338733 - Implement hipArrayGet* APIs
...
- Add implementation for hipArrayGetInfo, hipArrayGetDescriptor &
hipArray3DGetDescriptor APIs
Change-Id: I181a472066006bc3bd0d987408ea67e218310983
2023-02-16 23:04:29 -05:00
Matt Arsenault
eacf63673b
SWDEV-1 - Use proper __builtin_amdgcn_rcph builtin
...
Change-Id: Ic0b1b25a5ab7d132cb51425f2a17bc779c762668
2023-02-07 10:58:48 -05:00
Matt Arsenault
bebbf3c4ec
SWDEV-1 - Eliminate rcp_2f16 pseudo intrinsic
...
No such wrapper or intrinsic would ever exist because there is no such
underlying instruction.
Change-Id: I6c3f64cd2df2a58edf32037da8f5712868f296ea
2023-02-07 10:58:31 -05:00
Ioannis Assiouras
6704f3d0f0
SWDEV-380687 - Fixed typo in safe atomicAdd for gfx90a
...
Change-Id: I87bc6d9e0ef1f564d679c6280c71c4633d3aa619
2023-02-07 10:14:34 -05:00
Ioannis Assiouras
24d3d8216b
SWDEV-380466 - Fixed typo in unsafe atomicAdd for doubles and gfx940
...
Change-Id: I3b9ea4773f94bf6a9b9dfb655062f42aeb3b5eba
2023-02-07 10:06:17 -05:00
Matthew Arsenault
8c7391398a
SWDEV-1 - Reapply "SWDEV-1 - Directly call __builtin_amdgcn_fence"
...
This reverts commit 5b39c346ad .
Resubmit after https://github.com/ROCmSoftwarePlatform/rocPRIM/commit/ba8a86166ec9c4cf3a40ecde7017feeb1484e7f6
Change-Id: I194559484a5a0d7f4443381fe77218ea73d2def9
2023-02-03 10:54:27 -05:00
Matt Arsenault
14477c24d0
SWDEV-1 - Use proper builtins for math intrinsics
...
Change-Id: If0e420332c718abe0e9f6316c467b6b29b80b183
2023-02-01 07:19:45 -05:00
Rakesh Roy
8b00f56577
SWDEV-375004 - Fix rocprim test failure
...
- For !__HIPCC_RTC__ case, operator<<(std::ostream& os, const hip_bfloat16& bf16) calls itself in endless recursion
- Convert hip_bfloat16 to float to fix this
Change-Id: I252a656817550caf43c587cebf461474f12b8c0c
2023-01-25 00:01:02 -05:00
Satyanvesh Dittakavi
0ccd41050b
SWDEV-372396 - Address the errors in hiprtc headers with -Werror
...
Change-Id: Ied553216dfc2ff8b5c72de617cb47b6ca8f52063
2023-01-23 17:25:42 +00:00
Maneesh Gupta
5b39c346ad
SWDEV-378321 - Revert "SWDEV-1 - Directly call __builtin_amdgcn_fence"
...
This reverts commit 3e54bad5b3 .
Reason for revert: Root cause for regression SWDEV-378321
Change-Id: If3116c5ad1a23db1840e5fc019d2fe997b37d229
2023-01-19 13:01:27 -05:00
Satyanvesh Dittakavi
842b9d0b56
SWDEV-373231 - Support hip atomics header with HIPRTC
...
Change-Id: Ic500dfb66ed5ef0c4b672dfd0595409d0e8caaf5
2023-01-17 00:31:07 -05:00
Jaydeep Patel
8bd45ae5da
SWDEV-369961 - Add mappings for enum hipGraphInstantiateFlags and types.
...
Change-Id: I03fa8221684b08a0c44e12aaf2eb7ce5281be629
2023-01-15 12:17:21 +00:00
Matt Arsenault
8615a89f24
SWDEV-1 - Use proper builtin for groupstaticsize
...
Remove the now empty header for asm intrinsic wrappers.
Change-Id: I63495dc6ed899b1ea823062f2be2fb064af8d4f9
2023-01-13 13:14:10 -05:00
German Andryeyev
eef47ca24a
SWDEV-353281 - Initial support of memalloc in graph
...
Add memory allocation support in graph. Current implementation uses
cache from mempool to hold the allocations which belong to the graph.
Also the resource tracking is disabled at this moment because mempool
operates with hip::Stream objects, but graph has execution with
amd::HostQueue objects.
Change-Id: I54fe3250126d24f5a26ada975f37d429bb4ef17b
2023-01-13 13:06:59 -05:00
Satyanvesh Dittakavi
c193503722
SWDEV-376386 - Support __hip_move_dpp with hipRTC
...
Change-Id: If9a6fe23d6df143a7f522823fca7183416048c6a
2023-01-13 00:05:31 -05:00
Matt Arsenault
3e54bad5b3
SWDEV-1 - Directly call __builtin_amdgcn_fence
...
The device libraries provide hip.bc, which only contains
__atomic_work_item_fence. This is identical to OpenCL's
atomic_work_item_fence, except prefixed with __. This only existed to
switch over the unknown values the OpenCL function provides. There's
no need to go through the wrapper here since it's using constant
values, and without it we can drop one of the bitcode libraries. I'm
guessing this was a holdover from before the fence builtin existed.
Change-Id: I8b04614f96194eafddeba295215c12c3856e6c42
2023-01-12 12:04:09 -05:00
Matt Arsenault
91b851fad7
SWDEV-1 - Remove unused intrinsic declaration
...
The uses have moved to using the proper __builtin_amdgcn_ds_swizzle.
Change-Id: I82159ebf41fb91ac6a9d07b33311e569dce5e754
2023-01-12 12:04:09 -05:00
Ioannis Assiouras
e97dcfbe48
SWDEV-374405 - Added OpenGL interoperability APIs for nvidia
...
Change-Id: I1d6ad5c610ab6ee805bd07dacea316b3242ed129
2023-01-12 05:22:14 -05:00
Rakesh Roy
b321e2afc6
SWDEV-368881 - Implement hipModuleLaunchCooperative* APIs
...
- Add implementation for hipModuleLaunchCooperativeKernel, hipModuleLaunchCooperativeKernelMultiDevice APIs
Change-Id: I2a80264e458549211c34b413397a08b57d021147
2023-01-10 13:50:20 -05:00
Maneesh Gupta
6c53a670c1
SWDEV-345571 - Revert changes for warp shuffle functions
...
This reverts commit ad1fae7df4 and 695533d435 .
Change-Id: I548f15055417453a7fd323a481ed410598830521
2023-01-09 22:56:52 -05:00
kjayapra-amd
0c8c08cf51
SWDEV-361374 - Adding support for hipPointerSetAttributes
...
Change-Id: I56710b808854acefca07b3b5898deebee38063e9
2023-01-09 12:36:57 -05:00
Jaydeep Patel
695533d435
SWDEV-375137 - Bypass defination of half and half2 alias for internal include of hip_fp16.h
...
Change-Id: I5258a487d4c2d93e889a523fde81f1b3906d9318
2023-01-06 21:34:51 -05:00
pghafari
335a24d1a0
SWDEV-369042 - updating to GetError to match cuda
...
Change-Id: I0ec2330443b26cb1c8cedba942f31fb1267cd09d
2023-01-06 05:33:41 -05:00
Anusha GodavarthySurya
ad1fae7df4
SWDEV-345571 - Added support for half/half2 data types for warp shuffle functions
...
Change-Id: Ia0791760c1b0a35bbbf0b1f8435aa924af205169
2022-12-26 01:24:06 -05:00
Satyanvesh Dittakavi
e90721435b
SWDEV-372767 - Fix hipCUB and rocWMMA build failures
...
Change-Id: If3672fd50075c67cfce4d94e73b1376c4aa8d2ae
2022-12-16 03:08:18 -05:00
Satyanvesh Dittakavi
91291e48db
SWDEV-372032 - Support cooperative groups header with HIPRTC
...
Change-Id: Ia64652fd52b210a4eeee9d38fdfdfa6fdc7163ff
2022-12-09 04:45:59 -05:00
Satyanvesh Dittakavi
d413375d51
SWDEV-369955 - Support hipDrvGetErrorString and hipDrvGetErrorName on Nvidia headers
...
Change-Id: I54cfa8faed2f45f310682625f94952aa3c2ee27f
2022-12-02 11:52:07 -05:00
Satyanvesh Dittakavi
1deee433d0
SWDEV-287842 - Add int8_t and int16_t support in hiprtc
...
Change-Id: I03bb967068f1a29e85ce8d9410f78de4af9589c4
2022-12-01 11:58:46 -05:00
Your Name
3e47c052e0
SWDEV-368477 - hipOccupancyMaxPotentialBlockSizeVariableSMemWithFlags nvidia mapping
...
SWDEV-369618 - hipOccupancyMaxPotentialBlockSizeVariableSMemWithFlags invalid flag scenario
Change-Id: I2edbab5d3eeacf80bc215f0a77a46d733f4b4ec9
2022-11-29 12:38:39 -05:00
Rakesh Roy
cbc3a0a652
SWDEV-348820 - Fix hipPointerAttribute_t incompatibility
...
- Use hipPointerAttribute_t.type instead of hipPointerAttribute_t.memoryType
Change-Id: I3bf1c0758bdae987213ba7de62247dd420ef4cc9
2022-11-29 00:53:24 -05:00
Ioannis Assiouras
33cb388101
SWDEV-369557 - Added enum mapping for external[Semaphore/Memory]HandleType
...
Change-Id: If1b95b084f7fa312e6cdad4ccf433127671a35ce
2022-11-28 13:32:31 -05:00
Anusha GodavarthySurya
f7ca4b8fb9
SWDEV-365299 - update implementation __float2half_{rd,ru,rz} to call __ocml_cvt{rtn, rtp, rtz}_f16_f32
...
Change-Id: I6cd711fbeb0e02a1caa03ac7f3dd9c8f8fdbac01
2022-11-25 16:27:02 +00:00
Anusha GodavarthySurya
766c40aab8
SWDEV-325711 - Added few graph API support for nvidia path
...
Change-Id: I4eb6e77220e201c85fdaa3a004590c4c5d5ea6c6
2022-11-25 09:08:32 -05:00
Ioannis Assiouras
c5b08c621f
SWDEV-364950 - Remove Scalar_accessors from HIP_vector_type
...
Change-Id: I13ecc213c98a1c7b5febcd43c0a21ee6db725f60
2022-11-24 09:40:54 -05:00
Pierre van Houtryve
a312c0a956
Fix atomicInc/dec scope to "agent"
...
Addresses leftover comment of Icd1d9b9438cac4bef0f7c52d4cd341ac76500890
Change-Id: If9c395335497d634e4c702433d6e3681583bedef
2022-11-24 06:56:49 -05:00
Ioannis Assiouras
432f5fbc97
SWDEV-364950 - Add trivial copy assignment for HIP_vector_type under HIP_CLANG_ONLY path
...
Change-Id: I56a1003ba32c748631c7751b2e6a49e12e48afcb
2022-11-23 16:56:39 +00:00
Anusha GodavarthySurya
78eef7f173
SWDEV-366653 - Added new graph APIs - Fix windows build
...
Change-Id: I580133bddcbab5529aecfb70bc9776e8c40de697
2022-11-23 08:57:49 +00:00
Anusha Godavarthy Surya
93ce12c90d
SWDEV-325711 - Added Graph APIs hipGraphDebugDotPrint, hipGraphKernelNodeCopyAttributes
...
hipGraphNodeGetEnabled, hipGraphNodeSetEnabled and hipGraphUpload
Change-Id: Ib5a58fbc0a9b3f134eb56d42cd945f3d7f4922d8
2022-11-22 23:20:25 -05:00
Satyanvesh Dittakavi
5f4aa2e994
SWDEV-366306 - SWDEV-366314 - Support hip vector types and hip_bfloat16 with hiprtc
...
Also, moves the hip_bfloat16 header implemenation from hip
to hipamd
Change-Id: I9980126fa1ee8160ff9f5f8241f64699a9702c95
2022-11-18 06:58:31 -05:00
Julia Jiang
30c43056c1
SWDEV-340754 - Update coordinates in coperatative helper
...
Change-Id: Ibf4e833d10ea5c317900e83f020c9c23f55f3951
2022-11-17 14:17:58 -05:00
Jaydeep Patel
c800d2126d
SWDEV-366088 - Default case returns -1 as enum value.
...
Change-Id: I8dd5cac254c99e4de9f6d492e71060e39b069507
2022-11-15 22:12:12 -05:00
Daria Materikina
834d9d3a3c
make header intel-compatible
...
Change-Id: I8f8b68ba23e8f86dfeaf7c7b464f2b6ca6999971
2022-11-14 17:59:01 -05:00
kjayapra-amd
5ff27faa04
SWDEV-337452 - Adding__ockl_steadyctr_u64 option for wall_clock64.
...
Change-Id: I4ee6fd4d40d500363ef84975078185093dab5ddd
2022-11-07 08:47:30 -05:00
Anusha GodavarthySurya
c974cbcb3d
SWDEV-366153 - Fix __ocml_isnan_f16 to pass _Float16
...
Change-Id: I7628f21d000cec490f5fb2c91f1b99ca61fb46cf
2022-11-06 21:57:23 -05:00
pghafari
ac2948651e
SWDEV-362199 - add vulkan interop apis in nvidia
...
Change-Id: I6c5de6174a80be862b6d869927ed704dd1421b41
2022-11-04 16:49:21 -04:00
Anusha GodavarthySurya
07b3070e25
SWDEV-362611 - Added hmax and hmin
...
Change-Id: I133a80e997e39357693df7ab969425c0d5585607
2022-10-31 00:23:57 -04:00
Pierre van Houtryve
dcb5d29f1e
Replace LLVM atomic.inc intrinsics with Clang builtins
...
With opaque pointers, the suffix of those intrinsics changed. This caused
build failures that should be solved by just using the corresponding
Clang builtins instead of using intrinsics directly
See SWDEV-356581
Change-Id: Icd1d9b9438cac4bef0f7c52d4cd341ac76500890
2022-10-27 10:17:21 -04:00
Laurent Morichetti
19107858d6
SWDEV-361847 - Print 'char' parameters as 'integer'
...
When printing HIP API function parameters, use the integer format to
print 'char' arguments to avoid printing invalid ASCII characters
(value > 127).
Make sure the roctracer::hip_support::detail operator<< overloads are
used when printing arguments.
Change-Id: Id072c2ed19b1b4166108599e393d1cae6c54b6b0
2022-10-21 14:55:33 -04:00