Commit Graph

1566 Commits

Author SHA1 Message Date
Ioannis Assiouras 633e720da4 SWDEV-393199 - Added new include file for opengl interop mappings for nvidia
Change-Id: I2e955a9dd06539939188a5f2e1dde4f173af1202
2023-05-03 19:43:24 -04:00
taosang2 381e6520d0 SWDEV-396574 – Refactor texture channel type checking
Create __hip_is_tex_surf_channel_type to replace
__hip_is_tex_channel_type
__hip_is_itex_channel_type
__hip_is_surf_channel_type .

Change-Id: I1692b92d417bad742d562679f218ebf8ca532e31
2023-05-02 10:01:13 -04:00
Jaydeep Patel cbd91bdc19 SWDEV-395559 - Move unsigned short up.
Change-Id: I3373a7bc22e632dc10b94b93ff023444d456b2b7
2023-05-02 00:43:36 -04:00
taosang2 2f2d02649f SWDEV-396574 – Refactor device mapping functions
Refactor mapFrom(), mapTo(), mapElem(),
__hip_is_surf_channel_type() and
__hip_is_itex_channel_type()

Change-Id: I1692b92d407bad742d562678f218ebf8ca532e91
2023-05-01 13:57:10 -04:00
Satyanvesh Dittakavi a67f354cec SWDEV-395413 - Fix fp16 header errors with -Werror in hiprtc
Change-Id: I7875b91758e826b439c67ac9f211a20f9b6133f0
2023-04-26 13:05:20 -04:00
taosang2 650b6767c4 SWDEV-390626 - Fix wrong mapping functions
Remove wrong functions
__hipMapFromNativeFloat4()
__hipMapToNativeFloat4()

Replace them with mapFrom() and mapTo()

Change-Id: I1692b92d397bfd732d562778f918ebf8ca532e93
2023-04-25 13:19:00 -04:00
Ajay 9ca5e71127 SWDEV-390170 - guard deprecated runtime apis on CUDA 12.0
Change-Id: I4ab967c276dfa98f75d508b9484e8d1324b4e4a7
2023-04-05 16:36:53 +00:00
Julia Jiang 0442501849 SWDEV-389033 - Update header for cooperate group
Change-Id: Ica8f99c644a32835bf480b52a6a2af861f1526c0
2023-04-05 10:58:04 +00:00
Ioannis Assiouras 1d6c40b569 SWDEV-387173 - Added cuda mappings for hipGraphMemAttr* enums
Change-Id: Ie5485ec518c6d107901a429ac3614e22caa9196b
2023-03-09 08:08:06 -05:00
Anusha GodavarthySurya b689c56107 SWDEV-330658 - Added flag hipHostRegisterReadOnly
Change-Id: Idb59dc6187e99512546dfeafde44c08ae85f6057
2023-03-09 00:20:24 -05:00
Satyanvesh Dittakavi a8c2375fe1 SWDEV-384714 - Add hipRTC support for amd_hip_fp16
Change-Id: Ib6e2b46908d740909dfc15253965590f496a7068
2023-03-01 06:57:57 -05:00
Jatin Chaudhary a7049bf7a0 SWDEV-372153 - Add hipStreamGetDevice Implementation
Change-Id: Ifd1f13e311e8221ca6d94cf27f9131eb97678067
2023-03-01 04:35:49 -05:00
Sarbojit Sarkar d0920c5959 SWDEV-293749 - Fix for operator mixup
Change-Id: I61d1fea5a6ed2176dd92050c6d96cee1af3a39fb
2023-02-23 04:45:23 -05:00
Anusha GodavarthySurya 32a9249d3e SWDEV-345571 - Added support for half/half2 data types for warp shuffle functions
Change-Id: I276a55129d6527b73292d75eb18403acd2623c8a
2023-02-22 22:40:01 -05:00
Jaydeep Patel 0ad47eb3e5 SWDEV-319526 - SE ID size incresed.
Change-Id: Ia7df94d172102b321f12c8caa821e3a7e4447a21
2023-02-21 22:13:57 -05:00
Ioannis Assiouras aaf1c19609 SWDEV-384362 - Added explicit cast from __fp16 to _Float16 in rcph
Change-Id: Ibbb11e928a80126ed7a8c7cd8bddebc74c38986d
2023-02-21 17:32:59 -05:00
Ajay fc6a77a09f SWDEV-384100 - HIP support for CUDA 12.0
Apps are failing to build due to undefined deprecated texture APIs

Change-Id: I1fb64adc4bc0ba6ee6ecaa65d54b34da0327e6a3
2023-02-21 12:23:48 -05:00
Rakesh Roy 287d07ccd4 SWDEV-338733 - Implement hipArrayGet* APIs
- Add implementation for hipArrayGetInfo, hipArrayGetDescriptor &
  hipArray3DGetDescriptor APIs

Change-Id: I181a472066006bc3bd0d987408ea67e218310983
2023-02-16 23:04:29 -05:00
Matt Arsenault eacf63673b SWDEV-1 - Use proper __builtin_amdgcn_rcph builtin
Change-Id: Ic0b1b25a5ab7d132cb51425f2a17bc779c762668
2023-02-07 10:58:48 -05:00
Matt Arsenault bebbf3c4ec SWDEV-1 - Eliminate rcp_2f16 pseudo intrinsic
No such wrapper or intrinsic would ever exist because there is no such
underlying instruction.

Change-Id: I6c3f64cd2df2a58edf32037da8f5712868f296ea
2023-02-07 10:58:31 -05:00
Ioannis Assiouras 6704f3d0f0 SWDEV-380687 - Fixed typo in safe atomicAdd for gfx90a
Change-Id: I87bc6d9e0ef1f564d679c6280c71c4633d3aa619
2023-02-07 10:14:34 -05:00
Ioannis Assiouras 24d3d8216b SWDEV-380466 - Fixed typo in unsafe atomicAdd for doubles and gfx940
Change-Id: I3b9ea4773f94bf6a9b9dfb655062f42aeb3b5eba
2023-02-07 10:06:17 -05:00
Matthew Arsenault 8c7391398a SWDEV-1 - Reapply "SWDEV-1 - Directly call __builtin_amdgcn_fence"
This reverts commit 5b39c346ad.

Resubmit after https://github.com/ROCmSoftwarePlatform/rocPRIM/commit/ba8a86166ec9c4cf3a40ecde7017feeb1484e7f6

Change-Id: I194559484a5a0d7f4443381fe77218ea73d2def9
2023-02-03 10:54:27 -05:00
Matt Arsenault 14477c24d0 SWDEV-1 - Use proper builtins for math intrinsics
Change-Id: If0e420332c718abe0e9f6316c467b6b29b80b183
2023-02-01 07:19:45 -05:00
Rakesh Roy 8b00f56577 SWDEV-375004 - Fix rocprim test failure
- For !__HIPCC_RTC__ case, operator<<(std::ostream& os, const hip_bfloat16& bf16) calls itself in endless recursion
- Convert hip_bfloat16 to float to fix this

Change-Id: I252a656817550caf43c587cebf461474f12b8c0c
2023-01-25 00:01:02 -05:00
Satyanvesh Dittakavi 0ccd41050b SWDEV-372396 - Address the errors in hiprtc headers with -Werror
Change-Id: Ied553216dfc2ff8b5c72de617cb47b6ca8f52063
2023-01-23 17:25:42 +00:00
Maneesh Gupta 5b39c346ad SWDEV-378321 - Revert "SWDEV-1 - Directly call __builtin_amdgcn_fence"
This reverts commit 3e54bad5b3.

Reason for revert: Root cause for regression SWDEV-378321

Change-Id: If3116c5ad1a23db1840e5fc019d2fe997b37d229
2023-01-19 13:01:27 -05:00
Satyanvesh Dittakavi 842b9d0b56 SWDEV-373231 - Support hip atomics header with HIPRTC
Change-Id: Ic500dfb66ed5ef0c4b672dfd0595409d0e8caaf5
2023-01-17 00:31:07 -05:00
Jaydeep Patel 8bd45ae5da SWDEV-369961 - Add mappings for enum hipGraphInstantiateFlags and types.
Change-Id: I03fa8221684b08a0c44e12aaf2eb7ce5281be629
2023-01-15 12:17:21 +00:00
Matt Arsenault 8615a89f24 SWDEV-1 - Use proper builtin for groupstaticsize
Remove the now empty header for asm intrinsic wrappers.

Change-Id: I63495dc6ed899b1ea823062f2be2fb064af8d4f9
2023-01-13 13:14:10 -05:00
German Andryeyev eef47ca24a SWDEV-353281 - Initial support of memalloc in graph
Add memory allocation support in graph. Current implementation uses
cache from mempool  to hold the allocations which belong to the graph.
Also the resource tracking is disabled at this moment because mempool
operates with hip::Stream objects, but graph has execution with
amd::HostQueue objects.

Change-Id: I54fe3250126d24f5a26ada975f37d429bb4ef17b
2023-01-13 13:06:59 -05:00
Satyanvesh Dittakavi c193503722 SWDEV-376386 - Support __hip_move_dpp with hipRTC
Change-Id: If9a6fe23d6df143a7f522823fca7183416048c6a
2023-01-13 00:05:31 -05:00
Matt Arsenault 3e54bad5b3 SWDEV-1 - Directly call __builtin_amdgcn_fence
The device libraries provide hip.bc, which only contains
__atomic_work_item_fence. This is identical to OpenCL's
atomic_work_item_fence, except prefixed with __. This only existed to
switch over the unknown values the OpenCL function provides. There's
no need to go through the wrapper here since it's using constant
values, and without it we can drop one of the bitcode libraries. I'm
guessing this was a holdover from before the fence builtin existed.

Change-Id: I8b04614f96194eafddeba295215c12c3856e6c42
2023-01-12 12:04:09 -05:00
Matt Arsenault 91b851fad7 SWDEV-1 - Remove unused intrinsic declaration
The uses have moved to using the proper __builtin_amdgcn_ds_swizzle.

Change-Id: I82159ebf41fb91ac6a9d07b33311e569dce5e754
2023-01-12 12:04:09 -05:00
Ioannis Assiouras e97dcfbe48 SWDEV-374405 - Added OpenGL interoperability APIs for nvidia
Change-Id: I1d6ad5c610ab6ee805bd07dacea316b3242ed129
2023-01-12 05:22:14 -05:00
Rakesh Roy b321e2afc6 SWDEV-368881 - Implement hipModuleLaunchCooperative* APIs
- Add implementation for hipModuleLaunchCooperativeKernel, hipModuleLaunchCooperativeKernelMultiDevice APIs

Change-Id: I2a80264e458549211c34b413397a08b57d021147
2023-01-10 13:50:20 -05:00
Maneesh Gupta 6c53a670c1 SWDEV-345571 - Revert changes for warp shuffle functions
This reverts commit ad1fae7df4 and 695533d435.

Change-Id: I548f15055417453a7fd323a481ed410598830521
2023-01-09 22:56:52 -05:00
kjayapra-amd 0c8c08cf51 SWDEV-361374 - Adding support for hipPointerSetAttributes
Change-Id: I56710b808854acefca07b3b5898deebee38063e9
2023-01-09 12:36:57 -05:00
Jaydeep Patel 695533d435 SWDEV-375137 - Bypass defination of half and half2 alias for internal include of hip_fp16.h
Change-Id: I5258a487d4c2d93e889a523fde81f1b3906d9318
2023-01-06 21:34:51 -05:00
pghafari 335a24d1a0 SWDEV-369042 - updating to GetError to match cuda
Change-Id: I0ec2330443b26cb1c8cedba942f31fb1267cd09d
2023-01-06 05:33:41 -05:00
Anusha GodavarthySurya ad1fae7df4 SWDEV-345571 - Added support for half/half2 data types for warp shuffle functions
Change-Id: Ia0791760c1b0a35bbbf0b1f8435aa924af205169
2022-12-26 01:24:06 -05:00
Satyanvesh Dittakavi e90721435b SWDEV-372767 - Fix hipCUB and rocWMMA build failures
Change-Id: If3672fd50075c67cfce4d94e73b1376c4aa8d2ae
2022-12-16 03:08:18 -05:00
Satyanvesh Dittakavi 91291e48db SWDEV-372032 - Support cooperative groups header with HIPRTC
Change-Id: Ia64652fd52b210a4eeee9d38fdfdfa6fdc7163ff
2022-12-09 04:45:59 -05:00
Satyanvesh Dittakavi d413375d51 SWDEV-369955 - Support hipDrvGetErrorString and hipDrvGetErrorName on Nvidia headers
Change-Id: I54cfa8faed2f45f310682625f94952aa3c2ee27f
2022-12-02 11:52:07 -05:00
Satyanvesh Dittakavi 1deee433d0 SWDEV-287842 - Add int8_t and int16_t support in hiprtc
Change-Id: I03bb967068f1a29e85ce8d9410f78de4af9589c4
2022-12-01 11:58:46 -05:00
Your Name 3e47c052e0 SWDEV-368477 - hipOccupancyMaxPotentialBlockSizeVariableSMemWithFlags nvidia mapping
SWDEV-369618 - hipOccupancyMaxPotentialBlockSizeVariableSMemWithFlags invalid flag scenario
Change-Id: I2edbab5d3eeacf80bc215f0a77a46d733f4b4ec9
2022-11-29 12:38:39 -05:00
Rakesh Roy cbc3a0a652 SWDEV-348820 - Fix hipPointerAttribute_t incompatibility
- Use hipPointerAttribute_t.type instead of hipPointerAttribute_t.memoryType

Change-Id: I3bf1c0758bdae987213ba7de62247dd420ef4cc9
2022-11-29 00:53:24 -05:00
Ioannis Assiouras 33cb388101 SWDEV-369557 - Added enum mapping for external[Semaphore/Memory]HandleType
Change-Id: If1b95b084f7fa312e6cdad4ccf433127671a35ce
2022-11-28 13:32:31 -05:00
Anusha GodavarthySurya f7ca4b8fb9 SWDEV-365299 - update implementation __float2half_{rd,ru,rz} to call __ocml_cvt{rtn, rtp, rtz}_f16_f32
Change-Id: I6cd711fbeb0e02a1caa03ac7f3dd9c8f8fdbac01
2022-11-25 16:27:02 +00:00
Anusha GodavarthySurya 766c40aab8 SWDEV-325711 - Added few graph API support for nvidia path
Change-Id: I4eb6e77220e201c85fdaa3a004590c4c5d5ea6c6
2022-11-25 09:08:32 -05:00