Граф коммитов

1551 Коммитов

Автор SHA1 Сообщение Дата
Ioannis Assiouras aaf1c19609 SWDEV-384362 - Added explicit cast from __fp16 to _Float16 in rcph
Change-Id: Ibbb11e928a80126ed7a8c7cd8bddebc74c38986d
2023-02-21 17:32:59 -05:00
Ajay fc6a77a09f SWDEV-384100 - HIP support for CUDA 12.0
Apps are failing to build due to undefined deprecated texture APIs

Change-Id: I1fb64adc4bc0ba6ee6ecaa65d54b34da0327e6a3
2023-02-21 12:23:48 -05:00
Rakesh Roy 287d07ccd4 SWDEV-338733 - Implement hipArrayGet* APIs
- Add implementation for hipArrayGetInfo, hipArrayGetDescriptor &
  hipArray3DGetDescriptor APIs

Change-Id: I181a472066006bc3bd0d987408ea67e218310983
2023-02-16 23:04:29 -05:00
Matt Arsenault eacf63673b SWDEV-1 - Use proper __builtin_amdgcn_rcph builtin
Change-Id: Ic0b1b25a5ab7d132cb51425f2a17bc779c762668
2023-02-07 10:58:48 -05:00
Matt Arsenault bebbf3c4ec SWDEV-1 - Eliminate rcp_2f16 pseudo intrinsic
No such wrapper or intrinsic would ever exist because there is no such
underlying instruction.

Change-Id: I6c3f64cd2df2a58edf32037da8f5712868f296ea
2023-02-07 10:58:31 -05:00
Ioannis Assiouras 6704f3d0f0 SWDEV-380687 - Fixed typo in safe atomicAdd for gfx90a
Change-Id: I87bc6d9e0ef1f564d679c6280c71c4633d3aa619
2023-02-07 10:14:34 -05:00
Ioannis Assiouras 24d3d8216b SWDEV-380466 - Fixed typo in unsafe atomicAdd for doubles and gfx940
Change-Id: I3b9ea4773f94bf6a9b9dfb655062f42aeb3b5eba
2023-02-07 10:06:17 -05:00
Matthew Arsenault 8c7391398a SWDEV-1 - Reapply "SWDEV-1 - Directly call __builtin_amdgcn_fence"
This reverts commit 5b39c346ad.

Resubmit after https://github.com/ROCmSoftwarePlatform/rocPRIM/commit/ba8a86166ec9c4cf3a40ecde7017feeb1484e7f6

Change-Id: I194559484a5a0d7f4443381fe77218ea73d2def9
2023-02-03 10:54:27 -05:00
Matt Arsenault 14477c24d0 SWDEV-1 - Use proper builtins for math intrinsics
Change-Id: If0e420332c718abe0e9f6316c467b6b29b80b183
2023-02-01 07:19:45 -05:00
Rakesh Roy 8b00f56577 SWDEV-375004 - Fix rocprim test failure
- For !__HIPCC_RTC__ case, operator<<(std::ostream& os, const hip_bfloat16& bf16) calls itself in endless recursion
- Convert hip_bfloat16 to float to fix this

Change-Id: I252a656817550caf43c587cebf461474f12b8c0c
2023-01-25 00:01:02 -05:00
Satyanvesh Dittakavi 0ccd41050b SWDEV-372396 - Address the errors in hiprtc headers with -Werror
Change-Id: Ied553216dfc2ff8b5c72de617cb47b6ca8f52063
2023-01-23 17:25:42 +00:00
Maneesh Gupta 5b39c346ad SWDEV-378321 - Revert "SWDEV-1 - Directly call __builtin_amdgcn_fence"
This reverts commit 3e54bad5b3.

Reason for revert: Root cause for regression SWDEV-378321

Change-Id: If3116c5ad1a23db1840e5fc019d2fe997b37d229
2023-01-19 13:01:27 -05:00
Satyanvesh Dittakavi 842b9d0b56 SWDEV-373231 - Support hip atomics header with HIPRTC
Change-Id: Ic500dfb66ed5ef0c4b672dfd0595409d0e8caaf5
2023-01-17 00:31:07 -05:00
Jaydeep Patel 8bd45ae5da SWDEV-369961 - Add mappings for enum hipGraphInstantiateFlags and types.
Change-Id: I03fa8221684b08a0c44e12aaf2eb7ce5281be629
2023-01-15 12:17:21 +00:00
Matt Arsenault 8615a89f24 SWDEV-1 - Use proper builtin for groupstaticsize
Remove the now empty header for asm intrinsic wrappers.

Change-Id: I63495dc6ed899b1ea823062f2be2fb064af8d4f9
2023-01-13 13:14:10 -05:00
German Andryeyev eef47ca24a SWDEV-353281 - Initial support of memalloc in graph
Add memory allocation support in graph. Current implementation uses
cache from mempool  to hold the allocations which belong to the graph.
Also the resource tracking is disabled at this moment because mempool
operates with hip::Stream objects, but graph has execution with
amd::HostQueue objects.

Change-Id: I54fe3250126d24f5a26ada975f37d429bb4ef17b
2023-01-13 13:06:59 -05:00
Satyanvesh Dittakavi c193503722 SWDEV-376386 - Support __hip_move_dpp with hipRTC
Change-Id: If9a6fe23d6df143a7f522823fca7183416048c6a
2023-01-13 00:05:31 -05:00
Matt Arsenault 3e54bad5b3 SWDEV-1 - Directly call __builtin_amdgcn_fence
The device libraries provide hip.bc, which only contains
__atomic_work_item_fence. This is identical to OpenCL's
atomic_work_item_fence, except prefixed with __. This only existed to
switch over the unknown values the OpenCL function provides. There's
no need to go through the wrapper here since it's using constant
values, and without it we can drop one of the bitcode libraries. I'm
guessing this was a holdover from before the fence builtin existed.

Change-Id: I8b04614f96194eafddeba295215c12c3856e6c42
2023-01-12 12:04:09 -05:00
Matt Arsenault 91b851fad7 SWDEV-1 - Remove unused intrinsic declaration
The uses have moved to using the proper __builtin_amdgcn_ds_swizzle.

Change-Id: I82159ebf41fb91ac6a9d07b33311e569dce5e754
2023-01-12 12:04:09 -05:00
Ioannis Assiouras e97dcfbe48 SWDEV-374405 - Added OpenGL interoperability APIs for nvidia
Change-Id: I1d6ad5c610ab6ee805bd07dacea316b3242ed129
2023-01-12 05:22:14 -05:00
Rakesh Roy b321e2afc6 SWDEV-368881 - Implement hipModuleLaunchCooperative* APIs
- Add implementation for hipModuleLaunchCooperativeKernel, hipModuleLaunchCooperativeKernelMultiDevice APIs

Change-Id: I2a80264e458549211c34b413397a08b57d021147
2023-01-10 13:50:20 -05:00
Maneesh Gupta 6c53a670c1 SWDEV-345571 - Revert changes for warp shuffle functions
This reverts commit ad1fae7df4 and 695533d435.

Change-Id: I548f15055417453a7fd323a481ed410598830521
2023-01-09 22:56:52 -05:00
kjayapra-amd 0c8c08cf51 SWDEV-361374 - Adding support for hipPointerSetAttributes
Change-Id: I56710b808854acefca07b3b5898deebee38063e9
2023-01-09 12:36:57 -05:00
Jaydeep Patel 695533d435 SWDEV-375137 - Bypass defination of half and half2 alias for internal include of hip_fp16.h
Change-Id: I5258a487d4c2d93e889a523fde81f1b3906d9318
2023-01-06 21:34:51 -05:00
pghafari 335a24d1a0 SWDEV-369042 - updating to GetError to match cuda
Change-Id: I0ec2330443b26cb1c8cedba942f31fb1267cd09d
2023-01-06 05:33:41 -05:00
Anusha GodavarthySurya ad1fae7df4 SWDEV-345571 - Added support for half/half2 data types for warp shuffle functions
Change-Id: Ia0791760c1b0a35bbbf0b1f8435aa924af205169
2022-12-26 01:24:06 -05:00
Satyanvesh Dittakavi e90721435b SWDEV-372767 - Fix hipCUB and rocWMMA build failures
Change-Id: If3672fd50075c67cfce4d94e73b1376c4aa8d2ae
2022-12-16 03:08:18 -05:00
Satyanvesh Dittakavi 91291e48db SWDEV-372032 - Support cooperative groups header with HIPRTC
Change-Id: Ia64652fd52b210a4eeee9d38fdfdfa6fdc7163ff
2022-12-09 04:45:59 -05:00
Satyanvesh Dittakavi d413375d51 SWDEV-369955 - Support hipDrvGetErrorString and hipDrvGetErrorName on Nvidia headers
Change-Id: I54cfa8faed2f45f310682625f94952aa3c2ee27f
2022-12-02 11:52:07 -05:00
Satyanvesh Dittakavi 1deee433d0 SWDEV-287842 - Add int8_t and int16_t support in hiprtc
Change-Id: I03bb967068f1a29e85ce8d9410f78de4af9589c4
2022-12-01 11:58:46 -05:00
Your Name 3e47c052e0 SWDEV-368477 - hipOccupancyMaxPotentialBlockSizeVariableSMemWithFlags nvidia mapping
SWDEV-369618 - hipOccupancyMaxPotentialBlockSizeVariableSMemWithFlags invalid flag scenario
Change-Id: I2edbab5d3eeacf80bc215f0a77a46d733f4b4ec9
2022-11-29 12:38:39 -05:00
Rakesh Roy cbc3a0a652 SWDEV-348820 - Fix hipPointerAttribute_t incompatibility
- Use hipPointerAttribute_t.type instead of hipPointerAttribute_t.memoryType

Change-Id: I3bf1c0758bdae987213ba7de62247dd420ef4cc9
2022-11-29 00:53:24 -05:00
Ioannis Assiouras 33cb388101 SWDEV-369557 - Added enum mapping for external[Semaphore/Memory]HandleType
Change-Id: If1b95b084f7fa312e6cdad4ccf433127671a35ce
2022-11-28 13:32:31 -05:00
Anusha GodavarthySurya f7ca4b8fb9 SWDEV-365299 - update implementation __float2half_{rd,ru,rz} to call __ocml_cvt{rtn, rtp, rtz}_f16_f32
Change-Id: I6cd711fbeb0e02a1caa03ac7f3dd9c8f8fdbac01
2022-11-25 16:27:02 +00:00
Anusha GodavarthySurya 766c40aab8 SWDEV-325711 - Added few graph API support for nvidia path
Change-Id: I4eb6e77220e201c85fdaa3a004590c4c5d5ea6c6
2022-11-25 09:08:32 -05:00
Ioannis Assiouras c5b08c621f SWDEV-364950 - Remove Scalar_accessors from HIP_vector_type
Change-Id: I13ecc213c98a1c7b5febcd43c0a21ee6db725f60
2022-11-24 09:40:54 -05:00
Pierre van Houtryve a312c0a956 Fix atomicInc/dec scope to "agent"
Addresses leftover comment of Icd1d9b9438cac4bef0f7c52d4cd341ac76500890

Change-Id: If9c395335497d634e4c702433d6e3681583bedef
2022-11-24 06:56:49 -05:00
Ioannis Assiouras 432f5fbc97 SWDEV-364950 - Add trivial copy assignment for HIP_vector_type under HIP_CLANG_ONLY path
Change-Id: I56a1003ba32c748631c7751b2e6a49e12e48afcb
2022-11-23 16:56:39 +00:00
Anusha GodavarthySurya 78eef7f173 SWDEV-366653 - Added new graph APIs - Fix windows build
Change-Id: I580133bddcbab5529aecfb70bc9776e8c40de697
2022-11-23 08:57:49 +00:00
Anusha Godavarthy Surya 93ce12c90d SWDEV-325711 - Added Graph APIs hipGraphDebugDotPrint, hipGraphKernelNodeCopyAttributes
hipGraphNodeGetEnabled, hipGraphNodeSetEnabled and hipGraphUpload

Change-Id: Ib5a58fbc0a9b3f134eb56d42cd945f3d7f4922d8
2022-11-22 23:20:25 -05:00
Satyanvesh Dittakavi 5f4aa2e994 SWDEV-366306 - SWDEV-366314 - Support hip vector types and hip_bfloat16 with hiprtc
Also, moves the hip_bfloat16 header implemenation from hip
to hipamd

Change-Id: I9980126fa1ee8160ff9f5f8241f64699a9702c95
2022-11-18 06:58:31 -05:00
Julia Jiang 30c43056c1 SWDEV-340754 - Update coordinates in coperatative helper
Change-Id: Ibf4e833d10ea5c317900e83f020c9c23f55f3951
2022-11-17 14:17:58 -05:00
Jaydeep Patel c800d2126d SWDEV-366088 - Default case returns -1 as enum value.
Change-Id: I8dd5cac254c99e4de9f6d492e71060e39b069507
2022-11-15 22:12:12 -05:00
Daria Materikina 834d9d3a3c make header intel-compatible
Change-Id: I8f8b68ba23e8f86dfeaf7c7b464f2b6ca6999971
2022-11-14 17:59:01 -05:00
kjayapra-amd 5ff27faa04 SWDEV-337452 - Adding__ockl_steadyctr_u64 option for wall_clock64.
Change-Id: I4ee6fd4d40d500363ef84975078185093dab5ddd
2022-11-07 08:47:30 -05:00
Anusha GodavarthySurya c974cbcb3d SWDEV-366153 - Fix __ocml_isnan_f16 to pass _Float16
Change-Id: I7628f21d000cec490f5fb2c91f1b99ca61fb46cf
2022-11-06 21:57:23 -05:00
pghafari ac2948651e SWDEV-362199 - add vulkan interop apis in nvidia
Change-Id: I6c5de6174a80be862b6d869927ed704dd1421b41
2022-11-04 16:49:21 -04:00
Anusha GodavarthySurya 07b3070e25 SWDEV-362611 - Added hmax and hmin
Change-Id: I133a80e997e39357693df7ab969425c0d5585607
2022-10-31 00:23:57 -04:00
Pierre van Houtryve dcb5d29f1e Replace LLVM atomic.inc intrinsics with Clang builtins
With opaque pointers, the suffix of those intrinsics changed. This caused
build failures that should be solved by just using the corresponding
Clang builtins instead of using intrinsics directly

See SWDEV-356581

Change-Id: Icd1d9b9438cac4bef0f7c52d4cd341ac76500890
2022-10-27 10:17:21 -04:00
Laurent Morichetti 19107858d6 SWDEV-361847 - Print 'char' parameters as 'integer'
When printing HIP API function parameters, use the integer format to
print 'char' arguments to avoid printing invalid ASCII characters
(value > 127).

Make sure the roctracer::hip_support::detail operator<< overloads are
used when printing arguments.

Change-Id: Id072c2ed19b1b4166108599e393d1cae6c54b6b0
2022-10-21 14:55:33 -04:00