Jatin
19a30de0a6
SWDEV-245414 - fix nvcc CI breakage
...
Change-Id: I34ff533f34cfc2f60abefc0d8f6c6d8266d30f61
2021-06-26 13:31:10 +05:30
Sarbojit Sarkar
7e3c17c023
SWDEV-284435 - StreamOps mapping with cuda drv API
...
Change-Id: I6ed9196cfe59d9004338206f6dd8d6fb367b3ffa
2021-06-25 12:20:40 -04:00
Maneesh Gupta
f4680d98a9
SWDEV-285388 - Bring back the inline asm till we have a proper compiler support
...
Change-Id: Ibafe33dd8298f6e19f8b6a6d2aa220b633b76862
2021-06-21 09:15:34 -04:00
Maneesh Gupta
a9b4df0ad0
SWDEV-285388 - Allow fallback to asm when builtin not available
...
Change-Id: Ifeee7f3ad1be9ce9b79cf26008b75102153143a5
2021-06-17 21:21:19 -07:00
Aaron En Ye Shi
3fa21c1d49
SWDEV-285388 - Clean up llvm intrinsics using __asm
...
Instead of using inline asm, use clang builtins
for llvm intrinsics.
Change-Id: I30287f5a8de035ccd7e48d10e559a8a2e1d389f7
2021-06-16 13:20:44 -04:00
Jatin Chaudhary
2223a508f5
SWDEV-290993 - Fix error seen in __launch_bounds__ seen with -Werror -Wpedantic
...
Change-Id: If89c681a5e383c30a9233a183db282787e2f0081
2021-06-15 00:36:35 -07:00
Julia Jiang
de150e1803
SWDEV-290704 - update hiprtc header with doxygen
...
Change-Id: I93b7ddd2a89f8a214bca7e67bd5b88f6144b70dd
2021-06-14 15:28:32 -04:00
Sarbojit Sarkar
7866ba9713
SWDEV-290278 - typedef for hiprtcResult
...
Change-Id: I93f637dd04ce31aee8ef0ba097f1232af9af8d2b
2021-06-11 11:52:38 -04:00
Julia Jiang
1936678084
SWDEV-290704 - Add doxygen in hiprtc
...
Change-Id: Ib1efb6874472601c0f45fc5c7f85a01e69b0a958
2021-06-10 17:21:25 -04:00
Rahul Garg
19c84bc604
ROCMOPS-1956 - Push restructured code to hipamd
...
hipamd will have AMD's ROCCLR based HIP backend implementation
Change-Id: Id7de9634519b4ce46fca71a1b61f3d5b1e3fc459
2021-06-07 21:42:44 +00:00
Rahul Garg
260a1fed06
SWDEV-288707 - Merge HIP refactored code to staging
...
These changes move-
- rocclr based implementation inside src/hipamd/src
- platform specific (both nvidia and amd) headers inside
src/hipamd/include/hip
Change-Id: Ia29791a727244952591fe1d813dcef0303b73a9e
2021-06-07 00:07:24 +00:00
German Andryeyev
2b3a4e711b
SWDEV-240804 - Add hipMemAdvise() extension
...
Add coarse grain memory extension. The new advice will allow HMM
to disable cache coherency policy to improve performance
Change-Id: I3c792d6a96896b983a7ffccddaa0ded06d183212
2021-06-02 16:16:14 -04:00
Joseph Greathouse
3212fdf4d6
SWDEV-288361 - Implement funnel shift operators
...
Change-Id: I0c73809366f2d21e75f98fdeac7fed0c7591ee34
2021-06-01 11:10:11 -04:00
German Andryeyev
0e11c59272
SWDEV-240804 - Fix typos in the comments
...
Change-Id: Ic1385daf1e685772b02a11d43fb13f6cde74c4b7
2021-05-29 00:31:46 -04:00
Aaron En Ye Shi
414c80644b
SWDEV-283266 - Workaround RTC macro in vector types
...
Temporarily disable __HIPCC_RTC__ in hip_vector_types.h
while the upstream clang headers are outdated on mainline.
Once upstream patch has landed in mainline, revert this
change. This is a workaround for hiprtc testing.
Change-Id: Ib2cf6023b71431bbfbe3c699076caa4f90f7170c
2021-05-21 15:53:59 +00:00
Tao Sang
26235f49e6
SWDEV-282588 - Improve clock64()
...
Let clock64() read s_memtime to prevent rolling over
Add wall_clock64() to read s_memrealtime
Change-Id: I706387355add75852c570be2656131ef99e8968f
2021-05-17 13:19:04 -04:00
Aaron En Ye Shi
dd643ec517
SWDEV-263959 - Fix device decl with long and llong
...
Windows may expect long and ulong to be 4 bytes, while
Linux expects 8 bytes. Instead, use uint64_t for
unsigned long, and unsigned long long, and use int64_t
for long and long long to be consistent.
Change-Id: I6ed1cdde43721bcaaab0245644d607b1adbf9884
2021-05-14 11:59:27 -04:00
pghafari
ccff5216f0
SWDEV-245532 - HIP - Vulkan interop semaphores
...
Change-Id: I89be5ee84d4728d76e1987d5016c944c0dbc9a96
2021-05-11 11:44:59 -04:00
Aaron En Ye Shi
688693e9fd
SWDEV-283266 - __HIPCC_RTC__ macro added to HIP headers
...
For hipRTC on Windows, add macro __HIPCC_RTC__ to allow
online compilation of with device functions excluding standard
C/C++ headers, system headers, and host HIP APIs.
Change-Id: I1d91f042baf1359856ec83ab7030dc58785e0334
2021-05-10 17:28:07 +00:00
Anusha Godavarthy Surya
8cc0e04239
SWDEV-240806 - Initial commit for hipGraph and stream capture infrastructure
...
On StreamBegincapture captures the parameters passed to APIs and respective node will be created and added to graph
All parameters are passed to STREAM_CAPTURE macro, it checks if stream in capture mode and redirects the call to the capture function and returns
Updated hipStream and hipEvent with capture parameters
Added handling for hipStreamBeginCapture & hipStreamEndCapture
Change-Id: Ic8926a7b4336c2cc81f0b3a9a224aa392c474134
2021-05-07 17:38:16 -04:00
Anusha GodavarthySurya
e9cc056d25
SWDEV-271978 - Function __byte_perm to match CUDA behavior
...
Selector indices are as follows (the upper 16-bits of the selector are not used): selector[0] = s<2:0> selector[1] = s<6:4> selector[2] = s<10:8> selector[3] = s<14:12>
Change-Id: Ibf76c6ec2374f1f5b9bba8bd9dbd73660f830eea
2021-05-05 03:00:32 -04:00
kjayapra-amd
31d412c292
SWDEV-245533 - Implementing Mipmap array APIs
...
Change-Id: I222c8d0fcbd50cdefd702462e689463bb36366d2
2021-04-30 16:33:43 -04:00
Satyanvesh Dittakavi
cc39ceab25
SWDEV-283388 - Fix hipMemRangeGetAttributes on Nvidia Platform
...
Change-Id: I5daeacd9dd5c6ce7f914d6e6e45dd41fb2a675a5
hipMemRangeGetAttributes was returning hipErrorInvalidValue due to improper
mapping of the arguments to cudaMemRangeGetAttributes.
2021-04-29 06:06:03 -04:00
Satyanvesh Dittakavi
120436423c
SWDEV-281789 - Add missing HMM attributes on CUDA path
...
Change-Id: I11167eea006301e01a1f91708bf6d0ac832d8b7f
2021-04-26 13:54:01 -04:00
Yaxun (Sam) Liu
37b90c2f64
SWDEV-255234 - remove workaround for warpSize after clang fix is released
...
The workaround is no longer needed as the compiler emits the macro itself.
Change-Id: I6a06e73353628bcea71f6bd23503ab6d8e42a29d
2021-04-26 08:49:15 -04:00
Anusha GodavarthySurya
c23586440b
SWDEV-282361 - HIP support for NVRTC
...
Change-Id: I82869af8194b595f9a2fb64ad034630786189335
2021-04-20 09:28:58 -07:00
Tao Sang
20fd6b8a66
SWDEV-279657 - Fix hipMallocManaged-N256M failure
...
Add concurrentManagedAccess detection in hipMallocManaged test.
Skip test when device doesn't support concurrentManagedAccess.
Change-Id: Ie54046feef3baba857a7068972ec1fc1a60c2dfd
2021-04-20 09:07:31 -04:00
Sourabh Betigeri
0a0b026304
SWDEV-256723 - Added group partitioning feature in ROCm CG.
...
Change-Id: Ie54046feef3baba857a7068972ec1fc0a60c2df9
2021-04-19 18:53:17 -04:00
Julia Jiang
cb22874ebd
SWDEV-270961 - Update hip_programming_guide.md for event handling
...
Change-Id: Ieadac9972e5ee13c05ccb42a679866f494f96f47
2021-04-16 12:14:25 -04:00
Satyanvesh Dittakavi
009543ec7a
SWDEV-281504 - Rename structure names to use reserved names in HIP headers
...
HIP headers use few structure names as X, Y, and Z. This causes
compilation issues when the apps use similar names as macros.
Renamed the struct names to use reserved names such as
__X, __Y and __Z
Change-Id: I59416c3734f274e853c87d4856b7e616f6cee5f5
2021-04-15 09:52:58 -04:00
Tao Sang
2c0b63059b
SWDEV-273138 - Rearrange device attributes
...
Change-Id: Iab5902c9a2702456fbe437fdefc40541c48abb74
2021-04-12 17:05:27 -04:00
kjayapra-amd
61d1eaa522
SWDEV-274276 - Implement system scope atomics to _system builtins. Default atomics will map to agent.
...
Change-Id: I63b13063274418e96d4be0159c779127b166bea5
(cherry picked from commit 7c4ae8ca1daac0b75ac36c4e4a45b8d13f219d14)
2021-04-09 10:23:32 -04:00
Sarbojit Sarkar
daf0cbca3c
SWDEV-260454 - Fix for idenfying bad streams
...
Change-Id: I122ff6f47535c3c76ab56ba49ab1450ef886b15f
2021-04-07 12:13:30 -04:00
Sarbojit Sarkar
9cbb283771
SWDEV-280131 - Fix __half2 member access bug
...
Change-Id: Ic11750d18e4ba47446a7e494c3089a19cb4df783
2021-04-07 12:12:02 -04:00
pghafari
a9304f6111
SWDEV-245532 - HIP - Vulkan interop
...
Change-Id: Iba1ef8112e318b4f099da5a4a4602e0dae7de9e3
2021-03-31 10:23:31 -04:00
Sarbojit Sarkar
b7a03695c3
SWDEV-277926 - Updated deprecated APIs in Cuda
...
Change-Id: I9ff655af227b245067eb760325c395bb562696ee
2021-03-29 01:19:19 -04:00
Satyanvesh Dittakavi
68abfb4aff
SWDEV-277152 - Add hipMemcpy2DToArrayAsync API in headers
...
Also, address few scenarios for hipMemcpy2DToArray/Async to return
proper error types
Change-Id: Idbadf666a5e7bebc0f3ea8b7048b79208e04cf6b
2021-03-27 13:50:38 -04:00
jujiang
cc69e95c1a
SWDEV-277423 - Correct hipStreamWaitValue doxygen format
...
Change-Id: I8914c14cb7a05d5d0bdfa054af14765c872c70a6
2021-03-19 14:10:41 -04:00
Tao Sang
08625144b3
SWDEV-273138 - Update canUseStreamWaitValue retrieving
...
Remove canUseStreamWaitValue from hipDeviceProp_t.
Update canUseStreamWaitValue retrieving in hipDeviceGetAttribute().
Change-Id: I275bd42d8b03e141caaf9f1bc6b714e51ca72dfb
2021-03-18 22:42:35 -04:00
cjatin
06fa63df04
SWDEV-272811 - Using OCML functions for conversion of fp to int
...
Change-Id: Id0c02bf6a2e0553d522035b078a9db4cd3fad215
2021-03-18 11:43:30 -04:00
Ravi C Akkenapally
397bac37a3
SWDEV-276827 - Stream Operations: match API with CUDA
...
Change-Id: Ia79a950561e97df3b2229f76f5512dede2babf98
2021-03-18 07:52:29 -07:00
Tao Sang
629ea5fe19
SWDEV-273943 - Enable runtimeApi.event dtest
...
Add device_id_ in hip::event to match cuda behaviour in
hipEventQuery() and hipEventRecord().
Enable hipEventElapsedTime test on AMD platform.
Workarround sporadic crash of hipEventIpc test due to
some bug of event ipc.
Add missing hipEventDestroy() in some event tests.
Fix some logic code errors.
Fix typo in comment.
Change-Id: I9ec74c475161b3e31df48d193449023e921f2924
2021-03-16 17:12:08 -04:00
jujiang
3126196cca
SWDEV-270961 - Add doxygen for Texture APIs
...
Change-Id: I4038320a638292440d9ca8c23ffcabffcd9a48c2
2021-03-09 16:23:56 -05:00
Satyanvesh Dittakavi
daafd303ec
SWDEV-275317 - Fix hipMemcpy2DFromArray to return proper error types
...
Change-Id: Id722865d81460bfd2fa6ad7a198b6e3c2a08265a
2021-03-09 05:22:23 -05:00
Satyanvesh Dittakavi
37d4677aba
SWDEV-274404 - Add hipDrvMemcpy3D* and hipMemcpy2DFromArray* APIs on HIP CUDA
...
Change-Id: I4aba2bff60a7bae6b01b6e471968227b0df8e192
2021-03-08 13:14:25 -05:00
Arya.Rafii
8cb232ddef
SWDEV-269246 - Implementation of hipDrvMemcpy2DUnaligned
...
Change-Id: I86fdd3b930cbd74c45cd31944f0ea52c0ff65a59
2021-03-04 14:38:06 -05:00
Arya.Rafii
9f142f988d
SWDEV-269246 - Added missing exports for hipArrayDestroy
...
Change-Id: I730814720bb618e895efbe2f412e437110bed8bc
2021-03-04 14:29:16 -05:00
Ravi C Akkenapally
fd0f49503f
SWDEV-179105 - Stream Operations: Add swqupport for Wait and Write
...
Change-Id: I9d6d0665d12b62fe705ce1569a0e8264a4f23ab7
2021-03-02 12:15:46 -08:00
Vladislav Sytchenko
1f8bdcaf20
SWDEV-232428 - Enable device printf for Windows
...
Change-Id: Id72cfc34cfcfcbcb459bd8d84979637eaf7947dd
2021-02-26 11:46:38 -05:00
Satyanvesh Dittakavi
c2a767a66e
SWDEV-271491 - Fix flags for hipSetDeviceFlags on HIP-CUDA path
...
Change-Id: I29446d5cc5a26a4b83fa45175ccdf1d8f3a9ea40
2021-02-25 12:06:33 -05:00