Граф коммитов

910 Коммитов

Автор SHA1 Сообщение Дата
Maneesh Gupta 95fe4b770f Merge pull request #863 from aaronenyeshi/add-dot-instructions
Add Dot functions as amd_mixed_dot function
2019-01-29 06:01:01 +05:30
Aaron Enye Shi 2ab6fd374c Fix mixed dot for Jenkins rocm_2_0 2019-01-24 22:48:35 +00:00
Aaron Enye Shi b63597bb95 Use correct OCKL native vector types 2019-01-22 22:31:19 +00:00
Aaron Enye Shi b0e0ec3c67 Allow hip-clang to use amd_mixed_dot as well 2019-01-22 16:21:16 +00:00
Evgeny 47625cb8fd fixing HSA_INIT_API cid args 2019-01-16 23:45:44 -06:00
Maneesh Gupta f74816efe6 Map more instrincis to ocml native
Change-Id: I54e55d5902931bafdabd0956e4e8c1d7b39f7173
2019-01-15 19:40:37 +05:30
Aaron Enye Shi bdb84f3d9c Add backwards compatibility to dot funcs 2019-01-14 23:14:49 +00:00
Aaron Enye Shi 7ba7d9a0f6 Move fp16 dot functions to hip_fp16 2019-01-14 21:50:32 +00:00
Maneesh Gupta 54e4463d16 Switch to faster ocml variants for trig instrinsics
Change-Id: If62821e2fe1b0da91ad1b8c5580ebf1a009405e9
2019-01-14 13:01:57 +05:30
Aaron Enye Shi 84a028b9e1 Add Dot functions as amd_mixed_dot function
Introduce the Dot functions which are available in the device library. Forward those prototypes, and introduce HIP API to expose the usage of the dot functions.
2019-01-11 23:31:02 +00:00
Laurent Morichetti 167bbffc4f Fix compilation issues with MSVC 2019-01-09 09:32:50 -08:00
Alex Voicu 3470692998 Hook into the creaky lazy-reinit machinery. Try to minimise race-risk. 2019-01-01 11:01:16 +05:30
Alex Voicu 3e8f198b58 Start re-working 731 for 2.0. 2019-01-01 11:00:57 +05:30
Maneesh Gupta aa932d542a Merge pull request #818 from yxsamliu/fix-hip-config
Add default arguments for hipConfigureCall
2019-01-01 10:25:25 +05:30
Yaxun Sam Liu 57d1f6e6cb Use __dparm for default parameter 2018-12-31 13:07:19 -05:00
Yaxun Sam Liu 02deeadaa4 Add default arguments for hipConfigureCall 2018-12-18 15:48:21 -05:00
Aaron Enye Shi 2d50392c17 Add *_rn functions back into HIP intrinsics
Add back the round-to-nearest-even intrinsics back to HIP math
intrinsics as it caused regression.
2018-12-18 19:31:54 +00:00
Siu Chi Chan 4251a70bc0 Revert "Rely on code object metadat for kernarg arguments alignof and sizeof."
This reverts commit 299b9d1382.
2018-11-29 11:38:37 -05:00
Siu Chi Chan 30a9be5643 Revert "Handle (odd) corner case of argumentless __global__ function."
This reverts commit 927b06f8d4.
2018-11-29 11:38:34 -05:00
Maneesh Gupta 99bb89b756 Merge pull request #760 from eshcherb/roctracer-hip-frontend-181113
Roctracer hip frontend 181113
2018-11-23 11:08:25 +05:30
Michael Kuron e9b88711e2 Merge branch 'master' into getsymboladdress 2018-11-20 12:03:22 +01:00
Michael Kuron f71b8cbc7d Fix hipGetSymbolAddress/hipGetSymbolSize on nvcc 2018-11-15 09:48:00 +01:00
Maneesh Gupta 0c72d32b44 Merge pull request #756 from yxsamliu/fix-heap
Define __hip_device_heap in header for hip-clang only
2018-11-14 13:22:31 +05:30
root 0774b275c8 rebase to master, tracer related changes 2018-11-13 15:50:34 +00:00
Evgeny 1e49cff87a specialized spawner object 2018-11-13 15:33:26 +00:00
Evgeny e362688adf renaming HIP_INIT_CB_API to HIP_INIT_API 2018-11-13 15:33:26 +00:00
Evgeny 8c88932f30 hip_prof_(api/str).h moving under hcc_detail dir 2018-11-13 15:33:26 +00:00
Evgeny d35567f40a compilation fix - struct name fix 2018-11-13 15:33:26 +00:00
Evgeny 2f2c955ae0 fix 2018-11-13 15:33:26 +00:00
Evgeny 084a68be63 adding activity prof layer 2018-11-13 15:33:26 +00:00
Yaxun Sam Liu c44f2795d5 Define __hip_device_heap in header for hip-clang only 2018-11-12 16:39:41 -05:00
Alex Voicu 927b06f8d4 Handle (odd) corner case of argumentless __global__ function. 2018-11-12 00:32:35 +00:00
Alex Voicu 392e2ba1ba Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata 2018-11-08 06:53:06 -08:00
Aaron Enye Shi 7b3bbc85c5 Guard rcp rounded implementation as well
Since rcp implementations of non-default rounded versions are not correct or supported in OCML, guard them using the same macro OCML_BASIC_ROUNDED_OPERATIONS. Also update the docs and tests.
2018-11-06 19:53:28 +00:00
Aaron Enye Shi 9aa92238ab Guard the OCML rounded operations instead
Instead of commenting all these functions out, guard the functions with a macro OCML_BASIC_ROUNDED_OPERATIONS.
2018-11-06 16:32:14 +00:00
Michael Kuron 4da2d92281 Introduce ihipModuleGetGlobal 2018-11-06 09:54:34 +01:00
Aaron Enye Shi cef6e8ef1f Remove non-working non-default-rounded math apis
In ROCm-Device-Libs, they have dropped the non-default-rounded versions of add, sub, mul, div, sqrt and fma. Therefore, ocml has removed the rte, rtp, rtn, and rtz counterparts. This will remove the same math APIs in HIP for _ru, _rd, _rn, and _rz.
2018-11-05 22:34:16 +00:00
Michael Kuron 8049cdafab Document return values of hipMemcpyToSymbol, hipGetSymbolAddress 2018-11-04 11:47:17 +01:00
Michael Kuron 0b6f5791f8 Implement hipGetSymbolAddress and hipGetSymbolSize 2018-11-04 10:39:34 +01:00
Alex Voicu 1c525d7222 Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata 2018-11-01 23:17:27 +00:00
Siu Chi Chan cdd0109e70 Move the global arrays for hip malloc/free
from a header into a source file such that
there's only an unique copy in an executable
and prevent wasting static memory on the host

Change-Id: Id5b62766f77809c8d7b47892cb7149c490dcbdb9
2018-11-01 16:20:35 -04:00
Jeff Daily 6eac038694 typedef struct hipFuncAttributes now C compatible.
Fixes #591.
Fixes #694.
2018-10-31 14:12:45 +05:30
Maneesh Gupta c7f2198c6e Merge pull request #714 from aaronenyeshi/remove-static-hip-device
Remove libhip_device.a static library
2018-10-31 14:08:29 +05:30
Maneesh Gupta 8f667984af Merge pull request #713 from scchan/master-hcc_early_finalize_support
Support more than one device code blobs in a single .kernel section
2018-10-31 14:04:06 +05:30
Alex Voicu 000766850d Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata 2018-10-30 23:34:46 +00:00
Alex Voicu b68caf7653 Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata 2018-10-28 17:02:10 +00:00
Alex Voicu 299b9d1382 Rely on code object metadat for kernarg arguments alignof and sizeof. 2018-10-28 17:01:00 +00:00
Aaron Enye Shi ba43d1ce1b Remove libhip_device.a static library
Move remaining function definitions from device_util.cpp to hip_runtime.h header. We can now remove the static library completely as device_util.cpp was the last file part of target hip::hip_device .
2018-10-25 19:52:35 +00:00
Alex Voicu a743939900 hipLaunchKernel, hipLaunchParm are deprecated, and shall be removed. 2018-10-25 13:32:17 +01:00
Maneesh Gupta 05beeec84d Merge pull request #723 from mangupta/fix_double_shfl
Fix logic for double variants of __shfl*
2018-10-25 06:01:38 +05:30