Wykres commitów

905 Commity

Autor SHA1 Wiadomość Data
Aaron Enye Shi b63597bb95 Use correct OCKL native vector types 2019-01-22 22:31:19 +00:00
Aaron Enye Shi b0e0ec3c67 Allow hip-clang to use amd_mixed_dot as well 2019-01-22 16:21:16 +00:00
Aaron Enye Shi bdb84f3d9c Add backwards compatibility to dot funcs 2019-01-14 23:14:49 +00:00
Aaron Enye Shi 7ba7d9a0f6 Move fp16 dot functions to hip_fp16 2019-01-14 21:50:32 +00:00
Aaron Enye Shi 84a028b9e1 Add Dot functions as amd_mixed_dot function
Introduce the Dot functions which are available in the device library. Forward those prototypes, and introduce HIP API to expose the usage of the dot functions.
2019-01-11 23:31:02 +00:00
Laurent Morichetti 167bbffc4f Fix compilation issues with MSVC 2019-01-09 09:32:50 -08:00
Alex Voicu 3470692998 Hook into the creaky lazy-reinit machinery. Try to minimise race-risk. 2019-01-01 11:01:16 +05:30
Alex Voicu 3e8f198b58 Start re-working 731 for 2.0. 2019-01-01 11:00:57 +05:30
Maneesh Gupta aa932d542a Merge pull request #818 from yxsamliu/fix-hip-config
Add default arguments for hipConfigureCall
2019-01-01 10:25:25 +05:30
Yaxun Sam Liu 57d1f6e6cb Use __dparm for default parameter 2018-12-31 13:07:19 -05:00
Yaxun Sam Liu 02deeadaa4 Add default arguments for hipConfigureCall 2018-12-18 15:48:21 -05:00
Aaron Enye Shi 2d50392c17 Add *_rn functions back into HIP intrinsics
Add back the round-to-nearest-even intrinsics back to HIP math
intrinsics as it caused regression.
2018-12-18 19:31:54 +00:00
Siu Chi Chan 4251a70bc0 Revert "Rely on code object metadat for kernarg arguments alignof and sizeof."
This reverts commit 299b9d1382.
2018-11-29 11:38:37 -05:00
Siu Chi Chan 30a9be5643 Revert "Handle (odd) corner case of argumentless __global__ function."
This reverts commit 927b06f8d4.
2018-11-29 11:38:34 -05:00
Maneesh Gupta 99bb89b756 Merge pull request #760 from eshcherb/roctracer-hip-frontend-181113
Roctracer hip frontend 181113
2018-11-23 11:08:25 +05:30
Michael Kuron e9b88711e2 Merge branch 'master' into getsymboladdress 2018-11-20 12:03:22 +01:00
Michael Kuron f71b8cbc7d Fix hipGetSymbolAddress/hipGetSymbolSize on nvcc 2018-11-15 09:48:00 +01:00
Maneesh Gupta 0c72d32b44 Merge pull request #756 from yxsamliu/fix-heap
Define __hip_device_heap in header for hip-clang only
2018-11-14 13:22:31 +05:30
root 0774b275c8 rebase to master, tracer related changes 2018-11-13 15:50:34 +00:00
Evgeny 1e49cff87a specialized spawner object 2018-11-13 15:33:26 +00:00
Evgeny e362688adf renaming HIP_INIT_CB_API to HIP_INIT_API 2018-11-13 15:33:26 +00:00
Evgeny 8c88932f30 hip_prof_(api/str).h moving under hcc_detail dir 2018-11-13 15:33:26 +00:00
Evgeny d35567f40a compilation fix - struct name fix 2018-11-13 15:33:26 +00:00
Evgeny 2f2c955ae0 fix 2018-11-13 15:33:26 +00:00
Evgeny 084a68be63 adding activity prof layer 2018-11-13 15:33:26 +00:00
Yaxun Sam Liu c44f2795d5 Define __hip_device_heap in header for hip-clang only 2018-11-12 16:39:41 -05:00
Alex Voicu 927b06f8d4 Handle (odd) corner case of argumentless __global__ function. 2018-11-12 00:32:35 +00:00
Alex Voicu 392e2ba1ba Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata 2018-11-08 06:53:06 -08:00
Aaron Enye Shi 7b3bbc85c5 Guard rcp rounded implementation as well
Since rcp implementations of non-default rounded versions are not correct or supported in OCML, guard them using the same macro OCML_BASIC_ROUNDED_OPERATIONS. Also update the docs and tests.
2018-11-06 19:53:28 +00:00
Aaron Enye Shi 9aa92238ab Guard the OCML rounded operations instead
Instead of commenting all these functions out, guard the functions with a macro OCML_BASIC_ROUNDED_OPERATIONS.
2018-11-06 16:32:14 +00:00
Michael Kuron 4da2d92281 Introduce ihipModuleGetGlobal 2018-11-06 09:54:34 +01:00
Aaron Enye Shi cef6e8ef1f Remove non-working non-default-rounded math apis
In ROCm-Device-Libs, they have dropped the non-default-rounded versions of add, sub, mul, div, sqrt and fma. Therefore, ocml has removed the rte, rtp, rtn, and rtz counterparts. This will remove the same math APIs in HIP for _ru, _rd, _rn, and _rz.
2018-11-05 22:34:16 +00:00
Michael Kuron 8049cdafab Document return values of hipMemcpyToSymbol, hipGetSymbolAddress 2018-11-04 11:47:17 +01:00
Michael Kuron 0b6f5791f8 Implement hipGetSymbolAddress and hipGetSymbolSize 2018-11-04 10:39:34 +01:00
Alex Voicu 1c525d7222 Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata 2018-11-01 23:17:27 +00:00
Siu Chi Chan cdd0109e70 Move the global arrays for hip malloc/free
from a header into a source file such that
there's only an unique copy in an executable
and prevent wasting static memory on the host

Change-Id: Id5b62766f77809c8d7b47892cb7149c490dcbdb9
2018-11-01 16:20:35 -04:00
Jeff Daily 6eac038694 typedef struct hipFuncAttributes now C compatible.
Fixes #591.
Fixes #694.
2018-10-31 14:12:45 +05:30
Maneesh Gupta c7f2198c6e Merge pull request #714 from aaronenyeshi/remove-static-hip-device
Remove libhip_device.a static library
2018-10-31 14:08:29 +05:30
Maneesh Gupta 8f667984af Merge pull request #713 from scchan/master-hcc_early_finalize_support
Support more than one device code blobs in a single .kernel section
2018-10-31 14:04:06 +05:30
Alex Voicu 000766850d Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata 2018-10-30 23:34:46 +00:00
Alex Voicu b68caf7653 Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata 2018-10-28 17:02:10 +00:00
Alex Voicu 299b9d1382 Rely on code object metadat for kernarg arguments alignof and sizeof. 2018-10-28 17:01:00 +00:00
Aaron Enye Shi ba43d1ce1b Remove libhip_device.a static library
Move remaining function definitions from device_util.cpp to hip_runtime.h header. We can now remove the static library completely as device_util.cpp was the last file part of target hip::hip_device .
2018-10-25 19:52:35 +00:00
Alex Voicu a743939900 hipLaunchKernel, hipLaunchParm are deprecated, and shall be removed. 2018-10-25 13:32:17 +01:00
Maneesh Gupta 05beeec84d Merge pull request #723 from mangupta/fix_double_shfl
Fix logic for double variants of __shfl*
2018-10-25 06:01:38 +05:30
Maneesh Gupta 7b6d232672 Fix logic for double variants of __shfl*
Change-Id: I604f00b54cf4bd9c5f26ca6fa680fca5e9629417
2018-10-24 12:39:09 +05:30
Maneesh Gupta b01ac26948 Make HIP functional again with HCC from ROCm 1.9.x
Change-Id: I214acdfd0b79dcf783993e44fe31baee64fd4dc3
2018-10-24 10:41:56 +05:30
Maneesh Gupta 407e092a13 Merge pull request #705 from ROCm-Developer-Tools/feature_minimal_changes_for_hc_next
Feature minimal changes for hc next
2018-10-19 06:58:31 +05:30
Siu Chi Chan f39873c584 Support more than one bundles in a single .kernel section
When compiling with Early Finalization enabled in HCC,
the resulting .kernel section of the host object now may
contain more than one device code bundles.  This is to
teach the HIP runtime to correctly extract all the
bundles from the .kernel section.
2018-10-18 17:00:27 -04:00
Alex Voicu 3678063598 Dumb workaround is still needed, so add it back. 2018-10-18 15:33:46 +01:00