Aaron Enye Shi
b63597bb95
Use correct OCKL native vector types
2019-01-22 22:31:19 +00:00
Aaron Enye Shi
b0e0ec3c67
Allow hip-clang to use amd_mixed_dot as well
2019-01-22 16:21:16 +00:00
Aaron Enye Shi
bdb84f3d9c
Add backwards compatibility to dot funcs
2019-01-14 23:14:49 +00:00
Aaron Enye Shi
7ba7d9a0f6
Move fp16 dot functions to hip_fp16
2019-01-14 21:50:32 +00:00
Aaron Enye Shi
84a028b9e1
Add Dot functions as amd_mixed_dot function
...
Introduce the Dot functions which are available in the device library. Forward those prototypes, and introduce HIP API to expose the usage of the dot functions.
2019-01-11 23:31:02 +00:00
Laurent Morichetti
167bbffc4f
Fix compilation issues with MSVC
2019-01-09 09:32:50 -08:00
Alex Voicu
3470692998
Hook into the creaky lazy-reinit machinery. Try to minimise race-risk.
2019-01-01 11:01:16 +05:30
Alex Voicu
3e8f198b58
Start re-working 731 for 2.0.
2019-01-01 11:00:57 +05:30
Maneesh Gupta
aa932d542a
Merge pull request #818 from yxsamliu/fix-hip-config
...
Add default arguments for hipConfigureCall
2019-01-01 10:25:25 +05:30
Yaxun Sam Liu
57d1f6e6cb
Use __dparm for default parameter
2018-12-31 13:07:19 -05:00
Yaxun Sam Liu
02deeadaa4
Add default arguments for hipConfigureCall
2018-12-18 15:48:21 -05:00
Aaron Enye Shi
2d50392c17
Add *_rn functions back into HIP intrinsics
...
Add back the round-to-nearest-even intrinsics back to HIP math
intrinsics as it caused regression.
2018-12-18 19:31:54 +00:00
Siu Chi Chan
4251a70bc0
Revert "Rely on code object metadat for kernarg arguments alignof and sizeof."
...
This reverts commit 299b9d1382 .
2018-11-29 11:38:37 -05:00
Siu Chi Chan
30a9be5643
Revert "Handle (odd) corner case of argumentless __global__ function."
...
This reverts commit 927b06f8d4 .
2018-11-29 11:38:34 -05:00
Maneesh Gupta
99bb89b756
Merge pull request #760 from eshcherb/roctracer-hip-frontend-181113
...
Roctracer hip frontend 181113
2018-11-23 11:08:25 +05:30
Michael Kuron
e9b88711e2
Merge branch 'master' into getsymboladdress
2018-11-20 12:03:22 +01:00
Michael Kuron
f71b8cbc7d
Fix hipGetSymbolAddress/hipGetSymbolSize on nvcc
2018-11-15 09:48:00 +01:00
Maneesh Gupta
0c72d32b44
Merge pull request #756 from yxsamliu/fix-heap
...
Define __hip_device_heap in header for hip-clang only
2018-11-14 13:22:31 +05:30
root
0774b275c8
rebase to master, tracer related changes
2018-11-13 15:50:34 +00:00
Evgeny
1e49cff87a
specialized spawner object
2018-11-13 15:33:26 +00:00
Evgeny
e362688adf
renaming HIP_INIT_CB_API to HIP_INIT_API
2018-11-13 15:33:26 +00:00
Evgeny
8c88932f30
hip_prof_(api/str).h moving under hcc_detail dir
2018-11-13 15:33:26 +00:00
Evgeny
d35567f40a
compilation fix - struct name fix
2018-11-13 15:33:26 +00:00
Evgeny
2f2c955ae0
fix
2018-11-13 15:33:26 +00:00
Evgeny
084a68be63
adding activity prof layer
2018-11-13 15:33:26 +00:00
Yaxun Sam Liu
c44f2795d5
Define __hip_device_heap in header for hip-clang only
2018-11-12 16:39:41 -05:00
Alex Voicu
927b06f8d4
Handle (odd) corner case of argumentless __global__ function.
2018-11-12 00:32:35 +00:00
Alex Voicu
392e2ba1ba
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata
2018-11-08 06:53:06 -08:00
Aaron Enye Shi
7b3bbc85c5
Guard rcp rounded implementation as well
...
Since rcp implementations of non-default rounded versions are not correct or supported in OCML, guard them using the same macro OCML_BASIC_ROUNDED_OPERATIONS. Also update the docs and tests.
2018-11-06 19:53:28 +00:00
Aaron Enye Shi
9aa92238ab
Guard the OCML rounded operations instead
...
Instead of commenting all these functions out, guard the functions with a macro OCML_BASIC_ROUNDED_OPERATIONS.
2018-11-06 16:32:14 +00:00
Michael Kuron
4da2d92281
Introduce ihipModuleGetGlobal
2018-11-06 09:54:34 +01:00
Aaron Enye Shi
cef6e8ef1f
Remove non-working non-default-rounded math apis
...
In ROCm-Device-Libs, they have dropped the non-default-rounded versions of add, sub, mul, div, sqrt and fma. Therefore, ocml has removed the rte, rtp, rtn, and rtz counterparts. This will remove the same math APIs in HIP for _ru, _rd, _rn, and _rz.
2018-11-05 22:34:16 +00:00
Michael Kuron
8049cdafab
Document return values of hipMemcpyToSymbol, hipGetSymbolAddress
2018-11-04 11:47:17 +01:00
Michael Kuron
0b6f5791f8
Implement hipGetSymbolAddress and hipGetSymbolSize
2018-11-04 10:39:34 +01:00
Alex Voicu
1c525d7222
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata
2018-11-01 23:17:27 +00:00
Siu Chi Chan
cdd0109e70
Move the global arrays for hip malloc/free
...
from a header into a source file such that
there's only an unique copy in an executable
and prevent wasting static memory on the host
Change-Id: Id5b62766f77809c8d7b47892cb7149c490dcbdb9
2018-11-01 16:20:35 -04:00
Jeff Daily
6eac038694
typedef struct hipFuncAttributes now C compatible.
...
Fixes #591 .
Fixes #694 .
2018-10-31 14:12:45 +05:30
Maneesh Gupta
c7f2198c6e
Merge pull request #714 from aaronenyeshi/remove-static-hip-device
...
Remove libhip_device.a static library
2018-10-31 14:08:29 +05:30
Maneesh Gupta
8f667984af
Merge pull request #713 from scchan/master-hcc_early_finalize_support
...
Support more than one device code blobs in a single .kernel section
2018-10-31 14:04:06 +05:30
Alex Voicu
000766850d
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata
2018-10-30 23:34:46 +00:00
Alex Voicu
b68caf7653
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata
2018-10-28 17:02:10 +00:00
Alex Voicu
299b9d1382
Rely on code object metadat for kernarg arguments alignof and sizeof.
2018-10-28 17:01:00 +00:00
Aaron Enye Shi
ba43d1ce1b
Remove libhip_device.a static library
...
Move remaining function definitions from device_util.cpp to hip_runtime.h header. We can now remove the static library completely as device_util.cpp was the last file part of target hip::hip_device .
2018-10-25 19:52:35 +00:00
Alex Voicu
a743939900
hipLaunchKernel, hipLaunchParm are deprecated, and shall be removed.
2018-10-25 13:32:17 +01:00
Maneesh Gupta
05beeec84d
Merge pull request #723 from mangupta/fix_double_shfl
...
Fix logic for double variants of __shfl*
2018-10-25 06:01:38 +05:30
Maneesh Gupta
7b6d232672
Fix logic for double variants of __shfl*
...
Change-Id: I604f00b54cf4bd9c5f26ca6fa680fca5e9629417
2018-10-24 12:39:09 +05:30
Maneesh Gupta
b01ac26948
Make HIP functional again with HCC from ROCm 1.9.x
...
Change-Id: I214acdfd0b79dcf783993e44fe31baee64fd4dc3
2018-10-24 10:41:56 +05:30
Maneesh Gupta
407e092a13
Merge pull request #705 from ROCm-Developer-Tools/feature_minimal_changes_for_hc_next
...
Feature minimal changes for hc next
2018-10-19 06:58:31 +05:30
Siu Chi Chan
f39873c584
Support more than one bundles in a single .kernel section
...
When compiling with Early Finalization enabled in HCC,
the resulting .kernel section of the host object now may
contain more than one device code bundles. This is to
teach the HIP runtime to correctly extract all the
bundles from the .kernel section.
2018-10-18 17:00:27 -04:00
Alex Voicu
3678063598
Dumb workaround is still needed, so add it back.
2018-10-18 15:33:46 +01:00