Aaron Enye Shi
896b320c5f
Add Dot functions as amd_mixed_dot function
...
Introduce the Dot functions which are available in the device library. Forward those prototypes, and introduce HIP API to expose the usage of the dot functions.
[ROCm/clr commit: 84a028b9e1 ]
2019-01-11 23:31:02 +00:00
Laurent Morichetti
c292915466
Fix compilation issues with MSVC
...
[ROCm/clr commit: 167bbffc4f ]
2019-01-09 09:32:50 -08:00
Alex Voicu
cfed8191f1
Hook into the creaky lazy-reinit machinery. Try to minimise race-risk.
...
[ROCm/clr commit: 3470692998 ]
2019-01-01 11:01:16 +05:30
Alex Voicu
eebc544c62
Start re-working 731 for 2.0.
...
[ROCm/clr commit: 3e8f198b58 ]
2019-01-01 11:00:57 +05:30
Maneesh Gupta
5b8b5b842b
Merge pull request #818 from yxsamliu/fix-hip-config
...
Add default arguments for hipConfigureCall
[ROCm/clr commit: aa932d542a ]
2019-01-01 10:25:25 +05:30
Yaxun Sam Liu
b2d9783c2f
Use __dparm for default parameter
...
[ROCm/clr commit: 57d1f6e6cb ]
2018-12-31 13:07:19 -05:00
Yaxun Sam Liu
51702bed29
Add default arguments for hipConfigureCall
...
[ROCm/clr commit: 02deeadaa4 ]
2018-12-18 15:48:21 -05:00
Aaron Enye Shi
56262f0970
Add *_rn functions back into HIP intrinsics
...
Add back the round-to-nearest-even intrinsics back to HIP math
intrinsics as it caused regression.
[ROCm/clr commit: 2d50392c17 ]
2018-12-18 19:31:54 +00:00
Siu Chi Chan
bda6edc929
Revert "Rely on code object metadat for kernarg arguments alignof and sizeof."
...
This reverts commit 1ec78f6d67 .
[ROCm/clr commit: 4251a70bc0 ]
2018-11-29 11:38:37 -05:00
Siu Chi Chan
732088e131
Revert "Handle (odd) corner case of argumentless __global__ function."
...
This reverts commit f843c8dd48 .
[ROCm/clr commit: 30a9be5643 ]
2018-11-29 11:38:34 -05:00
Maneesh Gupta
938c731b27
Merge pull request #760 from eshcherb/roctracer-hip-frontend-181113
...
Roctracer hip frontend 181113
[ROCm/clr commit: 99bb89b756 ]
2018-11-23 11:08:25 +05:30
Michael Kuron
4b77b5c1ba
Merge branch 'master' into getsymboladdress
...
[ROCm/clr commit: e9b88711e2 ]
2018-11-20 12:03:22 +01:00
Michael Kuron
4a646eed6c
Fix hipGetSymbolAddress/hipGetSymbolSize on nvcc
...
[ROCm/clr commit: f71b8cbc7d ]
2018-11-15 09:48:00 +01:00
Maneesh Gupta
9b07e74cd3
Merge pull request #756 from yxsamliu/fix-heap
...
Define __hip_device_heap in header for hip-clang only
[ROCm/clr commit: 0c72d32b44 ]
2018-11-14 13:22:31 +05:30
root
873c51bdf5
rebase to master, tracer related changes
...
[ROCm/clr commit: 0774b275c8 ]
2018-11-13 15:50:34 +00:00
Evgeny
f6d8fa5487
specialized spawner object
...
[ROCm/clr commit: 1e49cff87a ]
2018-11-13 15:33:26 +00:00
Evgeny
96b7f0404e
renaming HIP_INIT_CB_API to HIP_INIT_API
...
[ROCm/clr commit: e362688adf ]
2018-11-13 15:33:26 +00:00
Evgeny
5aba14f601
hip_prof_(api/str).h moving under hcc_detail dir
...
[ROCm/clr commit: 8c88932f30 ]
2018-11-13 15:33:26 +00:00
Evgeny
eae2f875b6
compilation fix - struct name fix
...
[ROCm/clr commit: d35567f40a ]
2018-11-13 15:33:26 +00:00
Evgeny
3be7e99753
fix
...
[ROCm/clr commit: 2f2c955ae0 ]
2018-11-13 15:33:26 +00:00
Evgeny
7519b3f75e
adding activity prof layer
...
[ROCm/clr commit: 084a68be63 ]
2018-11-13 15:33:26 +00:00
Yaxun Sam Liu
857cfa32c7
Define __hip_device_heap in header for hip-clang only
...
[ROCm/clr commit: c44f2795d5 ]
2018-11-12 16:39:41 -05:00
Alex Voicu
f843c8dd48
Handle (odd) corner case of argumentless __global__ function.
...
[ROCm/clr commit: 927b06f8d4 ]
2018-11-12 00:32:35 +00:00
Alex Voicu
c4acf6e8cc
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata
...
[ROCm/clr commit: 392e2ba1ba ]
2018-11-08 06:53:06 -08:00
Aaron Enye Shi
890beb81d6
Guard rcp rounded implementation as well
...
Since rcp implementations of non-default rounded versions are not correct or supported in OCML, guard them using the same macro OCML_BASIC_ROUNDED_OPERATIONS. Also update the docs and tests.
[ROCm/clr commit: 7b3bbc85c5 ]
2018-11-06 19:53:28 +00:00
Aaron Enye Shi
4480bb6d06
Guard the OCML rounded operations instead
...
Instead of commenting all these functions out, guard the functions with a macro OCML_BASIC_ROUNDED_OPERATIONS.
[ROCm/clr commit: 9aa92238ab ]
2018-11-06 16:32:14 +00:00
Michael Kuron
a857354072
Introduce ihipModuleGetGlobal
...
[ROCm/clr commit: 4da2d92281 ]
2018-11-06 09:54:34 +01:00
Aaron Enye Shi
5c1dc7a071
Remove non-working non-default-rounded math apis
...
In ROCm-Device-Libs, they have dropped the non-default-rounded versions of add, sub, mul, div, sqrt and fma. Therefore, ocml has removed the rte, rtp, rtn, and rtz counterparts. This will remove the same math APIs in HIP for _ru, _rd, _rn, and _rz.
[ROCm/clr commit: cef6e8ef1f ]
2018-11-05 22:34:16 +00:00
Michael Kuron
15353a7b26
Document return values of hipMemcpyToSymbol, hipGetSymbolAddress
...
[ROCm/clr commit: 8049cdafab ]
2018-11-04 11:47:17 +01:00
Michael Kuron
a38366c81e
Implement hipGetSymbolAddress and hipGetSymbolSize
...
[ROCm/clr commit: 0b6f5791f8 ]
2018-11-04 10:39:34 +01:00
Alex Voicu
e3c9869fbe
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata
...
[ROCm/clr commit: 1c525d7222 ]
2018-11-01 23:17:27 +00:00
Siu Chi Chan
e26a728173
Move the global arrays for hip malloc/free
...
from a header into a source file such that
there's only an unique copy in an executable
and prevent wasting static memory on the host
Change-Id: Id5b62766f77809c8d7b47892cb7149c490dcbdb9
[ROCm/clr commit: cdd0109e70 ]
2018-11-01 16:20:35 -04:00
Jeff Daily
34c5d69f7c
typedef struct hipFuncAttributes now C compatible.
...
Fixes #591 .
Fixes #694 .
[ROCm/clr commit: 6eac038694 ]
2018-10-31 14:12:45 +05:30
Maneesh Gupta
130edf2c43
Merge pull request #714 from aaronenyeshi/remove-static-hip-device
...
Remove libhip_device.a static library
[ROCm/clr commit: c7f2198c6e ]
2018-10-31 14:08:29 +05:30
Maneesh Gupta
58aec2b859
Merge pull request #713 from scchan/master-hcc_early_finalize_support
...
Support more than one device code blobs in a single .kernel section
[ROCm/clr commit: 8f667984af ]
2018-10-31 14:04:06 +05:30
Alex Voicu
1c6b92bbfb
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata
...
[ROCm/clr commit: 000766850d ]
2018-10-30 23:34:46 +00:00
Alex Voicu
44f0ee9cff
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata
...
[ROCm/clr commit: b68caf7653 ]
2018-10-28 17:02:10 +00:00
Alex Voicu
1ec78f6d67
Rely on code object metadat for kernarg arguments alignof and sizeof.
...
[ROCm/clr commit: 299b9d1382 ]
2018-10-28 17:01:00 +00:00
Aaron Enye Shi
1c05b06bcd
Remove libhip_device.a static library
...
Move remaining function definitions from device_util.cpp to hip_runtime.h header. We can now remove the static library completely as device_util.cpp was the last file part of target hip::hip_device .
[ROCm/clr commit: ba43d1ce1b ]
2018-10-25 19:52:35 +00:00
Alex Voicu
5a6601c62a
hipLaunchKernel, hipLaunchParm are deprecated, and shall be removed.
...
[ROCm/clr commit: a743939900 ]
2018-10-25 13:32:17 +01:00
Maneesh Gupta
bd0e1c87b7
Merge pull request #723 from mangupta/fix_double_shfl
...
Fix logic for double variants of __shfl*
[ROCm/clr commit: 05beeec84d ]
2018-10-25 06:01:38 +05:30
Maneesh Gupta
b6f6734285
Fix logic for double variants of __shfl*
...
Change-Id: I604f00b54cf4bd9c5f26ca6fa680fca5e9629417
[ROCm/clr commit: 7b6d232672 ]
2018-10-24 12:39:09 +05:30
Maneesh Gupta
127f7d0320
Make HIP functional again with HCC from ROCm 1.9.x
...
Change-Id: I214acdfd0b79dcf783993e44fe31baee64fd4dc3
[ROCm/clr commit: b01ac26948 ]
2018-10-24 10:41:56 +05:30
Maneesh Gupta
b859ab46df
Merge pull request #705 from ROCm-Developer-Tools/feature_minimal_changes_for_hc_next
...
Feature minimal changes for hc next
[ROCm/clr commit: 407e092a13 ]
2018-10-19 06:58:31 +05:30
Siu Chi Chan
5d1a265efc
Support more than one bundles in a single .kernel section
...
When compiling with Early Finalization enabled in HCC,
the resulting .kernel section of the host object now may
contain more than one device code bundles. This is to
teach the HIP runtime to correctly extract all the
bundles from the .kernel section.
[ROCm/clr commit: f39873c584 ]
2018-10-18 17:00:27 -04:00
Alex Voicu
35e9dfc593
Dumb workaround is still needed, so add it back.
...
[ROCm/clr commit: 3678063598 ]
2018-10-18 15:33:46 +01:00
Alex Voicu
c505c2be38
Re-sync with upstream.
...
[ROCm/clr commit: 602280eb8b ]
2018-10-18 12:27:03 +01:00
Maneesh Gupta
5070b00c83
Merge pull request #688 from aaronenyeshi/fix-sinf-cosf-ocml
...
Use sinf and cosf from ocml device libs
[ROCm/clr commit: 30523b72a2 ]
2018-10-18 16:39:20 +05:30
Maneesh Gupta
cf95358003
Merge pull request #692 from whchung/hip-reinit-take2
...
HIP program state re-initialization logic (take 2)
[ROCm/clr commit: 9143ae6bdb ]
2018-10-18 12:06:41 +05:30
Maneesh Gupta
bb3706b6f6
Merge pull request #703 from mangupta/stream_create_with_priority
...
Implementation for stream priority
[ROCm/clr commit: 95be669f4a ]
2018-10-17 10:53:43 +05:30