Alex Voicu
ec14daa7ce
Hook into the creaky lazy-reinit machinery. Try to minimise race-risk.
2019-01-01 11:01:16 +05:30
Alex Voicu
25c7e5d609
Start re-working 731 for 2.0.
2019-01-01 11:00:57 +05:30
Maneesh Gupta
4d0266a495
Merge pull request #818 from yxsamliu/fix-hip-config
...
Add default arguments for hipConfigureCall
2019-01-01 10:25:25 +05:30
Yaxun Sam Liu
548f4dd4e5
Use __dparm for default parameter
2018-12-31 13:07:19 -05:00
Yaxun Sam Liu
ec03a8b352
Add default arguments for hipConfigureCall
2018-12-18 15:48:21 -05:00
Aaron Enye Shi
58dfeff27a
Add *_rn functions back into HIP intrinsics
...
Add back the round-to-nearest-even intrinsics back to HIP math
intrinsics as it caused regression.
2018-12-18 19:31:54 +00:00
Siu Chi Chan
1fbf639962
Revert "Rely on code object metadat for kernarg arguments alignof and sizeof."
...
This reverts commit fe1e963299 .
2018-11-29 11:38:37 -05:00
Siu Chi Chan
aeca2c8cdc
Revert "Handle (odd) corner case of argumentless __global__ function."
...
This reverts commit c0bd1a5af8 .
2018-11-29 11:38:34 -05:00
Maneesh Gupta
160c509e23
Merge pull request #760 from eshcherb/roctracer-hip-frontend-181113
...
Roctracer hip frontend 181113
2018-11-23 11:08:25 +05:30
Michael Kuron
8610128c3e
Merge branch 'master' into getsymboladdress
2018-11-20 12:03:22 +01:00
Michael Kuron
999b62b711
Fix hipGetSymbolAddress/hipGetSymbolSize on nvcc
2018-11-15 09:48:00 +01:00
Maneesh Gupta
0770c2b8da
Merge pull request #756 from yxsamliu/fix-heap
...
Define __hip_device_heap in header for hip-clang only
2018-11-14 13:22:31 +05:30
root
091124a766
rebase to master, tracer related changes
2018-11-13 15:50:34 +00:00
Evgeny
32c0008ef6
specialized spawner object
2018-11-13 15:33:26 +00:00
Evgeny
e5ba097afd
renaming HIP_INIT_CB_API to HIP_INIT_API
2018-11-13 15:33:26 +00:00
Evgeny
47f1d059d1
hip_prof_(api/str).h moving under hcc_detail dir
2018-11-13 15:33:26 +00:00
Evgeny
5d6a6a5c6c
compilation fix - struct name fix
2018-11-13 15:33:26 +00:00
Evgeny
ec989ffd96
fix
2018-11-13 15:33:26 +00:00
Evgeny
b8b1637ef7
adding activity prof layer
2018-11-13 15:33:26 +00:00
Yaxun Sam Liu
17ac81b69e
Define __hip_device_heap in header for hip-clang only
2018-11-12 16:39:41 -05:00
Alex Voicu
c0bd1a5af8
Handle (odd) corner case of argumentless __global__ function.
2018-11-12 00:32:35 +00:00
Alex Voicu
aa6815cdc8
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata
2018-11-08 06:53:06 -08:00
Aaron Enye Shi
0cfaa52d15
Guard rcp rounded implementation as well
...
Since rcp implementations of non-default rounded versions are not correct or supported in OCML, guard them using the same macro OCML_BASIC_ROUNDED_OPERATIONS. Also update the docs and tests.
2018-11-06 19:53:28 +00:00
Aaron Enye Shi
e59c33250a
Guard the OCML rounded operations instead
...
Instead of commenting all these functions out, guard the functions with a macro OCML_BASIC_ROUNDED_OPERATIONS.
2018-11-06 16:32:14 +00:00
Michael Kuron
31acf1c268
Introduce ihipModuleGetGlobal
2018-11-06 09:54:34 +01:00
Aaron Enye Shi
2b108a2cfd
Remove non-working non-default-rounded math apis
...
In ROCm-Device-Libs, they have dropped the non-default-rounded versions of add, sub, mul, div, sqrt and fma. Therefore, ocml has removed the rte, rtp, rtn, and rtz counterparts. This will remove the same math APIs in HIP for _ru, _rd, _rn, and _rz.
2018-11-05 22:34:16 +00:00
Michael Kuron
4390c82121
Document return values of hipMemcpyToSymbol, hipGetSymbolAddress
2018-11-04 11:47:17 +01:00
Michael Kuron
73616582d6
Implement hipGetSymbolAddress and hipGetSymbolSize
2018-11-04 10:39:34 +01:00
Alex Voicu
f28cde3f98
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata
2018-11-01 23:17:27 +00:00
Siu Chi Chan
0ff408a56c
Move the global arrays for hip malloc/free
...
from a header into a source file such that
there's only an unique copy in an executable
and prevent wasting static memory on the host
Change-Id: Id5b62766f77809c8d7b47892cb7149c490dcbdb9
2018-11-01 16:20:35 -04:00
Jeff Daily
bb447dd76e
typedef struct hipFuncAttributes now C compatible.
...
Fixes #591 .
Fixes #694 .
2018-10-31 14:12:45 +05:30
Maneesh Gupta
0b86b421c1
Merge pull request #714 from aaronenyeshi/remove-static-hip-device
...
Remove libhip_device.a static library
2018-10-31 14:08:29 +05:30
Maneesh Gupta
7f3e7b5334
Merge pull request #713 from scchan/master-hcc_early_finalize_support
...
Support more than one device code blobs in a single .kernel section
2018-10-31 14:04:06 +05:30
Alex Voicu
d035cb9000
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata
2018-10-30 23:34:46 +00:00
Alex Voicu
185fa122ed
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata
2018-10-28 17:02:10 +00:00
Alex Voicu
fe1e963299
Rely on code object metadat for kernarg arguments alignof and sizeof.
2018-10-28 17:01:00 +00:00
Aaron Enye Shi
817b27d530
Remove libhip_device.a static library
...
Move remaining function definitions from device_util.cpp to hip_runtime.h header. We can now remove the static library completely as device_util.cpp was the last file part of target hip::hip_device .
2018-10-25 19:52:35 +00:00
Alex Voicu
e4181b85be
hipLaunchKernel, hipLaunchParm are deprecated, and shall be removed.
2018-10-25 13:32:17 +01:00
Maneesh Gupta
f6f160fa6b
Merge pull request #723 from mangupta/fix_double_shfl
...
Fix logic for double variants of __shfl*
2018-10-25 06:01:38 +05:30
Maneesh Gupta
19404e603d
Fix logic for double variants of __shfl*
...
Change-Id: I604f00b54cf4bd9c5f26ca6fa680fca5e9629417
2018-10-24 12:39:09 +05:30
Maneesh Gupta
0703a2d0f0
Make HIP functional again with HCC from ROCm 1.9.x
...
Change-Id: I214acdfd0b79dcf783993e44fe31baee64fd4dc3
2018-10-24 10:41:56 +05:30
Maneesh Gupta
4a00b244a3
Merge pull request #705 from ROCm-Developer-Tools/feature_minimal_changes_for_hc_next
...
Feature minimal changes for hc next
2018-10-19 06:58:31 +05:30
Siu Chi Chan
30ce25e627
Support more than one bundles in a single .kernel section
...
When compiling with Early Finalization enabled in HCC,
the resulting .kernel section of the host object now may
contain more than one device code bundles. This is to
teach the HIP runtime to correctly extract all the
bundles from the .kernel section.
2018-10-18 17:00:27 -04:00
Alex Voicu
5ccaf2fa7d
Dumb workaround is still needed, so add it back.
2018-10-18 15:33:46 +01:00
Alex Voicu
fe959f7bd7
Re-sync with upstream.
2018-10-18 12:27:03 +01:00
Maneesh Gupta
1a5025c57e
Merge pull request #688 from aaronenyeshi/fix-sinf-cosf-ocml
...
Use sinf and cosf from ocml device libs
2018-10-18 16:39:20 +05:30
Maneesh Gupta
d133493669
Merge pull request #692 from whchung/hip-reinit-take2
...
HIP program state re-initialization logic (take 2)
2018-10-18 12:06:41 +05:30
Maneesh Gupta
c24b06fa0a
Merge pull request #703 from mangupta/stream_create_with_priority
...
Implementation for stream priority
2018-10-17 10:53:43 +05:30
Maneesh Gupta
dbe4431d98
Merge pull request #702 from aaronenyeshi/fix-missing-irif-lib
...
Replace IRIF fences with atomic_work_item_fence
2018-10-17 10:53:27 +05:30
Maneesh Gupta
64d1cf86b7
Add missing hipHostRegister flags on nvcc path
...
Change-Id: I69f09204d9c544935104d4168ab8d3626666a623
2018-10-15 15:30:24 +05:30