Michael Kuron
c35dfb71d5
Merge branch 'master' into getsymboladdress
...
[ROCm/hip commit: 8610128c3e ]
2018-11-20 12:03:22 +01:00
Michael Kuron
2e8a94e90b
Fix hipGetSymbolAddress/hipGetSymbolSize on nvcc
...
[ROCm/hip commit: 999b62b711 ]
2018-11-15 09:48:00 +01:00
Maneesh Gupta
bc85e694c5
Merge pull request #756 from yxsamliu/fix-heap
...
Define __hip_device_heap in header for hip-clang only
[ROCm/hip commit: 0770c2b8da ]
2018-11-14 13:22:31 +05:30
Yaxun Sam Liu
a30829a7bf
Define __hip_device_heap in header for hip-clang only
...
[ROCm/hip commit: 17ac81b69e ]
2018-11-12 16:39:41 -05:00
Alex Voicu
591d4d1b5f
Handle (odd) corner case of argumentless __global__ function.
...
[ROCm/hip commit: c0bd1a5af8 ]
2018-11-12 00:32:35 +00:00
Alex Voicu
9e2269c8db
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata
...
[ROCm/hip commit: aa6815cdc8 ]
2018-11-08 06:53:06 -08:00
Aaron Enye Shi
8bf2429663
Guard rcp rounded implementation as well
...
Since rcp implementations of non-default rounded versions are not correct or supported in OCML, guard them using the same macro OCML_BASIC_ROUNDED_OPERATIONS. Also update the docs and tests.
[ROCm/hip commit: 0cfaa52d15 ]
2018-11-06 19:53:28 +00:00
Aaron Enye Shi
eb4053e3fe
Guard the OCML rounded operations instead
...
Instead of commenting all these functions out, guard the functions with a macro OCML_BASIC_ROUNDED_OPERATIONS.
[ROCm/hip commit: e59c33250a ]
2018-11-06 16:32:14 +00:00
Michael Kuron
cbba8221ee
Introduce ihipModuleGetGlobal
...
[ROCm/hip commit: 31acf1c268 ]
2018-11-06 09:54:34 +01:00
Aaron Enye Shi
1c6f47ef55
Remove non-working non-default-rounded math apis
...
In ROCm-Device-Libs, they have dropped the non-default-rounded versions of add, sub, mul, div, sqrt and fma. Therefore, ocml has removed the rte, rtp, rtn, and rtz counterparts. This will remove the same math APIs in HIP for _ru, _rd, _rn, and _rz.
[ROCm/hip commit: 2b108a2cfd ]
2018-11-05 22:34:16 +00:00
Michael Kuron
da5df1fd1e
Document return values of hipMemcpyToSymbol, hipGetSymbolAddress
...
[ROCm/hip commit: 4390c82121 ]
2018-11-04 11:47:17 +01:00
Michael Kuron
bc455ccf50
Implement hipGetSymbolAddress and hipGetSymbolSize
...
[ROCm/hip commit: 73616582d6 ]
2018-11-04 10:39:34 +01:00
Alex Voicu
9cd4a9dbb1
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata
...
[ROCm/hip commit: f28cde3f98 ]
2018-11-01 23:17:27 +00:00
Siu Chi Chan
1159b4aa05
Move the global arrays for hip malloc/free
...
from a header into a source file such that
there's only an unique copy in an executable
and prevent wasting static memory on the host
Change-Id: Id5b62766f77809c8d7b47892cb7149c490dcbdb9
[ROCm/hip commit: 0ff408a56c ]
2018-11-01 16:20:35 -04:00
Jeff Daily
c28043bc4f
typedef struct hipFuncAttributes now C compatible.
...
Fixes #591 .
Fixes #694 .
[ROCm/hip commit: bb447dd76e ]
2018-10-31 14:12:45 +05:30
Maneesh Gupta
ac071f5390
Merge pull request #714 from aaronenyeshi/remove-static-hip-device
...
Remove libhip_device.a static library
[ROCm/hip commit: 0b86b421c1 ]
2018-10-31 14:08:29 +05:30
Maneesh Gupta
3986e47f8d
Merge pull request #713 from scchan/master-hcc_early_finalize_support
...
Support more than one device code blobs in a single .kernel section
[ROCm/hip commit: 7f3e7b5334 ]
2018-10-31 14:04:06 +05:30
Alex Voicu
726b654c6d
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata
...
[ROCm/hip commit: d035cb9000 ]
2018-10-30 23:34:46 +00:00
Alex Voicu
c998b7dfc2
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata
...
[ROCm/hip commit: 185fa122ed ]
2018-10-28 17:02:10 +00:00
Alex Voicu
6d64f5e112
Rely on code object metadat for kernarg arguments alignof and sizeof.
...
[ROCm/hip commit: fe1e963299 ]
2018-10-28 17:01:00 +00:00
Aaron Enye Shi
5bbd632888
Remove libhip_device.a static library
...
Move remaining function definitions from device_util.cpp to hip_runtime.h header. We can now remove the static library completely as device_util.cpp was the last file part of target hip::hip_device .
[ROCm/hip commit: 817b27d530 ]
2018-10-25 19:52:35 +00:00
Alex Voicu
147a6bab43
hipLaunchKernel, hipLaunchParm are deprecated, and shall be removed.
...
[ROCm/hip commit: e4181b85be ]
2018-10-25 13:32:17 +01:00
Maneesh Gupta
5333a22ccc
Merge pull request #723 from mangupta/fix_double_shfl
...
Fix logic for double variants of __shfl*
[ROCm/hip commit: f6f160fa6b ]
2018-10-25 06:01:38 +05:30
Maneesh Gupta
dd8ebc9a97
Fix logic for double variants of __shfl*
...
Change-Id: I604f00b54cf4bd9c5f26ca6fa680fca5e9629417
[ROCm/hip commit: 19404e603d ]
2018-10-24 12:39:09 +05:30
Maneesh Gupta
90d15c13a1
Make HIP functional again with HCC from ROCm 1.9.x
...
Change-Id: I214acdfd0b79dcf783993e44fe31baee64fd4dc3
[ROCm/hip commit: 0703a2d0f0 ]
2018-10-24 10:41:56 +05:30
Maneesh Gupta
a648f1c48e
Merge pull request #705 from ROCm-Developer-Tools/feature_minimal_changes_for_hc_next
...
Feature minimal changes for hc next
[ROCm/hip commit: 4a00b244a3 ]
2018-10-19 06:58:31 +05:30
Siu Chi Chan
7ce37f3a08
Support more than one bundles in a single .kernel section
...
When compiling with Early Finalization enabled in HCC,
the resulting .kernel section of the host object now may
contain more than one device code bundles. This is to
teach the HIP runtime to correctly extract all the
bundles from the .kernel section.
[ROCm/hip commit: 30ce25e627 ]
2018-10-18 17:00:27 -04:00
Alex Voicu
b8b15ce792
Dumb workaround is still needed, so add it back.
...
[ROCm/hip commit: 5ccaf2fa7d ]
2018-10-18 15:33:46 +01:00
Alex Voicu
de15c6988d
Re-sync with upstream.
...
[ROCm/hip commit: fe959f7bd7 ]
2018-10-18 12:27:03 +01:00
Maneesh Gupta
0dff12a615
Merge pull request #688 from aaronenyeshi/fix-sinf-cosf-ocml
...
Use sinf and cosf from ocml device libs
[ROCm/hip commit: 1a5025c57e ]
2018-10-18 16:39:20 +05:30
Maneesh Gupta
1a8b4ce037
Merge pull request #692 from whchung/hip-reinit-take2
...
HIP program state re-initialization logic (take 2)
[ROCm/hip commit: d133493669 ]
2018-10-18 12:06:41 +05:30
Maneesh Gupta
63e6cd228b
Merge pull request #703 from mangupta/stream_create_with_priority
...
Implementation for stream priority
[ROCm/hip commit: c24b06fa0a ]
2018-10-17 10:53:43 +05:30
Maneesh Gupta
a58a408e71
Merge pull request #702 from aaronenyeshi/fix-missing-irif-lib
...
Replace IRIF fences with atomic_work_item_fence
[ROCm/hip commit: dbe4431d98 ]
2018-10-17 10:53:27 +05:30
Maneesh Gupta
83e56b2ed4
Add missing hipHostRegister flags on nvcc path
...
Change-Id: I69f09204d9c544935104d4168ab8d3626666a623
[ROCm/hip commit: 64d1cf86b7 ]
2018-10-15 15:30:24 +05:30
Alex Voicu
69b0baf37f
Minimal should mean minimal.
...
[ROCm/hip commit: 5312336ce2 ]
2018-10-11 00:21:41 +01:00
Alex Voicu
d68ed45708
Address Aaron's comments
...
[ROCm/hip commit: 3e4dbd32a1 ]
2018-10-11 00:03:01 +01:00
Alex Voicu
0d30e15646
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_minimal_changes_for_hc_next
...
[ROCm/hip commit: 4bc40551b5 ]
2018-10-10 11:44:09 +01:00
Alex Voicu
6d0b50e9b9
Re-sync with upstream.
...
[ROCm/hip commit: ca375cb8c5 ]
2018-10-10 11:43:49 +01:00
Maneesh Gupta
b35590db18
Implementation for stream priority
...
- Requires ROCm 1.9.x or higher
- Requires HCC with PR#886 merged
Change-Id: Id7c95ea091ee610e80c9ad815f1cb989cba570ca
[ROCm/hip commit: da64156fb2 ]
2018-10-05 16:27:46 +05:30
Aaron Enye Shi
188df279a9
Replace IRIF fences with atomic_work_item_fence
...
[ROCm/hip commit: 0787f74ac2 ]
2018-10-04 21:47:28 +00:00
Aaron Enye Shi
ff1c7e9d27
Fix hip_vector_types.h for long long vectors
...
There was a missing long in the declaration for [u]longlongN types.
[ROCm/hip commit: 5dd35576f6 ]
2018-10-03 13:57:52 -04:00
Wen-Heng (Jack) Chung
4e04cf3d2a
HIP program state re-initialization logic
...
This commit is to support kernels dynamically loaded thru means such as
dlopen() after HIP runtime initializes.
[ROCm/hip commit: dab1a0f9db ]
2018-09-26 19:48:47 +00:00
Aaron Enye Shi
19d76382d6
Use sinf and cosf from ocml device libs
...
Using llvm_amdgcn builtin fails to produce accurate values, we should move to using the ocml device library versions.
[ROCm/hip commit: 77c07d4118 ]
2018-09-25 19:31:39 +00:00
Maneesh Gupta
70f9bd87c6
Merge pull request #614 from ROCm-Developer-Tools/fma
...
Add overloading resolution functions for fma
[ROCm/hip commit: 3d67c9f952 ]
2018-09-20 13:38:03 +05:30
Yaxun Sam Liu
f6318119a9
Silent warnings about duplicate static keyword
...
static is already in __DEVICE__, so should be removed.
[ROCm/hip commit: a5c961e26c ]
2018-09-19 10:39:45 -04:00
Yaxun Sam Liu
2d4040f75d
Add fma function with float and _Float16 arguments
...
[ROCm/hip commit: bd622a4b4a ]
2018-09-19 09:59:33 -04:00
Yaxun Sam Liu
dcc99ba5c6
Fix build failure of hipTestHalf and hipTestIncludeMath for hip-clang
...
[ROCm/hip commit: cf184460e9 ]
2018-09-18 21:00:15 -04:00
Maneesh Gupta
bec821768d
Merge pull request #672 from iotamudelta/fp16_fix
...
Only LLVM6 and higher contain the necessary intrinsics.
[ROCm/hip commit: 9ee70fca8a ]
2018-09-18 08:43:33 +05:30
Maneesh Gupta
4eba3cdf5c
Merge pull request #674 from mangupta/fix_dtests_on_nvcc
...
[dtests] Fix hipTestClock, hipTestNew, hipTestGlobalVariable, hipSimpleAtomicsTest & hipTestIncludeMath tests on nvcc path
[ROCm/hip commit: 32787fa1fc ]
2018-09-18 07:50:52 +05:30
Maneesh Gupta
e4f978c9d6
Merge pull request #677 from yxsamliu/fix-launch-decay
...
Fix hipLaunchKernelGGL for hip-clang
[ROCm/hip commit: 5cf281071d ]
2018-09-18 07:50:37 +05:30