Граф коммитов

892 Коммитов

Автор SHA1 Сообщение Дата
Siu Chi Chan aeca2c8cdc Revert "Handle (odd) corner case of argumentless __global__ function."
This reverts commit c0bd1a5af8.
2018-11-29 11:38:34 -05:00
Maneesh Gupta 160c509e23 Merge pull request #760 from eshcherb/roctracer-hip-frontend-181113
Roctracer hip frontend 181113
2018-11-23 11:08:25 +05:30
Michael Kuron 8610128c3e Merge branch 'master' into getsymboladdress 2018-11-20 12:03:22 +01:00
Michael Kuron 999b62b711 Fix hipGetSymbolAddress/hipGetSymbolSize on nvcc 2018-11-15 09:48:00 +01:00
Maneesh Gupta 0770c2b8da Merge pull request #756 from yxsamliu/fix-heap
Define __hip_device_heap in header for hip-clang only
2018-11-14 13:22:31 +05:30
root 091124a766 rebase to master, tracer related changes 2018-11-13 15:50:34 +00:00
Evgeny 32c0008ef6 specialized spawner object 2018-11-13 15:33:26 +00:00
Evgeny e5ba097afd renaming HIP_INIT_CB_API to HIP_INIT_API 2018-11-13 15:33:26 +00:00
Evgeny 47f1d059d1 hip_prof_(api/str).h moving under hcc_detail dir 2018-11-13 15:33:26 +00:00
Evgeny 5d6a6a5c6c compilation fix - struct name fix 2018-11-13 15:33:26 +00:00
Evgeny ec989ffd96 fix 2018-11-13 15:33:26 +00:00
Evgeny b8b1637ef7 adding activity prof layer 2018-11-13 15:33:26 +00:00
Yaxun Sam Liu 17ac81b69e Define __hip_device_heap in header for hip-clang only 2018-11-12 16:39:41 -05:00
Alex Voicu c0bd1a5af8 Handle (odd) corner case of argumentless __global__ function. 2018-11-12 00:32:35 +00:00
Alex Voicu aa6815cdc8 Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata 2018-11-08 06:53:06 -08:00
Aaron Enye Shi 0cfaa52d15 Guard rcp rounded implementation as well
Since rcp implementations of non-default rounded versions are not correct or supported in OCML, guard them using the same macro OCML_BASIC_ROUNDED_OPERATIONS. Also update the docs and tests.
2018-11-06 19:53:28 +00:00
Aaron Enye Shi e59c33250a Guard the OCML rounded operations instead
Instead of commenting all these functions out, guard the functions with a macro OCML_BASIC_ROUNDED_OPERATIONS.
2018-11-06 16:32:14 +00:00
Michael Kuron 31acf1c268 Introduce ihipModuleGetGlobal 2018-11-06 09:54:34 +01:00
Aaron Enye Shi 2b108a2cfd Remove non-working non-default-rounded math apis
In ROCm-Device-Libs, they have dropped the non-default-rounded versions of add, sub, mul, div, sqrt and fma. Therefore, ocml has removed the rte, rtp, rtn, and rtz counterparts. This will remove the same math APIs in HIP for _ru, _rd, _rn, and _rz.
2018-11-05 22:34:16 +00:00
Michael Kuron 4390c82121 Document return values of hipMemcpyToSymbol, hipGetSymbolAddress 2018-11-04 11:47:17 +01:00
Michael Kuron 73616582d6 Implement hipGetSymbolAddress and hipGetSymbolSize 2018-11-04 10:39:34 +01:00
Alex Voicu f28cde3f98 Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata 2018-11-01 23:17:27 +00:00
Siu Chi Chan 0ff408a56c Move the global arrays for hip malloc/free
from a header into a source file such that
there's only an unique copy in an executable
and prevent wasting static memory on the host

Change-Id: Id5b62766f77809c8d7b47892cb7149c490dcbdb9
2018-11-01 16:20:35 -04:00
Jeff Daily bb447dd76e typedef struct hipFuncAttributes now C compatible.
Fixes #591.
Fixes #694.
2018-10-31 14:12:45 +05:30
Maneesh Gupta 0b86b421c1 Merge pull request #714 from aaronenyeshi/remove-static-hip-device
Remove libhip_device.a static library
2018-10-31 14:08:29 +05:30
Maneesh Gupta 7f3e7b5334 Merge pull request #713 from scchan/master-hcc_early_finalize_support
Support more than one device code blobs in a single .kernel section
2018-10-31 14:04:06 +05:30
Alex Voicu d035cb9000 Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata 2018-10-30 23:34:46 +00:00
Alex Voicu 185fa122ed Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata 2018-10-28 17:02:10 +00:00
Alex Voicu fe1e963299 Rely on code object metadat for kernarg arguments alignof and sizeof. 2018-10-28 17:01:00 +00:00
Aaron Enye Shi 817b27d530 Remove libhip_device.a static library
Move remaining function definitions from device_util.cpp to hip_runtime.h header. We can now remove the static library completely as device_util.cpp was the last file part of target hip::hip_device .
2018-10-25 19:52:35 +00:00
Alex Voicu e4181b85be hipLaunchKernel, hipLaunchParm are deprecated, and shall be removed. 2018-10-25 13:32:17 +01:00
Maneesh Gupta f6f160fa6b Merge pull request #723 from mangupta/fix_double_shfl
Fix logic for double variants of __shfl*
2018-10-25 06:01:38 +05:30
Maneesh Gupta 19404e603d Fix logic for double variants of __shfl*
Change-Id: I604f00b54cf4bd9c5f26ca6fa680fca5e9629417
2018-10-24 12:39:09 +05:30
Maneesh Gupta 0703a2d0f0 Make HIP functional again with HCC from ROCm 1.9.x
Change-Id: I214acdfd0b79dcf783993e44fe31baee64fd4dc3
2018-10-24 10:41:56 +05:30
Maneesh Gupta 4a00b244a3 Merge pull request #705 from ROCm-Developer-Tools/feature_minimal_changes_for_hc_next
Feature minimal changes for hc next
2018-10-19 06:58:31 +05:30
Siu Chi Chan 30ce25e627 Support more than one bundles in a single .kernel section
When compiling with Early Finalization enabled in HCC,
the resulting .kernel section of the host object now may
contain more than one device code bundles.  This is to
teach the HIP runtime to correctly extract all the
bundles from the .kernel section.
2018-10-18 17:00:27 -04:00
Alex Voicu 5ccaf2fa7d Dumb workaround is still needed, so add it back. 2018-10-18 15:33:46 +01:00
Alex Voicu fe959f7bd7 Re-sync with upstream. 2018-10-18 12:27:03 +01:00
Maneesh Gupta 1a5025c57e Merge pull request #688 from aaronenyeshi/fix-sinf-cosf-ocml
Use sinf and cosf from ocml device libs
2018-10-18 16:39:20 +05:30
Maneesh Gupta d133493669 Merge pull request #692 from whchung/hip-reinit-take2
HIP program state re-initialization logic (take 2)
2018-10-18 12:06:41 +05:30
Maneesh Gupta c24b06fa0a Merge pull request #703 from mangupta/stream_create_with_priority
Implementation for stream priority
2018-10-17 10:53:43 +05:30
Maneesh Gupta dbe4431d98 Merge pull request #702 from aaronenyeshi/fix-missing-irif-lib
Replace IRIF fences with atomic_work_item_fence
2018-10-17 10:53:27 +05:30
Maneesh Gupta 64d1cf86b7 Add missing hipHostRegister flags on nvcc path
Change-Id: I69f09204d9c544935104d4168ab8d3626666a623
2018-10-15 15:30:24 +05:30
Alex Voicu 5312336ce2 Minimal should mean minimal. 2018-10-11 00:21:41 +01:00
Alex Voicu 3e4dbd32a1 Address Aaron's comments 2018-10-11 00:03:01 +01:00
Alex Voicu 4bc40551b5 Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_minimal_changes_for_hc_next 2018-10-10 11:44:09 +01:00
Alex Voicu ca375cb8c5 Re-sync with upstream. 2018-10-10 11:43:49 +01:00
Maneesh Gupta da64156fb2 Implementation for stream priority
- Requires ROCm 1.9.x or higher
- Requires HCC with PR#886 merged

Change-Id: Id7c95ea091ee610e80c9ad815f1cb989cba570ca
2018-10-05 16:27:46 +05:30
Aaron Enye Shi 0787f74ac2 Replace IRIF fences with atomic_work_item_fence 2018-10-04 21:47:28 +00:00
Aaron Enye Shi 5dd35576f6 Fix hip_vector_types.h for long long vectors
There was a missing long in the declaration for [u]longlongN types.
2018-10-03 13:57:52 -04:00