Wykres commitów

883 Commity

Autor SHA1 Wiadomość Data
Michael Kuron c35dfb71d5 Merge branch 'master' into getsymboladdress
[ROCm/hip commit: 8610128c3e]
2018-11-20 12:03:22 +01:00
Michael Kuron 2e8a94e90b Fix hipGetSymbolAddress/hipGetSymbolSize on nvcc
[ROCm/hip commit: 999b62b711]
2018-11-15 09:48:00 +01:00
Maneesh Gupta bc85e694c5 Merge pull request #756 from yxsamliu/fix-heap
Define __hip_device_heap in header for hip-clang only

[ROCm/hip commit: 0770c2b8da]
2018-11-14 13:22:31 +05:30
Yaxun Sam Liu a30829a7bf Define __hip_device_heap in header for hip-clang only
[ROCm/hip commit: 17ac81b69e]
2018-11-12 16:39:41 -05:00
Alex Voicu 591d4d1b5f Handle (odd) corner case of argumentless __global__ function.
[ROCm/hip commit: c0bd1a5af8]
2018-11-12 00:32:35 +00:00
Alex Voicu 9e2269c8db Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata
[ROCm/hip commit: aa6815cdc8]
2018-11-08 06:53:06 -08:00
Aaron Enye Shi 8bf2429663 Guard rcp rounded implementation as well
Since rcp implementations of non-default rounded versions are not correct or supported in OCML, guard them using the same macro OCML_BASIC_ROUNDED_OPERATIONS. Also update the docs and tests.


[ROCm/hip commit: 0cfaa52d15]
2018-11-06 19:53:28 +00:00
Aaron Enye Shi eb4053e3fe Guard the OCML rounded operations instead
Instead of commenting all these functions out, guard the functions with a macro OCML_BASIC_ROUNDED_OPERATIONS.


[ROCm/hip commit: e59c33250a]
2018-11-06 16:32:14 +00:00
Michael Kuron cbba8221ee Introduce ihipModuleGetGlobal
[ROCm/hip commit: 31acf1c268]
2018-11-06 09:54:34 +01:00
Aaron Enye Shi 1c6f47ef55 Remove non-working non-default-rounded math apis
In ROCm-Device-Libs, they have dropped the non-default-rounded versions of add, sub, mul, div, sqrt and fma. Therefore, ocml has removed the rte, rtp, rtn, and rtz counterparts. This will remove the same math APIs in HIP for _ru, _rd, _rn, and _rz.


[ROCm/hip commit: 2b108a2cfd]
2018-11-05 22:34:16 +00:00
Michael Kuron da5df1fd1e Document return values of hipMemcpyToSymbol, hipGetSymbolAddress
[ROCm/hip commit: 4390c82121]
2018-11-04 11:47:17 +01:00
Michael Kuron bc455ccf50 Implement hipGetSymbolAddress and hipGetSymbolSize
[ROCm/hip commit: 73616582d6]
2018-11-04 10:39:34 +01:00
Alex Voicu 9cd4a9dbb1 Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata
[ROCm/hip commit: f28cde3f98]
2018-11-01 23:17:27 +00:00
Siu Chi Chan 1159b4aa05 Move the global arrays for hip malloc/free
from a header into a source file such that
there's only an unique copy in an executable
and prevent wasting static memory on the host

Change-Id: Id5b62766f77809c8d7b47892cb7149c490dcbdb9


[ROCm/hip commit: 0ff408a56c]
2018-11-01 16:20:35 -04:00
Jeff Daily c28043bc4f typedef struct hipFuncAttributes now C compatible.
Fixes #591.
Fixes #694.


[ROCm/hip commit: bb447dd76e]
2018-10-31 14:12:45 +05:30
Maneesh Gupta ac071f5390 Merge pull request #714 from aaronenyeshi/remove-static-hip-device
Remove libhip_device.a static library

[ROCm/hip commit: 0b86b421c1]
2018-10-31 14:08:29 +05:30
Maneesh Gupta 3986e47f8d Merge pull request #713 from scchan/master-hcc_early_finalize_support
Support more than one device code blobs in a single .kernel section

[ROCm/hip commit: 7f3e7b5334]
2018-10-31 14:04:06 +05:30
Alex Voicu 726b654c6d Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata
[ROCm/hip commit: d035cb9000]
2018-10-30 23:34:46 +00:00
Alex Voicu c998b7dfc2 Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_get_alignment_and_size_from_metadata
[ROCm/hip commit: 185fa122ed]
2018-10-28 17:02:10 +00:00
Alex Voicu 6d64f5e112 Rely on code object metadat for kernarg arguments alignof and sizeof.
[ROCm/hip commit: fe1e963299]
2018-10-28 17:01:00 +00:00
Aaron Enye Shi 5bbd632888 Remove libhip_device.a static library
Move remaining function definitions from device_util.cpp to hip_runtime.h header. We can now remove the static library completely as device_util.cpp was the last file part of target hip::hip_device .


[ROCm/hip commit: 817b27d530]
2018-10-25 19:52:35 +00:00
Alex Voicu 147a6bab43 hipLaunchKernel, hipLaunchParm are deprecated, and shall be removed.
[ROCm/hip commit: e4181b85be]
2018-10-25 13:32:17 +01:00
Maneesh Gupta 5333a22ccc Merge pull request #723 from mangupta/fix_double_shfl
Fix logic for double variants of __shfl*

[ROCm/hip commit: f6f160fa6b]
2018-10-25 06:01:38 +05:30
Maneesh Gupta dd8ebc9a97 Fix logic for double variants of __shfl*
Change-Id: I604f00b54cf4bd9c5f26ca6fa680fca5e9629417


[ROCm/hip commit: 19404e603d]
2018-10-24 12:39:09 +05:30
Maneesh Gupta 90d15c13a1 Make HIP functional again with HCC from ROCm 1.9.x
Change-Id: I214acdfd0b79dcf783993e44fe31baee64fd4dc3


[ROCm/hip commit: 0703a2d0f0]
2018-10-24 10:41:56 +05:30
Maneesh Gupta a648f1c48e Merge pull request #705 from ROCm-Developer-Tools/feature_minimal_changes_for_hc_next
Feature minimal changes for hc next

[ROCm/hip commit: 4a00b244a3]
2018-10-19 06:58:31 +05:30
Siu Chi Chan 7ce37f3a08 Support more than one bundles in a single .kernel section
When compiling with Early Finalization enabled in HCC,
the resulting .kernel section of the host object now may
contain more than one device code bundles.  This is to
teach the HIP runtime to correctly extract all the
bundles from the .kernel section.


[ROCm/hip commit: 30ce25e627]
2018-10-18 17:00:27 -04:00
Alex Voicu b8b15ce792 Dumb workaround is still needed, so add it back.
[ROCm/hip commit: 5ccaf2fa7d]
2018-10-18 15:33:46 +01:00
Alex Voicu de15c6988d Re-sync with upstream.
[ROCm/hip commit: fe959f7bd7]
2018-10-18 12:27:03 +01:00
Maneesh Gupta 0dff12a615 Merge pull request #688 from aaronenyeshi/fix-sinf-cosf-ocml
Use sinf and cosf from ocml device libs

[ROCm/hip commit: 1a5025c57e]
2018-10-18 16:39:20 +05:30
Maneesh Gupta 1a8b4ce037 Merge pull request #692 from whchung/hip-reinit-take2
HIP program state re-initialization logic (take 2)

[ROCm/hip commit: d133493669]
2018-10-18 12:06:41 +05:30
Maneesh Gupta 63e6cd228b Merge pull request #703 from mangupta/stream_create_with_priority
Implementation for stream priority

[ROCm/hip commit: c24b06fa0a]
2018-10-17 10:53:43 +05:30
Maneesh Gupta a58a408e71 Merge pull request #702 from aaronenyeshi/fix-missing-irif-lib
Replace IRIF fences with atomic_work_item_fence

[ROCm/hip commit: dbe4431d98]
2018-10-17 10:53:27 +05:30
Maneesh Gupta 83e56b2ed4 Add missing hipHostRegister flags on nvcc path
Change-Id: I69f09204d9c544935104d4168ab8d3626666a623


[ROCm/hip commit: 64d1cf86b7]
2018-10-15 15:30:24 +05:30
Alex Voicu 69b0baf37f Minimal should mean minimal.
[ROCm/hip commit: 5312336ce2]
2018-10-11 00:21:41 +01:00
Alex Voicu d68ed45708 Address Aaron's comments
[ROCm/hip commit: 3e4dbd32a1]
2018-10-11 00:03:01 +01:00
Alex Voicu 0d30e15646 Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_minimal_changes_for_hc_next
[ROCm/hip commit: 4bc40551b5]
2018-10-10 11:44:09 +01:00
Alex Voicu 6d0b50e9b9 Re-sync with upstream.
[ROCm/hip commit: ca375cb8c5]
2018-10-10 11:43:49 +01:00
Maneesh Gupta b35590db18 Implementation for stream priority
- Requires ROCm 1.9.x or higher
- Requires HCC with PR#886 merged

Change-Id: Id7c95ea091ee610e80c9ad815f1cb989cba570ca


[ROCm/hip commit: da64156fb2]
2018-10-05 16:27:46 +05:30
Aaron Enye Shi 188df279a9 Replace IRIF fences with atomic_work_item_fence
[ROCm/hip commit: 0787f74ac2]
2018-10-04 21:47:28 +00:00
Aaron Enye Shi ff1c7e9d27 Fix hip_vector_types.h for long long vectors
There was a missing long in the declaration for [u]longlongN types.

[ROCm/hip commit: 5dd35576f6]
2018-10-03 13:57:52 -04:00
Wen-Heng (Jack) Chung 4e04cf3d2a HIP program state re-initialization logic
This commit is to support kernels dynamically loaded thru means such as
dlopen() after HIP runtime initializes.


[ROCm/hip commit: dab1a0f9db]
2018-09-26 19:48:47 +00:00
Aaron Enye Shi 19d76382d6 Use sinf and cosf from ocml device libs
Using llvm_amdgcn builtin fails to produce accurate values, we should move to using the ocml device library versions.


[ROCm/hip commit: 77c07d4118]
2018-09-25 19:31:39 +00:00
Maneesh Gupta 70f9bd87c6 Merge pull request #614 from ROCm-Developer-Tools/fma
Add overloading resolution functions for fma

[ROCm/hip commit: 3d67c9f952]
2018-09-20 13:38:03 +05:30
Yaxun Sam Liu f6318119a9 Silent warnings about duplicate static keyword
static is already in __DEVICE__, so should be removed.


[ROCm/hip commit: a5c961e26c]
2018-09-19 10:39:45 -04:00
Yaxun Sam Liu 2d4040f75d Add fma function with float and _Float16 arguments
[ROCm/hip commit: bd622a4b4a]
2018-09-19 09:59:33 -04:00
Yaxun Sam Liu dcc99ba5c6 Fix build failure of hipTestHalf and hipTestIncludeMath for hip-clang
[ROCm/hip commit: cf184460e9]
2018-09-18 21:00:15 -04:00
Maneesh Gupta bec821768d Merge pull request #672 from iotamudelta/fp16_fix
Only LLVM6 and higher contain the necessary intrinsics.

[ROCm/hip commit: 9ee70fca8a]
2018-09-18 08:43:33 +05:30
Maneesh Gupta 4eba3cdf5c Merge pull request #674 from mangupta/fix_dtests_on_nvcc
[dtests] Fix hipTestClock, hipTestNew, hipTestGlobalVariable, hipSimpleAtomicsTest & hipTestIncludeMath tests on nvcc path

[ROCm/hip commit: 32787fa1fc]
2018-09-18 07:50:52 +05:30
Maneesh Gupta e4f978c9d6 Merge pull request #677 from yxsamliu/fix-launch-decay
Fix hipLaunchKernelGGL for hip-clang

[ROCm/hip commit: 5cf281071d]
2018-09-18 07:50:37 +05:30