Граф коммитов

841 Коммитов

Автор SHA1 Сообщение Дата
Maneesh Gupta 83e56b2ed4 Add missing hipHostRegister flags on nvcc path
Change-Id: I69f09204d9c544935104d4168ab8d3626666a623


[ROCm/hip commit: 64d1cf86b7]
2018-10-15 15:30:24 +05:30
Aaron Enye Shi ff1c7e9d27 Fix hip_vector_types.h for long long vectors
There was a missing long in the declaration for [u]longlongN types.

[ROCm/hip commit: 5dd35576f6]
2018-10-03 13:57:52 -04:00
Maneesh Gupta 70f9bd87c6 Merge pull request #614 from ROCm-Developer-Tools/fma
Add overloading resolution functions for fma

[ROCm/hip commit: 3d67c9f952]
2018-09-20 13:38:03 +05:30
Yaxun Sam Liu f6318119a9 Silent warnings about duplicate static keyword
static is already in __DEVICE__, so should be removed.


[ROCm/hip commit: a5c961e26c]
2018-09-19 10:39:45 -04:00
Yaxun Sam Liu 2d4040f75d Add fma function with float and _Float16 arguments
[ROCm/hip commit: bd622a4b4a]
2018-09-19 09:59:33 -04:00
Yaxun Sam Liu dcc99ba5c6 Fix build failure of hipTestHalf and hipTestIncludeMath for hip-clang
[ROCm/hip commit: cf184460e9]
2018-09-18 21:00:15 -04:00
Maneesh Gupta bec821768d Merge pull request #672 from iotamudelta/fp16_fix
Only LLVM6 and higher contain the necessary intrinsics.

[ROCm/hip commit: 9ee70fca8a]
2018-09-18 08:43:33 +05:30
Maneesh Gupta 4eba3cdf5c Merge pull request #674 from mangupta/fix_dtests_on_nvcc
[dtests] Fix hipTestClock, hipTestNew, hipTestGlobalVariable, hipSimpleAtomicsTest & hipTestIncludeMath tests on nvcc path

[ROCm/hip commit: 32787fa1fc]
2018-09-18 07:50:52 +05:30
Maneesh Gupta e4f978c9d6 Merge pull request #677 from yxsamliu/fix-launch-decay
Fix hipLaunchKernelGGL for hip-clang

[ROCm/hip commit: 5cf281071d]
2018-09-18 07:50:37 +05:30
Yaxun Sam Liu 3d60af4f35 Disable device code for gcc in hip_memory.h
These device code should only be seen by HCC or hip-clang. They causd build failure
for HIP-VDI runtime and should be disabled for gcc.


[ROCm/hip commit: cdfd82f1de]
2018-09-17 16:50:42 -04:00
Yaxun Sam Liu 694b6e105a Fix hipLaunchKernelGGL for hip-clang
Do not decay function pointer type of the kernel argument passed to hipLaunchKernelGGL
and hipLaunchKernel, otherwise some type information is lost which may cause
type inference failure for the template.

This issue caused compilation error of FeatureLPPooling in Caffe2/PyTorch and this patch
fixes that.


[ROCm/hip commit: fc228c7ea6]
2018-09-17 11:20:41 -04:00
Maneesh Gupta fc3c4aca00 Add mappings for __clock* in nvcc_detail/hip_runtime.h
Change-Id: Ibcecf52f3e69298268d921efc036090544fa0ed0


[ROCm/hip commit: cef5261fa9]
2018-09-17 15:23:30 +05:30
Maneesh Gupta 4a671abf13 Merge branch 'master' into support-malloc
[ROCm/hip commit: 66f863d1f3]
2018-09-17 10:17:25 +05:30
Maneesh Gupta aef0245f90 Merge pull request #650 from ROCm-Developer-Tools/hip-clang-new
Support placement new in hip-clang

[ROCm/hip commit: cb348421d7]
2018-09-15 11:21:01 +05:30
Maneesh Gupta c8a776bfba Merge pull request #665 from aaronenyeshi/fix-min-funcs
Use templates for min to prevent ambiguity

[ROCm/hip commit: 8fe4e22b19]
2018-09-14 13:21:38 +05:30
Aaron Enye Shi fd9194fb16 Fix Tensorflow ambiguous min issue
[ROCm/hip commit: 6b811ca6d1]
2018-09-13 23:16:20 +00:00
Johannes M Dieterich b0602d501a Only LLVM6 and higher contain the necessary intrinsics.
[ROCm/hip commit: cf12a9c049]
2018-09-13 13:55:43 -05:00
Maneesh Gupta 5b2ca452c1 Merge pull request #669 from ROCm-Developer-Tools/feature_automatic_cast
Remove potential for mismatch between runtime passed actuals and defined formals

[ROCm/hip commit: aed5ad31ba]
2018-09-13 07:54:22 +05:30
Maneesh Gupta 2b59aea015 Merge pull request #661 from yxsamliu/add-empty-printf
Add empty printf for hip-clang

[ROCm/hip commit: 411e53a665]
2018-09-13 07:54:03 +05:30
Aaron Enye Shi 4d265c6df0 Avoid AMP-retrict call to CPU-restrict
[ROCm/hip commit: 894cbdd749]
2018-09-12 14:54:31 +00:00
Alex Voicu f6100dae08 Remove potential for mismatch between runtime passed actuals and defined formals.
[ROCm/hip commit: cdfea3ef7b]
2018-09-12 10:30:48 +01:00
Maneesh Gupta c1d8dc86ff Merge pull request #664 from lcskrishna/master
added __host__ to float2half and half2float functions.

[ROCm/hip commit: 8249cf037b]
2018-09-12 14:50:01 +05:30
Maneesh Gupta 3015572a69 Merge pull request #663 from yxsamliu/fix-launch
Use template for hipLaunchKernelGGL for hip-clang

[ROCm/hip commit: 133d665a88]
2018-09-12 14:49:38 +05:30
carlushuang 7c4aaeff95 fix __longlong_as_double() problem, return the double value
previous version return a long long valus *as* double, hence we may get the wrong result.
this also affect atomicAdd(double * ...), which use long long pointer to mimic double pointer.

Signed-off-by: carlushuang <carlus.huang@amd.com>


[ROCm/hip commit: d577f27d1a]
2018-09-12 13:25:00 +08:00
Aaron Enye Shi d335be17e4 Avoid host min func conflict with gcc min
[ROCm/hip commit: ffd89dde9c]
2018-09-11 18:48:31 +00:00
Aaron Enye Shi b98941d0f2 Use templates for min to prevent ambiguity
[ROCm/hip commit: 0121ec13aa]
2018-09-11 18:21:54 +00:00
Yaxun Sam Liu 7e51c5c48d Use template for hipLaunchKernelGGL for hip-clang
[ROCm/hip commit: 9e9a93e10a]
2018-09-07 16:20:00 -04:00
Yaxun Sam Liu 04c5bab9c6 Add empty printf for hip-clang
[ROCm/hip commit: 3e1833ca52]
2018-09-07 16:15:08 -04:00
lcskrishna 95bb14cebc added host for float2half and hlaf2float
[ROCm/hip commit: 2984c020b8]
2018-09-06 15:02:52 -07:00
Yaxun (Sam) Liu 747af80a5b Support placement new in hip-clang
[ROCm/hip commit: 338eaefa84]
2018-09-04 16:53:46 -04:00
Maneesh Gupta 489b696bae Merge pull request #654 from ROCm-Developer-Tools/fix-force-inline
__forceinline__ needs inline and always_inline

[ROCm/hip commit: 52b8879f12]
2018-09-03 16:35:34 +05:30
Maneesh Gupta be7f89f25e Merge pull request #657 from yxsamliu/fix-clz
Fix ambiguity of __clz and __clzll

[ROCm/hip commit: a3a93a68d4]
2018-09-03 16:34:42 +05:30
Maneesh Gupta ac4b243c2d Merge pull request #656 from searlmc1/msearles-fixes
Add keyword 'explicit' to avoid ambiguity

[ROCm/hip commit: 6b21d77a86]
2018-09-03 16:33:41 +05:30
Maneesh Gupta f219b3fa35 Merge pull request #651 from yxsamliu/hip-clang-clock
Add clock() and clock64()

[ROCm/hip commit: 7459defec1]
2018-09-03 16:32:30 +05:30
Yaxun Sam Liu 275d66917e Fix ambiguity of __clz and __clzll
[ROCm/hip commit: 11c91c2fdd]
2018-08-31 18:07:57 -04:00
Mark Searles 7c101b2e20 Add keyword 'explicit' to avoid ambiguity
Older gcc, e.g., 5.x, see an ambiguity in some calls. Example error
output as seen with gcc 5.5:

In file included from benchmark_wino.cpp:1:
In file included from ./miopen.hpp:13:
In file included from /usr/lib/gcc/x86_64-linux-gnu/5.5.0/../../../../include/c++/5.5.0/regex:38:
In file included from /usr/lib/gcc/x86_64-linux-gnu/5.5.0/../../../../include/c++/5.5.0/algorithm:62:
In file included from /usr/lib/gcc/x86_64-linux-gnu/5.5.0/../../../../include/c++/5.5.0/bits/stl_algo.h:66:
In file included from /usr/lib/gcc/x86_64-linux-gnu/5.5.0/../../../../include/c++/5.5.0/random:51:
/usr/lib/gcc/x86_64-linux-gnu/5.5.0/../../../../include/c++/5.5.0/bits/random.tcc:1324:27: error: call to 'abs' is ambiguous
                    const double __y = -std::abs(__n) * __param._M_sm - 1;
                                        ^~~~~~~~
/opt/rocm/hip/include/hip/hcc_detail/hip_complex.h:345:31: note: candidate function
__DEFINE_HIP_COMPLEX_REAL_FUN(abs, hipCabs)


[ROCm/hip commit: aa7ce2ef44]
2018-08-31 11:10:58 -07:00
Mark Searles ff4df50cd5 Add keyword 'explicit' to avoid ambiguity
Older gcc, e.g., 5.x, see an ambiguity in some calls. Example error
output as seen with gcc 5.5:

In file included from benchmark_wino.cpp:1:
In file included from ./miopen.hpp:13:
In file included from /usr/lib/gcc/x86_64-linux-gnu/5.5.0/../../../../include/c++/5.5.0/regex:38:
In file included from /usr/lib/gcc/x86_64-linux-gnu/5.5.0/../../../../include/c++/5.5.0/algorithm:62:
In file included from /usr/lib/gcc/x86_64-linux-gnu/5.5.0/../../../../include/c++/5.5.0/bits/stl_algo.h:66:
In file included from /usr/lib/gcc/x86_64-linux-gnu/5.5.0/../../../../include/c++/5.5.0/random:51:
/usr/lib/gcc/x86_64-linux-gnu/5.5.0/../../../../include/c++/5.5.0/bits/random.tcc:1324:27: error: call to 'abs' is ambiguous
                    const double __y = -std::abs(__n) * __param._M_sm - 1;
                                        ^~~~~~~~
/opt/rocm/hip/include/hip/hcc_detail/hip_complex.h:345:31: note: candidate function
__DEFINE_HIP_COMPLEX_REAL_FUN(abs, hipCabs)


[ROCm/hip commit: f0183dfaac]
2018-08-31 10:36:05 -07:00
Aaron Enye Shi 8a602184bb __forceinline__ needs inline and always_inline
Related to SWDEV-163084


[ROCm/hip commit: e7352a1f4a]
2018-08-29 18:33:35 +00:00
Yaxun Sam Liu c1ab349fc7 Add clock() and clock64()
[ROCm/hip commit: 1f74a66bea]
2018-08-29 10:51:00 -04:00
Aaron Enye Shi 9619457b1f Add bitinsert64 to device functions
[ROCm/hip commit: ba9fc6f355]
2018-08-28 20:01:14 +00:00
Aaron Enye Shi eb2e7e0671 Clean up and add meaningful variables to bit funcs
[ROCm/hip commit: b939a3c6ed]
2018-08-28 18:19:48 +00:00
Aaron Enye Shi 1d29819850 Add bitextract and bitinsert functions
[ROCm/hip commit: b445e45007]
2018-08-20 16:46:12 +00:00
Maneesh Gupta 236c7002b4 Merge pull request #623 from ROCm-Developer-Tools/tf18
Fix declaration conflict when hip/math_functions.h is included first

[ROCm/hip commit: d8d45a906c]
2018-08-10 09:36:27 +05:30
Maneesh Gupta 28739aa597 Merge pull request #613 from ROCm-Developer-Tools/hcc-compat
Add HCC compatibility mode for hip-clang

[ROCm/hip commit: 11cccaa5d0]
2018-08-10 09:32:31 +05:30
Yaxun Sam Liu 9c2a33b5c9 Fix __HIP_ARCH_* not defined after including math_functions.h
hcc_detail/math_functions.h used to include hcc_detail/hip_runtime.h.

Removing it has caused regression in TensorFlow 1.8.

Put it back for backward compatibiliity.


[ROCm/hip commit: b8a0c14171]
2018-08-08 08:55:28 -04:00
Yaxun Sam Liu 735e7ac097 Fix __HIP_DEVICE_COMPILE__ not defined when hip/math_functions.h is included
This fixes build failure in TensorFlow 1.8 for HCC


[ROCm/hip commit: 450ea49df0]
2018-08-07 17:14:39 -04:00
Yaxun Sam Liu fd1b704358 Fix declaration conflict when hip/math_functions.h is included first
This fixes build failure in TensorFlow 1.8 for HCC


[ROCm/hip commit: dacb18414e]
2018-08-07 15:44:59 -04:00
Wen-Heng (Jack) Chung 38b6402860 Revert "HIP program state re-initialization logic"
This reverts commit e5ca9eb081.


[ROCm/hip commit: 2604f33930]
2018-08-03 17:03:04 -05:00
Maneesh Gupta 9ddcc6e6c6 Merge pull request #603 from b-sumner/master
Drop use of NVCC_COMPAT since it is always set

[ROCm/hip commit: 78ca12d583]
2018-08-03 08:45:18 +05:30
Maneesh Gupta f0dfb51e39 Merge pull request #610 from aaronenyeshi/enable-hcc-module-launch
Enable hipHccModuleLaunchKernel on HIP-Clang

[ROCm/hip commit: d0cdc69676]
2018-08-02 08:08:46 +05:30