Graf commitů

2903 Commity

Autor SHA1 Zpráva Datum
Maneesh Gupta 9ee70fca8a Merge pull request #672 from iotamudelta/fp16_fix
Only LLVM6 and higher contain the necessary intrinsics.
2018-09-18 08:43:33 +05:30
Maneesh Gupta 32787fa1fc Merge pull request #674 from mangupta/fix_dtests_on_nvcc
[dtests] Fix hipTestClock, hipTestNew, hipTestGlobalVariable, hipSimpleAtomicsTest & hipTestIncludeMath tests on nvcc path
2018-09-18 07:50:52 +05:30
Maneesh Gupta 5cf281071d Merge pull request #677 from yxsamliu/fix-launch-decay
Fix hipLaunchKernelGGL for hip-clang
2018-09-18 07:50:37 +05:30
Maneesh Gupta d28ce807c7 Merge pull request #676 from VincentSC/patch-1
Update install.sh
2018-09-18 07:50:28 +05:30
Maneesh Gupta 47bd84974b Merge pull request #678 from yxsamliu/fix-memory
Disable device code for gcc in hip_memory.h
2018-09-18 07:50:18 +05:30
Yaxun Sam Liu cdfd82f1de Disable device code for gcc in hip_memory.h
These device code should only be seen by HCC or hip-clang. They causd build failure
for HIP-VDI runtime and should be disabled for gcc.
2018-09-17 16:50:42 -04:00
Yaxun Sam Liu fc228c7ea6 Fix hipLaunchKernelGGL for hip-clang
Do not decay function pointer type of the kernel argument passed to hipLaunchKernelGGL
and hipLaunchKernel, otherwise some type information is lost which may cause
type inference failure for the template.

This issue caused compilation error of FeatureLPPooling in Caffe2/PyTorch and this patch
fixes that.
2018-09-17 11:20:41 -04:00
VincentSC 30acc99e13 Update install.sh
'rename' added as a Linux prerequisite.
2018-09-17 13:25:12 +02:00
Maneesh Gupta cca2c5afc8 [dests] Fix hipTestClock, hipTestNew & hipTestGlobalVariable tests for nvcc
nvcc does not support global kernels in struct/class

Change-Id: I2d7297e0c3725564215e20dbdd31c0bb8d7a07de
2018-09-17 15:32:05 +05:30
Maneesh Gupta 94a0589eb5 [dtests] Fix deviceLib/hipSimpleAtomicsTest for nvcc
Change-Id: I23cbea2820d41da6f6a1bcab4bc3b59ac21799dd
2018-09-17 15:26:45 +05:30
Maneesh Gupta cef5261fa9 Add mappings for __clock* in nvcc_detail/hip_runtime.h
Change-Id: Ibcecf52f3e69298268d921efc036090544fa0ed0
2018-09-17 15:23:30 +05:30
Maneesh Gupta 0b211bb429 [dtests] Fix deviceLib/hipTestIncludeMath on nvcc path
Change-Id: Ifb121886bec6c1134e98bd194ff1713f28454d33
2018-09-17 15:15:49 +05:30
Maneesh Gupta 30b1494c3d Merge pull request #606 from ROCm-Developer-Tools/add-test-global-var
Add HIP directed test hipTestGlobalVariable.cpp
2018-09-17 11:11:51 +05:30
Maneesh Gupta 319d6dce5e Merge pull request #598 from ROCm-Developer-Tools/support-malloc
Support device-side malloc/free for hip-clang
2018-09-17 11:11:36 +05:30
Maneesh Gupta 66f863d1f3 Merge branch 'master' into support-malloc 2018-09-17 10:17:25 +05:30
Maneesh Gupta 48cc4f0d9e Merge pull request #639 from ROCm-Developer-Tools/skudchad-hip
Check for hipEnvVar at the same level if directed_tests location fails
2018-09-16 18:31:37 +05:30
Maneesh Gupta 9b2107749c Revert changes to runtime/stream/hipStreamSync2 2018-09-15 13:23:38 +05:30
Maneesh Gupta cb348421d7 Merge pull request #650 from ROCm-Developer-Tools/hip-clang-new
Support placement new in hip-clang
2018-09-15 11:21:01 +05:30
Maneesh Gupta 233ac1e0f0 Merge pull request #636 from ROCm-Developer-Tools/db-fatbin
Add HIP_DB=fatbin for debugging fat binary issues
2018-09-14 13:21:58 +05:30
Maneesh Gupta d6ce39c1b7 Merge pull request #637 from ROCm-Developer-Tools/add-pthread
Let hipcc link -lpthread -lm by default
2018-09-14 13:21:47 +05:30
Maneesh Gupta 8fe4e22b19 Merge pull request #665 from aaronenyeshi/fix-min-funcs
Use templates for min to prevent ambiguity
2018-09-14 13:21:38 +05:30
Maneesh Gupta 93a07ae6cb Merge pull request #642 from ROCm-Developer-Tools/handle-obj
Let hipcc handle obj files in linker response file for hip-clang
2018-09-14 08:59:53 +05:30
Maneesh Gupta 87bf9aa9d1 Merge pull request #658 from phani544/hipClangFrndClass
[HIP-Clang]Add friend class/function test
2018-09-14 08:59:40 +05:30
Aaron Enye Shi 6b811ca6d1 Fix Tensorflow ambiguous min issue 2018-09-13 23:16:20 +00:00
Johannes M Dieterich cf12a9c049 Only LLVM6 and higher contain the necessary intrinsics. 2018-09-13 13:55:43 -05:00
Maneesh Gupta 5cb819b34e Merge pull request #633 from aaronenyeshi/hip-clang-remove-am
Remove few hcc specific cmake for hip-clang
2018-09-13 10:20:12 +05:30
Maneesh Gupta aed5ad31ba Merge pull request #669 from ROCm-Developer-Tools/feature_automatic_cast
Remove potential for mismatch between runtime passed actuals and defined formals
2018-09-13 07:54:22 +05:30
Maneesh Gupta c44906011d Merge pull request #670 from gargrahul/remove_adipose_launchkernelhcc
Remove adipose extn from launchKernelHcc sample
2018-09-13 07:54:11 +05:30
Maneesh Gupta 411e53a665 Merge pull request #661 from yxsamliu/add-empty-printf
Add empty printf for hip-clang
2018-09-13 07:54:03 +05:30
Maneesh Gupta 48f3d77185 Merge pull request #648 from ROCm-Developer-Tools/yxsamliu-patch-2
Document kernel launching for hip-clang
2018-09-13 07:53:50 +05:30
Aaron Enye Shi 894cbdd749 Avoid AMP-retrict call to CPU-restrict 2018-09-12 14:54:31 +00:00
Rahul Garg fb706902cc Remove adipose extn from launchKernelHcc sample 2018-09-12 16:41:24 +05:30
Alex Voicu cdfea3ef7b Remove potential for mismatch between runtime passed actuals and defined formals. 2018-09-12 10:30:48 +01:00
Maneesh Gupta 8249cf037b Merge pull request #664 from lcskrishna/master
added __host__ to float2half and half2float functions.
2018-09-12 14:50:01 +05:30
Maneesh Gupta 133d665a88 Merge pull request #663 from yxsamliu/fix-launch
Use template for hipLaunchKernelGGL for hip-clang
2018-09-12 14:49:38 +05:30
Maneesh Gupta fefb2b2e68 Merge pull request #667 from carlushuang/longlong_as_double_fix
fix __longlong_as_double() problem, return the double value
2018-09-12 12:53:57 +05:30
Maneesh Gupta e63205fad8 Merge pull request #666 from ROCm-Developer-Tools/mangupta-patch-2
[ci] Disable hipVectorTypes* tests in CI
2018-09-12 12:52:15 +05:30
carlushuang d577f27d1a fix __longlong_as_double() problem, return the double value
previous version return a long long valus *as* double, hence we may get the wrong result.
this also affect atomicAdd(double * ...), which use long long pointer to mimic double pointer.

Signed-off-by: carlushuang <carlus.huang@amd.com>
2018-09-12 13:25:00 +08:00
Maneesh Gupta cb6cf6584e [ci] Disable hipVectorTypes* tests in CI
Disable directed_tests/deviceLib/hipVectorTypes.tst & directed_tests/deviceLib/hipVectorTypesDevice.tst in CI due to HCC regressions. Once HCC fixes are in, the tests can be re-enabled in CI.
2018-09-12 10:32:15 +05:30
Aaron Enye Shi ffd89dde9c Avoid host min func conflict with gcc min 2018-09-11 18:48:31 +00:00
Aaron Enye Shi 0121ec13aa Use templates for min to prevent ambiguity 2018-09-11 18:21:54 +00:00
Yaxun Sam Liu 9e9a93e10a Use template for hipLaunchKernelGGL for hip-clang 2018-09-07 16:20:00 -04:00
Yaxun Sam Liu 3e1833ca52 Add empty printf for hip-clang 2018-09-07 16:15:08 -04:00
lcskrishna 2984c020b8 added host for float2half and hlaf2float 2018-09-06 15:02:52 -07:00
fpadmin a8bc26344f [HIP-Clang]Add friend class/function test 2018-09-05 11:48:49 +05:30
Yaxun Sam Liu 4f17b4877e Let hipcc link -lpthread -lm by default 2018-09-04 17:11:47 -04:00
Yaxun Sam Liu 1587b18a91 Let hipcc handle obj files in linker response file for hip-clang
If obj files in linker response file contains device code, pass them to hip-clang, otherwise keep them in the linker
response file.
2018-09-04 16:57:55 -04:00
Yaxun (Sam) Liu 1bb28375bc Document kernel launching for hip-clang 2018-09-04 16:56:53 -04:00
Yaxun (Sam) Liu 338eaefa84 Support placement new in hip-clang 2018-09-04 16:53:46 -04:00
Maneesh Gupta 52b8879f12 Merge pull request #654 from ROCm-Developer-Tools/fix-force-inline
__forceinline__ needs inline and always_inline
2018-09-03 16:35:34 +05:30