커밋 그래프

104 커밋

작성자 SHA1 메시지 날짜
Alex Voicu 9d91b802a5 Let's try this again... 2018-06-25 17:49:50 +01:00
Alex Voicu 859133a045 Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_native_vector_types 2018-06-22 12:19:32 +01:00
Maneesh Gupta 946c8da88a Merge pull request #490 from ROCm-Developer-Tools/feature_decouple_atomics_from_hc
Switch the atomic implementation to use Clang  builtins.
2018-06-20 14:16:43 +05:30
Alex Voicu 28a1aef8a1 Revert "Revert "Switch over to using native vector types, for better codegen. Remove noise.""
This reverts commit 7a4aace13d.
2018-06-16 22:59:36 +01:00
Siu Chi Chan 7a4aace13d Revert "Switch over to using native vector types, for better codegen. Remove noise." 2018-06-08 16:48:22 -04:00
Alex Voicu 089ab3b947 Switch the atomic implementation to use Clang builtins. 2018-06-02 12:27:17 +01:00
Alex Voicu 417869821d Re-sync with upstream. 2018-06-01 15:49:05 +01:00
Maneesh Gupta 68c36f9125 Merge pull request #447 from ROCm-Developer-Tools/feature_native_vector_types
Switch over to using native vector types, for better codegen. Remove noise.
2018-06-01 13:58:07 +05:30
Alex Voicu ab4b2a650b Re-sync with upstream. Add integer abs. 2018-05-31 16:38:00 +01:00
Alex Voicu 0108819e2a Add missing interop with volatile. Fix unit tests. 2018-05-31 15:27:12 +01:00
Alex Voicu 59db16fd36 Switch to using ROCDL directly, as opposed to via HC. Add missing bits. 2018-05-31 03:17:26 +01:00
Alex Voicu 0eec115bcd Move converting constructor from _Float16 under macro guard. Refactor. 2018-05-25 19:46:41 +01:00
Alex Voicu 9948b5961e Update hipTestHalf to actually test behaviour. Add missing hipHostfree. 2018-05-24 13:55:30 +01:00
Alex Voicu 6f819f226b Missing commit. 2018-05-23 17:57:47 +01:00
Maneesh Gupta 305592d622 Disable incomplete unit tests that don't work on nvcc path
Change-Id: If5823ec96a3b2497a08c46ab802c5a0158271053
2018-05-21 11:35:03 +05:30
Maneesh Gupta 67d45164fa Merge pull request #444 from aaronenyeshi/vg20-initial
initial gfx906 support
2018-05-18 13:18:07 +05:30
Alex Voicu b8c98d2357 Switch over to using native vector types, for better codegen. Remove noise. 2018-05-12 00:14:17 +01:00
Aaron Enye Shi 848a24b524 Fix hipMathFunction for gfx906 2018-05-11 10:53:07 -04:00
Siu Chi Chan b898049412 initial gfx906 support 2018-05-10 19:28:00 +00:00
Siu Chi Chan b285145966 add intrinsics mbcnt_lo, mbcnt_hi, lane_id 2018-05-08 13:43:53 -04:00
Deven Desai 76a7d7e374 Fixing a copy/paste error in my previous checkin 2018-04-27 12:59:51 +00:00
Deven Desai 4be4cf644f Checkin to add unit tests for fixes in my previous commit
This change adds unit tests for

1. __hisinf
2. __hisnan
2. abs(long long)
3. lgamma(double)
2018-04-25 17:53:21 +00:00
Maneesh Gupta 1ba06f63c4 Apply .clangformat to all repo source files
Change-Id: I7e79c6058f0303f9a98911e3b7dd2e8596079344
2018-03-12 11:29:03 +05:30
Phaneendr-kumar Lanka bd60ef9031 [nvccWarnings] Fix -Wno-deprecated-declarations in hip_anyall and hip_ballot 2017-12-20 12:05:21 +05:30
Phaneendr-kumar Lanka eea7d495c7 [nvccWarnings] Fix warnings seen with dtests on nvcc path 2017-12-14 14:10:37 +05:30
Ben Sander 62cbe22d50 Temporarily disable a couple tests pending some HCC work 2017-12-01 21:46:28 +00:00
Alex Voicu 32e11e7dc6 Revert "Revert adoption of CUDA indexing in general - this can only work with later versions of the compiler, just like module based dispatch, and thus must be guarded against usage in earlier (e.g. 1.6) versions."
This reverts commit d2fd1f5
2017-11-29 21:49:10 +00:00
Alex Voicu fbaf729f88 Revert "Revert adoption of CUDA indexing in general - this can only work with later versions of the compiler, just like module based dispatch, and thus must be guarded against usage in earlier (e.g. 1.6) versions."
This reverts commit d2fd1f5
2017-11-29 21:36:29 +00:00
Alex Voicu 4131b47134 Modify the set component of the memcpy test (unclear why there is a memset component to begin with). 2017-11-21 17:52:01 +00:00
Alex Voicu c5f2b22d0d Re-sync with upstream. 2017-11-20 15:34:50 +00:00
Maneesh Gupta db378fbc9e Merge pull request #266 from gargrahul/fix_half2_gfx900
Fixed half2 issue on gfx900
2017-11-20 07:28:41 +05:30
Alex Voicu cffd0e14eb This implements the trivial change needed to move back from the hip{Something}_{x, y, z} macros to the natural CUDA syntax of Something.{x, y, z}. This is contained in lines 384-404 in hip_runtime.h. All of the other changes have to do with changing unit tests to use this syntax. The macros are retained for backwards compatibility. 2017-11-19 01:54:12 +00:00
Rahul Garg 9af0f9cbc1 Fixed test case for GFX900 2017-11-16 09:34:52 +05:30
Rahul Garg fef496d4f1 Fixed half2 issue on gfx900 2017-11-15 18:52:59 +05:30
Alex Voicu a619ac6f92 This corrects some interesting choices that were present in the HIP
unit tests such as e.g. de-allocating memory allocated with new[] using
free. All of these were identified via cppcheck.
2017-11-01 22:51:22 +00:00
Alex Voicu c2482d1255 This switches HIP from its currently convoluted macro + pfe based dispatch mechanism to a more natural one partially based on the existing module API. The basic idea is that HCC will always correctly emit __global__ functions: as empty-bodied stubs, on host, and as kernels, on device. It then becomes trivial to obtain the mangled name on host, at dispatch, from the function's address, and then to use the mangled name to retrieve the kernel. This should address all problems stemming from serialisation, dubious mismatches due to the manufactured functor, macro-isms et al. It also immediately enables support for generalised globals as a consequence of that being available in the module API. Finally, it will make debug much easier, since the actual names of the __global__ functions will automatically be used in traces etc. One detail is that due to how dispatch works now (hipLaunchKernel and hipLaunchKernelGGL are themselves variadic function templates which deduce the function type of the callee), in certain cases it may be necesssary to insert explicit casts to ensure that the variadic argument list selects a viable overload - this can be observed in some unit tests. Eventually we may be able to remove this limitation, but for now it does not appear terribly onerous. The code is not extremely HIPpie, nor is it fully optimised, but rather is intended as a starting point for the HIP team to make its own. 2017-11-01 15:09:59 +00:00
Maneesh Gupta 4bcb3372b5 [dtests] Fix hip_threadfence_system test's BUILD command
Change-Id: I037c523369e4bddd7a386ffe117fe97a178900fd
2017-08-11 14:49:17 +05:30
Siu Chi Chan 9a04c5cdd4 add C++11 compilation flags and minor bug fixes 2017-08-10 17:58:36 +00:00
Maneesh Gupta 69e431f035 [dtests] Disable hip_threadfence_system.tst on nvcc
The test needs some more modifications before it is ready for building
on nvcc path. Disabling it for now so that the pull request can be
merged.

Change-Id: I19a894fcda7b2159f86a4e4e95a409c5655d3760
2017-08-10 12:05:58 +05:30
Siu Chi Chan 96e79f97e2 implement __threadfence_system 2017-08-02 08:50:18 +00:00
Maneesh Gupta 7a446dcbc5 Renable frexp(f) device math function
Change-Id: I53c022b8ddf38cd17ddb42eba457b9020db66395
2017-07-20 14:41:30 +05:30
Rahul Garg 039f62213b Enable memcpytosymbol family functions test 2017-07-18 12:37:37 +05:30
Wen-Heng (Jack) Chung 30ee797dd4 Remove explicit address spaces declarations 2017-07-07 12:46:29 -05:00
Maneesh Gupta e6cafbf342 hipDeviceMemcpy test: make it functional on nvcc path
Change-Id: Id10c79b48747ed701adbd0a233c53cd60cfa743b
2017-06-07 15:24:44 +05:30
Maneesh Gupta 5cdd1b2bf5 Disable rcbrtf, scalblnf, scalbnf in single precision device test
Change-Id: I8a250a64a0cb05132d022a11d9766ced9cdf11a7
2017-05-31 10:16:19 +05:30
Maneesh Gupta 4919863d3e Disable rcbrt, scalbln and scalbn double precision device test
Change-Id: I46bd895701c46d3592b553090cafba99e41a2e2d
2017-05-31 10:15:41 +05:30
Aditya Atluri 6dff2714e9 added __host__ for complex functions and corrected memset and memcpy test
Change-Id: I9ffefb7a0025aa111a54d20d2766982df15532e7
2017-04-06 09:29:44 -05:00
Sun, Peng c865151e50 fix hipVectorTypesDevice direct test with GGL enabled
Change-Id: I7a63b87348f08f094cd709e87397d9e0fc24e4c2
2017-03-30 17:14:55 -05:00
Rahul Garg ec0d334354 Update hipTestDeviceSymbol sample
Change-Id: If5ba99c60cd30c4491ca3a4856764224163d3ddf
2017-03-24 10:39:11 +05:30
Ben Sander e85c167171 Add USE_PROMOTE_FREE_HCC for smooth transition to new HCC caps.
ADDRESS_SPACE_1 defines
2017-03-17 11:04:39 -05:00