Alex Voicu
9d91b802a5
Let's try this again...
2018-06-25 17:49:50 +01:00
Alex Voicu
859133a045
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_native_vector_types
2018-06-22 12:19:32 +01:00
Maneesh Gupta
946c8da88a
Merge pull request #490 from ROCm-Developer-Tools/feature_decouple_atomics_from_hc
...
Switch the atomic implementation to use Clang builtins.
2018-06-20 14:16:43 +05:30
Alex Voicu
28a1aef8a1
Revert "Revert "Switch over to using native vector types, for better codegen. Remove noise.""
...
This reverts commit 7a4aace13d .
2018-06-16 22:59:36 +01:00
Siu Chi Chan
7a4aace13d
Revert "Switch over to using native vector types, for better codegen. Remove noise."
2018-06-08 16:48:22 -04:00
Alex Voicu
089ab3b947
Switch the atomic implementation to use Clang builtins.
2018-06-02 12:27:17 +01:00
Alex Voicu
417869821d
Re-sync with upstream.
2018-06-01 15:49:05 +01:00
Maneesh Gupta
68c36f9125
Merge pull request #447 from ROCm-Developer-Tools/feature_native_vector_types
...
Switch over to using native vector types, for better codegen. Remove noise.
2018-06-01 13:58:07 +05:30
Alex Voicu
ab4b2a650b
Re-sync with upstream. Add integer abs.
2018-05-31 16:38:00 +01:00
Alex Voicu
0108819e2a
Add missing interop with volatile. Fix unit tests.
2018-05-31 15:27:12 +01:00
Alex Voicu
59db16fd36
Switch to using ROCDL directly, as opposed to via HC. Add missing bits.
2018-05-31 03:17:26 +01:00
Alex Voicu
0eec115bcd
Move converting constructor from _Float16 under macro guard. Refactor.
2018-05-25 19:46:41 +01:00
Alex Voicu
9948b5961e
Update hipTestHalf to actually test behaviour. Add missing hipHostfree.
2018-05-24 13:55:30 +01:00
Alex Voicu
6f819f226b
Missing commit.
2018-05-23 17:57:47 +01:00
Maneesh Gupta
305592d622
Disable incomplete unit tests that don't work on nvcc path
...
Change-Id: If5823ec96a3b2497a08c46ab802c5a0158271053
2018-05-21 11:35:03 +05:30
Maneesh Gupta
67d45164fa
Merge pull request #444 from aaronenyeshi/vg20-initial
...
initial gfx906 support
2018-05-18 13:18:07 +05:30
Alex Voicu
b8c98d2357
Switch over to using native vector types, for better codegen. Remove noise.
2018-05-12 00:14:17 +01:00
Aaron Enye Shi
848a24b524
Fix hipMathFunction for gfx906
2018-05-11 10:53:07 -04:00
Siu Chi Chan
b898049412
initial gfx906 support
2018-05-10 19:28:00 +00:00
Siu Chi Chan
b285145966
add intrinsics mbcnt_lo, mbcnt_hi, lane_id
2018-05-08 13:43:53 -04:00
Deven Desai
76a7d7e374
Fixing a copy/paste error in my previous checkin
2018-04-27 12:59:51 +00:00
Deven Desai
4be4cf644f
Checkin to add unit tests for fixes in my previous commit
...
This change adds unit tests for
1. __hisinf
2. __hisnan
2. abs(long long)
3. lgamma(double)
2018-04-25 17:53:21 +00:00
Maneesh Gupta
1ba06f63c4
Apply .clangformat to all repo source files
...
Change-Id: I7e79c6058f0303f9a98911e3b7dd2e8596079344
2018-03-12 11:29:03 +05:30
Phaneendr-kumar Lanka
bd60ef9031
[nvccWarnings] Fix -Wno-deprecated-declarations in hip_anyall and hip_ballot
2017-12-20 12:05:21 +05:30
Phaneendr-kumar Lanka
eea7d495c7
[nvccWarnings] Fix warnings seen with dtests on nvcc path
2017-12-14 14:10:37 +05:30
Ben Sander
62cbe22d50
Temporarily disable a couple tests pending some HCC work
2017-12-01 21:46:28 +00:00
Alex Voicu
32e11e7dc6
Revert "Revert adoption of CUDA indexing in general - this can only work with later versions of the compiler, just like module based dispatch, and thus must be guarded against usage in earlier (e.g. 1.6) versions."
...
This reverts commit d2fd1f5
2017-11-29 21:49:10 +00:00
Alex Voicu
fbaf729f88
Revert "Revert adoption of CUDA indexing in general - this can only work with later versions of the compiler, just like module based dispatch, and thus must be guarded against usage in earlier (e.g. 1.6) versions."
...
This reverts commit d2fd1f5
2017-11-29 21:36:29 +00:00
Alex Voicu
4131b47134
Modify the set component of the memcpy test (unclear why there is a memset component to begin with).
2017-11-21 17:52:01 +00:00
Alex Voicu
c5f2b22d0d
Re-sync with upstream.
2017-11-20 15:34:50 +00:00
Maneesh Gupta
db378fbc9e
Merge pull request #266 from gargrahul/fix_half2_gfx900
...
Fixed half2 issue on gfx900
2017-11-20 07:28:41 +05:30
Alex Voicu
cffd0e14eb
This implements the trivial change needed to move back from the hip{Something}_{x, y, z} macros to the natural CUDA syntax of Something.{x, y, z}. This is contained in lines 384-404 in hip_runtime.h. All of the other changes have to do with changing unit tests to use this syntax. The macros are retained for backwards compatibility.
2017-11-19 01:54:12 +00:00
Rahul Garg
9af0f9cbc1
Fixed test case for GFX900
2017-11-16 09:34:52 +05:30
Rahul Garg
fef496d4f1
Fixed half2 issue on gfx900
2017-11-15 18:52:59 +05:30
Alex Voicu
a619ac6f92
This corrects some interesting choices that were present in the HIP
...
unit tests such as e.g. de-allocating memory allocated with new[] using
free. All of these were identified via cppcheck.
2017-11-01 22:51:22 +00:00
Alex Voicu
c2482d1255
This switches HIP from its currently convoluted macro + pfe based dispatch mechanism to a more natural one partially based on the existing module API. The basic idea is that HCC will always correctly emit __global__ functions: as empty-bodied stubs, on host, and as kernels, on device. It then becomes trivial to obtain the mangled name on host, at dispatch, from the function's address, and then to use the mangled name to retrieve the kernel. This should address all problems stemming from serialisation, dubious mismatches due to the manufactured functor, macro-isms et al. It also immediately enables support for generalised globals as a consequence of that being available in the module API. Finally, it will make debug much easier, since the actual names of the __global__ functions will automatically be used in traces etc. One detail is that due to how dispatch works now (hipLaunchKernel and hipLaunchKernelGGL are themselves variadic function templates which deduce the function type of the callee), in certain cases it may be necesssary to insert explicit casts to ensure that the variadic argument list selects a viable overload - this can be observed in some unit tests. Eventually we may be able to remove this limitation, but for now it does not appear terribly onerous. The code is not extremely HIPpie, nor is it fully optimised, but rather is intended as a starting point for the HIP team to make its own.
2017-11-01 15:09:59 +00:00
Maneesh Gupta
4bcb3372b5
[dtests] Fix hip_threadfence_system test's BUILD command
...
Change-Id: I037c523369e4bddd7a386ffe117fe97a178900fd
2017-08-11 14:49:17 +05:30
Siu Chi Chan
9a04c5cdd4
add C++11 compilation flags and minor bug fixes
2017-08-10 17:58:36 +00:00
Maneesh Gupta
69e431f035
[dtests] Disable hip_threadfence_system.tst on nvcc
...
The test needs some more modifications before it is ready for building
on nvcc path. Disabling it for now so that the pull request can be
merged.
Change-Id: I19a894fcda7b2159f86a4e4e95a409c5655d3760
2017-08-10 12:05:58 +05:30
Siu Chi Chan
96e79f97e2
implement __threadfence_system
2017-08-02 08:50:18 +00:00
Maneesh Gupta
7a446dcbc5
Renable frexp(f) device math function
...
Change-Id: I53c022b8ddf38cd17ddb42eba457b9020db66395
2017-07-20 14:41:30 +05:30
Rahul Garg
039f62213b
Enable memcpytosymbol family functions test
2017-07-18 12:37:37 +05:30
Wen-Heng (Jack) Chung
30ee797dd4
Remove explicit address spaces declarations
2017-07-07 12:46:29 -05:00
Maneesh Gupta
e6cafbf342
hipDeviceMemcpy test: make it functional on nvcc path
...
Change-Id: Id10c79b48747ed701adbd0a233c53cd60cfa743b
2017-06-07 15:24:44 +05:30
Maneesh Gupta
5cdd1b2bf5
Disable rcbrtf, scalblnf, scalbnf in single precision device test
...
Change-Id: I8a250a64a0cb05132d022a11d9766ced9cdf11a7
2017-05-31 10:16:19 +05:30
Maneesh Gupta
4919863d3e
Disable rcbrt, scalbln and scalbn double precision device test
...
Change-Id: I46bd895701c46d3592b553090cafba99e41a2e2d
2017-05-31 10:15:41 +05:30
Aditya Atluri
6dff2714e9
added __host__ for complex functions and corrected memset and memcpy test
...
Change-Id: I9ffefb7a0025aa111a54d20d2766982df15532e7
2017-04-06 09:29:44 -05:00
Sun, Peng
c865151e50
fix hipVectorTypesDevice direct test with GGL enabled
...
Change-Id: I7a63b87348f08f094cd709e87397d9e0fc24e4c2
2017-03-30 17:14:55 -05:00
Rahul Garg
ec0d334354
Update hipTestDeviceSymbol sample
...
Change-Id: If5ba99c60cd30c4491ca3a4856764224163d3ddf
2017-03-24 10:39:11 +05:30
Ben Sander
e85c167171
Add USE_PROMOTE_FREE_HCC for smooth transition to new HCC caps.
...
ADDRESS_SPACE_1 defines
2017-03-17 11:04:39 -05:00