Maneesh Gupta
1fc3ef3cd4
Merge branch 'master' into move-memcpy
2018-07-17 10:51:42 +05:30
Maneesh Gupta
9e142ea66a
Merge branch 'master' into feature_native_vector_types
...
Change-Id: I943b94ee230871a960d8f9c321ad5e096183b415
2018-07-17 09:51:22 +05:30
Maneesh Gupta
7d8c1ae761
Merge pull request #515 from ROCm-Developer-Tools/hipclang-add-amdgcn-funcs
...
Add hipclang amdgcn functions
2018-07-17 09:25:09 +05:30
Alex Voicu
a2b6508912
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_native_vector_types
2018-07-15 11:44:48 +01:00
Evgeny Mankov
4e0d07f07a
[HIPIFY] Support of cudaComputeMode / CUcomputemode
...
+ update docs
+ fix typo in hip_runtime_api.h
2018-07-14 16:08:19 +03:00
Maneesh Gupta
56984d04b0
Merge pull request #563 from gargrahul/add_gettexrefoffset_nvcc
...
Add hipGetTextureAlignmentOffset and hipGetChanDesc on NVCC path
2018-07-11 15:01:20 +05:30
Maneesh Gupta
afe1396b35
Merge branch 'master' into add_bindtextoarrayref_overload_api
2018-07-11 12:55:49 +05:30
Maneesh Gupta
8820eee3a2
Merge pull request #549 from gargrahul/correct_bindtex_syntax_nvcc
...
Corrected enum type for TextureReadMode
2018-07-11 12:52:50 +05:30
Rahul Garg
bab48b86a6
Added hipGetChanDesc for NVCC path
2018-07-11 12:37:07 +05:30
Rahul Garg
5b53e278a5
Add hipGetTextureAlignmentOffset on NVCC path
2018-07-11 12:17:33 +05:30
Yaxun (Sam) Liu
2531815dfe
Include stddef.h in device_functions.h and remove use of uint8_t
2018-07-10 18:12:41 -04:00
Yaxun (Sam) Liu
903dfa94eb
Move __hip_hc_memcpy and __hip_hc_memset from device_utils.cpp to device_functions.h as inline functions
2018-07-10 18:12:41 -04:00
Aaron Enye Shi
9f3404cfea
Fix min/max, icmp asm and add comment for conversion functions
2018-07-10 18:56:48 +00:00
Aaron Enye Shi
bc8bc8cac4
Add func attributes to match ocml and ockl
2018-07-10 18:27:32 +00:00
Aaron Enye Shi
f8746ecc64
Remove activelanemask asm using ockl and llvm instrinsics
...
Replace implementation of __any and __all functions using OCKL functions and replaced __ballot implementation to use llvm intrinsic llvm.amdgcn.icmp.i32 instead of calls to __activelanemask_v4_b64_b1 which is not convergent.
2018-07-10 17:56:57 +00:00
Yaxun (Sam) Liu
9aaa792286
Add workaround __local_to_generic
2018-07-09 17:56:33 +00:00
Yaxun (Sam) Liu
1f34993ac1
Add declare of __get_dynamicgroupbaseptr for host compilation
2018-07-09 17:56:25 +00:00
Aaron Enye Shi
5862ae4b6a
Enable surface functions on HIP clang path
...
Fix surface test on HIP clang path.
2018-07-05 20:49:47 +00:00
Aaron Enye Shi
d39508b4fd
Implement hip_ldg Functions into HIP header
...
Move all the function definitions for hip_ldg.cpp into hip_ldg.h header and enable for HIP clang path.
2018-07-05 20:38:46 +00:00
Aaron Enye Shi
b362c53475
Implement min/max functions in HIP header
...
Remove using hc::precise_math min and max. Instead we can use ocml directly for device and std:: for host.
2018-07-05 20:15:41 +00:00
Rahul Garg
017f668a72
Added another variant of bindtextoarray for direct porting
2018-07-05 23:41:31 +05:30
Rahul Garg
b15b08d0db
Corrected enum type
2018-07-05 23:11:39 +05:30
Rahul Garg
f080abe5a5
Added tex2dlayered mapping for HIP/NVCC
2018-07-05 23:00:41 +05:30
Aaron Enye Shi
96ab7c7b25
Implement Memory Fence Functions in header
...
Enabled __llvm_fence_* functions for seq_cst.
2018-07-04 23:35:24 +00:00
Aaron Enye Shi
ce9ca744dc
Workaround cast warning of smaller integer type for __to_local
...
For now, guard the __to_local function for device compile only since a local pointer should be same size as unsigned int on GPU compile. Also change to void* instead of char*.
2018-07-04 20:52:55 +00:00
Aaron Enye Shi
f8109c5b06
Merge branch 'master' into hipclang-add-amdgcn-funcs
2018-07-04 17:36:08 +00:00
Maneesh Gupta
e54ffe52a0
Merge pull request #530 from ROCm-Developer-Tools/hipclang-compute-mode
...
Add HIP Compute Mode
2018-07-04 10:55:48 +05:30
Alex Voicu
f931980a6a
Add scalar operands and fix C implementation.
2018-06-29 05:23:49 +01:00
Alex Voicu
c061d885f6
Re-sync with upstream.
2018-06-29 00:16:34 +01:00
Maneesh Gupta
1340adce1a
Merge pull request #503 from ROCm-Developer-Tools/fix-complex
...
Fix hip_complex.h
2018-06-28 12:15:22 +05:30
Maneesh Gupta
b6f5c61a2c
Merge pull request #495 from ROCm-Developer-Tools/hip-clang-upstream
...
add more device functions for hip-clang
2018-06-28 11:42:01 +05:30
Maneesh Gupta
f426491f17
Fix typo
2018-06-28 11:19:22 +05:30
Maneesh Gupta
06c1c93945
Merge pull request #536 from ROCm-Developer-Tools/hotfix_remove_unimplemented_math
...
Removes use of unimplemented OCML functionality.
2018-06-28 11:15:57 +05:30
Aaron Enye Shi
e44eaa1a1e
Implement __shfl_* funcs into HIP headers
2018-06-26 18:32:11 +00:00
Alex Voicu
2acf7f033c
Existence is a complex affair.
2018-06-26 00:41:35 +01:00
Alex Voicu
b9ead38478
Be nice to GCC, it is old and worthy of respect.
2018-06-25 22:59:07 +01:00
Yaxun (Sam) Liu
c74682ef1b
Include host_defines.h in hip_fp16.h since it uses __host__ __device__ attributes
2018-06-25 15:34:34 -04:00
Alex Voicu
9452298fbe
Removes use of unimplemented OCML functionality.
2018-06-25 19:16:27 +01:00
Alex Voicu
3ff1d4081e
Let's try this again...
2018-06-25 17:49:50 +01:00
Aaron Enye Shi
c73cd64f5b
Add HIP Compute Mode
2018-06-22 14:37:19 +00:00
Alex Voicu
0e0d43bfe1
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_native_vector_types
2018-06-22 12:19:32 +01:00
Aaron Enye Shi
5d4986d470
Replace __hip_hc_ir_ inline asm with __ockl_* functions
2018-06-20 20:40:14 +00:00
Aaron Enye Shi
9a295a732d
Implement __ballot, __any, __all into HIP headers
2018-06-20 17:39:39 +00:00
Aaron Enye Shi
175c87f2bf
Implement hip_hc.ll into HIP headers
...
Move all __hip_hc_ir_* functions from hip_hc.ll into HIP header as inline asm. Remove hip_hc.ll and build dependencies from HIP.
2018-06-20 17:39:31 +00:00
Aaron Enye Shi
ad5f1dcbf2
Implement device_functions.cpp into HIP headers
...
Move all Integer Intrinsics, device_functions.cpp definitions and HIP specific device functions into HIP headers. Implement the device functions using llvm_intrinsics and device-libs functions instead of calling hc::__* functions. Remove device_functions.cpp since everything is now defined in header.
2018-06-20 17:39:23 +00:00
Aaron Enye Shi
e8a18a5473
Move hipclang funcs into corresponding headers
2018-06-20 17:39:15 +00:00
Aaron Enye Shi
366207f8af
Split __llvm and device lib funcs into new headers
2018-06-20 17:39:06 +00:00
Aaron Enye Shi
0694d0300a
Add get_dynamicgroupbaseptr def and remove hc_
2018-06-20 17:38:59 +00:00
Aaron Enye Shi
7cade79d98
Add prefix __ to memory scope and order
2018-06-20 17:38:52 +00:00
Aaron Enye Shi
4298ed308e
Add __llvm_fence funcs and __ prefixes
2018-06-20 17:38:45 +00:00