Yaxun Sam Liu
da2fd69c44
Fix thread index calculation in __hip_malloc
...
[ROCm/hip commit: 03320890de ]
2018-07-27 16:24:51 -04:00
Yaxun Sam Liu
289bfe5f95
Support malloc/free for hip-clang
...
[ROCm/hip commit: 02d0e93601 ]
2018-07-27 16:24:51 -04:00
Maneesh Gupta
f87491f61c
Merge pull request #584 from ROCm-Developer-Tools/fix-vector-explicit-cast
...
Fix explicit cast required
[ROCm/hip commit: 06db076b10 ]
2018-07-23 10:13:05 +05:30
Aaron Enye Shi
89a00bc280
Fix explicit cast required
...
This is required for PyTorch which runs into an issue with narrowing types. Requires an explicit cast.
[ROCm/hip commit: 4ddb81424f ]
2018-07-20 16:34:15 -04:00
Aaron Enye Shi
3408e6440d
[HIPClang] Switch ordering on math functions
...
Header math_functions.h should be included after including __clang_cuda_math_forward_declares.h to avoid warning: attribute declaration must precede definition.
[ROCm/hip commit: ffefae77d7 ]
2018-07-20 13:25:04 -04:00
Maneesh Gupta
9648844caf
Merge pull request #543 from ROCm-Developer-Tools/add-std-complex
...
Support std::complex for hip-clang
[ROCm/hip commit: 12202dae90 ]
2018-07-19 07:14:45 +05:30
Yaxun (Sam) Liu
30e14f699b
Support std::complex for hip-clang
...
[ROCm/hip commit: 80438035c2 ]
2018-07-18 00:08:04 -04:00
Alex Voicu
4f0111994e
Update hip_vector_types.h
...
[ROCm/hip commit: 487bb6f2ef ]
2018-07-17 17:28:48 +01:00
Alex Voicu
45018c46cb
Not yet.
...
[ROCm/hip commit: 73d0091328 ]
2018-07-17 17:21:10 +01:00
Alex Voicu
6005a9d8d6
Constrain variadic constructor to ranks > 1 and add missing __device__s
...
[ROCm/hip commit: 3e2755adab ]
2018-07-17 17:17:48 +01:00
Maneesh Gupta
c86d32350a
Merge branch 'master' into move-memcpy
...
[ROCm/hip commit: b5cfa773ef ]
2018-07-17 10:51:42 +05:30
Maneesh Gupta
d24617e118
Merge branch 'master' into feature_native_vector_types
...
Change-Id: I943b94ee230871a960d8f9c321ad5e096183b415
[ROCm/hip commit: fb5199dee6 ]
2018-07-17 09:51:22 +05:30
Maneesh Gupta
d655537bc6
Merge pull request #515 from ROCm-Developer-Tools/hipclang-add-amdgcn-funcs
...
Add hipclang amdgcn functions
[ROCm/hip commit: fbbe2599dd ]
2018-07-17 09:25:09 +05:30
Alex Voicu
b47b4c771d
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_native_vector_types
...
[ROCm/hip commit: fe289dbebe ]
2018-07-15 11:44:48 +01:00
Evgeny Mankov
b063b3eb26
[HIPIFY] Support of cudaComputeMode / CUcomputemode
...
+ update docs
+ fix typo in hip_runtime_api.h
[ROCm/hip commit: 55e21055e3 ]
2018-07-14 16:08:19 +03:00
Maneesh Gupta
0d92d4d9e3
Merge pull request #563 from gargrahul/add_gettexrefoffset_nvcc
...
Add hipGetTextureAlignmentOffset and hipGetChanDesc on NVCC path
[ROCm/hip commit: 4aef1351fc ]
2018-07-11 15:01:20 +05:30
Maneesh Gupta
4a8751b6e7
Merge branch 'master' into add_bindtextoarrayref_overload_api
...
[ROCm/hip commit: 65e82f785d ]
2018-07-11 12:55:49 +05:30
Maneesh Gupta
12569a1aa9
Merge pull request #549 from gargrahul/correct_bindtex_syntax_nvcc
...
Corrected enum type for TextureReadMode
[ROCm/hip commit: 583b6b270a ]
2018-07-11 12:52:50 +05:30
Rahul Garg
00f7f9488b
Added hipGetChanDesc for NVCC path
...
[ROCm/hip commit: 6bf979d9e6 ]
2018-07-11 12:37:07 +05:30
Rahul Garg
54881b7a12
Add hipGetTextureAlignmentOffset on NVCC path
...
[ROCm/hip commit: c93f216fe8 ]
2018-07-11 12:17:33 +05:30
Yaxun (Sam) Liu
9de153ce91
Include stddef.h in device_functions.h and remove use of uint8_t
...
[ROCm/hip commit: e22c35e124 ]
2018-07-10 18:12:41 -04:00
Yaxun (Sam) Liu
244c9875e2
Move __hip_hc_memcpy and __hip_hc_memset from device_utils.cpp to device_functions.h as inline functions
...
[ROCm/hip commit: 8136a348ab ]
2018-07-10 18:12:41 -04:00
Aaron Enye Shi
2b89bb572a
Fix min/max, icmp asm and add comment for conversion functions
...
[ROCm/hip commit: 4a7ad93655 ]
2018-07-10 18:56:48 +00:00
Aaron Enye Shi
a13caad541
Add func attributes to match ocml and ockl
...
[ROCm/hip commit: 1f871736e5 ]
2018-07-10 18:27:32 +00:00
Aaron Enye Shi
e0f0c2c312
Remove activelanemask asm using ockl and llvm instrinsics
...
Replace implementation of __any and __all functions using OCKL functions and replaced __ballot implementation to use llvm intrinsic llvm.amdgcn.icmp.i32 instead of calls to __activelanemask_v4_b64_b1 which is not convergent.
[ROCm/hip commit: d1323f4f9b ]
2018-07-10 17:56:57 +00:00
Yaxun (Sam) Liu
c2350905f1
Add workaround __local_to_generic
...
[ROCm/hip commit: 5ec7973a6f ]
2018-07-09 17:56:33 +00:00
Yaxun (Sam) Liu
e97e1fd176
Add declare of __get_dynamicgroupbaseptr for host compilation
...
[ROCm/hip commit: e0390ddc9f ]
2018-07-09 17:56:25 +00:00
Aaron Enye Shi
0dde5be16b
Enable surface functions on HIP clang path
...
Fix surface test on HIP clang path.
[ROCm/hip commit: 22b60afa0d ]
2018-07-05 20:49:47 +00:00
Aaron Enye Shi
17453a6278
Implement hip_ldg Functions into HIP header
...
Move all the function definitions for hip_ldg.cpp into hip_ldg.h header and enable for HIP clang path.
[ROCm/hip commit: 76f86ef097 ]
2018-07-05 20:38:46 +00:00
Aaron Enye Shi
8e1a7fdd09
Implement min/max functions in HIP header
...
Remove using hc::precise_math min and max. Instead we can use ocml directly for device and std:: for host.
[ROCm/hip commit: 47d78e372e ]
2018-07-05 20:15:41 +00:00
Rahul Garg
9735598cd1
Added another variant of bindtextoarray for direct porting
...
[ROCm/hip commit: bd4816fc85 ]
2018-07-05 23:41:31 +05:30
Rahul Garg
399ec56015
Corrected enum type
...
[ROCm/hip commit: 4acf489bd5 ]
2018-07-05 23:11:39 +05:30
Rahul Garg
0d40be6866
Added tex2dlayered mapping for HIP/NVCC
...
[ROCm/hip commit: 19bae58e48 ]
2018-07-05 23:00:41 +05:30
Aaron Enye Shi
0ea959ba9d
Implement Memory Fence Functions in header
...
Enabled __llvm_fence_* functions for seq_cst.
[ROCm/hip commit: 930a16bccd ]
2018-07-04 23:35:24 +00:00
Aaron Enye Shi
6bcb3cca86
Workaround cast warning of smaller integer type for __to_local
...
For now, guard the __to_local function for device compile only since a local pointer should be same size as unsigned int on GPU compile. Also change to void* instead of char*.
[ROCm/hip commit: 07de5cb334 ]
2018-07-04 20:52:55 +00:00
Aaron Enye Shi
e8fac0449e
Merge branch 'master' into hipclang-add-amdgcn-funcs
...
[ROCm/hip commit: 2975f2a10a ]
2018-07-04 17:36:08 +00:00
Maneesh Gupta
a4eb081242
Merge pull request #530 from ROCm-Developer-Tools/hipclang-compute-mode
...
Add HIP Compute Mode
[ROCm/hip commit: 4b4d78fdb1 ]
2018-07-04 10:55:48 +05:30
Alex Voicu
d66f337df9
Add scalar operands and fix C implementation.
...
[ROCm/hip commit: b3e6fcdf18 ]
2018-06-29 05:23:49 +01:00
Alex Voicu
1320c586e0
Re-sync with upstream.
...
[ROCm/hip commit: fdb3e3e4a6 ]
2018-06-29 00:16:34 +01:00
Maneesh Gupta
a31dc511f9
Merge pull request #503 from ROCm-Developer-Tools/fix-complex
...
Fix hip_complex.h
[ROCm/hip commit: 2ce48fbc05 ]
2018-06-28 12:15:22 +05:30
Maneesh Gupta
e3996a5e16
Merge pull request #495 from ROCm-Developer-Tools/hip-clang-upstream
...
add more device functions for hip-clang
[ROCm/hip commit: 76b2d0f82d ]
2018-06-28 11:42:01 +05:30
Maneesh Gupta
fe5afd44f4
Fix typo
...
[ROCm/hip commit: dde875f23b ]
2018-06-28 11:19:22 +05:30
Maneesh Gupta
bf2a6352b7
Merge pull request #536 from ROCm-Developer-Tools/hotfix_remove_unimplemented_math
...
Removes use of unimplemented OCML functionality.
[ROCm/hip commit: 525716bf91 ]
2018-06-28 11:15:57 +05:30
Aaron Enye Shi
6e2ee0f857
Implement __shfl_* funcs into HIP headers
...
[ROCm/hip commit: 9ac31e0bb6 ]
2018-06-26 18:32:11 +00:00
Alex Voicu
158e2ed16d
Existence is a complex affair.
...
[ROCm/hip commit: 3c83e047df ]
2018-06-26 00:41:35 +01:00
Alex Voicu
e2fef7283b
Be nice to GCC, it is old and worthy of respect.
...
[ROCm/hip commit: 99c61ce7e4 ]
2018-06-25 22:59:07 +01:00
Yaxun (Sam) Liu
c205c090f4
Include host_defines.h in hip_fp16.h since it uses __host__ __device__ attributes
...
[ROCm/hip commit: 02b160491d ]
2018-06-25 15:34:34 -04:00
Alex Voicu
9fed6fed2b
Removes use of unimplemented OCML functionality.
...
[ROCm/hip commit: 6c7a64efa2 ]
2018-06-25 19:16:27 +01:00
Alex Voicu
51c5249b40
Let's try this again...
...
[ROCm/hip commit: 9d91b802a5 ]
2018-06-25 17:49:50 +01:00
Aaron Enye Shi
260e50769b
Add HIP Compute Mode
...
[ROCm/hip commit: ff924ecb3d ]
2018-06-22 14:37:19 +00:00