커밋 그래프

749 커밋

작성자 SHA1 메시지 날짜
Yaxun (Sam) Liu 5ec7973a6f Add workaround __local_to_generic 2018-07-09 17:56:33 +00:00
Yaxun (Sam) Liu e0390ddc9f Add declare of __get_dynamicgroupbaseptr for host compilation 2018-07-09 17:56:25 +00:00
Aaron Enye Shi 22b60afa0d Enable surface functions on HIP clang path
Fix surface test on HIP clang path.
2018-07-05 20:49:47 +00:00
Aaron Enye Shi 76f86ef097 Implement hip_ldg Functions into HIP header
Move all the function definitions for hip_ldg.cpp into hip_ldg.h header and enable for HIP clang path.
2018-07-05 20:38:46 +00:00
Aaron Enye Shi 47d78e372e Implement min/max functions in HIP header
Remove using hc::precise_math min and max. Instead we can use ocml directly for device and std:: for host.
2018-07-05 20:15:41 +00:00
Aaron Enye Shi 930a16bccd Implement Memory Fence Functions in header
Enabled __llvm_fence_* functions for seq_cst.
2018-07-04 23:35:24 +00:00
Aaron Enye Shi 07de5cb334 Workaround cast warning of smaller integer type for __to_local
For now, guard the __to_local function for device compile only since a local pointer should be same size as unsigned int on GPU compile. Also change to void* instead of char*.
2018-07-04 20:52:55 +00:00
Aaron Enye Shi 2975f2a10a Merge branch 'master' into hipclang-add-amdgcn-funcs 2018-07-04 17:36:08 +00:00
Maneesh Gupta 4b4d78fdb1 Merge pull request #530 from ROCm-Developer-Tools/hipclang-compute-mode
Add HIP Compute Mode
2018-07-04 10:55:48 +05:30
Maneesh Gupta 2ce48fbc05 Merge pull request #503 from ROCm-Developer-Tools/fix-complex
Fix hip_complex.h
2018-06-28 12:15:22 +05:30
Maneesh Gupta 76b2d0f82d Merge pull request #495 from ROCm-Developer-Tools/hip-clang-upstream
add more device functions for hip-clang
2018-06-28 11:42:01 +05:30
Maneesh Gupta dde875f23b Fix typo 2018-06-28 11:19:22 +05:30
Maneesh Gupta 525716bf91 Merge pull request #536 from ROCm-Developer-Tools/hotfix_remove_unimplemented_math
Removes use of unimplemented OCML functionality.
2018-06-28 11:15:57 +05:30
Aaron Enye Shi 9ac31e0bb6 Implement __shfl_* funcs into HIP headers 2018-06-26 18:32:11 +00:00
Yaxun (Sam) Liu 02b160491d Include host_defines.h in hip_fp16.h since it uses __host__ __device__ attributes 2018-06-25 15:34:34 -04:00
Alex Voicu 6c7a64efa2 Removes use of unimplemented OCML functionality. 2018-06-25 19:16:27 +01:00
Aaron Enye Shi ff924ecb3d Add HIP Compute Mode 2018-06-22 14:37:19 +00:00
Aaron Enye Shi 8ac864c2e3 Replace __hip_hc_ir_ inline asm with __ockl_* functions 2018-06-20 20:40:14 +00:00
Aaron Enye Shi 6dc16bbf04 Implement __ballot, __any, __all into HIP headers 2018-06-20 17:39:39 +00:00
Aaron Enye Shi 2142eb4d12 Implement hip_hc.ll into HIP headers
Move all __hip_hc_ir_* functions from hip_hc.ll into HIP header as inline asm. Remove hip_hc.ll and build dependencies from HIP.
2018-06-20 17:39:31 +00:00
Aaron Enye Shi e02fc7e680 Implement device_functions.cpp into HIP headers
Move all Integer Intrinsics, device_functions.cpp definitions and HIP specific device functions into HIP headers. Implement the device functions using llvm_intrinsics and device-libs functions instead of calling hc::__* functions. Remove device_functions.cpp since everything is now defined in header.
2018-06-20 17:39:23 +00:00
Aaron Enye Shi fe4e6c53fc Move hipclang funcs into corresponding headers 2018-06-20 17:39:15 +00:00
Aaron Enye Shi cfe37484c9 Split __llvm and device lib funcs into new headers 2018-06-20 17:39:06 +00:00
Aaron Enye Shi 27f600b425 Add get_dynamicgroupbaseptr def and remove hc_ 2018-06-20 17:38:59 +00:00
Aaron Enye Shi 871cfc6fd2 Add prefix __ to memory scope and order 2018-06-20 17:38:52 +00:00
Aaron Enye Shi cfa8fc1ca5 Add __llvm_fence funcs and __ prefixes 2018-06-20 17:38:45 +00:00
Aaron Enye Shi c453b42bff Add hipclang amdgcn functions
These are moving from hipclang in device library to hip headers. These are required for the functionality of HIPclang project.
2018-06-20 17:38:37 +00:00
Maneesh Gupta fc80fb4ab3 Merge pull request #507 from ROCm-Developer-Tools/fix-forward
Add __device__ to device functions in hip_fp16_math_fwd.h
2018-06-20 14:21:46 +05:30
Maneesh Gupta cffc5ad273 Merge pull request #504 from ROCm-Developer-Tools/fix-vector3
Fix channel_descriptor.h about vector 3 for gcc
2018-06-20 14:20:29 +05:30
Maneesh Gupta 946c8da88a Merge pull request #490 from ROCm-Developer-Tools/feature_decouple_atomics_from_hc
Switch the atomic implementation to use Clang  builtins.
2018-06-20 14:16:43 +05:30
Maneesh Gupta 836627279f Merge pull request #457 from whchung/hip-reinit
HIP program state re-initialization logic
2018-06-20 09:37:27 +05:30
Yaxun (Sam) Liu 84da72dae8 Add conj, operator-,==,!= for hipFloatComplex/hipDoubleComplex 2018-06-19 10:49:59 -04:00
Yaxun (Sam) Liu 9181fbb0b7 Add abs/real/imag functions for hipFloatComplex/hipDoubleComplex 2018-06-18 11:57:57 -04:00
Yaxun (Sam) Liu 7a5605d006 Add missing __device__ __host__ to complex constructor
Also add missing typedef value_type
2018-06-17 20:20:32 -04:00
Yaxun (Sam) Liu 2523c39a37 Includes <cmath> or <math.h> by __cplusplus in hip_complex.h 2018-06-17 20:20:32 -04:00
Yaxun (Sam) Liu 325cf3ccf0 Include cmath instead of math.h in hip_complex.h 2018-06-17 20:20:32 -04:00
Yaxun (Sam) Liu 5eeb57b0a6 Add missing macro MAKE_COMPONENT_CONSTRUCTOR_TWO_COMPONENT to hip_complex.h 2018-06-17 20:20:32 -04:00
Nico Trost 0b1e698e74 added missing hipCmul() to nvcc_detail/hip_complex.h 2018-06-14 21:49:54 +02:00
Wen-Heng (Jack) Chung 379b7a2241 HIP program state re-initialization logic
This commit is to support kernels dynamically loaded thru means such as
dlopen() after HIP runtime initializes.
2018-06-14 15:46:49 +00:00
Maneesh Gupta e0400674fd Merge pull request #506 from ROCm-Developer-Tools/fix-extern-shared
Add support of extern __shared__ for hip-clang
2018-06-11 11:59:58 +05:30
Siu Chi Chan 7a4aace13d Revert "Switch over to using native vector types, for better codegen. Remove noise." 2018-06-08 16:48:22 -04:00
Yaxun (Sam) Liu 17e3093f0e Add __device__ to device functions in hip_fp16_math_fwd.h 2018-06-08 11:23:52 -04:00
Yaxun (Sam) Liu d726ba1d9a Add more function declarations for hip-clang 2018-06-08 11:20:42 -04:00
Yaxun (Sam) Liu 9141037105 Fix channel_descriptor.h about vector 3 for gcc 2018-06-08 11:18:41 -04:00
Yaxun (Sam) Liu cc14ed0981 Add support of extern __shared__ for hip-clang 2018-06-08 11:17:25 -04:00
Maneesh Gupta 203dd6cb70 Merge pull request #482 from ROCm-Developer-Tools/feature_clean_up_hip_math
Switch to using ROCDL directly, as opposed to via HC. Add missing bits.
2018-06-06 16:07:22 +05:30
Maneesh Gupta 02ea7f13b3 Merge pull request #496 from gargrahul/add_gettexresdesc_nvcc
Add getTextureResourceDescriptor on NVCC
2018-06-06 15:12:11 +05:30
Maneesh Gupta de5043c47c Merge pull request #487 from gargrahul/fix_hiparray_alloc_flag_nvcc
Map hipArray alloc flags on NVCC
2018-06-06 15:11:40 +05:30
Rahul Garg 17bb8dbe86 Add getTextureResourceDescriptor on NVCC 2018-06-05 18:46:25 +05:30
Alex Voicu 23f5feaf13 Fix hideous typos. 2018-06-03 03:03:55 +01:00