Aaron Enye Shi
8e1a7fdd09
Implement min/max functions in HIP header
...
Remove using hc::precise_math min and max. Instead we can use ocml directly for device and std:: for host.
[ROCm/hip commit: 47d78e372e ]
2018-07-05 20:15:41 +00:00
Aaron Enye Shi
0ea959ba9d
Implement Memory Fence Functions in header
...
Enabled __llvm_fence_* functions for seq_cst.
[ROCm/hip commit: 930a16bccd ]
2018-07-04 23:35:24 +00:00
Aaron Enye Shi
6bcb3cca86
Workaround cast warning of smaller integer type for __to_local
...
For now, guard the __to_local function for device compile only since a local pointer should be same size as unsigned int on GPU compile. Also change to void* instead of char*.
[ROCm/hip commit: 07de5cb334 ]
2018-07-04 20:52:55 +00:00
Aaron Enye Shi
e8fac0449e
Merge branch 'master' into hipclang-add-amdgcn-funcs
...
[ROCm/hip commit: 2975f2a10a ]
2018-07-04 17:36:08 +00:00
Maneesh Gupta
f9ea86e749
Merge pull request #539 from ROCm-Developer-Tools/fix-tf-link
...
Add workaround to hipcc for build failure in tensorflow due to missin…
[ROCm/hip commit: 66f12daaa4 ]
2018-07-04 11:08:52 +05:30
Maneesh Gupta
a4eb081242
Merge pull request #530 from ROCm-Developer-Tools/hipclang-compute-mode
...
Add HIP Compute Mode
[ROCm/hip commit: 4b4d78fdb1 ]
2018-07-04 10:55:48 +05:30
Maneesh Gupta
86abb2f7b8
Merge pull request #517 from ROCm-Developer-Tools/fix-hipcc-link
...
Let hipcc handle static library for hip-clang only if it contains bun…
[ROCm/hip commit: 7927f66326 ]
2018-07-04 10:51:44 +05:30
Maneesh Gupta
19cb14d54c
Update hip_hcc_internal.h
...
Adding missing include for hip_hcc_internal in order to build with HCC
[ROCm/hip commit: 0c2f985553 ]
2018-07-04 09:33:51 +05:30
Maneesh Gupta
b7367558ef
Update hip-targets.cmake for hip::host & hip::device
...
[ROCm/hip commit: 4b660c8382 ]
2018-07-02 10:37:20 +05:30
Maneesh Gupta
54d7f22c92
Merge pull request #528 from ROCm-Developer-Tools/fix-hipclang-hipcc
...
HIPCC - Use clang if clang++ is not compiled
[ROCm/hip commit: b326b8a0a6 ]
2018-06-28 12:36:49 +05:30
Maneesh Gupta
a31dc511f9
Merge pull request #503 from ROCm-Developer-Tools/fix-complex
...
Fix hip_complex.h
[ROCm/hip commit: 2ce48fbc05 ]
2018-06-28 12:15:22 +05:30
Maneesh Gupta
e3996a5e16
Merge pull request #495 from ROCm-Developer-Tools/hip-clang-upstream
...
add more device functions for hip-clang
[ROCm/hip commit: 76b2d0f82d ]
2018-06-28 11:42:01 +05:30
Maneesh Gupta
47804939a0
Merge pull request #522 from pradeepisro/cmake_verbose_fix
...
added changes to FindHIP.cmake which would allow us to build using ninja #467
[ROCm/hip commit: c004b15fc2 ]
2018-06-28 11:39:03 +05:30
Maneesh Gupta
fe5afd44f4
Fix typo
...
[ROCm/hip commit: dde875f23b ]
2018-06-28 11:19:22 +05:30
Maneesh Gupta
bf2a6352b7
Merge pull request #536 from ROCm-Developer-Tools/hotfix_remove_unimplemented_math
...
Removes use of unimplemented OCML functionality.
[ROCm/hip commit: 525716bf91 ]
2018-06-28 11:15:57 +05:30
Aaron Enye Shi
6e2ee0f857
Implement __shfl_* funcs into HIP headers
...
[ROCm/hip commit: 9ac31e0bb6 ]
2018-06-26 18:32:11 +00:00
Yaxun (Sam) Liu
af0e9b54de
Add workaround to hipcc for build failure in tensorflow due to missing symbol __cpu_model
...
https://github.com/tensorflow/tensorflow/issues/9593
[ROCm/hip commit: 021728cd86 ]
2018-06-25 18:14:51 -04:00
Yaxun (Sam) Liu
c205c090f4
Include host_defines.h in hip_fp16.h since it uses __host__ __device__ attributes
...
[ROCm/hip commit: 02b160491d ]
2018-06-25 15:34:34 -04:00
Alex Voicu
9fed6fed2b
Removes use of unimplemented OCML functionality.
...
[ROCm/hip commit: 6c7a64efa2 ]
2018-06-25 19:16:27 +01:00
Aaron Enye Shi
260e50769b
Add HIP Compute Mode
...
[ROCm/hip commit: ff924ecb3d ]
2018-06-22 14:37:19 +00:00
Aaron Enye Shi
a7baff62e7
HIPCC - Use clang if clang++ is not compiled
...
[ROCm/hip commit: 73023e0a54 ]
2018-06-21 18:12:55 +00:00
Aaron Enye Shi
20674cfcb2
Replace __hip_hc_ir_ inline asm with __ockl_* functions
...
[ROCm/hip commit: 8ac864c2e3 ]
2018-06-20 20:40:14 +00:00
Aaron Enye Shi
e2f8af2f3d
Implement __ballot, __any, __all into HIP headers
...
[ROCm/hip commit: 6dc16bbf04 ]
2018-06-20 17:39:39 +00:00
Aaron Enye Shi
2a2f34b42d
Implement hip_hc.ll into HIP headers
...
Move all __hip_hc_ir_* functions from hip_hc.ll into HIP header as inline asm. Remove hip_hc.ll and build dependencies from HIP.
[ROCm/hip commit: 2142eb4d12 ]
2018-06-20 17:39:31 +00:00
Aaron Enye Shi
0fca41a7ce
Implement device_functions.cpp into HIP headers
...
Move all Integer Intrinsics, device_functions.cpp definitions and HIP specific device functions into HIP headers. Implement the device functions using llvm_intrinsics and device-libs functions instead of calling hc::__* functions. Remove device_functions.cpp since everything is now defined in header.
[ROCm/hip commit: e02fc7e680 ]
2018-06-20 17:39:23 +00:00
Aaron Enye Shi
14f721702a
Move hipclang funcs into corresponding headers
...
[ROCm/hip commit: fe4e6c53fc ]
2018-06-20 17:39:15 +00:00
Aaron Enye Shi
8d65776127
Split __llvm and device lib funcs into new headers
...
[ROCm/hip commit: cfe37484c9 ]
2018-06-20 17:39:06 +00:00
Aaron Enye Shi
cb46df1654
Add get_dynamicgroupbaseptr def and remove hc_
...
[ROCm/hip commit: 27f600b425 ]
2018-06-20 17:38:59 +00:00
Aaron Enye Shi
9b03a036c3
Add prefix __ to memory scope and order
...
[ROCm/hip commit: 871cfc6fd2 ]
2018-06-20 17:38:52 +00:00
Aaron Enye Shi
cdd7931ac1
Add __llvm_fence funcs and __ prefixes
...
[ROCm/hip commit: cfa8fc1ca5 ]
2018-06-20 17:38:45 +00:00
Aaron Enye Shi
b34868005a
Add hipclang amdgcn functions
...
These are moving from hipclang in device library to hip headers. These are required for the functionality of HIPclang project.
[ROCm/hip commit: c453b42bff ]
2018-06-20 17:38:37 +00:00
Maneesh Gupta
a4b9e5305f
Merge pull request #507 from ROCm-Developer-Tools/fix-forward
...
Add __device__ to device functions in hip_fp16_math_fwd.h
[ROCm/hip commit: fc80fb4ab3 ]
2018-06-20 14:21:46 +05:30
Maneesh Gupta
6932d45c14
Merge pull request #504 from ROCm-Developer-Tools/fix-vector3
...
Fix channel_descriptor.h about vector 3 for gcc
[ROCm/hip commit: cffc5ad273 ]
2018-06-20 14:20:29 +05:30
Maneesh Gupta
27e2af1773
Merge pull request #490 from ROCm-Developer-Tools/feature_decouple_atomics_from_hc
...
Switch the atomic implementation to use Clang builtins.
[ROCm/hip commit: 946c8da88a ]
2018-06-20 14:16:43 +05:30
Maneesh Gupta
4cdc20a6ce
Merge pull request #457 from whchung/hip-reinit
...
HIP program state re-initialization logic
[ROCm/hip commit: 836627279f ]
2018-06-20 09:37:27 +05:30
Yaxun (Sam) Liu
8696038043
Let hipcc handle library with extension lo for hip-clang
...
[ROCm/hip commit: 46d3c1d51e ]
2018-06-19 10:51:08 -04:00
Yaxun (Sam) Liu
b8ae2784e9
Add conj, operator-,==,!= for hipFloatComplex/hipDoubleComplex
...
[ROCm/hip commit: 84da72dae8 ]
2018-06-19 10:49:59 -04:00
pradeepisro
4c9fad371f
added option HIP_BUILD_VERBOSE to enable verbosity in HIP build
...
[ROCm/hip commit: f6938961b9 ]
2018-06-19 13:50:24 +05:30
pradeepisro
1eb91f1bab
added changes to FindHIP.cmake which would allow us to build using ninja #467
...
[ROCm/hip commit: 4f0aefbd63 ]
2018-06-19 12:26:18 +05:30
Maneesh Gupta
163e55fe10
Merge pull request #516 from bddppq/empty-generator-expression
...
Properly handle (empty) cmake generator expression
[ROCm/hip commit: 1b88a2ce2f ]
2018-06-19 09:38:21 +05:30
Maneesh Gupta
3475fe6efa
Merge pull request #520 from ntrost57/master
...
added missing hipCmul() to nvcc_detail/hip_complex.h
[ROCm/hip commit: 523e7fd9b2 ]
2018-06-19 09:37:50 +05:30
Wen-Heng (Jack) Chung
d214b045c3
Keep the map which tracks GPU kernel symbols to grow monotonically
...
[ROCm/hip commit: 32789a8b7d ]
2018-06-18 16:54:18 -05:00
Yaxun (Sam) Liu
a46f62a5c0
Add abs/real/imag functions for hipFloatComplex/hipDoubleComplex
...
[ROCm/hip commit: 9181fbb0b7 ]
2018-06-18 11:57:57 -04:00
Yaxun (Sam) Liu
fa9e73ccdc
Add missing __device__ __host__ to complex constructor
...
Also add missing typedef value_type
[ROCm/hip commit: 7a5605d006 ]
2018-06-17 20:20:32 -04:00
Yaxun (Sam) Liu
0e29f327e2
Includes <cmath> or <math.h> by __cplusplus in hip_complex.h
...
[ROCm/hip commit: 2523c39a37 ]
2018-06-17 20:20:32 -04:00
Yaxun (Sam) Liu
cf6cdab029
Include cmath instead of math.h in hip_complex.h
...
[ROCm/hip commit: 325cf3ccf0 ]
2018-06-17 20:20:32 -04:00
Yaxun (Sam) Liu
0b10ab2b7e
Add missing macro MAKE_COMPONENT_CONSTRUCTOR_TWO_COMPONENT to hip_complex.h
...
[ROCm/hip commit: 5eeb57b0a6 ]
2018-06-17 20:20:32 -04:00
Yaxun (Sam) Liu
acaf783f52
Fix handling of static library in hipcc for hip-clang
...
[ROCm/hip commit: 2b32dbd414 ]
2018-06-17 20:11:10 -04:00
Yaxun (Sam) Liu
0573ee79de
Let hipcc handle static library for hip-clang only if it contains bundles
...
[ROCm/hip commit: f0935e6d4f ]
2018-06-17 20:11:10 -04:00
Wen-Heng (Jack) Chung
c6b975bf13
Improve performance of re-initialization logic
...
Keep track of shared libaries already discovered. Do not build HSA executables
for them.
[ROCm/hip commit: ece4539c1d ]
2018-06-15 18:07:33 -05:00