Yaxun (Sam) Liu
021728cd86
Add workaround to hipcc for build failure in tensorflow due to missing symbol __cpu_model
...
https://github.com/tensorflow/tensorflow/issues/9593
2018-06-25 18:14:51 -04:00
Alex Voicu
99c61ce7e4
Be nice to GCC, it is old and worthy of respect.
2018-06-25 22:59:07 +01:00
Yaxun (Sam) Liu
02b160491d
Include host_defines.h in hip_fp16.h since it uses __host__ __device__ attributes
2018-06-25 15:34:34 -04:00
Alex Voicu
6c7a64efa2
Removes use of unimplemented OCML functionality.
2018-06-25 19:16:27 +01:00
foreman
3f0b25f6bf
P4 to Git Change 1572348 by skudchad@skudchad_test2_win_opencl on 2018/06/25 13:21:41
...
SWDEV-145570 - [HIP] - Fix hipHostGetDevicePointer. hipHostRegister tests now passes
ReviewBoardURL = http://ocltc.amd.com/reviews/r/15261/diff/
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#34 edit
2018-06-25 13:29:56 -04:00
Alex Voicu
9d91b802a5
Let's try this again...
2018-06-25 17:49:50 +01:00
Srinivasuluch
d8290f5a7b
Update hipLaunchParm.cpp
...
Added validation part for struct, added two struct padding tests
2018-06-25 16:53:12 +05:30
Aaron Enye Shi
ff924ecb3d
Add HIP Compute Mode
2018-06-22 14:37:19 +00:00
Alex Voicu
859133a045
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_native_vector_types
2018-06-22 12:19:32 +01:00
Aaron Enye Shi
73023e0a54
HIPCC - Use clang if clang++ is not compiled
2018-06-21 18:12:55 +00:00
Aaron Enye Shi
8ac864c2e3
Replace __hip_hc_ir_ inline asm with __ockl_* functions
2018-06-20 20:40:14 +00:00
Aaron Enye Shi
6dc16bbf04
Implement __ballot, __any, __all into HIP headers
2018-06-20 17:39:39 +00:00
Aaron Enye Shi
2142eb4d12
Implement hip_hc.ll into HIP headers
...
Move all __hip_hc_ir_* functions from hip_hc.ll into HIP header as inline asm. Remove hip_hc.ll and build dependencies from HIP.
2018-06-20 17:39:31 +00:00
Aaron Enye Shi
e02fc7e680
Implement device_functions.cpp into HIP headers
...
Move all Integer Intrinsics, device_functions.cpp definitions and HIP specific device functions into HIP headers. Implement the device functions using llvm_intrinsics and device-libs functions instead of calling hc::__* functions. Remove device_functions.cpp since everything is now defined in header.
2018-06-20 17:39:23 +00:00
Aaron Enye Shi
fe4e6c53fc
Move hipclang funcs into corresponding headers
2018-06-20 17:39:15 +00:00
Aaron Enye Shi
cfe37484c9
Split __llvm and device lib funcs into new headers
2018-06-20 17:39:06 +00:00
Aaron Enye Shi
27f600b425
Add get_dynamicgroupbaseptr def and remove hc_
2018-06-20 17:38:59 +00:00
Aaron Enye Shi
871cfc6fd2
Add prefix __ to memory scope and order
2018-06-20 17:38:52 +00:00
Aaron Enye Shi
cfa8fc1ca5
Add __llvm_fence funcs and __ prefixes
2018-06-20 17:38:45 +00:00
Aaron Enye Shi
c453b42bff
Add hipclang amdgcn functions
...
These are moving from hipclang in device library to hip headers. These are required for the functionality of HIPclang project.
2018-06-20 17:38:37 +00:00
Jenkins
9d56f7c1cd
Merge 'master' into 'amd-master'
...
Change-Id: Ia897c694ffbbce2ab01493d3ea60e42701ce021e
2018-06-20 04:09:40 -05:00
Maneesh Gupta
fc80fb4ab3
Merge pull request #507 from ROCm-Developer-Tools/fix-forward
...
Add __device__ to device functions in hip_fp16_math_fwd.h
2018-06-20 14:21:46 +05:30
Maneesh Gupta
cffc5ad273
Merge pull request #504 from ROCm-Developer-Tools/fix-vector3
...
Fix channel_descriptor.h about vector 3 for gcc
2018-06-20 14:20:29 +05:30
Maneesh Gupta
946c8da88a
Merge pull request #490 from ROCm-Developer-Tools/feature_decouple_atomics_from_hc
...
Switch the atomic implementation to use Clang builtins.
2018-06-20 14:16:43 +05:30
Maneesh Gupta
836627279f
Merge pull request #457 from whchung/hip-reinit
...
HIP program state re-initialization logic
2018-06-20 09:37:27 +05:30
Yaxun (Sam) Liu
46d3c1d51e
Let hipcc handle library with extension lo for hip-clang
2018-06-19 10:51:08 -04:00
Yaxun (Sam) Liu
84da72dae8
Add conj, operator-,==,!= for hipFloatComplex/hipDoubleComplex
2018-06-19 10:49:59 -04:00
Jenkins
168fca2193
Merge 'master' into 'amd-master'
...
Change-Id: I539a79195161876b0a6eae952f64af85d4c47b28
2018-06-19 04:09:57 -05:00
pradeepisro
f6938961b9
added option HIP_BUILD_VERBOSE to enable verbosity in HIP build
2018-06-19 13:50:24 +05:30
pradeepisro
4f0aefbd63
added changes to FindHIP.cmake which would allow us to build using ninja #467
2018-06-19 12:26:18 +05:30
Maneesh Gupta
1b88a2ce2f
Merge pull request #516 from bddppq/empty-generator-expression
...
Properly handle (empty) cmake generator expression
2018-06-19 09:38:21 +05:30
Maneesh Gupta
523e7fd9b2
Merge pull request #520 from ntrost57/master
...
added missing hipCmul() to nvcc_detail/hip_complex.h
2018-06-19 09:37:50 +05:30
Wen-Heng (Jack) Chung
32789a8b7d
Keep the map which tracks GPU kernel symbols to grow monotonically
2018-06-18 16:54:18 -05:00
Yaxun (Sam) Liu
9181fbb0b7
Add abs/real/imag functions for hipFloatComplex/hipDoubleComplex
2018-06-18 11:57:57 -04:00
Maneesh Gupta
3d8317a50d
Merge pull request #521 from gargrahul/temp_fixmemcpy2dasync_trsmissue
...
Use memcpy kernel for all pinned memory cases in hipMemcpy2DAsync
2018-06-18 09:34:02 +05:30
Yaxun (Sam) Liu
7a5605d006
Add missing __device__ __host__ to complex constructor
...
Also add missing typedef value_type
2018-06-17 20:20:32 -04:00
Yaxun (Sam) Liu
2523c39a37
Includes <cmath> or <math.h> by __cplusplus in hip_complex.h
2018-06-17 20:20:32 -04:00
Yaxun (Sam) Liu
325cf3ccf0
Include cmath instead of math.h in hip_complex.h
2018-06-17 20:20:32 -04:00
Yaxun (Sam) Liu
5eeb57b0a6
Add missing macro MAKE_COMPONENT_CONSTRUCTOR_TWO_COMPONENT to hip_complex.h
2018-06-17 20:20:32 -04:00
Yaxun (Sam) Liu
2b32dbd414
Fix handling of static library in hipcc for hip-clang
2018-06-17 20:11:10 -04:00
Yaxun (Sam) Liu
f0935e6d4f
Let hipcc handle static library for hip-clang only if it contains bundles
2018-06-17 20:11:10 -04:00
Alex Voicu
28a1aef8a1
Revert "Revert "Switch over to using native vector types, for better codegen. Remove noise.""
...
This reverts commit 7a4aace13d .
2018-06-16 22:59:36 +01:00
Wen-Heng (Jack) Chung
ece4539c1d
Improve performance of re-initialization logic
...
Keep track of shared libaries already discovered. Do not build HSA executables
for them.
2018-06-15 18:07:33 -05:00
Rahul Garg
cd23905897
TEMP- fix memcpy2dAsync for trsm issue
2018-06-15 16:08:29 +05:30
Lakhan Singh
90c8230cfa
Made changes as suggested.
2018-06-15 11:30:02 +05:30
Nico Trost
0b1e698e74
added missing hipCmul() to nvcc_detail/hip_complex.h
2018-06-14 21:49:54 +02:00
Wen-Heng (Jack) Chung
379b7a2241
HIP program state re-initialization logic
...
This commit is to support kernels dynamically loaded thru means such as
dlopen() after HIP runtime initializes.
2018-06-14 15:46:49 +00:00
Jenkins
9720ce390b
Merge 'master' into 'amd-master'
...
Change-Id: Ife0db35b29ac732ba661499de009ada0456df724
2018-06-14 04:09:37 -05:00
Maneesh Gupta
bae22822f2
Merge pull request #519 from gargrahul/fix_memcpy2dasync_streamres
...
Fix stream resolution in memcpy2dasync
2018-06-14 12:34:32 +05:30
Rahul Garg
069e2c34c9
Fix stream resolution in memcpy2dasync
2018-06-14 11:58:56 +05:30