Wykres commitów

6428 Commity

Autor SHA1 Wiadomość Data
Yaxun (Sam) Liu 021728cd86 Add workaround to hipcc for build failure in tensorflow due to missing symbol __cpu_model
https://github.com/tensorflow/tensorflow/issues/9593
2018-06-25 18:14:51 -04:00
Alex Voicu 99c61ce7e4 Be nice to GCC, it is old and worthy of respect. 2018-06-25 22:59:07 +01:00
Yaxun (Sam) Liu 02b160491d Include host_defines.h in hip_fp16.h since it uses __host__ __device__ attributes 2018-06-25 15:34:34 -04:00
Alex Voicu 6c7a64efa2 Removes use of unimplemented OCML functionality. 2018-06-25 19:16:27 +01:00
foreman 3f0b25f6bf P4 to Git Change 1572348 by skudchad@skudchad_test2_win_opencl on 2018/06/25 13:21:41
SWDEV-145570 - [HIP] - Fix hipHostGetDevicePointer. hipHostRegister tests now passes

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/15261/diff/

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#34 edit
2018-06-25 13:29:56 -04:00
Alex Voicu 9d91b802a5 Let's try this again... 2018-06-25 17:49:50 +01:00
Srinivasuluch d8290f5a7b Update hipLaunchParm.cpp
Added validation part for struct, added two struct padding tests
2018-06-25 16:53:12 +05:30
Aaron Enye Shi ff924ecb3d Add HIP Compute Mode 2018-06-22 14:37:19 +00:00
Alex Voicu 859133a045 Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_native_vector_types 2018-06-22 12:19:32 +01:00
Aaron Enye Shi 73023e0a54 HIPCC - Use clang if clang++ is not compiled 2018-06-21 18:12:55 +00:00
Aaron Enye Shi 8ac864c2e3 Replace __hip_hc_ir_ inline asm with __ockl_* functions 2018-06-20 20:40:14 +00:00
Aaron Enye Shi 6dc16bbf04 Implement __ballot, __any, __all into HIP headers 2018-06-20 17:39:39 +00:00
Aaron Enye Shi 2142eb4d12 Implement hip_hc.ll into HIP headers
Move all __hip_hc_ir_* functions from hip_hc.ll into HIP header as inline asm. Remove hip_hc.ll and build dependencies from HIP.
2018-06-20 17:39:31 +00:00
Aaron Enye Shi e02fc7e680 Implement device_functions.cpp into HIP headers
Move all Integer Intrinsics, device_functions.cpp definitions and HIP specific device functions into HIP headers. Implement the device functions using llvm_intrinsics and device-libs functions instead of calling hc::__* functions. Remove device_functions.cpp since everything is now defined in header.
2018-06-20 17:39:23 +00:00
Aaron Enye Shi fe4e6c53fc Move hipclang funcs into corresponding headers 2018-06-20 17:39:15 +00:00
Aaron Enye Shi cfe37484c9 Split __llvm and device lib funcs into new headers 2018-06-20 17:39:06 +00:00
Aaron Enye Shi 27f600b425 Add get_dynamicgroupbaseptr def and remove hc_ 2018-06-20 17:38:59 +00:00
Aaron Enye Shi 871cfc6fd2 Add prefix __ to memory scope and order 2018-06-20 17:38:52 +00:00
Aaron Enye Shi cfa8fc1ca5 Add __llvm_fence funcs and __ prefixes 2018-06-20 17:38:45 +00:00
Aaron Enye Shi c453b42bff Add hipclang amdgcn functions
These are moving from hipclang in device library to hip headers. These are required for the functionality of HIPclang project.
2018-06-20 17:38:37 +00:00
Jenkins 9d56f7c1cd Merge 'master' into 'amd-master'
Change-Id: Ia897c694ffbbce2ab01493d3ea60e42701ce021e
2018-06-20 04:09:40 -05:00
Maneesh Gupta fc80fb4ab3 Merge pull request #507 from ROCm-Developer-Tools/fix-forward
Add __device__ to device functions in hip_fp16_math_fwd.h
2018-06-20 14:21:46 +05:30
Maneesh Gupta cffc5ad273 Merge pull request #504 from ROCm-Developer-Tools/fix-vector3
Fix channel_descriptor.h about vector 3 for gcc
2018-06-20 14:20:29 +05:30
Maneesh Gupta 946c8da88a Merge pull request #490 from ROCm-Developer-Tools/feature_decouple_atomics_from_hc
Switch the atomic implementation to use Clang  builtins.
2018-06-20 14:16:43 +05:30
Maneesh Gupta 836627279f Merge pull request #457 from whchung/hip-reinit
HIP program state re-initialization logic
2018-06-20 09:37:27 +05:30
Yaxun (Sam) Liu 46d3c1d51e Let hipcc handle library with extension lo for hip-clang 2018-06-19 10:51:08 -04:00
Yaxun (Sam) Liu 84da72dae8 Add conj, operator-,==,!= for hipFloatComplex/hipDoubleComplex 2018-06-19 10:49:59 -04:00
Jenkins 168fca2193 Merge 'master' into 'amd-master'
Change-Id: I539a79195161876b0a6eae952f64af85d4c47b28
2018-06-19 04:09:57 -05:00
pradeepisro f6938961b9 added option HIP_BUILD_VERBOSE to enable verbosity in HIP build 2018-06-19 13:50:24 +05:30
pradeepisro 4f0aefbd63 added changes to FindHIP.cmake which would allow us to build using ninja #467 2018-06-19 12:26:18 +05:30
Maneesh Gupta 1b88a2ce2f Merge pull request #516 from bddppq/empty-generator-expression
Properly handle (empty) cmake generator expression
2018-06-19 09:38:21 +05:30
Maneesh Gupta 523e7fd9b2 Merge pull request #520 from ntrost57/master
added missing hipCmul() to nvcc_detail/hip_complex.h
2018-06-19 09:37:50 +05:30
Wen-Heng (Jack) Chung 32789a8b7d Keep the map which tracks GPU kernel symbols to grow monotonically 2018-06-18 16:54:18 -05:00
Yaxun (Sam) Liu 9181fbb0b7 Add abs/real/imag functions for hipFloatComplex/hipDoubleComplex 2018-06-18 11:57:57 -04:00
Maneesh Gupta 3d8317a50d Merge pull request #521 from gargrahul/temp_fixmemcpy2dasync_trsmissue
Use memcpy kernel for all pinned memory cases in hipMemcpy2DAsync
2018-06-18 09:34:02 +05:30
Yaxun (Sam) Liu 7a5605d006 Add missing __device__ __host__ to complex constructor
Also add missing typedef value_type
2018-06-17 20:20:32 -04:00
Yaxun (Sam) Liu 2523c39a37 Includes <cmath> or <math.h> by __cplusplus in hip_complex.h 2018-06-17 20:20:32 -04:00
Yaxun (Sam) Liu 325cf3ccf0 Include cmath instead of math.h in hip_complex.h 2018-06-17 20:20:32 -04:00
Yaxun (Sam) Liu 5eeb57b0a6 Add missing macro MAKE_COMPONENT_CONSTRUCTOR_TWO_COMPONENT to hip_complex.h 2018-06-17 20:20:32 -04:00
Yaxun (Sam) Liu 2b32dbd414 Fix handling of static library in hipcc for hip-clang 2018-06-17 20:11:10 -04:00
Yaxun (Sam) Liu f0935e6d4f Let hipcc handle static library for hip-clang only if it contains bundles 2018-06-17 20:11:10 -04:00
Alex Voicu 28a1aef8a1 Revert "Revert "Switch over to using native vector types, for better codegen. Remove noise.""
This reverts commit 7a4aace13d.
2018-06-16 22:59:36 +01:00
Wen-Heng (Jack) Chung ece4539c1d Improve performance of re-initialization logic
Keep track of shared libaries already discovered. Do not build HSA executables
for them.
2018-06-15 18:07:33 -05:00
Rahul Garg cd23905897 TEMP- fix memcpy2dAsync for trsm issue 2018-06-15 16:08:29 +05:30
Lakhan Singh 90c8230cfa Made changes as suggested. 2018-06-15 11:30:02 +05:30
Nico Trost 0b1e698e74 added missing hipCmul() to nvcc_detail/hip_complex.h 2018-06-14 21:49:54 +02:00
Wen-Heng (Jack) Chung 379b7a2241 HIP program state re-initialization logic
This commit is to support kernels dynamically loaded thru means such as
dlopen() after HIP runtime initializes.
2018-06-14 15:46:49 +00:00
Jenkins 9720ce390b Merge 'master' into 'amd-master'
Change-Id: Ife0db35b29ac732ba661499de009ada0456df724
2018-06-14 04:09:37 -05:00
Maneesh Gupta bae22822f2 Merge pull request #519 from gargrahul/fix_memcpy2dasync_streamres
Fix stream resolution in memcpy2dasync
2018-06-14 12:34:32 +05:30
Rahul Garg 069e2c34c9 Fix stream resolution in memcpy2dasync 2018-06-14 11:58:56 +05:30