Граф коммитов

2601 Коммитов

Автор SHA1 Сообщение Дата
Alex Voicu 3c83e047df Existence is a complex affair. 2018-06-26 00:41:35 +01:00
Alex Voicu 99c61ce7e4 Be nice to GCC, it is old and worthy of respect. 2018-06-25 22:59:07 +01:00
Alex Voicu 9d91b802a5 Let's try this again... 2018-06-25 17:49:50 +01:00
Alex Voicu 859133a045 Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_native_vector_types 2018-06-22 12:19:32 +01:00
Maneesh Gupta fc80fb4ab3 Merge pull request #507 from ROCm-Developer-Tools/fix-forward
Add __device__ to device functions in hip_fp16_math_fwd.h
2018-06-20 14:21:46 +05:30
Maneesh Gupta cffc5ad273 Merge pull request #504 from ROCm-Developer-Tools/fix-vector3
Fix channel_descriptor.h about vector 3 for gcc
2018-06-20 14:20:29 +05:30
Maneesh Gupta 946c8da88a Merge pull request #490 from ROCm-Developer-Tools/feature_decouple_atomics_from_hc
Switch the atomic implementation to use Clang  builtins.
2018-06-20 14:16:43 +05:30
Maneesh Gupta 836627279f Merge pull request #457 from whchung/hip-reinit
HIP program state re-initialization logic
2018-06-20 09:37:27 +05:30
Maneesh Gupta 1b88a2ce2f Merge pull request #516 from bddppq/empty-generator-expression
Properly handle (empty) cmake generator expression
2018-06-19 09:38:21 +05:30
Maneesh Gupta 523e7fd9b2 Merge pull request #520 from ntrost57/master
added missing hipCmul() to nvcc_detail/hip_complex.h
2018-06-19 09:37:50 +05:30
Wen-Heng (Jack) Chung 32789a8b7d Keep the map which tracks GPU kernel symbols to grow monotonically 2018-06-18 16:54:18 -05:00
Maneesh Gupta 3d8317a50d Merge pull request #521 from gargrahul/temp_fixmemcpy2dasync_trsmissue
Use memcpy kernel for all pinned memory cases in hipMemcpy2DAsync
2018-06-18 09:34:02 +05:30
Alex Voicu 28a1aef8a1 Revert "Revert "Switch over to using native vector types, for better codegen. Remove noise.""
This reverts commit 7a4aace13d.
2018-06-16 22:59:36 +01:00
Wen-Heng (Jack) Chung ece4539c1d Improve performance of re-initialization logic
Keep track of shared libaries already discovered. Do not build HSA executables
for them.
2018-06-15 18:07:33 -05:00
Rahul Garg cd23905897 TEMP- fix memcpy2dAsync for trsm issue 2018-06-15 16:08:29 +05:30
Nico Trost 0b1e698e74 added missing hipCmul() to nvcc_detail/hip_complex.h 2018-06-14 21:49:54 +02:00
Wen-Heng (Jack) Chung 379b7a2241 HIP program state re-initialization logic
This commit is to support kernels dynamically loaded thru means such as
dlopen() after HIP runtime initializes.
2018-06-14 15:46:49 +00:00
Maneesh Gupta bae22822f2 Merge pull request #519 from gargrahul/fix_memcpy2dasync_streamres
Fix stream resolution in memcpy2dasync
2018-06-14 12:34:32 +05:30
Rahul Garg 069e2c34c9 Fix stream resolution in memcpy2dasync 2018-06-14 11:58:56 +05:30
Maneesh Gupta e96c67228f Merge pull request #518 from gargrahul/fix_pr_484_devptroffset
Fix retrieved locked ptr offset
2018-06-14 07:37:34 +05:30
Rahul Garg 00f8a36bc7 Fix retrieved locked ptr offset 2018-06-13 23:10:05 +05:30
Junjie Bai 03d3c6eaed Properly handle (empty) cmake generator expression 2018-06-12 23:53:18 -07:00
Maneesh Gupta f865341cd9 Merge pull request #505 from ROCm-Developer-Tools/fix-hipcc-linker
Let hipcc handle library in linker response file for hip-clang
2018-06-11 12:01:45 +05:30
Maneesh Gupta e0400674fd Merge pull request #506 from ROCm-Developer-Tools/fix-extern-shared
Add support of extern __shared__ for hip-clang
2018-06-11 11:59:58 +05:30
Maneesh Gupta 848b27642d Merge pull request #509 from odellus/patch-1
Update hip_porting_guide.md
2018-06-11 09:32:36 +05:30
Tomas Wood ed7dee4d19 Update hip_porting_guide.md
use ".hip.cpp" for *source* files
2018-06-09 17:16:08 -07:00
Siu Chi Chan 9484e0f4c0 Merge pull request #508 from ROCm-Developer-Tools/revert-447-feature_native_vector_types
Revert "Switch over to using native vector types, for better codegen. Remove noise."
2018-06-08 18:11:41 -04:00
Siu Chi Chan 7a4aace13d Revert "Switch over to using native vector types, for better codegen. Remove noise." 2018-06-08 16:48:22 -04:00
Yaxun (Sam) Liu 17e3093f0e Add __device__ to device functions in hip_fp16_math_fwd.h 2018-06-08 11:23:52 -04:00
Yaxun (Sam) Liu 9141037105 Fix channel_descriptor.h about vector 3 for gcc 2018-06-08 11:18:41 -04:00
Yaxun (Sam) Liu cc14ed0981 Add support of extern __shared__ for hip-clang 2018-06-08 11:17:25 -04:00
Yaxun (Sam) Liu 04a0f9bd81 Let hipcc handle library in linker response file for hip-clang 2018-06-08 11:14:26 -04:00
Maneesh Gupta 203dd6cb70 Merge pull request #482 from ROCm-Developer-Tools/feature_clean_up_hip_math
Switch to using ROCDL directly, as opposed to via HC. Add missing bits.
2018-06-06 16:07:22 +05:30
Maneesh Gupta 02ea7f13b3 Merge pull request #496 from gargrahul/add_gettexresdesc_nvcc
Add getTextureResourceDescriptor on NVCC
2018-06-06 15:12:11 +05:30
Maneesh Gupta de5043c47c Merge pull request #487 from gargrahul/fix_hiparray_alloc_flag_nvcc
Map hipArray alloc flags on NVCC
2018-06-06 15:11:40 +05:30
Maneesh Gupta 9e9c039ee4 Merge pull request #497 from gargrahul/fix_memcpy3d_fastpath
Fix hipMemcpy3D for fast path
2018-06-06 14:44:02 +05:30
Maneesh Gupta 216f34eea8 Merge pull request #492 from gargrahul/fix_depth_3d_alloc
Fix depth value for 3D allocations
2018-06-06 14:41:23 +05:30
Maneesh Gupta 7311b60220 Merge pull request #491 from scchan/fix_wait
callback handling: don't need to wait for the thread to become ready
2018-06-06 14:38:25 +05:30
Maneesh Gupta 391ff1c949 Merge pull request #489 from gargrahul/add_dev_prop_integrated
Add integrated device property
2018-06-06 14:31:30 +05:30
Maneesh Gupta 916fedee16 Merge pull request #488 from gargrahul/fix_surface2dobj_test
Fix surface 2d object test for testResult
2018-06-06 14:31:00 +05:30
Maneesh Gupta 18cb0485f1 Merge pull request #486 from ROCm-Developer-Tools/yxsamliu-patch-1
Add documentation for compiling HIP program with hip-clang
2018-06-06 13:04:23 +05:30
Rahul Garg a46ff2afd5 Fix hipMemcpy3D for fast path 2018-06-05 18:54:33 +05:30
Rahul Garg 17bb8dbe86 Add getTextureResourceDescriptor on NVCC 2018-06-05 18:46:25 +05:30
Siu Chi Chan a1f3b587fb remove the _ready flag in ihipStreamCallback_t and the mutex that protects it. 2018-06-04 17:29:04 -04:00
Rahul Garg 276c948a16 Fix depth value for 3D allocations 2018-06-04 18:00:22 +05:30
Alex Voicu 23f5feaf13 Fix hideous typos. 2018-06-03 03:03:55 +01:00
Siu Chi Chan d3a9985f10 callback handler: don't need to wait for the thread to become ready 2018-06-02 17:55:37 -04:00
Alex Voicu 59adb5e52a Add missing __device__ for forward declares. 2018-06-02 17:46:37 +01:00
Alex Voicu 089ab3b947 Switch the atomic implementation to use Clang builtins. 2018-06-02 12:27:17 +01:00
Alex Voicu 14e449b5bb Remove vestigial implementations. 2018-06-02 11:37:08 +01:00