Alex Voicu
|
3c83e047df
|
Existence is a complex affair.
|
2018-06-26 00:41:35 +01:00 |
|
Alex Voicu
|
99c61ce7e4
|
Be nice to GCC, it is old and worthy of respect.
|
2018-06-25 22:59:07 +01:00 |
|
Alex Voicu
|
9d91b802a5
|
Let's try this again...
|
2018-06-25 17:49:50 +01:00 |
|
Alex Voicu
|
859133a045
|
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_native_vector_types
|
2018-06-22 12:19:32 +01:00 |
|
Maneesh Gupta
|
fc80fb4ab3
|
Merge pull request #507 from ROCm-Developer-Tools/fix-forward
Add __device__ to device functions in hip_fp16_math_fwd.h
|
2018-06-20 14:21:46 +05:30 |
|
Maneesh Gupta
|
cffc5ad273
|
Merge pull request #504 from ROCm-Developer-Tools/fix-vector3
Fix channel_descriptor.h about vector 3 for gcc
|
2018-06-20 14:20:29 +05:30 |
|
Maneesh Gupta
|
946c8da88a
|
Merge pull request #490 from ROCm-Developer-Tools/feature_decouple_atomics_from_hc
Switch the atomic implementation to use Clang builtins.
|
2018-06-20 14:16:43 +05:30 |
|
Maneesh Gupta
|
836627279f
|
Merge pull request #457 from whchung/hip-reinit
HIP program state re-initialization logic
|
2018-06-20 09:37:27 +05:30 |
|
Maneesh Gupta
|
1b88a2ce2f
|
Merge pull request #516 from bddppq/empty-generator-expression
Properly handle (empty) cmake generator expression
|
2018-06-19 09:38:21 +05:30 |
|
Maneesh Gupta
|
523e7fd9b2
|
Merge pull request #520 from ntrost57/master
added missing hipCmul() to nvcc_detail/hip_complex.h
|
2018-06-19 09:37:50 +05:30 |
|
Wen-Heng (Jack) Chung
|
32789a8b7d
|
Keep the map which tracks GPU kernel symbols to grow monotonically
|
2018-06-18 16:54:18 -05:00 |
|
Maneesh Gupta
|
3d8317a50d
|
Merge pull request #521 from gargrahul/temp_fixmemcpy2dasync_trsmissue
Use memcpy kernel for all pinned memory cases in hipMemcpy2DAsync
|
2018-06-18 09:34:02 +05:30 |
|
Alex Voicu
|
28a1aef8a1
|
Revert "Revert "Switch over to using native vector types, for better codegen. Remove noise.""
This reverts commit 7a4aace13d.
|
2018-06-16 22:59:36 +01:00 |
|
Wen-Heng (Jack) Chung
|
ece4539c1d
|
Improve performance of re-initialization logic
Keep track of shared libaries already discovered. Do not build HSA executables
for them.
|
2018-06-15 18:07:33 -05:00 |
|
Rahul Garg
|
cd23905897
|
TEMP- fix memcpy2dAsync for trsm issue
|
2018-06-15 16:08:29 +05:30 |
|
Nico Trost
|
0b1e698e74
|
added missing hipCmul() to nvcc_detail/hip_complex.h
|
2018-06-14 21:49:54 +02:00 |
|
Wen-Heng (Jack) Chung
|
379b7a2241
|
HIP program state re-initialization logic
This commit is to support kernels dynamically loaded thru means such as
dlopen() after HIP runtime initializes.
|
2018-06-14 15:46:49 +00:00 |
|
Maneesh Gupta
|
bae22822f2
|
Merge pull request #519 from gargrahul/fix_memcpy2dasync_streamres
Fix stream resolution in memcpy2dasync
|
2018-06-14 12:34:32 +05:30 |
|
Rahul Garg
|
069e2c34c9
|
Fix stream resolution in memcpy2dasync
|
2018-06-14 11:58:56 +05:30 |
|
Maneesh Gupta
|
e96c67228f
|
Merge pull request #518 from gargrahul/fix_pr_484_devptroffset
Fix retrieved locked ptr offset
|
2018-06-14 07:37:34 +05:30 |
|
Rahul Garg
|
00f8a36bc7
|
Fix retrieved locked ptr offset
|
2018-06-13 23:10:05 +05:30 |
|
Junjie Bai
|
03d3c6eaed
|
Properly handle (empty) cmake generator expression
|
2018-06-12 23:53:18 -07:00 |
|
Maneesh Gupta
|
f865341cd9
|
Merge pull request #505 from ROCm-Developer-Tools/fix-hipcc-linker
Let hipcc handle library in linker response file for hip-clang
|
2018-06-11 12:01:45 +05:30 |
|
Maneesh Gupta
|
e0400674fd
|
Merge pull request #506 from ROCm-Developer-Tools/fix-extern-shared
Add support of extern __shared__ for hip-clang
|
2018-06-11 11:59:58 +05:30 |
|
Maneesh Gupta
|
848b27642d
|
Merge pull request #509 from odellus/patch-1
Update hip_porting_guide.md
|
2018-06-11 09:32:36 +05:30 |
|
Tomas Wood
|
ed7dee4d19
|
Update hip_porting_guide.md
use ".hip.cpp" for *source* files
|
2018-06-09 17:16:08 -07:00 |
|
Siu Chi Chan
|
9484e0f4c0
|
Merge pull request #508 from ROCm-Developer-Tools/revert-447-feature_native_vector_types
Revert "Switch over to using native vector types, for better codegen. Remove noise."
|
2018-06-08 18:11:41 -04:00 |
|
Siu Chi Chan
|
7a4aace13d
|
Revert "Switch over to using native vector types, for better codegen. Remove noise."
|
2018-06-08 16:48:22 -04:00 |
|
Yaxun (Sam) Liu
|
17e3093f0e
|
Add __device__ to device functions in hip_fp16_math_fwd.h
|
2018-06-08 11:23:52 -04:00 |
|
Yaxun (Sam) Liu
|
9141037105
|
Fix channel_descriptor.h about vector 3 for gcc
|
2018-06-08 11:18:41 -04:00 |
|
Yaxun (Sam) Liu
|
cc14ed0981
|
Add support of extern __shared__ for hip-clang
|
2018-06-08 11:17:25 -04:00 |
|
Yaxun (Sam) Liu
|
04a0f9bd81
|
Let hipcc handle library in linker response file for hip-clang
|
2018-06-08 11:14:26 -04:00 |
|
Maneesh Gupta
|
203dd6cb70
|
Merge pull request #482 from ROCm-Developer-Tools/feature_clean_up_hip_math
Switch to using ROCDL directly, as opposed to via HC. Add missing bits.
|
2018-06-06 16:07:22 +05:30 |
|
Maneesh Gupta
|
02ea7f13b3
|
Merge pull request #496 from gargrahul/add_gettexresdesc_nvcc
Add getTextureResourceDescriptor on NVCC
|
2018-06-06 15:12:11 +05:30 |
|
Maneesh Gupta
|
de5043c47c
|
Merge pull request #487 from gargrahul/fix_hiparray_alloc_flag_nvcc
Map hipArray alloc flags on NVCC
|
2018-06-06 15:11:40 +05:30 |
|
Maneesh Gupta
|
9e9c039ee4
|
Merge pull request #497 from gargrahul/fix_memcpy3d_fastpath
Fix hipMemcpy3D for fast path
|
2018-06-06 14:44:02 +05:30 |
|
Maneesh Gupta
|
216f34eea8
|
Merge pull request #492 from gargrahul/fix_depth_3d_alloc
Fix depth value for 3D allocations
|
2018-06-06 14:41:23 +05:30 |
|
Maneesh Gupta
|
7311b60220
|
Merge pull request #491 from scchan/fix_wait
callback handling: don't need to wait for the thread to become ready
|
2018-06-06 14:38:25 +05:30 |
|
Maneesh Gupta
|
391ff1c949
|
Merge pull request #489 from gargrahul/add_dev_prop_integrated
Add integrated device property
|
2018-06-06 14:31:30 +05:30 |
|
Maneesh Gupta
|
916fedee16
|
Merge pull request #488 from gargrahul/fix_surface2dobj_test
Fix surface 2d object test for testResult
|
2018-06-06 14:31:00 +05:30 |
|
Maneesh Gupta
|
18cb0485f1
|
Merge pull request #486 from ROCm-Developer-Tools/yxsamliu-patch-1
Add documentation for compiling HIP program with hip-clang
|
2018-06-06 13:04:23 +05:30 |
|
Rahul Garg
|
a46ff2afd5
|
Fix hipMemcpy3D for fast path
|
2018-06-05 18:54:33 +05:30 |
|
Rahul Garg
|
17bb8dbe86
|
Add getTextureResourceDescriptor on NVCC
|
2018-06-05 18:46:25 +05:30 |
|
Siu Chi Chan
|
a1f3b587fb
|
remove the _ready flag in ihipStreamCallback_t and the mutex that protects it.
|
2018-06-04 17:29:04 -04:00 |
|
Rahul Garg
|
276c948a16
|
Fix depth value for 3D allocations
|
2018-06-04 18:00:22 +05:30 |
|
Alex Voicu
|
23f5feaf13
|
Fix hideous typos.
|
2018-06-03 03:03:55 +01:00 |
|
Siu Chi Chan
|
d3a9985f10
|
callback handler: don't need to wait for the thread to become ready
|
2018-06-02 17:55:37 -04:00 |
|
Alex Voicu
|
59adb5e52a
|
Add missing __device__ for forward declares.
|
2018-06-02 17:46:37 +01:00 |
|
Alex Voicu
|
089ab3b947
|
Switch the atomic implementation to use Clang builtins.
|
2018-06-02 12:27:17 +01:00 |
|
Alex Voicu
|
14e449b5bb
|
Remove vestigial implementations.
|
2018-06-02 11:37:08 +01:00 |
|