Wen-Heng (Jack) Chung
|
b883ea759d
|
Improve performance of re-initialization logic
Keep track of shared libaries already discovered. Do not build HSA executables
for them.
|
2018-06-15 18:07:33 -05:00 |
|
Wen-Heng (Jack) Chung
|
04640992dc
|
HIP program state re-initialization logic
This commit is to support kernels dynamically loaded thru means such as
dlopen() after HIP runtime initializes.
|
2018-06-14 15:46:49 +00:00 |
|
Maneesh Gupta
|
edab13da7d
|
Merge pull request #519 from gargrahul/fix_memcpy2dasync_streamres
Fix stream resolution in memcpy2dasync
|
2018-06-14 12:34:32 +05:30 |
|
Rahul Garg
|
2ae3be9773
|
Fix stream resolution in memcpy2dasync
|
2018-06-14 11:58:56 +05:30 |
|
Maneesh Gupta
|
f5fccba009
|
Merge pull request #518 from gargrahul/fix_pr_484_devptroffset
Fix retrieved locked ptr offset
|
2018-06-14 07:37:34 +05:30 |
|
Rahul Garg
|
68554e155b
|
Fix retrieved locked ptr offset
|
2018-06-13 23:10:05 +05:30 |
|
Maneesh Gupta
|
359e6609bc
|
Merge pull request #505 from ROCm-Developer-Tools/fix-hipcc-linker
Let hipcc handle library in linker response file for hip-clang
|
2018-06-11 12:01:45 +05:30 |
|
Maneesh Gupta
|
181cde1899
|
Merge pull request #506 from ROCm-Developer-Tools/fix-extern-shared
Add support of extern __shared__ for hip-clang
|
2018-06-11 11:59:58 +05:30 |
|
Maneesh Gupta
|
7122376751
|
Merge pull request #509 from odellus/patch-1
Update hip_porting_guide.md
|
2018-06-11 09:32:36 +05:30 |
|
Tomas Wood
|
683894c698
|
Update hip_porting_guide.md
use ".hip.cpp" for *source* files
|
2018-06-09 17:16:08 -07:00 |
|
Siu Chi Chan
|
2fbe29093d
|
Merge pull request #508 from ROCm-Developer-Tools/revert-447-feature_native_vector_types
Revert "Switch over to using native vector types, for better codegen. Remove noise."
|
2018-06-08 18:11:41 -04:00 |
|
Siu Chi Chan
|
d137271083
|
Revert "Switch over to using native vector types, for better codegen. Remove noise."
|
2018-06-08 16:48:22 -04:00 |
|
Yaxun (Sam) Liu
|
9398c9c927
|
Add support of extern __shared__ for hip-clang
|
2018-06-08 11:17:25 -04:00 |
|
Yaxun (Sam) Liu
|
8d3e4b4475
|
Let hipcc handle library in linker response file for hip-clang
|
2018-06-08 11:14:26 -04:00 |
|
Maneesh Gupta
|
cb642f14ab
|
Merge pull request #482 from ROCm-Developer-Tools/feature_clean_up_hip_math
Switch to using ROCDL directly, as opposed to via HC. Add missing bits.
|
2018-06-06 16:07:22 +05:30 |
|
Maneesh Gupta
|
a1ad2b9c65
|
Merge pull request #496 from gargrahul/add_gettexresdesc_nvcc
Add getTextureResourceDescriptor on NVCC
|
2018-06-06 15:12:11 +05:30 |
|
Maneesh Gupta
|
a432847b1b
|
Merge pull request #487 from gargrahul/fix_hiparray_alloc_flag_nvcc
Map hipArray alloc flags on NVCC
|
2018-06-06 15:11:40 +05:30 |
|
Maneesh Gupta
|
53037472ff
|
Merge pull request #497 from gargrahul/fix_memcpy3d_fastpath
Fix hipMemcpy3D for fast path
|
2018-06-06 14:44:02 +05:30 |
|
Maneesh Gupta
|
28c5b15d88
|
Merge pull request #492 from gargrahul/fix_depth_3d_alloc
Fix depth value for 3D allocations
|
2018-06-06 14:41:23 +05:30 |
|
Maneesh Gupta
|
f7c49dde38
|
Merge pull request #491 from scchan/fix_wait
callback handling: don't need to wait for the thread to become ready
|
2018-06-06 14:38:25 +05:30 |
|
Maneesh Gupta
|
cd20a370eb
|
Merge pull request #489 from gargrahul/add_dev_prop_integrated
Add integrated device property
|
2018-06-06 14:31:30 +05:30 |
|
Maneesh Gupta
|
eab2c3f248
|
Merge pull request #488 from gargrahul/fix_surface2dobj_test
Fix surface 2d object test for testResult
|
2018-06-06 14:31:00 +05:30 |
|
Maneesh Gupta
|
95270bcdca
|
Merge pull request #486 from ROCm-Developer-Tools/yxsamliu-patch-1
Add documentation for compiling HIP program with hip-clang
|
2018-06-06 13:04:23 +05:30 |
|
Rahul Garg
|
163d4a5b03
|
Fix hipMemcpy3D for fast path
|
2018-06-05 18:54:33 +05:30 |
|
Rahul Garg
|
6f8bcf53b0
|
Add getTextureResourceDescriptor on NVCC
|
2018-06-05 18:46:25 +05:30 |
|
Siu Chi Chan
|
0d719c514f
|
remove the _ready flag in ihipStreamCallback_t and the mutex that protects it.
|
2018-06-04 17:29:04 -04:00 |
|
Rahul Garg
|
fa6ce7a724
|
Fix depth value for 3D allocations
|
2018-06-04 18:00:22 +05:30 |
|
Siu Chi Chan
|
e21e6ed3a0
|
callback handler: don't need to wait for the thread to become ready
|
2018-06-02 17:55:37 -04:00 |
|
Alex Voicu
|
68a0dd826d
|
Remove vestigial implementations.
|
2018-06-02 11:37:08 +01:00 |
|
Rahul Garg
|
94f086e9cd
|
Add integrated device property
|
2018-06-02 13:11:16 +05:30 |
|
Rahul Garg
|
b94f18767f
|
Fix surface 2d object test for testResult
|
2018-06-02 10:58:03 +05:30 |
|
Alex Voicu
|
3a17e2ad06
|
Rename for minimal confusion.
|
2018-06-01 22:55:33 +01:00 |
|
Alex Voicu
|
b72d82f982
|
Missing __device__.
|
2018-06-01 19:48:36 +01:00 |
|
Alex Voicu
|
91d9ec75d7
|
Fix typos / address review comments.
|
2018-06-01 16:20:21 +01:00 |
|
Alex Voicu
|
f2d7f112ab
|
Re-sync with upstream.
|
2018-06-01 15:49:05 +01:00 |
|
Yaxun (Sam) Liu
|
bb9ad15c34
|
Update INSTALL.md
|
2018-06-01 10:19:02 -04:00 |
|
Rahul Garg
|
b1b9a477a2
|
Map hipArray alloc flags on NVCC
|
2018-06-01 17:28:43 +05:30 |
|
Maneesh Gupta
|
095d4dd91e
|
Merge pull request #484 from gargrahul/fix_malloc_hiphostreg
Fix memcpy2D for malloc+ hostRegister
|
2018-06-01 16:53:25 +05:30 |
|
Maneesh Gupta
|
3fec282097
|
Merge pull request #447 from ROCm-Developer-Tools/feature_native_vector_types
Switch over to using native vector types, for better codegen. Remove noise.
|
2018-06-01 13:58:07 +05:30 |
|
Maneesh Gupta
|
8ecb3eeb55
|
Merge pull request #466 from ROCm-Developer-Tools/feature_use_Float16
Feature use _Float16 and match CUDA __half behaviour.
|
2018-06-01 13:50:12 +05:30 |
|
Yaxun (Sam) Liu
|
3ed6fff0db
|
Update INSTALL.md
|
2018-05-31 23:55:42 -04:00 |
|
Alex Voicu
|
e03ca1a72e
|
Re-sync with upstream. Add integer abs.
|
2018-05-31 16:38:00 +01:00 |
|
Alex Voicu
|
e20380319a
|
Merge branch 'feature_use_Float16' of https://github.com/ROCm-Developer-Tools/HIP into feature_use_Float16
|
2018-05-31 15:27:31 +01:00 |
|
Alex Voicu
|
208f5a41c6
|
Add missing interop with volatile. Fix unit tests.
|
2018-05-31 15:27:12 +01:00 |
|
Rahul Garg
|
a3609eaf61
|
Fix memcpy2D for malloc+ hostRegister
|
2018-05-31 13:14:27 +05:30 |
|
Maneesh Gupta
|
2271d26cda
|
Merge pull request #480 from yxsamliu/add-fun
Add more function declarations for hip-clang
|
2018-05-31 09:27:54 +05:30 |
|
Maneesh Gupta
|
f815552066
|
Merge pull request #481 from gargrahul/fix_texobj1dfetch_test
Fixed texture obj 1Dfetch test
|
2018-05-31 09:14:31 +05:30 |
|
Maneesh Gupta
|
c264dcb715
|
Merge pull request #479 from yxsamliu/fix-hipcc
Drop --amdgpu-target= options for hip-clang
|
2018-05-31 09:12:36 +05:30 |
|
Alex Voicu
|
14e6a04387
|
Switch to using ROCDL directly, as opposed to via HC. Add missing bits.
|
2018-05-31 03:17:26 +01:00 |
|
Yaxun (Sam) Liu
|
0e0b028846
|
Fix __syncthreads for hip-clang
|
2018-05-30 16:33:18 -04:00 |
|