Gráfico de commits

2580 Commits

Autor SHA1 Mensaje Fecha
Wen-Heng (Jack) Chung b883ea759d Improve performance of re-initialization logic
Keep track of shared libaries already discovered. Do not build HSA executables
for them.
2018-06-15 18:07:33 -05:00
Wen-Heng (Jack) Chung 04640992dc HIP program state re-initialization logic
This commit is to support kernels dynamically loaded thru means such as
dlopen() after HIP runtime initializes.
2018-06-14 15:46:49 +00:00
Maneesh Gupta edab13da7d Merge pull request #519 from gargrahul/fix_memcpy2dasync_streamres
Fix stream resolution in memcpy2dasync
2018-06-14 12:34:32 +05:30
Rahul Garg 2ae3be9773 Fix stream resolution in memcpy2dasync 2018-06-14 11:58:56 +05:30
Maneesh Gupta f5fccba009 Merge pull request #518 from gargrahul/fix_pr_484_devptroffset
Fix retrieved locked ptr offset
2018-06-14 07:37:34 +05:30
Rahul Garg 68554e155b Fix retrieved locked ptr offset 2018-06-13 23:10:05 +05:30
Maneesh Gupta 359e6609bc Merge pull request #505 from ROCm-Developer-Tools/fix-hipcc-linker
Let hipcc handle library in linker response file for hip-clang
2018-06-11 12:01:45 +05:30
Maneesh Gupta 181cde1899 Merge pull request #506 from ROCm-Developer-Tools/fix-extern-shared
Add support of extern __shared__ for hip-clang
2018-06-11 11:59:58 +05:30
Maneesh Gupta 7122376751 Merge pull request #509 from odellus/patch-1
Update hip_porting_guide.md
2018-06-11 09:32:36 +05:30
Tomas Wood 683894c698 Update hip_porting_guide.md
use ".hip.cpp" for *source* files
2018-06-09 17:16:08 -07:00
Siu Chi Chan 2fbe29093d Merge pull request #508 from ROCm-Developer-Tools/revert-447-feature_native_vector_types
Revert "Switch over to using native vector types, for better codegen. Remove noise."
2018-06-08 18:11:41 -04:00
Siu Chi Chan d137271083 Revert "Switch over to using native vector types, for better codegen. Remove noise." 2018-06-08 16:48:22 -04:00
Yaxun (Sam) Liu 9398c9c927 Add support of extern __shared__ for hip-clang 2018-06-08 11:17:25 -04:00
Yaxun (Sam) Liu 8d3e4b4475 Let hipcc handle library in linker response file for hip-clang 2018-06-08 11:14:26 -04:00
Maneesh Gupta cb642f14ab Merge pull request #482 from ROCm-Developer-Tools/feature_clean_up_hip_math
Switch to using ROCDL directly, as opposed to via HC. Add missing bits.
2018-06-06 16:07:22 +05:30
Maneesh Gupta a1ad2b9c65 Merge pull request #496 from gargrahul/add_gettexresdesc_nvcc
Add getTextureResourceDescriptor on NVCC
2018-06-06 15:12:11 +05:30
Maneesh Gupta a432847b1b Merge pull request #487 from gargrahul/fix_hiparray_alloc_flag_nvcc
Map hipArray alloc flags on NVCC
2018-06-06 15:11:40 +05:30
Maneesh Gupta 53037472ff Merge pull request #497 from gargrahul/fix_memcpy3d_fastpath
Fix hipMemcpy3D for fast path
2018-06-06 14:44:02 +05:30
Maneesh Gupta 28c5b15d88 Merge pull request #492 from gargrahul/fix_depth_3d_alloc
Fix depth value for 3D allocations
2018-06-06 14:41:23 +05:30
Maneesh Gupta f7c49dde38 Merge pull request #491 from scchan/fix_wait
callback handling: don't need to wait for the thread to become ready
2018-06-06 14:38:25 +05:30
Maneesh Gupta cd20a370eb Merge pull request #489 from gargrahul/add_dev_prop_integrated
Add integrated device property
2018-06-06 14:31:30 +05:30
Maneesh Gupta eab2c3f248 Merge pull request #488 from gargrahul/fix_surface2dobj_test
Fix surface 2d object test for testResult
2018-06-06 14:31:00 +05:30
Maneesh Gupta 95270bcdca Merge pull request #486 from ROCm-Developer-Tools/yxsamliu-patch-1
Add documentation for compiling HIP program with hip-clang
2018-06-06 13:04:23 +05:30
Rahul Garg 163d4a5b03 Fix hipMemcpy3D for fast path 2018-06-05 18:54:33 +05:30
Rahul Garg 6f8bcf53b0 Add getTextureResourceDescriptor on NVCC 2018-06-05 18:46:25 +05:30
Siu Chi Chan 0d719c514f remove the _ready flag in ihipStreamCallback_t and the mutex that protects it. 2018-06-04 17:29:04 -04:00
Rahul Garg fa6ce7a724 Fix depth value for 3D allocations 2018-06-04 18:00:22 +05:30
Siu Chi Chan e21e6ed3a0 callback handler: don't need to wait for the thread to become ready 2018-06-02 17:55:37 -04:00
Alex Voicu 68a0dd826d Remove vestigial implementations. 2018-06-02 11:37:08 +01:00
Rahul Garg 94f086e9cd Add integrated device property 2018-06-02 13:11:16 +05:30
Rahul Garg b94f18767f Fix surface 2d object test for testResult 2018-06-02 10:58:03 +05:30
Alex Voicu 3a17e2ad06 Rename for minimal confusion. 2018-06-01 22:55:33 +01:00
Alex Voicu b72d82f982 Missing __device__. 2018-06-01 19:48:36 +01:00
Alex Voicu 91d9ec75d7 Fix typos / address review comments. 2018-06-01 16:20:21 +01:00
Alex Voicu f2d7f112ab Re-sync with upstream. 2018-06-01 15:49:05 +01:00
Yaxun (Sam) Liu bb9ad15c34 Update INSTALL.md 2018-06-01 10:19:02 -04:00
Rahul Garg b1b9a477a2 Map hipArray alloc flags on NVCC 2018-06-01 17:28:43 +05:30
Maneesh Gupta 095d4dd91e Merge pull request #484 from gargrahul/fix_malloc_hiphostreg
Fix memcpy2D for malloc+ hostRegister
2018-06-01 16:53:25 +05:30
Maneesh Gupta 3fec282097 Merge pull request #447 from ROCm-Developer-Tools/feature_native_vector_types
Switch over to using native vector types, for better codegen. Remove noise.
2018-06-01 13:58:07 +05:30
Maneesh Gupta 8ecb3eeb55 Merge pull request #466 from ROCm-Developer-Tools/feature_use_Float16
Feature use _Float16 and match CUDA __half behaviour.
2018-06-01 13:50:12 +05:30
Yaxun (Sam) Liu 3ed6fff0db Update INSTALL.md 2018-05-31 23:55:42 -04:00
Alex Voicu e03ca1a72e Re-sync with upstream. Add integer abs. 2018-05-31 16:38:00 +01:00
Alex Voicu e20380319a Merge branch 'feature_use_Float16' of https://github.com/ROCm-Developer-Tools/HIP into feature_use_Float16 2018-05-31 15:27:31 +01:00
Alex Voicu 208f5a41c6 Add missing interop with volatile. Fix unit tests. 2018-05-31 15:27:12 +01:00
Rahul Garg a3609eaf61 Fix memcpy2D for malloc+ hostRegister 2018-05-31 13:14:27 +05:30
Maneesh Gupta 2271d26cda Merge pull request #480 from yxsamliu/add-fun
Add more function declarations for hip-clang
2018-05-31 09:27:54 +05:30
Maneesh Gupta f815552066 Merge pull request #481 from gargrahul/fix_texobj1dfetch_test
Fixed texture obj 1Dfetch test
2018-05-31 09:14:31 +05:30
Maneesh Gupta c264dcb715 Merge pull request #479 from yxsamliu/fix-hipcc
Drop --amdgpu-target= options for hip-clang
2018-05-31 09:12:36 +05:30
Alex Voicu 14e6a04387 Switch to using ROCDL directly, as opposed to via HC. Add missing bits. 2018-05-31 03:17:26 +01:00
Yaxun (Sam) Liu 0e0b028846 Fix __syncthreads for hip-clang 2018-05-30 16:33:18 -04:00