Граф коммитов

2406 Коммитов

Автор SHA1 Сообщение Дата
Wen-Heng (Jack) Chung c6b975bf13 Improve performance of re-initialization logic
Keep track of shared libaries already discovered. Do not build HSA executables
for them.


[ROCm/hip commit: ece4539c1d]
2018-06-15 18:07:33 -05:00
Wen-Heng (Jack) Chung e5ca9eb081 HIP program state re-initialization logic
This commit is to support kernels dynamically loaded thru means such as
dlopen() after HIP runtime initializes.


[ROCm/hip commit: 379b7a2241]
2018-06-14 15:46:49 +00:00
Rahul Garg 312999de41 Fix stream resolution in memcpy2dasync
[ROCm/hip commit: 069e2c34c9]
2018-06-14 11:58:56 +05:30
Rahul Garg 1d6396dfb9 Fix retrieved locked ptr offset
[ROCm/hip commit: 00f8a36bc7]
2018-06-13 23:10:05 +05:30
Maneesh Gupta b8293d0176 Merge pull request #505 from ROCm-Developer-Tools/fix-hipcc-linker
Let hipcc handle library in linker response file for hip-clang

[ROCm/hip commit: f865341cd9]
2018-06-11 12:01:45 +05:30
Maneesh Gupta a6fb892e43 Merge pull request #506 from ROCm-Developer-Tools/fix-extern-shared
Add support of extern __shared__ for hip-clang

[ROCm/hip commit: e0400674fd]
2018-06-11 11:59:58 +05:30
Tomas Wood d6c5a2539f Update hip_porting_guide.md
use ".hip.cpp" for *source* files

[ROCm/hip commit: ed7dee4d19]
2018-06-09 17:16:08 -07:00
Siu Chi Chan 3792417463 Revert "Switch over to using native vector types, for better codegen. Remove noise."
[ROCm/hip commit: 7a4aace13d]
2018-06-08 16:48:22 -04:00
Yaxun (Sam) Liu 2f017c1200 Add support of extern __shared__ for hip-clang
[ROCm/hip commit: cc14ed0981]
2018-06-08 11:17:25 -04:00
Yaxun (Sam) Liu 07bb1c44df Let hipcc handle library in linker response file for hip-clang
[ROCm/hip commit: 04a0f9bd81]
2018-06-08 11:14:26 -04:00
Maneesh Gupta b54be20b05 Merge pull request #482 from ROCm-Developer-Tools/feature_clean_up_hip_math
Switch to using ROCDL directly, as opposed to via HC. Add missing bits.

[ROCm/hip commit: 203dd6cb70]
2018-06-06 16:07:22 +05:30
Maneesh Gupta 52eee341cc Merge pull request #496 from gargrahul/add_gettexresdesc_nvcc
Add getTextureResourceDescriptor on NVCC

[ROCm/hip commit: 02ea7f13b3]
2018-06-06 15:12:11 +05:30
Maneesh Gupta ddbf7f1028 Merge pull request #487 from gargrahul/fix_hiparray_alloc_flag_nvcc
Map hipArray alloc flags on NVCC

[ROCm/hip commit: de5043c47c]
2018-06-06 15:11:40 +05:30
Maneesh Gupta ac027e4092 Merge pull request #497 from gargrahul/fix_memcpy3d_fastpath
Fix hipMemcpy3D for fast path

[ROCm/hip commit: 9e9c039ee4]
2018-06-06 14:44:02 +05:30
Maneesh Gupta f4dd7fd056 Merge pull request #492 from gargrahul/fix_depth_3d_alloc
Fix depth value for 3D allocations

[ROCm/hip commit: 216f34eea8]
2018-06-06 14:41:23 +05:30
Maneesh Gupta f44e77944d Merge pull request #491 from scchan/fix_wait
callback handling: don't need to wait for the thread to become ready

[ROCm/hip commit: 7311b60220]
2018-06-06 14:38:25 +05:30
Maneesh Gupta bcab16f2c9 Merge pull request #489 from gargrahul/add_dev_prop_integrated
Add integrated device property

[ROCm/hip commit: 391ff1c949]
2018-06-06 14:31:30 +05:30
Maneesh Gupta 05b6bcb5dc Merge pull request #488 from gargrahul/fix_surface2dobj_test
Fix surface 2d object test for testResult

[ROCm/hip commit: 916fedee16]
2018-06-06 14:31:00 +05:30
Maneesh Gupta 54a90e2038 Merge pull request #486 from ROCm-Developer-Tools/yxsamliu-patch-1
Add documentation for compiling HIP program with hip-clang

[ROCm/hip commit: 18cb0485f1]
2018-06-06 13:04:23 +05:30
Rahul Garg e7bc68d347 Fix hipMemcpy3D for fast path
[ROCm/hip commit: a46ff2afd5]
2018-06-05 18:54:33 +05:30
Rahul Garg 522677e2dd Add getTextureResourceDescriptor on NVCC
[ROCm/hip commit: 17bb8dbe86]
2018-06-05 18:46:25 +05:30
Siu Chi Chan 417dde9d73 remove the _ready flag in ihipStreamCallback_t and the mutex that protects it.
[ROCm/hip commit: a1f3b587fb]
2018-06-04 17:29:04 -04:00
Rahul Garg 6592b35c39 Fix depth value for 3D allocations
[ROCm/hip commit: 276c948a16]
2018-06-04 18:00:22 +05:30
Siu Chi Chan 4b25b76898 callback handler: don't need to wait for the thread to become ready
[ROCm/hip commit: d3a9985f10]
2018-06-02 17:55:37 -04:00
Alex Voicu 980fa8050d Remove vestigial implementations.
[ROCm/hip commit: 14e449b5bb]
2018-06-02 11:37:08 +01:00
Rahul Garg 07115e0c02 Add integrated device property
[ROCm/hip commit: 1a02bc364f]
2018-06-02 13:11:16 +05:30
Rahul Garg 49e6de2f3e Fix surface 2d object test for testResult
[ROCm/hip commit: 7a091ed8e9]
2018-06-02 10:58:03 +05:30
Alex Voicu 3e8b76ba55 Rename for minimal confusion.
[ROCm/hip commit: 8bfcb22c57]
2018-06-01 22:55:33 +01:00
Alex Voicu 6f29707dd3 Missing __device__.
[ROCm/hip commit: 5e90c6a081]
2018-06-01 19:48:36 +01:00
Alex Voicu 82e4fecfeb Fix typos / address review comments.
[ROCm/hip commit: ebd6d49cd2]
2018-06-01 16:20:21 +01:00
Alex Voicu b9bf931765 Re-sync with upstream.
[ROCm/hip commit: 417869821d]
2018-06-01 15:49:05 +01:00
Yaxun (Sam) Liu b46a961f1d Update INSTALL.md
[ROCm/hip commit: 8ccb069226]
2018-06-01 10:19:02 -04:00
Rahul Garg 17ff646d18 Map hipArray alloc flags on NVCC
[ROCm/hip commit: d3621a2500]
2018-06-01 17:28:43 +05:30
Maneesh Gupta eded0da7b5 Merge pull request #484 from gargrahul/fix_malloc_hiphostreg
Fix memcpy2D for malloc+ hostRegister

[ROCm/hip commit: df450c6680]
2018-06-01 16:53:25 +05:30
Maneesh Gupta 121993cae8 Merge pull request #447 from ROCm-Developer-Tools/feature_native_vector_types
Switch over to using native vector types, for better codegen. Remove noise.

[ROCm/hip commit: 68c36f9125]
2018-06-01 13:58:07 +05:30
Maneesh Gupta 7f9b00ba19 Merge pull request #466 from ROCm-Developer-Tools/feature_use_Float16
Feature use _Float16 and match CUDA __half behaviour.

[ROCm/hip commit: bdf2645713]
2018-06-01 13:50:12 +05:30
Yaxun (Sam) Liu d7737975e9 Update INSTALL.md
[ROCm/hip commit: 98491cd759]
2018-05-31 23:55:42 -04:00
Alex Voicu 63c8aa6fcb Re-sync with upstream. Add integer abs.
[ROCm/hip commit: ab4b2a650b]
2018-05-31 16:38:00 +01:00
Alex Voicu 54024c3dc1 Merge branch 'feature_use_Float16' of https://github.com/ROCm-Developer-Tools/HIP into feature_use_Float16
[ROCm/hip commit: b9b1e0b703]
2018-05-31 15:27:31 +01:00
Alex Voicu 90e7799bf4 Add missing interop with volatile. Fix unit tests.
[ROCm/hip commit: 0108819e2a]
2018-05-31 15:27:12 +01:00
Rahul Garg 46e623fb31 Fix memcpy2D for malloc+ hostRegister
[ROCm/hip commit: 8d6357669d]
2018-05-31 13:14:27 +05:30
Maneesh Gupta 6f20a8cc27 Merge pull request #480 from yxsamliu/add-fun
Add more function declarations for hip-clang

[ROCm/hip commit: 449ddba0aa]
2018-05-31 09:27:54 +05:30
Maneesh Gupta 8174643aca Merge pull request #481 from gargrahul/fix_texobj1dfetch_test
Fixed texture obj 1Dfetch test

[ROCm/hip commit: 3f87d4efb2]
2018-05-31 09:14:31 +05:30
Alex Voicu 1d220f8867 Switch to using ROCDL directly, as opposed to via HC. Add missing bits.
[ROCm/hip commit: 59db16fd36]
2018-05-31 03:17:26 +01:00
Yaxun (Sam) Liu 2bd034a0d9 Fix __syncthreads for hip-clang
[ROCm/hip commit: b7641cf835]
2018-05-30 16:33:18 -04:00
Rahul Garg c7d6ef2219 Fixed texture obj 1Dfetch test
[ROCm/hip commit: 5ae6d93526]
2018-05-30 21:54:51 +05:30
Yaxun (Sam) Liu de13fa6e8b Add more function declarations for hip-clang
[ROCm/hip commit: 6b8218e5a0]
2018-05-30 06:51:03 -04:00
Yaxun (Sam) Liu ef02210768 Drop --amdgpu-target= options for hip-clang
They are replaced by --cuda-gpu-arch= options elsewhere


[ROCm/hip commit: 9008a3ec56]
2018-05-30 06:47:44 -04:00
Maneesh Gupta b92ace4cdb Merge pull request #469 from yxsamliu/fun
Add math function declaration for hip-clang

[ROCm/hip commit: a1fea02c6e]
2018-05-30 15:01:27 +05:30
Maneesh Gupta c9b0c7620c Merge pull request #477 from mangupta/hipfuncgetattr_nvcc
Add signature for hipFuncGetAttributes to nvcc_details

[ROCm/hip commit: 59d849a643]
2018-05-30 12:29:03 +05:30