Aaron Enye Shi
2a2f34b42d
Implement hip_hc.ll into HIP headers
...
Move all __hip_hc_ir_* functions from hip_hc.ll into HIP header as inline asm. Remove hip_hc.ll and build dependencies from HIP.
[ROCm/hip commit: 2142eb4d12 ]
2018-06-20 17:39:31 +00:00
Aaron Enye Shi
0fca41a7ce
Implement device_functions.cpp into HIP headers
...
Move all Integer Intrinsics, device_functions.cpp definitions and HIP specific device functions into HIP headers. Implement the device functions using llvm_intrinsics and device-libs functions instead of calling hc::__* functions. Remove device_functions.cpp since everything is now defined in header.
[ROCm/hip commit: e02fc7e680 ]
2018-06-20 17:39:23 +00:00
Aaron Enye Shi
14f721702a
Move hipclang funcs into corresponding headers
...
[ROCm/hip commit: fe4e6c53fc ]
2018-06-20 17:39:15 +00:00
Aaron Enye Shi
8d65776127
Split __llvm and device lib funcs into new headers
...
[ROCm/hip commit: cfe37484c9 ]
2018-06-20 17:39:06 +00:00
Aaron Enye Shi
cb46df1654
Add get_dynamicgroupbaseptr def and remove hc_
...
[ROCm/hip commit: 27f600b425 ]
2018-06-20 17:38:59 +00:00
Aaron Enye Shi
9b03a036c3
Add prefix __ to memory scope and order
...
[ROCm/hip commit: 871cfc6fd2 ]
2018-06-20 17:38:52 +00:00
Aaron Enye Shi
cdd7931ac1
Add __llvm_fence funcs and __ prefixes
...
[ROCm/hip commit: cfa8fc1ca5 ]
2018-06-20 17:38:45 +00:00
Aaron Enye Shi
b34868005a
Add hipclang amdgcn functions
...
These are moving from hipclang in device library to hip headers. These are required for the functionality of HIPclang project.
[ROCm/hip commit: c453b42bff ]
2018-06-20 17:38:37 +00:00
Maneesh Gupta
a4b9e5305f
Merge pull request #507 from ROCm-Developer-Tools/fix-forward
...
Add __device__ to device functions in hip_fp16_math_fwd.h
[ROCm/hip commit: fc80fb4ab3 ]
2018-06-20 14:21:46 +05:30
Maneesh Gupta
6932d45c14
Merge pull request #504 from ROCm-Developer-Tools/fix-vector3
...
Fix channel_descriptor.h about vector 3 for gcc
[ROCm/hip commit: cffc5ad273 ]
2018-06-20 14:20:29 +05:30
Maneesh Gupta
27e2af1773
Merge pull request #490 from ROCm-Developer-Tools/feature_decouple_atomics_from_hc
...
Switch the atomic implementation to use Clang builtins.
[ROCm/hip commit: 946c8da88a ]
2018-06-20 14:16:43 +05:30
Maneesh Gupta
4cdc20a6ce
Merge pull request #457 from whchung/hip-reinit
...
HIP program state re-initialization logic
[ROCm/hip commit: 836627279f ]
2018-06-20 09:37:27 +05:30
Maneesh Gupta
163e55fe10
Merge pull request #516 from bddppq/empty-generator-expression
...
Properly handle (empty) cmake generator expression
[ROCm/hip commit: 1b88a2ce2f ]
2018-06-19 09:38:21 +05:30
Maneesh Gupta
3475fe6efa
Merge pull request #520 from ntrost57/master
...
added missing hipCmul() to nvcc_detail/hip_complex.h
[ROCm/hip commit: 523e7fd9b2 ]
2018-06-19 09:37:50 +05:30
Wen-Heng (Jack) Chung
d214b045c3
Keep the map which tracks GPU kernel symbols to grow monotonically
...
[ROCm/hip commit: 32789a8b7d ]
2018-06-18 16:54:18 -05:00
Maneesh Gupta
411414acc5
Merge pull request #521 from gargrahul/temp_fixmemcpy2dasync_trsmissue
...
Use memcpy kernel for all pinned memory cases in hipMemcpy2DAsync
[ROCm/hip commit: 3d8317a50d ]
2018-06-18 09:34:02 +05:30
Wen-Heng (Jack) Chung
c6b975bf13
Improve performance of re-initialization logic
...
Keep track of shared libaries already discovered. Do not build HSA executables
for them.
[ROCm/hip commit: ece4539c1d ]
2018-06-15 18:07:33 -05:00
Rahul Garg
388679efc8
TEMP- fix memcpy2dAsync for trsm issue
...
[ROCm/hip commit: cd23905897 ]
2018-06-15 16:08:29 +05:30
Nico Trost
494d34f4ee
added missing hipCmul() to nvcc_detail/hip_complex.h
...
[ROCm/hip commit: 0b1e698e74 ]
2018-06-14 21:49:54 +02:00
Wen-Heng (Jack) Chung
e5ca9eb081
HIP program state re-initialization logic
...
This commit is to support kernels dynamically loaded thru means such as
dlopen() after HIP runtime initializes.
[ROCm/hip commit: 379b7a2241 ]
2018-06-14 15:46:49 +00:00
Maneesh Gupta
94b64ea44b
Merge pull request #519 from gargrahul/fix_memcpy2dasync_streamres
...
Fix stream resolution in memcpy2dasync
[ROCm/hip commit: bae22822f2 ]
2018-06-14 12:34:32 +05:30
Rahul Garg
312999de41
Fix stream resolution in memcpy2dasync
...
[ROCm/hip commit: 069e2c34c9 ]
2018-06-14 11:58:56 +05:30
Maneesh Gupta
34c6d24305
Merge pull request #518 from gargrahul/fix_pr_484_devptroffset
...
Fix retrieved locked ptr offset
[ROCm/hip commit: e96c67228f ]
2018-06-14 07:37:34 +05:30
Rahul Garg
1d6396dfb9
Fix retrieved locked ptr offset
...
[ROCm/hip commit: 00f8a36bc7 ]
2018-06-13 23:10:05 +05:30
Junjie Bai
53a09e7bb8
Properly handle (empty) cmake generator expression
...
[ROCm/hip commit: 03d3c6eaed ]
2018-06-12 23:53:18 -07:00
Maneesh Gupta
b8293d0176
Merge pull request #505 from ROCm-Developer-Tools/fix-hipcc-linker
...
Let hipcc handle library in linker response file for hip-clang
[ROCm/hip commit: f865341cd9 ]
2018-06-11 12:01:45 +05:30
Maneesh Gupta
a6fb892e43
Merge pull request #506 from ROCm-Developer-Tools/fix-extern-shared
...
Add support of extern __shared__ for hip-clang
[ROCm/hip commit: e0400674fd ]
2018-06-11 11:59:58 +05:30
Maneesh Gupta
9343e6581e
Merge pull request #509 from odellus/patch-1
...
Update hip_porting_guide.md
[ROCm/hip commit: 848b27642d ]
2018-06-11 09:32:36 +05:30
Tomas Wood
d6c5a2539f
Update hip_porting_guide.md
...
use ".hip.cpp" for *source* files
[ROCm/hip commit: ed7dee4d19 ]
2018-06-09 17:16:08 -07:00
Siu Chi Chan
b6c7b32b12
Merge pull request #508 from ROCm-Developer-Tools/revert-447-feature_native_vector_types
...
Revert "Switch over to using native vector types, for better codegen. Remove noise."
[ROCm/hip commit: 9484e0f4c0 ]
2018-06-08 18:11:41 -04:00
Siu Chi Chan
3792417463
Revert "Switch over to using native vector types, for better codegen. Remove noise."
...
[ROCm/hip commit: 7a4aace13d ]
2018-06-08 16:48:22 -04:00
Yaxun (Sam) Liu
8e6b72b04d
Add __device__ to device functions in hip_fp16_math_fwd.h
...
[ROCm/hip commit: 17e3093f0e ]
2018-06-08 11:23:52 -04:00
Yaxun (Sam) Liu
a96a56a986
Fix channel_descriptor.h about vector 3 for gcc
...
[ROCm/hip commit: 9141037105 ]
2018-06-08 11:18:41 -04:00
Yaxun (Sam) Liu
2f017c1200
Add support of extern __shared__ for hip-clang
...
[ROCm/hip commit: cc14ed0981 ]
2018-06-08 11:17:25 -04:00
Yaxun (Sam) Liu
07bb1c44df
Let hipcc handle library in linker response file for hip-clang
...
[ROCm/hip commit: 04a0f9bd81 ]
2018-06-08 11:14:26 -04:00
Maneesh Gupta
b54be20b05
Merge pull request #482 from ROCm-Developer-Tools/feature_clean_up_hip_math
...
Switch to using ROCDL directly, as opposed to via HC. Add missing bits.
[ROCm/hip commit: 203dd6cb70 ]
2018-06-06 16:07:22 +05:30
Maneesh Gupta
52eee341cc
Merge pull request #496 from gargrahul/add_gettexresdesc_nvcc
...
Add getTextureResourceDescriptor on NVCC
[ROCm/hip commit: 02ea7f13b3 ]
2018-06-06 15:12:11 +05:30
Maneesh Gupta
ddbf7f1028
Merge pull request #487 from gargrahul/fix_hiparray_alloc_flag_nvcc
...
Map hipArray alloc flags on NVCC
[ROCm/hip commit: de5043c47c ]
2018-06-06 15:11:40 +05:30
Maneesh Gupta
ac027e4092
Merge pull request #497 from gargrahul/fix_memcpy3d_fastpath
...
Fix hipMemcpy3D for fast path
[ROCm/hip commit: 9e9c039ee4 ]
2018-06-06 14:44:02 +05:30
Maneesh Gupta
f4dd7fd056
Merge pull request #492 from gargrahul/fix_depth_3d_alloc
...
Fix depth value for 3D allocations
[ROCm/hip commit: 216f34eea8 ]
2018-06-06 14:41:23 +05:30
Maneesh Gupta
f44e77944d
Merge pull request #491 from scchan/fix_wait
...
callback handling: don't need to wait for the thread to become ready
[ROCm/hip commit: 7311b60220 ]
2018-06-06 14:38:25 +05:30
Maneesh Gupta
bcab16f2c9
Merge pull request #489 from gargrahul/add_dev_prop_integrated
...
Add integrated device property
[ROCm/hip commit: 391ff1c949 ]
2018-06-06 14:31:30 +05:30
Maneesh Gupta
05b6bcb5dc
Merge pull request #488 from gargrahul/fix_surface2dobj_test
...
Fix surface 2d object test for testResult
[ROCm/hip commit: 916fedee16 ]
2018-06-06 14:31:00 +05:30
Maneesh Gupta
54a90e2038
Merge pull request #486 from ROCm-Developer-Tools/yxsamliu-patch-1
...
Add documentation for compiling HIP program with hip-clang
[ROCm/hip commit: 18cb0485f1 ]
2018-06-06 13:04:23 +05:30
Rahul Garg
e7bc68d347
Fix hipMemcpy3D for fast path
...
[ROCm/hip commit: a46ff2afd5 ]
2018-06-05 18:54:33 +05:30
Rahul Garg
522677e2dd
Add getTextureResourceDescriptor on NVCC
...
[ROCm/hip commit: 17bb8dbe86 ]
2018-06-05 18:46:25 +05:30
Siu Chi Chan
417dde9d73
remove the _ready flag in ihipStreamCallback_t and the mutex that protects it.
...
[ROCm/hip commit: a1f3b587fb ]
2018-06-04 17:29:04 -04:00
Rahul Garg
6592b35c39
Fix depth value for 3D allocations
...
[ROCm/hip commit: 276c948a16 ]
2018-06-04 18:00:22 +05:30
Alex Voicu
6b3d95c2f6
Fix hideous typos.
...
[ROCm/hip commit: 23f5feaf13 ]
2018-06-03 03:03:55 +01:00
Siu Chi Chan
4b25b76898
callback handler: don't need to wait for the thread to become ready
...
[ROCm/hip commit: d3a9985f10 ]
2018-06-02 17:55:37 -04:00