Grafik Komit

2429 Melakukan

Penulis SHA1 Pesan Tanggal
Aaron Enye Shi 2a2f34b42d Implement hip_hc.ll into HIP headers
Move all __hip_hc_ir_* functions from hip_hc.ll into HIP header as inline asm. Remove hip_hc.ll and build dependencies from HIP.


[ROCm/hip commit: 2142eb4d12]
2018-06-20 17:39:31 +00:00
Aaron Enye Shi 0fca41a7ce Implement device_functions.cpp into HIP headers
Move all Integer Intrinsics, device_functions.cpp definitions and HIP specific device functions into HIP headers. Implement the device functions using llvm_intrinsics and device-libs functions instead of calling hc::__* functions. Remove device_functions.cpp since everything is now defined in header.


[ROCm/hip commit: e02fc7e680]
2018-06-20 17:39:23 +00:00
Aaron Enye Shi 14f721702a Move hipclang funcs into corresponding headers
[ROCm/hip commit: fe4e6c53fc]
2018-06-20 17:39:15 +00:00
Aaron Enye Shi 8d65776127 Split __llvm and device lib funcs into new headers
[ROCm/hip commit: cfe37484c9]
2018-06-20 17:39:06 +00:00
Aaron Enye Shi cb46df1654 Add get_dynamicgroupbaseptr def and remove hc_
[ROCm/hip commit: 27f600b425]
2018-06-20 17:38:59 +00:00
Aaron Enye Shi 9b03a036c3 Add prefix __ to memory scope and order
[ROCm/hip commit: 871cfc6fd2]
2018-06-20 17:38:52 +00:00
Aaron Enye Shi cdd7931ac1 Add __llvm_fence funcs and __ prefixes
[ROCm/hip commit: cfa8fc1ca5]
2018-06-20 17:38:45 +00:00
Aaron Enye Shi b34868005a Add hipclang amdgcn functions
These are moving from hipclang in device library to hip headers. These are required for the functionality of HIPclang project.


[ROCm/hip commit: c453b42bff]
2018-06-20 17:38:37 +00:00
Maneesh Gupta a4b9e5305f Merge pull request #507 from ROCm-Developer-Tools/fix-forward
Add __device__ to device functions in hip_fp16_math_fwd.h

[ROCm/hip commit: fc80fb4ab3]
2018-06-20 14:21:46 +05:30
Maneesh Gupta 6932d45c14 Merge pull request #504 from ROCm-Developer-Tools/fix-vector3
Fix channel_descriptor.h about vector 3 for gcc

[ROCm/hip commit: cffc5ad273]
2018-06-20 14:20:29 +05:30
Maneesh Gupta 27e2af1773 Merge pull request #490 from ROCm-Developer-Tools/feature_decouple_atomics_from_hc
Switch the atomic implementation to use Clang  builtins.

[ROCm/hip commit: 946c8da88a]
2018-06-20 14:16:43 +05:30
Maneesh Gupta 4cdc20a6ce Merge pull request #457 from whchung/hip-reinit
HIP program state re-initialization logic

[ROCm/hip commit: 836627279f]
2018-06-20 09:37:27 +05:30
Maneesh Gupta 163e55fe10 Merge pull request #516 from bddppq/empty-generator-expression
Properly handle (empty) cmake generator expression

[ROCm/hip commit: 1b88a2ce2f]
2018-06-19 09:38:21 +05:30
Maneesh Gupta 3475fe6efa Merge pull request #520 from ntrost57/master
added missing hipCmul() to nvcc_detail/hip_complex.h

[ROCm/hip commit: 523e7fd9b2]
2018-06-19 09:37:50 +05:30
Wen-Heng (Jack) Chung d214b045c3 Keep the map which tracks GPU kernel symbols to grow monotonically
[ROCm/hip commit: 32789a8b7d]
2018-06-18 16:54:18 -05:00
Wen-Heng (Jack) Chung c6b975bf13 Improve performance of re-initialization logic
Keep track of shared libaries already discovered. Do not build HSA executables
for them.


[ROCm/hip commit: ece4539c1d]
2018-06-15 18:07:33 -05:00
Rahul Garg 388679efc8 TEMP- fix memcpy2dAsync for trsm issue
[ROCm/hip commit: cd23905897]
2018-06-15 16:08:29 +05:30
Nico Trost 494d34f4ee added missing hipCmul() to nvcc_detail/hip_complex.h
[ROCm/hip commit: 0b1e698e74]
2018-06-14 21:49:54 +02:00
Wen-Heng (Jack) Chung e5ca9eb081 HIP program state re-initialization logic
This commit is to support kernels dynamically loaded thru means such as
dlopen() after HIP runtime initializes.


[ROCm/hip commit: 379b7a2241]
2018-06-14 15:46:49 +00:00
Rahul Garg 312999de41 Fix stream resolution in memcpy2dasync
[ROCm/hip commit: 069e2c34c9]
2018-06-14 11:58:56 +05:30
Rahul Garg 1d6396dfb9 Fix retrieved locked ptr offset
[ROCm/hip commit: 00f8a36bc7]
2018-06-13 23:10:05 +05:30
Junjie Bai 53a09e7bb8 Properly handle (empty) cmake generator expression
[ROCm/hip commit: 03d3c6eaed]
2018-06-12 23:53:18 -07:00
Maneesh Gupta b8293d0176 Merge pull request #505 from ROCm-Developer-Tools/fix-hipcc-linker
Let hipcc handle library in linker response file for hip-clang

[ROCm/hip commit: f865341cd9]
2018-06-11 12:01:45 +05:30
Maneesh Gupta a6fb892e43 Merge pull request #506 from ROCm-Developer-Tools/fix-extern-shared
Add support of extern __shared__ for hip-clang

[ROCm/hip commit: e0400674fd]
2018-06-11 11:59:58 +05:30
Tomas Wood d6c5a2539f Update hip_porting_guide.md
use ".hip.cpp" for *source* files

[ROCm/hip commit: ed7dee4d19]
2018-06-09 17:16:08 -07:00
Siu Chi Chan 3792417463 Revert "Switch over to using native vector types, for better codegen. Remove noise."
[ROCm/hip commit: 7a4aace13d]
2018-06-08 16:48:22 -04:00
Yaxun (Sam) Liu 8e6b72b04d Add __device__ to device functions in hip_fp16_math_fwd.h
[ROCm/hip commit: 17e3093f0e]
2018-06-08 11:23:52 -04:00
Yaxun (Sam) Liu a96a56a986 Fix channel_descriptor.h about vector 3 for gcc
[ROCm/hip commit: 9141037105]
2018-06-08 11:18:41 -04:00
Yaxun (Sam) Liu 2f017c1200 Add support of extern __shared__ for hip-clang
[ROCm/hip commit: cc14ed0981]
2018-06-08 11:17:25 -04:00
Yaxun (Sam) Liu 07bb1c44df Let hipcc handle library in linker response file for hip-clang
[ROCm/hip commit: 04a0f9bd81]
2018-06-08 11:14:26 -04:00
Maneesh Gupta b54be20b05 Merge pull request #482 from ROCm-Developer-Tools/feature_clean_up_hip_math
Switch to using ROCDL directly, as opposed to via HC. Add missing bits.

[ROCm/hip commit: 203dd6cb70]
2018-06-06 16:07:22 +05:30
Maneesh Gupta 52eee341cc Merge pull request #496 from gargrahul/add_gettexresdesc_nvcc
Add getTextureResourceDescriptor on NVCC

[ROCm/hip commit: 02ea7f13b3]
2018-06-06 15:12:11 +05:30
Maneesh Gupta ddbf7f1028 Merge pull request #487 from gargrahul/fix_hiparray_alloc_flag_nvcc
Map hipArray alloc flags on NVCC

[ROCm/hip commit: de5043c47c]
2018-06-06 15:11:40 +05:30
Maneesh Gupta ac027e4092 Merge pull request #497 from gargrahul/fix_memcpy3d_fastpath
Fix hipMemcpy3D for fast path

[ROCm/hip commit: 9e9c039ee4]
2018-06-06 14:44:02 +05:30
Maneesh Gupta f4dd7fd056 Merge pull request #492 from gargrahul/fix_depth_3d_alloc
Fix depth value for 3D allocations

[ROCm/hip commit: 216f34eea8]
2018-06-06 14:41:23 +05:30
Maneesh Gupta f44e77944d Merge pull request #491 from scchan/fix_wait
callback handling: don't need to wait for the thread to become ready

[ROCm/hip commit: 7311b60220]
2018-06-06 14:38:25 +05:30
Maneesh Gupta bcab16f2c9 Merge pull request #489 from gargrahul/add_dev_prop_integrated
Add integrated device property

[ROCm/hip commit: 391ff1c949]
2018-06-06 14:31:30 +05:30
Maneesh Gupta 05b6bcb5dc Merge pull request #488 from gargrahul/fix_surface2dobj_test
Fix surface 2d object test for testResult

[ROCm/hip commit: 916fedee16]
2018-06-06 14:31:00 +05:30
Maneesh Gupta 54a90e2038 Merge pull request #486 from ROCm-Developer-Tools/yxsamliu-patch-1
Add documentation for compiling HIP program with hip-clang

[ROCm/hip commit: 18cb0485f1]
2018-06-06 13:04:23 +05:30
Rahul Garg e7bc68d347 Fix hipMemcpy3D for fast path
[ROCm/hip commit: a46ff2afd5]
2018-06-05 18:54:33 +05:30
Rahul Garg 522677e2dd Add getTextureResourceDescriptor on NVCC
[ROCm/hip commit: 17bb8dbe86]
2018-06-05 18:46:25 +05:30
Siu Chi Chan 417dde9d73 remove the _ready flag in ihipStreamCallback_t and the mutex that protects it.
[ROCm/hip commit: a1f3b587fb]
2018-06-04 17:29:04 -04:00
Rahul Garg 6592b35c39 Fix depth value for 3D allocations
[ROCm/hip commit: 276c948a16]
2018-06-04 18:00:22 +05:30
Alex Voicu 6b3d95c2f6 Fix hideous typos.
[ROCm/hip commit: 23f5feaf13]
2018-06-03 03:03:55 +01:00
Siu Chi Chan 4b25b76898 callback handler: don't need to wait for the thread to become ready
[ROCm/hip commit: d3a9985f10]
2018-06-02 17:55:37 -04:00
Alex Voicu 192839c71b Add missing __device__ for forward declares.
[ROCm/hip commit: 59adb5e52a]
2018-06-02 17:46:37 +01:00
Alex Voicu f7fd20ec17 Switch the atomic implementation to use Clang builtins.
[ROCm/hip commit: 089ab3b947]
2018-06-02 12:27:17 +01:00
Alex Voicu 980fa8050d Remove vestigial implementations.
[ROCm/hip commit: 14e449b5bb]
2018-06-02 11:37:08 +01:00
Rahul Garg 07115e0c02 Add integrated device property
[ROCm/hip commit: 1a02bc364f]
2018-06-02 13:11:16 +05:30
Rahul Garg 49e6de2f3e Fix surface 2d object test for testResult
[ROCm/hip commit: 7a091ed8e9]
2018-06-02 10:58:03 +05:30