Evgeny Mankov
23c8a3e18f
Merge pull request #712 from emankov/master
...
[HIPIFY] CUDA Driver API data types total revise
2018-10-18 18:52:41 +03:00
Evgeny Mankov
865c6f23c7
[HIPIFY] CUDA Driver API data types total revise
...
+ for all CUDA versions
+ add missing types
+ fix typos
+ sync with HIP
+ update CUDA_Driver_API_functions_supported_by_HIP.md
+ formatting, annotating
2018-10-18 18:50:24 +03:00
Maneesh Gupta
30523b72a2
Merge pull request #688 from aaronenyeshi/fix-sinf-cosf-ocml
...
Use sinf and cosf from ocml device libs
2018-10-18 16:39:20 +05:30
Maneesh Gupta
9143ae6bdb
Merge pull request #692 from whchung/hip-reinit-take2
...
HIP program state re-initialization logic (take 2)
2018-10-18 12:06:41 +05:30
Maneesh Gupta
c7a147f109
Merge pull request #711 from nicholasmalaya/patch-1
...
Updates to HIP porting guide
2018-10-18 12:06:31 +05:30
Maneesh Gupta
3f87e33c9a
Merge pull request #710 from mangupta/use_hipLaunchKernelGGL
...
Replace hipLaunchKernel -> hipLaunchKernelGGL
2018-10-18 08:36:59 +05:30
Nicholas Malaya
bfbc1298d1
Update hip_porting_guide.md
2018-10-17 14:27:11 -05:00
Nicholas Malaya
adc7741441
Fixing link
2018-10-17 14:26:49 -05:00
Nicholas Malaya
319f7232c2
Fixing a link
2018-10-17 14:25:54 -05:00
Nicholas Malaya
4c2e4ead8d
Adding library equivalent section
2018-10-17 14:25:07 -05:00
Nicholas Malaya
41279dcb38
Small editing changes to clean up document
2018-10-17 14:11:25 -05:00
Nicholas Malaya
4939444a51
Fixing a broken indentation
...
Minor (cosmetic) edit to make items appear in ordered bulleted list
2018-10-17 13:56:51 -05:00
Nicholas Malaya
20bb485199
Update hip_porting_guide.md
...
Adding hyperlink to bin/hipconvertinplace.sh
2018-10-17 13:49:47 -05:00
Maneesh Gupta
e1fe095471
Replace hipLaunchKernel -> hipLaunchKernelGGL
...
Change-Id: I4d99009e1199811d417becf1e1b934ec4d4e30be
2018-10-17 14:32:25 +05:30
Maneesh Gupta
3bf8e9ad8e
Fix typos in Jenkinsfile
2018-10-17 11:13:37 +05:30
Maneesh Gupta
f41924d42e
Update Jenkinsfile
...
Disable rocm-1.9x testing in CI due to incompatible changes with HCC from ROCm 1.9.x
2018-10-17 11:03:43 +05:30
Maneesh Gupta
95be669f4a
Merge pull request #703 from mangupta/stream_create_with_priority
...
Implementation for stream priority
2018-10-17 10:53:43 +05:30
Maneesh Gupta
3485d86746
Merge pull request #702 from aaronenyeshi/fix-missing-irif-lib
...
Replace IRIF fences with atomic_work_item_fence
2018-10-17 10:53:27 +05:30
Maneesh Gupta
6a4aaed7f3
Merge pull request #698 from yxsamliu/compile-flags
...
Add HIPCC_COMPILE_FLAGS_APPEND
2018-10-17 10:53:17 +05:30
Maneesh Gupta
5ca415a172
Merge pull request #696 from gargrahul/fix_texrestypelin_size
...
Fixed image width for linear resource type texture
2018-10-17 06:11:23 +05:30
Maneesh Gupta
ae4e24dc21
Merge pull request #708 from mangupta/swdev-125523
...
Add missing hipHostRegister flags on nvcc path
2018-10-17 06:10:25 +05:30
Evgeny Mankov
3f4f090fd9
Merge pull request #709 from emankov/master
...
[HIPIFY] Code cleanup and formatting
2018-10-15 15:37:38 +03:00
Evgeny Mankov
9a1a511c84
[HIPIFY] Code cleanup and formatting
2018-10-15 15:27:37 +03:00
Maneesh Gupta
d71006eb99
Add missing hipHostRegister flags on nvcc path
...
Change-Id: I69f09204d9c544935104d4168ab8d3626666a623
2018-10-15 15:30:24 +05:30
Maneesh Gupta
07ee1f07d8
Implementation for stream priority
...
- Requires ROCm 1.9.x or higher
- Requires HCC with PR#886 merged
Change-Id: Id7c95ea091ee610e80c9ad815f1cb989cba570ca
2018-10-05 16:27:46 +05:30
Aaron Enye Shi
03822afaa9
Replace IRIF fences with atomic_work_item_fence
2018-10-04 21:47:28 +00:00
Maneesh Gupta
e7d3f3860e
Merge pull request #697 from yxsamliu/dev-lib-path
...
Let hipcc add --hip-device-lib-path by default for hip-clang
2018-10-04 07:40:37 +05:30
Maneesh Gupta
ed51044502
Merge pull request #700 from ROCm-Developer-Tools/fix-long-long-decl
...
Fix hip_vector_types.h for long long vectors
2018-10-04 07:39:30 +05:30
Aaron Enye Shi
6a71cf17a1
Fix hip_vector_types.h for long long vectors
...
There was a missing long in the declaration for [u]longlongN types.
2018-10-03 13:57:52 -04:00
Evgeny Mankov
bde7c600c4
Merge pull request #699 from emankov/master
...
[HIPIFY] CUDA 10.0 Driver API initial support
2018-10-03 20:33:37 +03:00
Evgeny Mankov
cb4eb94174
[HIPIFY] CUDA 10.0 Driver API initial support
2018-10-03 20:29:22 +03:00
Rahul Garg
a78d68d639
Corrected the width calculation logic to accomodate multi channels
2018-10-03 12:07:38 +05:30
Yaxun Sam Liu
88768895bc
Let hipcc add --hip-device-lib-path by default for hip-clang
...
hip-clang by default assumes -fno-gpu-rdc, therefore requires
--hip-device-lib-path by default.
2018-10-01 15:14:54 -04:00
Yaxun Sam Liu
2c361906fa
Add HIPCC_COMPILE_FLAGS_APPEND
2018-10-01 14:51:29 -04:00
Rahul Garg
167265c279
Fixed image width for linear resource type texture
2018-10-01 15:28:34 +05:30
Evgeny Mankov
28e5a5ced7
Merge pull request #695 from emankov/master
...
[HIPIFY][cmake] CUDA 10.0 is not supported.
2018-09-27 19:08:58 +03:00
Evgeny Mankov
4e7b330037
[HIPIFY][cmake] CUDA 10.0 is not supported.
2018-09-27 19:05:22 +03:00
Wen-Heng (Jack) Chung
e257de95f3
Keep the map which tracks GPU kernel symbols to grow monotonically
2018-09-26 19:49:02 +00:00
Wen-Heng (Jack) Chung
060b3c0bf8
Improve performance of re-initialization logic
...
Keep track of shared libaries already discovered. Do not build HSA executables
for them.
2018-09-26 19:48:56 +00:00
Wen-Heng (Jack) Chung
319f007bf1
HIP program state re-initialization logic
...
This commit is to support kernels dynamically loaded thru means such as
dlopen() after HIP runtime initializes.
2018-09-26 19:48:47 +00:00
Evgeny Mankov
c778463625
Merge pull request #691 from emankov/docs
...
[HIPIFY][docs] Fix typos in Readme.md
2018-09-26 17:27:18 +03:00
Evgeny Mankov
6056686796
[HIPIFY][docs] Fix typos in Readme.md
2018-09-26 17:26:25 +03:00
Evgeny Mankov
e879118d87
Merge pull request #690 from emankov/hipBLAS
...
[HIPIFY][doc] Update README.md due to new LLVM 7.0.0 and CUDA 10.0 re…
2018-09-26 17:05:47 +03:00
Evgeny Mankov
a7e75c72ee
[HIPIFY][doc] Update README.md due to new LLVM 7.0.0 and CUDA 10.0 releases.
2018-09-26 17:01:59 +03:00
Maneesh Gupta
5bd2219f9d
Merge pull request #689 from gargrahul/hipmemset_ret_success_sizezero
...
Return hipSuccess when sizeBytes=0 in hipMemset
2018-09-26 13:57:44 +05:30
Rahul Garg
bd27310127
Return hipSuccess when sizeBytes=0 in hipMemset
2018-09-26 12:47:36 +05:30
Maneesh Gupta
802520f5f1
Merge pull request #685 from ROCm-Developer-Tools/hip-trig-return
...
Improve hip_trig test case
2018-09-26 09:50:48 +05:30
Aaron Enye Shi
eeb6c11050
Use sinf and cosf from ocml device libs
...
Using llvm_amdgcn builtin fails to produce accurate values, we should move to using the ocml device library versions.
2018-09-25 19:31:39 +00:00
Aaron Enye Shi
04ed44f074
Use trig functions from ocml instead
2018-09-25 15:58:36 +00:00
Evgeny Mankov
d12c1f7eff
Merge pull request #687 from emankov/hipBLAS
...
[HIPIFY][BLAS] Add support of hipblasGemmEx and corresponding types
2018-09-25 18:48:06 +03:00