Maneesh Gupta
|
30523b72a2
|
Merge pull request #688 from aaronenyeshi/fix-sinf-cosf-ocml
Use sinf and cosf from ocml device libs
|
2018-10-18 16:39:20 +05:30 |
|
Maneesh Gupta
|
9143ae6bdb
|
Merge pull request #692 from whchung/hip-reinit-take2
HIP program state re-initialization logic (take 2)
|
2018-10-18 12:06:41 +05:30 |
|
Maneesh Gupta
|
c7a147f109
|
Merge pull request #711 from nicholasmalaya/patch-1
Updates to HIP porting guide
|
2018-10-18 12:06:31 +05:30 |
|
Maneesh Gupta
|
3f87e33c9a
|
Merge pull request #710 from mangupta/use_hipLaunchKernelGGL
Replace hipLaunchKernel -> hipLaunchKernelGGL
|
2018-10-18 08:36:59 +05:30 |
|
Nicholas Malaya
|
bfbc1298d1
|
Update hip_porting_guide.md
|
2018-10-17 14:27:11 -05:00 |
|
Nicholas Malaya
|
adc7741441
|
Fixing link
|
2018-10-17 14:26:49 -05:00 |
|
Nicholas Malaya
|
319f7232c2
|
Fixing a link
|
2018-10-17 14:25:54 -05:00 |
|
Nicholas Malaya
|
4c2e4ead8d
|
Adding library equivalent section
|
2018-10-17 14:25:07 -05:00 |
|
Nicholas Malaya
|
41279dcb38
|
Small editing changes to clean up document
|
2018-10-17 14:11:25 -05:00 |
|
Nicholas Malaya
|
4939444a51
|
Fixing a broken indentation
Minor (cosmetic) edit to make items appear in ordered bulleted list
|
2018-10-17 13:56:51 -05:00 |
|
Nicholas Malaya
|
20bb485199
|
Update hip_porting_guide.md
Adding hyperlink to bin/hipconvertinplace.sh
|
2018-10-17 13:49:47 -05:00 |
|
Maneesh Gupta
|
e1fe095471
|
Replace hipLaunchKernel -> hipLaunchKernelGGL
Change-Id: I4d99009e1199811d417becf1e1b934ec4d4e30be
|
2018-10-17 14:32:25 +05:30 |
|
Maneesh Gupta
|
3bf8e9ad8e
|
Fix typos in Jenkinsfile
|
2018-10-17 11:13:37 +05:30 |
|
Maneesh Gupta
|
f41924d42e
|
Update Jenkinsfile
Disable rocm-1.9x testing in CI due to incompatible changes with HCC from ROCm 1.9.x
|
2018-10-17 11:03:43 +05:30 |
|
Maneesh Gupta
|
95be669f4a
|
Merge pull request #703 from mangupta/stream_create_with_priority
Implementation for stream priority
|
2018-10-17 10:53:43 +05:30 |
|
Maneesh Gupta
|
3485d86746
|
Merge pull request #702 from aaronenyeshi/fix-missing-irif-lib
Replace IRIF fences with atomic_work_item_fence
|
2018-10-17 10:53:27 +05:30 |
|
Maneesh Gupta
|
6a4aaed7f3
|
Merge pull request #698 from yxsamliu/compile-flags
Add HIPCC_COMPILE_FLAGS_APPEND
|
2018-10-17 10:53:17 +05:30 |
|
Maneesh Gupta
|
5ca415a172
|
Merge pull request #696 from gargrahul/fix_texrestypelin_size
Fixed image width for linear resource type texture
|
2018-10-17 06:11:23 +05:30 |
|
Maneesh Gupta
|
ae4e24dc21
|
Merge pull request #708 from mangupta/swdev-125523
Add missing hipHostRegister flags on nvcc path
|
2018-10-17 06:10:25 +05:30 |
|
Evgeny Mankov
|
3f4f090fd9
|
Merge pull request #709 from emankov/master
[HIPIFY] Code cleanup and formatting
|
2018-10-15 15:37:38 +03:00 |
|
Evgeny Mankov
|
9a1a511c84
|
[HIPIFY] Code cleanup and formatting
|
2018-10-15 15:27:37 +03:00 |
|
Maneesh Gupta
|
d71006eb99
|
Add missing hipHostRegister flags on nvcc path
Change-Id: I69f09204d9c544935104d4168ab8d3626666a623
|
2018-10-15 15:30:24 +05:30 |
|
Maneesh Gupta
|
07ee1f07d8
|
Implementation for stream priority
- Requires ROCm 1.9.x or higher
- Requires HCC with PR#886 merged
Change-Id: Id7c95ea091ee610e80c9ad815f1cb989cba570ca
|
2018-10-05 16:27:46 +05:30 |
|
Aaron Enye Shi
|
03822afaa9
|
Replace IRIF fences with atomic_work_item_fence
|
2018-10-04 21:47:28 +00:00 |
|
Maneesh Gupta
|
e7d3f3860e
|
Merge pull request #697 from yxsamliu/dev-lib-path
Let hipcc add --hip-device-lib-path by default for hip-clang
|
2018-10-04 07:40:37 +05:30 |
|
Maneesh Gupta
|
ed51044502
|
Merge pull request #700 from ROCm-Developer-Tools/fix-long-long-decl
Fix hip_vector_types.h for long long vectors
|
2018-10-04 07:39:30 +05:30 |
|
Aaron Enye Shi
|
6a71cf17a1
|
Fix hip_vector_types.h for long long vectors
There was a missing long in the declaration for [u]longlongN types.
|
2018-10-03 13:57:52 -04:00 |
|
Evgeny Mankov
|
bde7c600c4
|
Merge pull request #699 from emankov/master
[HIPIFY] CUDA 10.0 Driver API initial support
|
2018-10-03 20:33:37 +03:00 |
|
Evgeny Mankov
|
cb4eb94174
|
[HIPIFY] CUDA 10.0 Driver API initial support
|
2018-10-03 20:29:22 +03:00 |
|
Rahul Garg
|
a78d68d639
|
Corrected the width calculation logic to accomodate multi channels
|
2018-10-03 12:07:38 +05:30 |
|
Yaxun Sam Liu
|
88768895bc
|
Let hipcc add --hip-device-lib-path by default for hip-clang
hip-clang by default assumes -fno-gpu-rdc, therefore requires
--hip-device-lib-path by default.
|
2018-10-01 15:14:54 -04:00 |
|
Yaxun Sam Liu
|
2c361906fa
|
Add HIPCC_COMPILE_FLAGS_APPEND
|
2018-10-01 14:51:29 -04:00 |
|
Rahul Garg
|
167265c279
|
Fixed image width for linear resource type texture
|
2018-10-01 15:28:34 +05:30 |
|
Evgeny Mankov
|
28e5a5ced7
|
Merge pull request #695 from emankov/master
[HIPIFY][cmake] CUDA 10.0 is not supported.
|
2018-09-27 19:08:58 +03:00 |
|
Evgeny Mankov
|
4e7b330037
|
[HIPIFY][cmake] CUDA 10.0 is not supported.
|
2018-09-27 19:05:22 +03:00 |
|
Wen-Heng (Jack) Chung
|
e257de95f3
|
Keep the map which tracks GPU kernel symbols to grow monotonically
|
2018-09-26 19:49:02 +00:00 |
|
Wen-Heng (Jack) Chung
|
060b3c0bf8
|
Improve performance of re-initialization logic
Keep track of shared libaries already discovered. Do not build HSA executables
for them.
|
2018-09-26 19:48:56 +00:00 |
|
Wen-Heng (Jack) Chung
|
319f007bf1
|
HIP program state re-initialization logic
This commit is to support kernels dynamically loaded thru means such as
dlopen() after HIP runtime initializes.
|
2018-09-26 19:48:47 +00:00 |
|
Evgeny Mankov
|
c778463625
|
Merge pull request #691 from emankov/docs
[HIPIFY][docs] Fix typos in Readme.md
|
2018-09-26 17:27:18 +03:00 |
|
Evgeny Mankov
|
6056686796
|
[HIPIFY][docs] Fix typos in Readme.md
|
2018-09-26 17:26:25 +03:00 |
|
Evgeny Mankov
|
e879118d87
|
Merge pull request #690 from emankov/hipBLAS
[HIPIFY][doc] Update README.md due to new LLVM 7.0.0 and CUDA 10.0 re…
|
2018-09-26 17:05:47 +03:00 |
|
Evgeny Mankov
|
a7e75c72ee
|
[HIPIFY][doc] Update README.md due to new LLVM 7.0.0 and CUDA 10.0 releases.
|
2018-09-26 17:01:59 +03:00 |
|
Maneesh Gupta
|
5bd2219f9d
|
Merge pull request #689 from gargrahul/hipmemset_ret_success_sizezero
Return hipSuccess when sizeBytes=0 in hipMemset
|
2018-09-26 13:57:44 +05:30 |
|
Rahul Garg
|
bd27310127
|
Return hipSuccess when sizeBytes=0 in hipMemset
|
2018-09-26 12:47:36 +05:30 |
|
Maneesh Gupta
|
802520f5f1
|
Merge pull request #685 from ROCm-Developer-Tools/hip-trig-return
Improve hip_trig test case
|
2018-09-26 09:50:48 +05:30 |
|
Aaron Enye Shi
|
eeb6c11050
|
Use sinf and cosf from ocml device libs
Using llvm_amdgcn builtin fails to produce accurate values, we should move to using the ocml device library versions.
|
2018-09-25 19:31:39 +00:00 |
|
Aaron Enye Shi
|
04ed44f074
|
Use trig functions from ocml instead
|
2018-09-25 15:58:36 +00:00 |
|
Evgeny Mankov
|
d12c1f7eff
|
Merge pull request #687 from emankov/hipBLAS
[HIPIFY][BLAS] Add support of hipblasGemmEx and corresponding types
|
2018-09-25 18:48:06 +03:00 |
|
Evgeny Mankov
|
65035fa485
|
[HIPIFY][BLAS] Add support of hipblasGemmEx and corresponding types
TODO (hipBLAS/HIP): rename hipblasDatatype_t to hipDataType_t and move it from hipBLAS to HIP, as Data types are used not only in BLAS library.
|
2018-09-25 18:46:23 +03:00 |
|
Evgeny Mankov
|
fae1de41e2
|
Merge pull request #686 from emankov/docs
[HIPIFY][docs] Update CUDNN_API_supported_by_HIP.md
|
2018-09-25 16:52:19 +03:00 |
|