Evgeny Mankov
0206fadc14
[HIPIFY] option cuda-path might be used with '--' as well as with a single dash: '-'
...
+ Update README.md
2019-01-10 18:01:33 +03:00
Evgeny Mankov
fbfe005e4e
[HIPIFY] Introduce CUDA installation path option '-cuda-path'
...
Repeats clang's '--cuda-path' option.
[Reason]
In case of absence of any other clang's options setting '-cuda-path' allows not to specify separator '--' before clang's '--cuda-path'.
+ Tests and scripts are updated accordingly.
2019-01-09 20:18:36 +03:00
Maneesh Gupta
580c808400
Update hipAsynchronousStreams.cpp
...
This test needs to be excluded from nvcc platform
2019-01-07 09:26:21 +05:30
Sameer Sahasrabuddhe
8fe2a4b167
introduce test for asynchronous streams
...
This test is identical to the HCC tests under
tests/Unit/OCKL/AsyncStreams, but only for synchronous operation.
2019-01-04 05:51:50 -05:00
emankov
9abcad9407
[HIPIFY][tests] CUDA 7.0 is also supported
...
+ Exclude 1 test for CUDA 7.0
+ Update README.md
2019-01-04 01:32:30 +03:00
emankov
65155c7159
[HIPIFY][tests] Exclude CUDA 8.0 (9.0) tests if CUDA < 8.0 (9.0)
2019-01-03 18:06:49 +03:00
emankov
7fb0f4ee2a
[HIPIFY][tests] Reduce cudaRegister.cu test
2019-01-03 17:17:21 +03:00
Maneesh Gupta
4d0266a495
Merge pull request #818 from yxsamliu/fix-hip-config
...
Add default arguments for hipConfigureCall
2019-01-01 10:25:25 +05:30
Maneesh Gupta
cb8e28144a
Merge pull request #821 from gargrahul/remove_redundancy_memset3d_test
...
[dtest] Remove redundant arguments from hipMemset3D test
2018-12-31 15:32:19 +05:30
emankov
dec459efca
[HIPIFY] Get rid of setting '--cuda-gpu-arch='
...
[Reasons]
+ We don't compile kernel code at least for now as HIP kernel syntax is almost equal CUDA's;
+ clang always includes PTX in its binaries, so e.g. a binary compiled with --cuda-gpu-arch= would be forwards-compatible with e.g. sm_35 GPUs.
2018-12-29 19:48:28 +03:00
emankov
7ea586c323
[HIPIFY] Starts implicitly setting '-x cuda' by hipify-clang itself
...
+ No need in setting '-x cuda' for the user anymore;
+ Testing and Readme updated accordingly.
2018-12-29 17:04:59 +03:00
emankov
2d3f62c3e1
[HIPIFY][win] Set -std=c++14 if MSVC
...
+ Due to latest Windows SDK implementation based on c++14 features, '-std+c++14' is a must;
+ Doesn't affect older versions of MSVC 2017, 2015 (the latest supported is MSVC 2015);
+ On linux still '-std=c++11' is a must;
+ Exclude '-std=' from lit config.
2018-12-29 15:51:06 +03:00
Rahul Garg
4272119eeb
Remove redundant arguments from hipMemset3D test
2018-12-20 06:51:22 +05:30
Yaxun Sam Liu
ec03a8b352
Add default arguments for hipConfigureCall
2018-12-18 15:48:21 -05:00
Aaron Enye Shi
58dfeff27a
Add *_rn functions back into HIP intrinsics
...
Add back the round-to-nearest-even intrinsics back to HIP math
intrinsics as it caused regression.
2018-12-18 19:31:54 +00:00
Evgeny Mankov
3d528cf491
[HIPIFY][tests][fix] Add correct CUDA version detection while testing
...
+ Just do load_config for lit.site.cfg earlier in lit.cfg to get initialized cuda_version
+ Remove lit.cfg.in for CMake variables substitution
+ Return back lit.cfg
+ Exclude CUDA 9.0 tests if CUDA < 9.0
+ Report CUDA version before running tests
2018-12-12 17:05:01 +03:00
Evgeny Mankov
b955e317a7
[HIPIFY][tests][fix] Add correct CUDA version detection while testing
...
+ Add lit.cfg.in for CMake variables substitution to now generated lit.cfg
+ Exclude lit.cfg
+ Exclude CUDA 9.0 tests if CUDA < 9.0
+ Report CUDA version before running tests
2018-12-11 22:03:11 +03:00
Evgeny Mankov
09612ac03f
[HIPIFY][tests] Reorganization
2018-12-04 20:47:34 +03:00
Evgeny Mankov
72d40db358
[HIPIFY][SPARSE] Add 3 more CUDA 10.0 tests
...
+ lit update
+ fix typos
2018-12-04 19:24:29 +03:00
Evgeny Mankov
116b9191f7
[HIPIFY][SPARSE] Matrix Reorderings and Format Conversion Reference
...
+ cuSPARSE is supported up to CUDA 10.0
+ cuSPARSE_08 test (CUDA 10.0)
+ update CUSPARSE_API_supported_by_HIP.md
+ lit: add a rule for CUDA 10.0 tests excluding
2018-11-30 15:33:57 +03:00
Evgeny Mankov
8a84c665e2
[HIPIFY][SPARSE] Preconditioners Reference: Tridiagonal and Pentadiagonal solves
...
+ Tridiagonal Solve, Batched Tridiagonal and Pentadiagonal Solve
+ cuSPARSE_07 test
+ update CUSPARSE_API_supported_by_HIP.md
2018-11-29 18:46:51 +03:00
Evgeny Mankov
6c085c83a2
[HIPIFY][SPARSE] Preconditioners Reference: Incomplete LU Factorization: level 0
...
+ cuSPARSE_06 test
+ update CUSPARSE_API_supported_by_HIP.md
2018-11-29 15:59:58 +03:00
Evgeny Mankov
b39bd8c9a9
[HIPIFY][SPARSE] Preconditioners Reference: Incomplete Cholesky Factorization: level 0
...
+ cuSPARSE_05 test
+ update CUSPARSE_API_supported_by_HIP.md
2018-11-28 20:10:30 +03:00
Evgeny Mankov
11fedfbff6
[HIPIFY][SPARSE] Extra functions
...
+ cuSPARSE_04 test
+ update CUSPARSE_API_supported_by_HIP.md
2018-11-27 12:41:50 +03:00
Evgeny Mankov
82bbaf0b70
[HIPIFY][SPARSE] Level 3 functions
...
+ cuSPARSE_03 test
+ update CUSPARSE_API_supported_by_HIP.md
2018-11-27 11:57:25 +03:00
Evgeny Mankov
c7f6266824
[HIPIFY][SPARSE] Helper and Level 1,2 functions
...
+ 2 cuSPARSE tests
+ update CUSPARSE_API_supported_by_HIP.md
2018-11-26 15:11:52 +03:00
Maneesh Gupta
bcea027bf1
Merge pull request #748 from mkuron/getsymboladdress
...
Implement hipGetSymbolAddress and hipGetSymbolSize
2018-11-21 10:32:01 +05:30
emankov
30c387a811
[HIPIFY][SPARSE] Initial support
2018-11-20 17:58:05 +03:00
Michael Kuron
8610128c3e
Merge branch 'master' into getsymboladdress
2018-11-20 12:03:22 +01:00
Aaron Enye Shi
0cfaa52d15
Guard rcp rounded implementation as well
...
Since rcp implementations of non-default rounded versions are not correct or supported in OCML, guard them using the same macro OCML_BASIC_ROUNDED_OPERATIONS. Also update the docs and tests.
2018-11-06 19:53:28 +00:00
Aaron Enye Shi
e59c33250a
Guard the OCML rounded operations instead
...
Instead of commenting all these functions out, guard the functions with a macro OCML_BASIC_ROUNDED_OPERATIONS.
2018-11-06 16:32:14 +00:00
Michael Kuron
49cc703d7a
Test for hipGetSymbolSize and hipGetSymbolAddress
2018-11-06 11:39:34 +01:00
Aaron Enye Shi
2b108a2cfd
Remove non-working non-default-rounded math apis
...
In ROCm-Device-Libs, they have dropped the non-default-rounded versions of add, sub, mul, div, sqrt and fma. Therefore, ocml has removed the rte, rtp, rtn, and rtz counterparts. This will remove the same math APIs in HIP for _ru, _rd, _rn, and _rz.
2018-11-05 22:34:16 +00:00
Alex Voicu
fe959f7bd7
Re-sync with upstream.
2018-10-18 12:27:03 +01:00
Maneesh Gupta
52e320f396
Replace hipLaunchKernel -> hipLaunchKernelGGL
...
Change-Id: I4d99009e1199811d417becf1e1b934ec4d4e30be
2018-10-17 14:32:25 +05:30
Alex Voicu
5312336ce2
Minimal should mean minimal.
2018-10-11 00:21:41 +01:00
Alex Voicu
50265028fc
Trailing whitespace is still not the final frontier.
2018-10-11 00:08:09 +01:00
Alex Voicu
069bbbd7ed
Use HIP math functions, stop using hipLaunchParm.
2018-10-10 11:56:54 +01:00
Alex Voicu
ca375cb8c5
Re-sync with upstream.
2018-10-10 11:43:49 +01:00
Maneesh Gupta
da64156fb2
Implementation for stream priority
...
- Requires ROCm 1.9.x or higher
- Requires HCC with PR#886 merged
Change-Id: Id7c95ea091ee610e80c9ad815f1cb989cba570ca
2018-10-05 16:27:46 +05:30
Maneesh Gupta
5c665acf74
Merge pull request #685 from ROCm-Developer-Tools/hip-trig-return
...
Improve hip_trig test case
2018-09-26 09:50:48 +05:30
Aaron Enye Shi
5179a72cda
Use trig functions from ocml instead
2018-09-25 15:58:36 +00:00
Aaron Enye Shi
3d3820272a
Fix missing HIP_ASSERT in hip_trig
2018-09-24 15:01:24 +00:00
Maneesh Gupta
ca6b916d15
Merge pull request #683 from ROCm-Developer-Tools/hip-float-rounded
...
Disable non-default-rounded functions
2018-09-24 09:32:52 +05:30
Aaron Enye Shi
357b6844fa
Improve hip_trig test case
...
Hip_trig is failing but returning a passing value on HIP-Clang. Also update with debug output and free arrays used.
2018-09-21 15:28:10 -04:00
Maneesh Gupta
255589ae15
Update hipTestFMA.cpp
2018-09-21 11:00:08 +05:30
Aaron Enye Shi
c11220f224
Disable non-default-rounded functions
...
Device library has removed the non-default-rounded functions, so hipFloatMath will fail to build. These include the removal of __ocml_sqrt_rte, __ocml_sqrt_rtn, __ocml_sqrt_rtp, and __ocml_sqrt_rtz. As seen here: https://github.com/RadeonOpenCompute/ROCm-Device-Libs/commit/2fc04e10e1354edee331ce700f98a60f8255effb . Disable these function tests for now, until they are re-enabled, or deleted completely.
2018-09-20 16:33:32 -04:00
Maneesh Gupta
ecd6a212c7
Update hipTestFMA.cpp
...
Fix the test so that it works on nvcc path as well.
2018-09-20 11:23:51 +05:30
Yaxun Sam Liu
bd622a4b4a
Add fma function with float and _Float16 arguments
2018-09-19 09:59:33 -04:00
Rahul Garg
8da3a96f2b
Fixed get alignment offset test for NVCC
...
- Removed dtest hipGetAlgntoffset2D.cpp
- hipGetTextureAlignmentOffset is being tested now in hipBindTexRef1DFetch.cpp
2018-09-18 14:10:03 +05:30