Rahul Garg
131c0ea326
Fix review comments
2019-03-02 23:38:37 +05:30
Rahul Garg
77c6d5451f
Fix hipMemset test for HIP/NVCC
2019-03-01 03:46:57 +05:30
Rahul Garg
026dc9457e
Update hipMemset test
2019-02-28 06:54:49 +05:30
Evgeny Mankov
3a8ef9c8a2
[HIPIFY][tests] caffe2 test fix
2019-02-25 17:12:32 +03:00
Evgeny Mankov
2619f22e5c
[HIPIFY][Caffe2] Initial Caffe2 support
2019-02-23 20:46:22 +03:00
Evgeny Mankov
d26dc3cea7
[HIPIFY][DNN] Revise and sync with CUDNN 10 and hipDNN
...
+ CUDNN_API_supported_by_HIP.md update
+ CUDA_Runtime_API_functions_supported_by_HIP.md update
+ libraryPropertyType_t is moved to Runtime
+ test update
2019-02-13 19:40:59 +03:00
Evgeny Mankov
7ba8c440e6
[HIPIFY][RAND] revise
...
+ add missing functions
+ add minimum rocRAND support
+ updated CURAND_API_supported_by_HIP.md
2019-02-08 17:54:28 +03:00
Evgeny Mankov
385fe36f2a
[HIPIFY][BLAS][tests] Add tests on hipifying to 'roc'
2019-02-07 19:25:23 +03:00
Maneesh Gupta
50a12f1ce1
Merge pull request #863 from aaronenyeshi/add-dot-instructions
...
Add Dot functions as amd_mixed_dot function
2019-01-29 06:01:01 +05:30
Aaron Enye Shi
ede494bd5f
Fix mixed dot for Jenkins rocm_2_0
2019-01-24 22:48:35 +00:00
Aaron Enye Shi
7e46f0639c
Exclude mixed dot functions from nvcc path
2019-01-24 21:00:45 +00:00
Evgeny Mankov
4ff355357f
[HIPIFY] CUDA Runtime API functions total revise (up to CUDA 10.0)
...
+ for all CUDA versions starting from 7.0
+ add missing functions
+ sync with HIP
+ update CUDA_Runtime_API_functions_supported_by_HIP.md
+ update affected tests
+ fix typos
+ formatting, annotating
2019-01-24 14:05:36 +03:00
Aaron Enye Shi
f31aa946e7
Fix mixed dot for Jenkins rocm_head
2019-01-23 16:39:25 +00:00
Aaron Enye Shi
1d70f4217e
Add tests for dot functions
2019-01-22 22:32:28 +00:00
Evgeny Mankov
841e056935
[HIPIFY][tests] Exclude one more test if CUDA == 7.0, which is of CUDA 7.5
2019-01-22 22:26:43 +03:00
Yaxun Sam Liu
aa6abb0dcf
Fix get_time in tests for nvcc
2019-01-21 14:07:50 -05:00
Yaxun Sam Liu
05b5e30187
Use chrono instead of sys/time.h in test
...
sys/time.h is not available on Windows. Use C++11 <chrono> instead so that the tests
compile on Windows.
2019-01-20 00:05:27 -05:00
Evgeny Mankov
0206fadc14
[HIPIFY] option cuda-path might be used with '--' as well as with a single dash: '-'
...
+ Update README.md
2019-01-10 18:01:33 +03:00
Evgeny Mankov
fbfe005e4e
[HIPIFY] Introduce CUDA installation path option '-cuda-path'
...
Repeats clang's '--cuda-path' option.
[Reason]
In case of absence of any other clang's options setting '-cuda-path' allows not to specify separator '--' before clang's '--cuda-path'.
+ Tests and scripts are updated accordingly.
2019-01-09 20:18:36 +03:00
Maneesh Gupta
580c808400
Update hipAsynchronousStreams.cpp
...
This test needs to be excluded from nvcc platform
2019-01-07 09:26:21 +05:30
Sameer Sahasrabuddhe
8fe2a4b167
introduce test for asynchronous streams
...
This test is identical to the HCC tests under
tests/Unit/OCKL/AsyncStreams, but only for synchronous operation.
2019-01-04 05:51:50 -05:00
emankov
9abcad9407
[HIPIFY][tests] CUDA 7.0 is also supported
...
+ Exclude 1 test for CUDA 7.0
+ Update README.md
2019-01-04 01:32:30 +03:00
emankov
65155c7159
[HIPIFY][tests] Exclude CUDA 8.0 (9.0) tests if CUDA < 8.0 (9.0)
2019-01-03 18:06:49 +03:00
emankov
7fb0f4ee2a
[HIPIFY][tests] Reduce cudaRegister.cu test
2019-01-03 17:17:21 +03:00
Maneesh Gupta
4d0266a495
Merge pull request #818 from yxsamliu/fix-hip-config
...
Add default arguments for hipConfigureCall
2019-01-01 10:25:25 +05:30
Maneesh Gupta
cb8e28144a
Merge pull request #821 from gargrahul/remove_redundancy_memset3d_test
...
[dtest] Remove redundant arguments from hipMemset3D test
2018-12-31 15:32:19 +05:30
emankov
dec459efca
[HIPIFY] Get rid of setting '--cuda-gpu-arch='
...
[Reasons]
+ We don't compile kernel code at least for now as HIP kernel syntax is almost equal CUDA's;
+ clang always includes PTX in its binaries, so e.g. a binary compiled with --cuda-gpu-arch= would be forwards-compatible with e.g. sm_35 GPUs.
2018-12-29 19:48:28 +03:00
emankov
7ea586c323
[HIPIFY] Starts implicitly setting '-x cuda' by hipify-clang itself
...
+ No need in setting '-x cuda' for the user anymore;
+ Testing and Readme updated accordingly.
2018-12-29 17:04:59 +03:00
emankov
2d3f62c3e1
[HIPIFY][win] Set -std=c++14 if MSVC
...
+ Due to latest Windows SDK implementation based on c++14 features, '-std+c++14' is a must;
+ Doesn't affect older versions of MSVC 2017, 2015 (the latest supported is MSVC 2015);
+ On linux still '-std=c++11' is a must;
+ Exclude '-std=' from lit config.
2018-12-29 15:51:06 +03:00
Rahul Garg
4272119eeb
Remove redundant arguments from hipMemset3D test
2018-12-20 06:51:22 +05:30
Yaxun Sam Liu
ec03a8b352
Add default arguments for hipConfigureCall
2018-12-18 15:48:21 -05:00
Aaron Enye Shi
58dfeff27a
Add *_rn functions back into HIP intrinsics
...
Add back the round-to-nearest-even intrinsics back to HIP math
intrinsics as it caused regression.
2018-12-18 19:31:54 +00:00
Evgeny Mankov
3d528cf491
[HIPIFY][tests][fix] Add correct CUDA version detection while testing
...
+ Just do load_config for lit.site.cfg earlier in lit.cfg to get initialized cuda_version
+ Remove lit.cfg.in for CMake variables substitution
+ Return back lit.cfg
+ Exclude CUDA 9.0 tests if CUDA < 9.0
+ Report CUDA version before running tests
2018-12-12 17:05:01 +03:00
Evgeny Mankov
b955e317a7
[HIPIFY][tests][fix] Add correct CUDA version detection while testing
...
+ Add lit.cfg.in for CMake variables substitution to now generated lit.cfg
+ Exclude lit.cfg
+ Exclude CUDA 9.0 tests if CUDA < 9.0
+ Report CUDA version before running tests
2018-12-11 22:03:11 +03:00
Evgeny Mankov
09612ac03f
[HIPIFY][tests] Reorganization
2018-12-04 20:47:34 +03:00
Evgeny Mankov
72d40db358
[HIPIFY][SPARSE] Add 3 more CUDA 10.0 tests
...
+ lit update
+ fix typos
2018-12-04 19:24:29 +03:00
Evgeny Mankov
116b9191f7
[HIPIFY][SPARSE] Matrix Reorderings and Format Conversion Reference
...
+ cuSPARSE is supported up to CUDA 10.0
+ cuSPARSE_08 test (CUDA 10.0)
+ update CUSPARSE_API_supported_by_HIP.md
+ lit: add a rule for CUDA 10.0 tests excluding
2018-11-30 15:33:57 +03:00
Evgeny Mankov
8a84c665e2
[HIPIFY][SPARSE] Preconditioners Reference: Tridiagonal and Pentadiagonal solves
...
+ Tridiagonal Solve, Batched Tridiagonal and Pentadiagonal Solve
+ cuSPARSE_07 test
+ update CUSPARSE_API_supported_by_HIP.md
2018-11-29 18:46:51 +03:00
Evgeny Mankov
6c085c83a2
[HIPIFY][SPARSE] Preconditioners Reference: Incomplete LU Factorization: level 0
...
+ cuSPARSE_06 test
+ update CUSPARSE_API_supported_by_HIP.md
2018-11-29 15:59:58 +03:00
Evgeny Mankov
b39bd8c9a9
[HIPIFY][SPARSE] Preconditioners Reference: Incomplete Cholesky Factorization: level 0
...
+ cuSPARSE_05 test
+ update CUSPARSE_API_supported_by_HIP.md
2018-11-28 20:10:30 +03:00
Evgeny Mankov
11fedfbff6
[HIPIFY][SPARSE] Extra functions
...
+ cuSPARSE_04 test
+ update CUSPARSE_API_supported_by_HIP.md
2018-11-27 12:41:50 +03:00
Evgeny Mankov
82bbaf0b70
[HIPIFY][SPARSE] Level 3 functions
...
+ cuSPARSE_03 test
+ update CUSPARSE_API_supported_by_HIP.md
2018-11-27 11:57:25 +03:00
Evgeny Mankov
c7f6266824
[HIPIFY][SPARSE] Helper and Level 1,2 functions
...
+ 2 cuSPARSE tests
+ update CUSPARSE_API_supported_by_HIP.md
2018-11-26 15:11:52 +03:00
Maneesh Gupta
bcea027bf1
Merge pull request #748 from mkuron/getsymboladdress
...
Implement hipGetSymbolAddress and hipGetSymbolSize
2018-11-21 10:32:01 +05:30
emankov
30c387a811
[HIPIFY][SPARSE] Initial support
2018-11-20 17:58:05 +03:00
Michael Kuron
8610128c3e
Merge branch 'master' into getsymboladdress
2018-11-20 12:03:22 +01:00
Aaron Enye Shi
0cfaa52d15
Guard rcp rounded implementation as well
...
Since rcp implementations of non-default rounded versions are not correct or supported in OCML, guard them using the same macro OCML_BASIC_ROUNDED_OPERATIONS. Also update the docs and tests.
2018-11-06 19:53:28 +00:00
Aaron Enye Shi
e59c33250a
Guard the OCML rounded operations instead
...
Instead of commenting all these functions out, guard the functions with a macro OCML_BASIC_ROUNDED_OPERATIONS.
2018-11-06 16:32:14 +00:00
Michael Kuron
49cc703d7a
Test for hipGetSymbolSize and hipGetSymbolAddress
2018-11-06 11:39:34 +01:00
Aaron Enye Shi
2b108a2cfd
Remove non-working non-default-rounded math apis
...
In ROCm-Device-Libs, they have dropped the non-default-rounded versions of add, sub, mul, div, sqrt and fma. Therefore, ocml has removed the rte, rtp, rtn, and rtz counterparts. This will remove the same math APIs in HIP for _ru, _rd, _rn, and _rz.
2018-11-05 22:34:16 +00:00