Maneesh Gupta
274db66ff2
Merge pull request #817 from aaronenyeshi/fix-rn-intrinsics
...
Add *_rn functions back into HIP intrinsics
2018-12-27 16:07:27 +05:30
Evgeny Mankov
8ffd7e5fa4
Merge pull request #831 from emankov/master
...
[HIPIFY] Introduce option -o-dir
2018-12-27 13:36:12 +03:00
emankov
c65338fbf7
[HIPIFY] Introduce option -o-dir
...
Option -o-dir for output directory:
+ if not specified source file(s) directory is used;
+ creates the directory if the directory doesn't exist (only one level in a tree);
+ if -o and -o-dir both are specified the hipified file is placed to "-o-dir" + "-o";
+ reports an error in case of a wrong directory specified, in case of necessity of creating a tree of subfolders, or in case of a filename specified.
2018-12-26 16:30:41 +03:00
Evgeny Mankov
3cf4df4571
Merge pull request #829 from emankov/master
...
[HIPIFY][#812 ] Finishing with temps
2018-12-25 18:39:12 +03:00
emankov
7cac7dbfea
[HIPIFY][ #812 ] Finishing with temps
...
1. Option -temp-dir for temporary directory:
+ if not specified system temp is used;
+ creates the directory if the directory doesn't exist (only one level in a tree);
+ reports an error in case of a wrong directory specified, in case of necessity of creating a tree of subfolders, or in case of a filename specified.
2. Option -save-temps for preserving temporary files:
+ if specified temporary files are not being deleted from system temps and user temps as well.
3. Work with files in terms of calculated absolute paths by collapsing all '.' and '..' patterns, resolving symlinks and expanding '~' expression to the user's home directory:
+ to produce correct include paths;
+ to avoid possible errors on file routines.
2018-12-22 00:47:19 +03:00
Evgeny Mankov
33d4018c67
Merge pull request #825 from emankov/master
...
[HIPIFY][fix][#824 ] Fix the bug with -inplace option
2018-12-21 18:37:15 +03:00
emankov
cb598445f5
[HIPIFY][fix][ #824 ] Fix the bug with -inplace option
...
+ Do temp file copying instead of renaming
+ Replace all file routine functions with LLVM's
+ Do not output hipified file in case of errors
2018-12-20 20:43:50 +03:00
Evgeny Mankov
10faf729fa
Merge pull request #822 from emankov/master
...
[HIPIFY] Add the legal right comment to all the sources
2018-12-20 18:53:09 +03:00
emankov
340550cc56
[HIPIFY] Add the legal right comment to all the sources
2018-12-20 18:47:57 +03:00
Evgeny Mankov
41867c6753
Merge pull request #816 from emankov/master
...
[HIPIFY] Temporary files in system temps introducing
2018-12-20 18:28:53 +03:00
Aaron Enye Shi
2d50392c17
Add *_rn functions back into HIP intrinsics
...
Add back the round-to-nearest-even intrinsics back to HIP math
intrinsics as it caused regression.
2018-12-18 19:31:54 +00:00
Evgeny Mankov
1180928408
[HIPIFY] Temporary files in system temps introducing
...
+ Files are not being placed near original source before hipification anymore, system tmp is used for them now.
+ Additional include (-I) to absolute original source dir is added
+ Partially implements #812
+ Tested on windows/linux, single/multiple inputs, with and without includes to user header files
2018-12-17 20:48:48 +03:00
Evgeny Mankov
637bf7eb6f
Merge pull request #809 from emankov/master
...
[HIPIFY][fix] Fix for the rest of found bugs in Statistics
2018-12-14 15:21:53 +03:00
Evgeny Mankov
1e30cd3ea7
[HIPIFY][fix] Fix for the rest of found bugs in Statistics
...
+ Signs of the converted file are extended
+ Total converted lines and total elapsed time are fixed
+ Zero rates are excluded from statistics
2018-12-13 18:35:45 +03:00
Evgeny Mankov
a5d25a9c5d
Merge pull request #802 from emankov/master
...
[HIPIFY][fix] Fix for #801
2018-12-13 14:40:00 +03:00
Evgeny Mankov
8d28db9c8d
[HIPIFY][fix] Fixer for #801
...
[issue #801 ] Errors in hipification statistics
2018-12-12 21:33:00 +03:00
Evgeny Mankov
80d5861d78
Merge pull request #800 from emankov/master
...
[HIPIFY][tests][fix] Add correct CUDA version detection while testing
2018-12-12 18:30:03 +03:00
Evgeny Mankov
a5c66018b6
[HIPIFY][tests][fix] Add correct CUDA version detection while testing
...
+ Just do load_config for lit.site.cfg earlier in lit.cfg to get initialized cuda_version
+ Remove lit.cfg.in for CMake variables substitution
+ Return back lit.cfg
+ Exclude CUDA 9.0 tests if CUDA < 9.0
+ Report CUDA version before running tests
2018-12-12 17:05:01 +03:00
Maneesh Gupta
a778f7cdf7
Merge pull request #797 from gargrahul/fixhipPointerGetAttributes
...
Fixed hipPointerGetAttributes for hostmalloced ptr
2018-12-12 10:16:07 +05:30
Maneesh Gupta
6ce99b066c
Merge pull request #608 from gargrahul/add_pinned_2d_sdma_copy
...
Added support for pinned 2D SDMA copy
2018-12-12 07:44:16 +05:30
Evgeny Mankov
2b01f1eb6f
[HIPIFY][tests][fix] Add correct CUDA version detection while testing
...
+ Add lit.cfg.in for CMake variables substitution to now generated lit.cfg
+ Exclude lit.cfg
+ Exclude CUDA 9.0 tests if CUDA < 9.0
+ Report CUDA version before running tests
2018-12-11 22:03:11 +03:00
Evgeny Mankov
f2afe92844
Merge pull request #789 from emankov/master
...
[HIPIFY] Initial support of translation to ROCm
2018-12-11 10:56:41 +03:00
Rahul Garg
77fd517e09
Fixed hipPointerGetAttributes for hostmalloced ptr
2018-12-08 01:42:08 +05:30
Evgeny Mankov
747eb2c11c
[HIPIFY] Initial support of translation to ROCm
...
+ Option '-roc' is added to translate to ROCm instead of HIP
+ Update hipification mechanism accordingly
+ Populate all translation maps with additional field for ROCm identifiers
+ There are 4 degrees of CUDA support now: full support, only by HIP, only by ROCm, unsupported
2018-12-06 21:52:54 +03:00
Evgeny Mankov
32455eddaf
Merge pull request #787 from emankov/master
...
[HIPIFY][tests] Reorganization
2018-12-04 20:48:48 +03:00
Evgeny Mankov
b76cf94562
[HIPIFY][tests] Reorganization
2018-12-04 20:47:34 +03:00
Evgeny Mankov
f368ab4a32
Merge pull request #786 from emankov/master
...
[HIPIFY][SPARSE] Add 3 more CUDA 10.0 tests
2018-12-04 19:30:26 +03:00
Evgeny Mankov
a421aff96d
[HIPIFY][SPARSE] Add 3 more CUDA 10.0 tests
...
+ lit update
+ fix typos
2018-12-04 19:24:29 +03:00
Maneesh Gupta
f6ec2236cd
Merge pull request #785 from mangupta/ci-add-rocm_2_0
...
[ci] Add rocm-2.0.x to CI test infrastructure
2018-12-03 10:41:43 +05:30
Maneesh Gupta
94292df570
[ci] Add rocm-2.0.x to CI test infrastructure
...
Change-Id: I7fc0c40d1bf50a90ce3e210f2c8e83d1f4bf6d5c
2018-12-03 08:54:13 +05:30
Evgeny Mankov
7de2948334
Merge pull request #784 from emankov/master
...
[HIPIFY][SPARSE] Matrix Reorderings and Format Conversion Reference
2018-11-30 15:35:14 +03:00
Evgeny Mankov
424e4c8229
[HIPIFY][SPARSE] Matrix Reorderings and Format Conversion Reference
...
+ cuSPARSE is supported up to CUDA 10.0
+ cuSPARSE_08 test (CUDA 10.0)
+ update CUSPARSE_API_supported_by_HIP.md
+ lit: add a rule for CUDA 10.0 tests excluding
2018-11-30 15:33:57 +03:00
Maneesh Gupta
940c5c2b26
Merge pull request #783 from scchan/revert_metadata_for_kerarg
...
Revert metadata for kerarg
2018-11-30 09:39:42 +05:30
Siu Chi Chan
4251a70bc0
Revert "Rely on code object metadat for kernarg arguments alignof and sizeof."
...
This reverts commit 299b9d1382 .
2018-11-29 11:38:37 -05:00
Siu Chi Chan
a41bce5163
Revert "Handle the very confusing dual encoding of the symbol name."
...
This reverts commit 1fe33b4738 .
2018-11-29 11:38:36 -05:00
Siu Chi Chan
dc36c49212
Revert "If we've already seen a __global__ function we do not need to re-parse"
...
This reverts commit 1621deee71 .
2018-11-29 11:38:35 -05:00
Siu Chi Chan
30a9be5643
Revert "Handle (odd) corner case of argumentless __global__ function."
...
This reverts commit 927b06f8d4 .
2018-11-29 11:38:34 -05:00
Siu Chi Chan
c5df445d1f
Revert "Missing handling nullary __global__ functions for mixed arity cases."
...
This reverts commit 1e64813716 .
2018-11-29 11:38:31 -05:00
Siu Chi Chan
adbf0c444b
Revert "Fix issue in kernarg metadata parsing due to early finalization"
...
This reverts commit ba2b5ff9da .
2018-11-29 11:35:00 -05:00
Evgeny Mankov
c0b3e59fcf
Merge pull request #782 from emankov/master
...
[HIPIFY][SPARSE] Preconditioners Reference: Tridiagonal and Pentadiagonal solves
2018-11-29 18:49:34 +03:00
Evgeny Mankov
e8054fe485
[HIPIFY][SPARSE] Preconditioners Reference: Tridiagonal and Pentadiagonal solves
...
+ Tridiagonal Solve, Batched Tridiagonal and Pentadiagonal Solve
+ cuSPARSE_07 test
+ update CUSPARSE_API_supported_by_HIP.md
2018-11-29 18:46:51 +03:00
Evgeny Mankov
7317dce7bd
Merge pull request #781 from emankov/master
...
[HIPIFY][SPARSE] Preconditioners Reference: Incomplete LU Factorization: level 0
2018-11-29 16:01:44 +03:00
Evgeny Mankov
ef99d6dee4
[HIPIFY][SPARSE] Preconditioners Reference: Incomplete LU Factorization: level 0
...
+ cuSPARSE_06 test
+ update CUSPARSE_API_supported_by_HIP.md
2018-11-29 15:59:58 +03:00
Evgeny Mankov
df81e8dcd8
Merge pull request #780 from emankov/master
...
[HIPIFY][SPARSE] Preconditioners Reference: Incomplete Cholesky Factorization: level 0
2018-11-28 20:12:21 +03:00
Evgeny Mankov
e90373c927
[HIPIFY][SPARSE] Preconditioners Reference: Incomplete Cholesky Factorization: level 0
...
+ cuSPARSE_05 test
+ update CUSPARSE_API_supported_by_HIP.md
2018-11-28 20:10:30 +03:00
Maneesh Gupta
a2de2a40d9
Merge pull request #776 from yxsamliu/fix-vdi-path
...
Let hipcc handle HIP_VDI_HOME without x86_64
2018-11-28 09:33:03 +05:30
Maneesh Gupta
3dcf68fe79
Merge pull request #761 from yxsamliu/genco
...
Let hip-clang support --genco
2018-11-28 09:32:56 +05:30
Yaxun Sam Liu
988dcd1e4a
Let hip-clang support --genco
2018-11-27 15:55:50 -05:00
Evgeny Mankov
2cbf618191
Merge pull request #779 from emankov/master
...
[HIPIFY][SPARSE] Extra functions
2018-11-27 12:43:30 +03:00
Evgeny Mankov
c75b3c444b
[HIPIFY][SPARSE] Extra functions
...
+ cuSPARSE_04 test
+ update CUSPARSE_API_supported_by_HIP.md
2018-11-27 12:41:50 +03:00