Evgeny Mankov
bffacb6c72
[HIPIFY][fix] Fix for the rest of found bugs in Statistics
...
+ Signs of the converted file are extended
+ Total converted lines and total elapsed time are fixed
+ Zero rates are excluded from statistics
2018-12-13 18:35:45 +03:00
Evgeny Mankov
4a58c0480b
Merge pull request #802 from emankov/master
...
[HIPIFY][fix] Fix for #801
2018-12-13 14:40:00 +03:00
Evgeny Mankov
9d223f0f65
[HIPIFY][fix] Fixer for #801
...
[issue #801 ] Errors in hipification statistics
2018-12-12 21:33:00 +03:00
Evgeny Mankov
7741457dea
Merge pull request #800 from emankov/master
...
[HIPIFY][tests][fix] Add correct CUDA version detection while testing
2018-12-12 18:30:03 +03:00
Evgeny Mankov
3d528cf491
[HIPIFY][tests][fix] Add correct CUDA version detection while testing
...
+ Just do load_config for lit.site.cfg earlier in lit.cfg to get initialized cuda_version
+ Remove lit.cfg.in for CMake variables substitution
+ Return back lit.cfg
+ Exclude CUDA 9.0 tests if CUDA < 9.0
+ Report CUDA version before running tests
2018-12-12 17:05:01 +03:00
Maneesh Gupta
56ce3e37d5
Merge pull request #797 from gargrahul/fixhipPointerGetAttributes
...
Fixed hipPointerGetAttributes for hostmalloced ptr
2018-12-12 10:16:07 +05:30
Maneesh Gupta
0dd26b4f63
Merge pull request #608 from gargrahul/add_pinned_2d_sdma_copy
...
Added support for pinned 2D SDMA copy
2018-12-12 07:44:16 +05:30
Evgeny Mankov
b955e317a7
[HIPIFY][tests][fix] Add correct CUDA version detection while testing
...
+ Add lit.cfg.in for CMake variables substitution to now generated lit.cfg
+ Exclude lit.cfg
+ Exclude CUDA 9.0 tests if CUDA < 9.0
+ Report CUDA version before running tests
2018-12-11 22:03:11 +03:00
Evgeny Mankov
f3d8d3d989
Merge pull request #789 from emankov/master
...
[HIPIFY] Initial support of translation to ROCm
2018-12-11 10:56:41 +03:00
Rahul Garg
5f12067708
Fixed hipPointerGetAttributes for hostmalloced ptr
2018-12-08 01:42:08 +05:30
Evgeny Mankov
c5ed25da22
[HIPIFY] Initial support of translation to ROCm
...
+ Option '-roc' is added to translate to ROCm instead of HIP
+ Update hipification mechanism accordingly
+ Populate all translation maps with additional field for ROCm identifiers
+ There are 4 degrees of CUDA support now: full support, only by HIP, only by ROCm, unsupported
2018-12-06 21:52:54 +03:00
Evgeny Mankov
d3654e12e4
Merge pull request #787 from emankov/master
...
[HIPIFY][tests] Reorganization
2018-12-04 20:48:48 +03:00
Evgeny Mankov
09612ac03f
[HIPIFY][tests] Reorganization
2018-12-04 20:47:34 +03:00
Evgeny Mankov
e083cc64eb
Merge pull request #786 from emankov/master
...
[HIPIFY][SPARSE] Add 3 more CUDA 10.0 tests
2018-12-04 19:30:26 +03:00
Evgeny Mankov
72d40db358
[HIPIFY][SPARSE] Add 3 more CUDA 10.0 tests
...
+ lit update
+ fix typos
2018-12-04 19:24:29 +03:00
Maneesh Gupta
0c8ba46b9f
Merge pull request #785 from mangupta/ci-add-rocm_2_0
...
[ci] Add rocm-2.0.x to CI test infrastructure
2018-12-03 10:41:43 +05:30
Maneesh Gupta
dc5b0a1fe0
[ci] Add rocm-2.0.x to CI test infrastructure
...
Change-Id: I7fc0c40d1bf50a90ce3e210f2c8e83d1f4bf6d5c
2018-12-03 08:54:13 +05:30
Evgeny Mankov
1ecfec2264
Merge pull request #784 from emankov/master
...
[HIPIFY][SPARSE] Matrix Reorderings and Format Conversion Reference
2018-11-30 15:35:14 +03:00
Evgeny Mankov
116b9191f7
[HIPIFY][SPARSE] Matrix Reorderings and Format Conversion Reference
...
+ cuSPARSE is supported up to CUDA 10.0
+ cuSPARSE_08 test (CUDA 10.0)
+ update CUSPARSE_API_supported_by_HIP.md
+ lit: add a rule for CUDA 10.0 tests excluding
2018-11-30 15:33:57 +03:00
Maneesh Gupta
dbe34ac6ac
Merge pull request #783 from scchan/revert_metadata_for_kerarg
...
Revert metadata for kerarg
2018-11-30 09:39:42 +05:30
Siu Chi Chan
1fbf639962
Revert "Rely on code object metadat for kernarg arguments alignof and sizeof."
...
This reverts commit fe1e963299 .
2018-11-29 11:38:37 -05:00
Siu Chi Chan
71189c10c1
Revert "Handle the very confusing dual encoding of the symbol name."
...
This reverts commit bce3de8162 .
2018-11-29 11:38:36 -05:00
Siu Chi Chan
09f87e41d9
Revert "If we've already seen a __global__ function we do not need to re-parse"
...
This reverts commit f7ba987038 .
2018-11-29 11:38:35 -05:00
Siu Chi Chan
aeca2c8cdc
Revert "Handle (odd) corner case of argumentless __global__ function."
...
This reverts commit c0bd1a5af8 .
2018-11-29 11:38:34 -05:00
Siu Chi Chan
8eb9b38e76
Revert "Missing handling nullary __global__ functions for mixed arity cases."
...
This reverts commit 4ebc229b9a .
2018-11-29 11:38:31 -05:00
Siu Chi Chan
909609773c
Revert "Fix issue in kernarg metadata parsing due to early finalization"
...
This reverts commit 19acf86cef .
2018-11-29 11:35:00 -05:00
Evgeny Mankov
c80c6886e2
Merge pull request #782 from emankov/master
...
[HIPIFY][SPARSE] Preconditioners Reference: Tridiagonal and Pentadiagonal solves
2018-11-29 18:49:34 +03:00
Evgeny Mankov
8a84c665e2
[HIPIFY][SPARSE] Preconditioners Reference: Tridiagonal and Pentadiagonal solves
...
+ Tridiagonal Solve, Batched Tridiagonal and Pentadiagonal Solve
+ cuSPARSE_07 test
+ update CUSPARSE_API_supported_by_HIP.md
2018-11-29 18:46:51 +03:00
Evgeny Mankov
d09e6dfb68
Merge pull request #781 from emankov/master
...
[HIPIFY][SPARSE] Preconditioners Reference: Incomplete LU Factorization: level 0
2018-11-29 16:01:44 +03:00
Evgeny Mankov
6c085c83a2
[HIPIFY][SPARSE] Preconditioners Reference: Incomplete LU Factorization: level 0
...
+ cuSPARSE_06 test
+ update CUSPARSE_API_supported_by_HIP.md
2018-11-29 15:59:58 +03:00
Evgeny Mankov
eafd40f904
Merge pull request #780 from emankov/master
...
[HIPIFY][SPARSE] Preconditioners Reference: Incomplete Cholesky Factorization: level 0
2018-11-28 20:12:21 +03:00
Evgeny Mankov
b39bd8c9a9
[HIPIFY][SPARSE] Preconditioners Reference: Incomplete Cholesky Factorization: level 0
...
+ cuSPARSE_05 test
+ update CUSPARSE_API_supported_by_HIP.md
2018-11-28 20:10:30 +03:00
Maneesh Gupta
491caacff1
Merge pull request #776 from yxsamliu/fix-vdi-path
...
Let hipcc handle HIP_VDI_HOME without x86_64
2018-11-28 09:33:03 +05:30
Maneesh Gupta
9f649333ab
Merge pull request #761 from yxsamliu/genco
...
Let hip-clang support --genco
2018-11-28 09:32:56 +05:30
Yaxun Sam Liu
450f093231
Let hip-clang support --genco
2018-11-27 15:55:50 -05:00
Evgeny Mankov
11c094bedc
Merge pull request #779 from emankov/master
...
[HIPIFY][SPARSE] Extra functions
2018-11-27 12:43:30 +03:00
Evgeny Mankov
11fedfbff6
[HIPIFY][SPARSE] Extra functions
...
+ cuSPARSE_04 test
+ update CUSPARSE_API_supported_by_HIP.md
2018-11-27 12:41:50 +03:00
Evgeny Mankov
7d5d598288
Merge pull request #778 from emankov/master
...
[HIPIFY][SPARSE] Level 3 functions
2018-11-27 11:59:09 +03:00
Evgeny Mankov
82bbaf0b70
[HIPIFY][SPARSE] Level 3 functions
...
+ cuSPARSE_03 test
+ update CUSPARSE_API_supported_by_HIP.md
2018-11-27 11:57:25 +03:00
Yaxun Sam Liu
9d46966a59
Let hipcc handle HIP_VDI_HOME without x86_64
2018-11-26 17:00:37 -05:00
Evgeny Mankov
dc5aea4b03
Merge pull request #775 from emankov/master
...
[HIPIFY][SPARSE] Helper and Level 1,2 functions
2018-11-26 15:14:40 +03:00
Evgeny Mankov
c7f6266824
[HIPIFY][SPARSE] Helper and Level 1,2 functions
...
+ 2 cuSPARSE tests
+ update CUSPARSE_API_supported_by_HIP.md
2018-11-26 15:11:52 +03:00
Maneesh Gupta
9a1d8137eb
Merge pull request #772 from scchan/kernarg_metadata_early_finalize
...
Fix issue in kernarg metadata parsing due to early finalization
2018-11-26 08:55:19 +05:30
Maneesh Gupta
7ce082415b
Merge pull request #773 from fronteer/master
...
Support of printing process ID for HIP tracing
2018-11-23 11:16:22 +05:30
Maneesh Gupta
160c509e23
Merge pull request #760 from eshcherb/roctracer-hip-frontend-181113
...
Roctracer hip frontend 181113
2018-11-23 11:08:25 +05:30
Evgeny
8a4354bdc0
FATAL_ERRROR if Profiling API header not found
2018-11-22 21:51:20 -06:00
Evgeny
3bcf2fcd18
fixing the adding of USE_PROF_API define only if the tracer header was found
2018-11-22 21:12:08 -06:00
Qianfeng Zhang
91e8ec9e4b
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP
2018-11-22 19:11:49 +08:00
Qianfeng Zhang
81cf7cabfa
Add support of printing process ID for HIP Tracing
2018-11-22 18:58:06 +08:00
Siu Chi Chan
19acf86cef
Fix issue in kernarg metadata parsing due to early finalization
...
The logic to parse the kernel metadata is unaware that enabling
of early finalization could result in multiple code blobs in a
single .kernel section. This teaches the HIP runtime to handle
that.
Change-Id: I1581b42f0da8b30233d7898014f7468728c1d489
2018-11-21 12:09:46 -05:00