Граф коммитов

3140 Коммитов

Автор SHA1 Сообщение Дата
Evgeny Mankov bffacb6c72 [HIPIFY][fix] Fix for the rest of found bugs in Statistics
+ Signs of the converted file are extended
+ Total converted lines and total elapsed time are fixed
+ Zero rates are excluded from statistics
2018-12-13 18:35:45 +03:00
Evgeny Mankov 4a58c0480b Merge pull request #802 from emankov/master
[HIPIFY][fix] Fix for #801
2018-12-13 14:40:00 +03:00
Evgeny Mankov 9d223f0f65 [HIPIFY][fix] Fixer for #801
[issue #801] Errors in hipification statistics
2018-12-12 21:33:00 +03:00
Evgeny Mankov 7741457dea Merge pull request #800 from emankov/master
[HIPIFY][tests][fix] Add correct CUDA version detection while testing
2018-12-12 18:30:03 +03:00
Evgeny Mankov 3d528cf491 [HIPIFY][tests][fix] Add correct CUDA version detection while testing
+ Just do load_config for lit.site.cfg earlier in lit.cfg to get initialized cuda_version
+ Remove lit.cfg.in for CMake variables substitution
+ Return back lit.cfg
+ Exclude CUDA 9.0 tests if CUDA < 9.0
+ Report CUDA version before running tests
2018-12-12 17:05:01 +03:00
Maneesh Gupta 56ce3e37d5 Merge pull request #797 from gargrahul/fixhipPointerGetAttributes
Fixed hipPointerGetAttributes for hostmalloced ptr
2018-12-12 10:16:07 +05:30
Maneesh Gupta 0dd26b4f63 Merge pull request #608 from gargrahul/add_pinned_2d_sdma_copy
Added support for pinned 2D SDMA copy
2018-12-12 07:44:16 +05:30
Evgeny Mankov b955e317a7 [HIPIFY][tests][fix] Add correct CUDA version detection while testing
+ Add lit.cfg.in for CMake variables substitution to now generated lit.cfg
+ Exclude lit.cfg
+ Exclude CUDA 9.0 tests if CUDA < 9.0
+ Report CUDA version before running tests
2018-12-11 22:03:11 +03:00
Evgeny Mankov f3d8d3d989 Merge pull request #789 from emankov/master
[HIPIFY] Initial support of translation to ROCm
2018-12-11 10:56:41 +03:00
Rahul Garg 5f12067708 Fixed hipPointerGetAttributes for hostmalloced ptr 2018-12-08 01:42:08 +05:30
Evgeny Mankov c5ed25da22 [HIPIFY] Initial support of translation to ROCm
+ Option '-roc' is added to translate to ROCm instead of HIP
+ Update hipification mechanism accordingly
+ Populate all translation maps with additional field for ROCm identifiers
+ There are 4 degrees of CUDA support now: full support, only by HIP, only by ROCm, unsupported
2018-12-06 21:52:54 +03:00
Evgeny Mankov d3654e12e4 Merge pull request #787 from emankov/master
[HIPIFY][tests] Reorganization
2018-12-04 20:48:48 +03:00
Evgeny Mankov 09612ac03f [HIPIFY][tests] Reorganization 2018-12-04 20:47:34 +03:00
Evgeny Mankov e083cc64eb Merge pull request #786 from emankov/master
[HIPIFY][SPARSE] Add 3 more CUDA 10.0 tests
2018-12-04 19:30:26 +03:00
Evgeny Mankov 72d40db358 [HIPIFY][SPARSE] Add 3 more CUDA 10.0 tests
+ lit update
+ fix typos
2018-12-04 19:24:29 +03:00
Maneesh Gupta 0c8ba46b9f Merge pull request #785 from mangupta/ci-add-rocm_2_0
[ci] Add rocm-2.0.x to CI test infrastructure
2018-12-03 10:41:43 +05:30
Maneesh Gupta dc5b0a1fe0 [ci] Add rocm-2.0.x to CI test infrastructure
Change-Id: I7fc0c40d1bf50a90ce3e210f2c8e83d1f4bf6d5c
2018-12-03 08:54:13 +05:30
Evgeny Mankov 1ecfec2264 Merge pull request #784 from emankov/master
[HIPIFY][SPARSE] Matrix Reorderings and Format Conversion Reference
2018-11-30 15:35:14 +03:00
Evgeny Mankov 116b9191f7 [HIPIFY][SPARSE] Matrix Reorderings and Format Conversion Reference
+ cuSPARSE is supported up to CUDA 10.0
+ cuSPARSE_08 test (CUDA 10.0)
+ update CUSPARSE_API_supported_by_HIP.md
+ lit: add a rule for CUDA 10.0 tests excluding
2018-11-30 15:33:57 +03:00
Maneesh Gupta dbe34ac6ac Merge pull request #783 from scchan/revert_metadata_for_kerarg
Revert metadata for kerarg
2018-11-30 09:39:42 +05:30
Siu Chi Chan 1fbf639962 Revert "Rely on code object metadat for kernarg arguments alignof and sizeof."
This reverts commit fe1e963299.
2018-11-29 11:38:37 -05:00
Siu Chi Chan 71189c10c1 Revert "Handle the very confusing dual encoding of the symbol name."
This reverts commit bce3de8162.
2018-11-29 11:38:36 -05:00
Siu Chi Chan 09f87e41d9 Revert "If we've already seen a __global__ function we do not need to re-parse"
This reverts commit f7ba987038.
2018-11-29 11:38:35 -05:00
Siu Chi Chan aeca2c8cdc Revert "Handle (odd) corner case of argumentless __global__ function."
This reverts commit c0bd1a5af8.
2018-11-29 11:38:34 -05:00
Siu Chi Chan 8eb9b38e76 Revert "Missing handling nullary __global__ functions for mixed arity cases."
This reverts commit 4ebc229b9a.
2018-11-29 11:38:31 -05:00
Siu Chi Chan 909609773c Revert "Fix issue in kernarg metadata parsing due to early finalization"
This reverts commit 19acf86cef.
2018-11-29 11:35:00 -05:00
Evgeny Mankov c80c6886e2 Merge pull request #782 from emankov/master
[HIPIFY][SPARSE] Preconditioners Reference: Tridiagonal and Pentadiagonal solves
2018-11-29 18:49:34 +03:00
Evgeny Mankov 8a84c665e2 [HIPIFY][SPARSE] Preconditioners Reference: Tridiagonal and Pentadiagonal solves
+ Tridiagonal Solve, Batched Tridiagonal and Pentadiagonal Solve
+ cuSPARSE_07 test
+ update CUSPARSE_API_supported_by_HIP.md
2018-11-29 18:46:51 +03:00
Evgeny Mankov d09e6dfb68 Merge pull request #781 from emankov/master
[HIPIFY][SPARSE] Preconditioners Reference: Incomplete LU Factorization: level 0
2018-11-29 16:01:44 +03:00
Evgeny Mankov 6c085c83a2 [HIPIFY][SPARSE] Preconditioners Reference: Incomplete LU Factorization: level 0
+ cuSPARSE_06 test
+ update CUSPARSE_API_supported_by_HIP.md
2018-11-29 15:59:58 +03:00
Evgeny Mankov eafd40f904 Merge pull request #780 from emankov/master
[HIPIFY][SPARSE] Preconditioners Reference: Incomplete Cholesky Factorization: level 0
2018-11-28 20:12:21 +03:00
Evgeny Mankov b39bd8c9a9 [HIPIFY][SPARSE] Preconditioners Reference: Incomplete Cholesky Factorization: level 0
+ cuSPARSE_05 test
+ update CUSPARSE_API_supported_by_HIP.md
2018-11-28 20:10:30 +03:00
Maneesh Gupta 491caacff1 Merge pull request #776 from yxsamliu/fix-vdi-path
Let hipcc handle HIP_VDI_HOME without x86_64
2018-11-28 09:33:03 +05:30
Maneesh Gupta 9f649333ab Merge pull request #761 from yxsamliu/genco
Let hip-clang support --genco
2018-11-28 09:32:56 +05:30
Yaxun Sam Liu 450f093231 Let hip-clang support --genco 2018-11-27 15:55:50 -05:00
Evgeny Mankov 11c094bedc Merge pull request #779 from emankov/master
[HIPIFY][SPARSE] Extra functions
2018-11-27 12:43:30 +03:00
Evgeny Mankov 11fedfbff6 [HIPIFY][SPARSE] Extra functions
+ cuSPARSE_04 test
+ update CUSPARSE_API_supported_by_HIP.md
2018-11-27 12:41:50 +03:00
Evgeny Mankov 7d5d598288 Merge pull request #778 from emankov/master
[HIPIFY][SPARSE] Level 3 functions
2018-11-27 11:59:09 +03:00
Evgeny Mankov 82bbaf0b70 [HIPIFY][SPARSE] Level 3 functions
+ cuSPARSE_03 test
+ update CUSPARSE_API_supported_by_HIP.md
2018-11-27 11:57:25 +03:00
Yaxun Sam Liu 9d46966a59 Let hipcc handle HIP_VDI_HOME without x86_64 2018-11-26 17:00:37 -05:00
Evgeny Mankov dc5aea4b03 Merge pull request #775 from emankov/master
[HIPIFY][SPARSE] Helper and Level 1,2 functions
2018-11-26 15:14:40 +03:00
Evgeny Mankov c7f6266824 [HIPIFY][SPARSE] Helper and Level 1,2 functions
+ 2 cuSPARSE tests
+ update CUSPARSE_API_supported_by_HIP.md
2018-11-26 15:11:52 +03:00
Maneesh Gupta 9a1d8137eb Merge pull request #772 from scchan/kernarg_metadata_early_finalize
Fix issue in kernarg metadata parsing due to early finalization
2018-11-26 08:55:19 +05:30
Maneesh Gupta 7ce082415b Merge pull request #773 from fronteer/master
Support of printing process ID for HIP tracing
2018-11-23 11:16:22 +05:30
Maneesh Gupta 160c509e23 Merge pull request #760 from eshcherb/roctracer-hip-frontend-181113
Roctracer hip frontend 181113
2018-11-23 11:08:25 +05:30
Evgeny 8a4354bdc0 FATAL_ERRROR if Profiling API header not found 2018-11-22 21:51:20 -06:00
Evgeny 3bcf2fcd18 fixing the adding of USE_PROF_API define only if the tracer header was found 2018-11-22 21:12:08 -06:00
Qianfeng Zhang 91e8ec9e4b Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP 2018-11-22 19:11:49 +08:00
Qianfeng Zhang 81cf7cabfa Add support of printing process ID for HIP Tracing 2018-11-22 18:58:06 +08:00
Siu Chi Chan 19acf86cef Fix issue in kernarg metadata parsing due to early finalization
The logic to parse the kernel metadata is unaware that enabling
of early finalization could result in multiple code blobs in a
single .kernel section.  This teaches the HIP runtime to handle
that.

Change-Id: I1581b42f0da8b30233d7898014f7468728c1d489
2018-11-21 12:09:46 -05:00