Графік комітів

4249 Коміти

Автор SHA1 Повідомлення Дата
Alex Voicu be70b9f7e7 static inline in a header, just like excess sugar in a diet, causes bloat (#1692) 2019-12-23 19:09:38 +05:30
Evgeny Mankov 9c91632194 Merge pull request #1734 from vsytch/win10-hipMemcpy-fixes
Fix OOM issues with hipMemcpy test on Windows
2019-12-21 22:02:34 +03:00
Maneesh Gupta 18c1003469 bump version to 3.1 (#1756)
* Bump version to 3.1

* [ci] Enable tests on ROCm 3.0
2019-12-20 12:10:25 -08:00
Evgeny Mankov 5678cf450a Merge pull request #1757 from emankov/master
[HIPIFY][doc] Update Readme.md: versioning
2019-12-20 20:57:48 +03:00
Evgeny Mankov dd0bf8c7ff [HIPIFY][doc] Update Readme.md: versioning 2019-12-20 20:55:19 +03:00
Sarbojit2019 e2fc00da65 Fix for windows dtest build failure (#1742) 2019-12-19 13:10:43 -08:00
mhbliao 99a3b66110 [hip] Add macro guarding the enum conversion for scalar accessor. (#1748)
- That's a high overhead part, which needs enabling ONLY if necessary.
2019-12-19 10:08:37 -08:00
Evgeny Mankov 191808fe72 Merge pull request #1739 from emankov/hipify-clang
[HIP] Unify hipError_t (Step 1)
2019-12-19 07:57:02 +03:00
Evgeny Mankov 664a4b5ce1 Merge pull request #1743 from emankov/master
[HIPIFY] Revise HIP runtime
2019-12-16 20:11:19 +03:00
Evgeny Mankov 3b00057922 [HIPIFY] Revise HIP runtime
+ hipMemAllocHost is deprecated, use hipHostMalloc instead
+ hipMemAllocManaged -> hipMallocManaged
+ hipFreeHost -> hipHostFree
+ cudaMemset3D(Async) -> hipMemset3D(Async)
+ update docs and hipify-perl accordingly
2019-12-16 20:09:55 +03:00
Vladislav Sytchenko bf3df9d7c0 Add comments explaining choice on the cap for system memory on Windows. 2019-12-13 21:38:27 -05:00
Vladislav Sytchenko 12634879e2 Add explicit cast when computing the available amount of system memory. 2019-12-13 21:37:30 -05:00
Evgeny Mankov d8737ba50c [HIP] Unify hipError_t (Step 1)
Step 1. Set the same values for RT error codes as for analogous Driver's

[Reason] RT's and Driver's error codes unification in CUDA 10.2
2019-12-13 19:40:16 +03:00
Evgeny Mankov 651a91b5e0 Merge pull request #1738 from emankov/hipify-clang
[HIPIFY][build] Fix building with LLVM trunk
2019-12-13 15:43:24 +03:00
Evgeny Mankov 96839c82ae [HIPIFY][build] Fix building with LLVM trunk 2019-12-13 15:42:27 +03:00
Evgeny Mankov f53aae50ff Merge pull request #1736 from emankov/hipify
[HIPIFY][BLAS] Sync with cuBlas/rocblas/hipBlas
2019-12-12 19:56:08 +03:00
Evgeny Mankov 396a8a7a00 [HIPIFY][BLAS] Sync with cuBlas/rocblas/hipBlas 2019-12-12 19:52:18 +03:00
Vladislav Sytchenko b12c53cceb Reduce the amount of free host memory to 40% of what is reported on Windows, otherwise we can run into OOM situations. 2019-12-11 20:21:12 -05:00
mhbliao 444c931641 Only add hipExtLaunchMultiKernelMultiDevice for non-HCC compilers. (#1729) 2019-12-10 10:32:25 -08:00
jglaser 00d735cdc9 fix linking of vector types with gcc (#1690)
* fix linking of vector types when linking hipcc objects with gcc

* use __atribute__((vector_size)) with both clang and gcc

and reinstate nonaligned n=3 vector type

* use implicit conversion to value and ext_vector_type when available

* Alternate formulation for GCC compatibility

* Built-in arrays don't mix well with placement new

* Fix typo

* Add conversions to enum

* Fix Scalar_accessor assignment.

* Update hip_vector_types.h

* stir up the underlying_type hideous mess

This fixes the HIP build issue "error: only enumeration types have underlying types".
2019-12-10 09:40:15 +05:30
Evgeny Mankov a1ff68dbd6 Merge pull request #1725 from emankov/hipify-clang
[HIPIFY] Add cleanupHipifyOptions functionality
2019-12-08 15:15:57 +03:00
Evgeny Mankov 40bac6b144 [HIPIFY] Add cleanupHipifyOptions functionality
Needed for sorting input files to avoid clang errors on unsupported options
2019-12-08 15:13:28 +03:00
Evgeny Mankov f5f18efad8 Merge pull request #1723 from emankov/hipify-clang
[HIPIFY] main source file refactoring
2019-12-07 14:07:23 +03:00
Evgeny Mankov b472f41896 [HIPIFY] main source file refactoring 2019-12-07 13:50:13 +03:00
Evgeny Mankov c14026b056 Merge pull request #1722 from emankov/hipify-clang
[HIPIFY][fix][#1246][#1655] Sort input files based on their dependency graph
2019-12-07 11:23:01 +03:00
Evgeny Mankov 6868b38d1b [HIPIFY][fix][#1246][#1655] Sort input files based on their dependency graph
+ Implemented by using clang Driver infrastructure

[ToDo] Add tests for in-place hipification of multiple files
2019-12-07 11:20:17 +03:00
mhbliao e9da934ac6 Fix hipExtLaunchMultiKernelMultiDevice refactoring. (#1714)
- Use the correct condition for HIP VDI runtime.
2019-12-06 09:49:17 -08:00
Rahul Garg e53fc316f1 Revert - Changes related to hipMemcpyWithStream (#1718)
Reverting #1673, #1697 and #1707.
Support for hipMemcpyWithStream and memcpy optimizations, will be brought in again once issues seen with these are resolved independently.
2019-12-06 09:51:53 +05:30
Evgeny Mankov 496f0c53b3 Merge pull request #1712 from emankov/hipify
[HIPIFY] CUDA 10.2 (functions only)
2019-12-04 18:39:10 +03:00
Evgeny Mankov ec9b6a95a3 [HIPIFY] CUDA 10.2 (functions only)
+ Both Driver API and RT API are supported and synced with each other
+ Update *.md docs and hipify-perl accordingly
+ Add new conversion type "virtual_memory", introduced in Dirver API
2019-12-04 18:37:31 +03:00
Evgeny Mankov c518929fbc Merge pull request #1710 from emankov/hipify
[HIPIFY] CUDA 10.2 fp_16 support
2019-12-04 13:28:43 +03:00
Evgeny Mankov e3c3c6d86e [HIPIFY] CUDA 10.2 fp_16 support 2019-12-04 13:27:46 +03:00
Maneesh Gupta 1f6b3e2aa5 [packaging] Rename hip packages to conform to debian package renaming rules (#1708) 2019-12-04 15:55:37 +05:30
Evgeny Mankov 09998838e4 Merge pull request #1709 from emankov/hipify
[HIPIFY] CUDA 10.2 support (data types only)
2019-12-04 12:26:51 +03:00
Evgeny Mankov 5a03e8ccb9 [HIPIFY] CUDA 10.2 support (data types only)
+ Both Driver API and RT API are supported and synced with each other
+ Update *.md docs and hipify-perl accordingly
2019-12-04 12:25:20 +03:00
Rahul Garg 892d2a562e Fix hipMemcpyWithStream (#1707)
* Fix hipMemcpyWithStream
* Add a dtest for the same.
2019-12-04 11:56:23 +05:30
Aryan Salmanpour 68cc787781 [hip] refactoring hipExtLaunchMultiKernelMultiDevice API (#1698)
[Background] it was found that if lazy linking used for a library that calls hipExtLaunchMultiKernelMultiDevice API then this API can get the wrong program_state object for looking up device kernels leading to a "No device code available" error in this API.

To fix this issue, the API was refactored to be inline and get and pass the correct program_state to an internal hip API to request a multi-device kernel launch.
2019-12-04 11:50:51 +05:30
Maneesh Gupta 4585b7f6f3 Revert "[packaging] Conform to debian package naming rules (#1647)" (#1688)
This reverts commit 2ca1a82036.
2019-12-01 14:47:43 +05:30
Evgeny Mankov 1e9ac70f3b Merge pull request #1702 from emankov/hipify
[HIPIFY] Sync cudaError/CUresult between CUDA Driver API/CUDA RT API/HIP
2019-11-29 19:31:28 +03:00
Evgeny Mankov d33dd71ed3 [HIPIFY] Sync cudaError/CUresult between CUDA Driver API/CUDA RT API/HIP
+ Update *.md docs and hipify-perl accordingly

[Reason]
Starting with CUDA 10.1 all error codes are merged between Driver and RT APIs

[ToDo]
Do the same merge in HIP API as there is no need in distinguishing return codes by API
2019-11-29 19:29:11 +03:00
Maneesh Gupta 32442c6506 Revert changes for atomic FADD support when address is in LDS (#1701)
This reverts PR #1591 and follow-on PR #1695
2019-11-29 11:58:12 +05:30
Alex Voicu b6514fffb9 Uniform is_shared query. (#1695) 2019-11-28 13:39:05 +05:30
Jeff Daily 299bbe2325 fix hipMempcy precondition, test for zero size first (#1697) 2019-11-25 19:03:03 -08:00
Alex Voicu aaf31b6b96 Unary operators were too restrictive in the type of their argument. (#1683) 2019-11-22 07:54:53 +05:30
Shyam Saladi 70f457d7ea [HIPIFY] fix typo (#1681) 2019-11-22 07:54:37 +05:30
ansurya e60dec51da Fix rocBLAS compilation failure (#1677)
SWDEV-212749:
o Recent changes to “add support for extended launch” require hip_runtime.h to be include in hip_ext.h
o Order in which external applications include hip_hcc.h/hip_runtime.h causes compilation failure
2019-11-22 07:54:17 +05:30
Alex Voicu d597e7ca20 Use native support for atomic FADD when address is in LDS (#1591) 2019-11-22 07:53:48 +05:30
satyanveshd 6b06911ef1 fixed directed tests fail when hcc bumped to 3.0 (#1678)
Handled the HCC version check appropriately as few of the directed tests (SWDEV-212161) were failing when hcc was bumped to 3.0.
2019-11-20 21:37:52 +05:30
Alex Voicu 5a1f823739 General sync memcpy improvements. Add hipMemcpyWithStream (#1673)
* General sync memcpy improvements. Add `hipMemcpyWithStream`

* Update hip_memory.cpp
2019-11-20 21:36:37 +05:30
Rahul Garg b3161e9fa0 Update error codes for hipGetDevice for doxygen and move up null check (#1668)
* [docs] Update error codes for hipGetDevice

* Move up out ptr check
2019-11-20 21:35:27 +05:30