Граф коммитов

4224 Коммитов

Автор SHA1 Сообщение Дата
Evgeny Mankov 15d0c4b58b [HIPIFY][fix][#1246][#1655] Sort input files based on their dependency graph
+ Implemented by using clang Driver infrastructure

[ToDo] Add tests for in-place hipification of multiple files


[ROCm/clr commit: ef405daaee]
2019-12-07 11:20:17 +03:00
mhbliao 6643c80e8e Fix hipExtLaunchMultiKernelMultiDevice refactoring. (#1714)
- Use the correct condition for HIP VDI runtime.

[ROCm/clr commit: 209f31525f]
2019-12-06 09:49:17 -08:00
Rahul Garg 48faf7423a Revert - Changes related to hipMemcpyWithStream (#1718)
Reverting #1673, #1697 and #1707.
Support for hipMemcpyWithStream and memcpy optimizations, will be brought in again once issues seen with these are resolved independently.

[ROCm/clr commit: a369bd4418]
2019-12-06 09:51:53 +05:30
Evgeny Mankov d56471b77b Merge pull request #1712 from emankov/hipify
[HIPIFY] CUDA 10.2 (functions only)

[ROCm/clr commit: c3076a2a07]
2019-12-04 18:39:10 +03:00
Evgeny Mankov 8477ab5d86 [HIPIFY] CUDA 10.2 (functions only)
+ Both Driver API and RT API are supported and synced with each other
+ Update *.md docs and hipify-perl accordingly
+ Add new conversion type "virtual_memory", introduced in Dirver API


[ROCm/clr commit: 43441eb99c]
2019-12-04 18:37:31 +03:00
Evgeny Mankov 03250ea190 Merge pull request #1710 from emankov/hipify
[HIPIFY] CUDA 10.2 fp_16 support

[ROCm/clr commit: 187f28e26b]
2019-12-04 13:28:43 +03:00
Evgeny Mankov f6da983c96 [HIPIFY] CUDA 10.2 fp_16 support
[ROCm/clr commit: 721a8540c0]
2019-12-04 13:27:46 +03:00
Maneesh Gupta ecbc77d921 [packaging] Rename hip packages to conform to debian package renaming rules (#1708)
[ROCm/clr commit: 59b3bd0048]
2019-12-04 15:55:37 +05:30
Evgeny Mankov 9976235fb4 Merge pull request #1709 from emankov/hipify
[HIPIFY] CUDA 10.2 support (data types only)

[ROCm/clr commit: 89703e4ea9]
2019-12-04 12:26:51 +03:00
Evgeny Mankov deaf478ced [HIPIFY] CUDA 10.2 support (data types only)
+ Both Driver API and RT API are supported and synced with each other
+ Update *.md docs and hipify-perl accordingly


[ROCm/clr commit: 33f81effb1]
2019-12-04 12:25:20 +03:00
Rahul Garg 6c7a338471 Fix hipMemcpyWithStream (#1707)
* Fix hipMemcpyWithStream
* Add a dtest for the same.


[ROCm/clr commit: 261a04580f]
2019-12-04 11:56:23 +05:30
Aryan Salmanpour 2471c1950c [hip] refactoring hipExtLaunchMultiKernelMultiDevice API (#1698)
[Background] it was found that if lazy linking used for a library that calls hipExtLaunchMultiKernelMultiDevice API then this API can get the wrong program_state object for looking up device kernels leading to a "No device code available" error in this API.

To fix this issue, the API was refactored to be inline and get and pass the correct program_state to an internal hip API to request a multi-device kernel launch.

[ROCm/clr commit: 8eaea4d114]
2019-12-04 11:50:51 +05:30
Maneesh Gupta 9e9d49c891 Revert "[packaging] Conform to debian package naming rules (#1647)" (#1688)
This reverts commit 71be7baece.

[ROCm/clr commit: 38e971b645]
2019-12-01 14:47:43 +05:30
Evgeny Mankov 2c2a726726 Merge pull request #1702 from emankov/hipify
[HIPIFY] Sync cudaError/CUresult between CUDA Driver API/CUDA RT API/HIP

[ROCm/clr commit: da468e181f]
2019-11-29 19:31:28 +03:00
Evgeny Mankov 5317d618d1 [HIPIFY] Sync cudaError/CUresult between CUDA Driver API/CUDA RT API/HIP
+ Update *.md docs and hipify-perl accordingly

[Reason]
Starting with CUDA 10.1 all error codes are merged between Driver and RT APIs

[ToDo]
Do the same merge in HIP API as there is no need in distinguishing return codes by API


[ROCm/clr commit: 636108aaa9]
2019-11-29 19:29:11 +03:00
Maneesh Gupta 97850ae9a8 Revert changes for atomic FADD support when address is in LDS (#1701)
This reverts PR #1591 and follow-on PR #1695

[ROCm/clr commit: 4c92bd50c4]
2019-11-29 11:58:12 +05:30
Alex Voicu b26aa2378e Uniform is_shared query. (#1695)
[ROCm/clr commit: 17a4780dc6]
2019-11-28 13:39:05 +05:30
Jeff Daily 22c07e2331 fix hipMempcy precondition, test for zero size first (#1697)
[ROCm/clr commit: 6df73e1f12]
2019-11-25 19:03:03 -08:00
Alex Voicu 82b55e6502 Unary operators were too restrictive in the type of their argument. (#1683)
[ROCm/clr commit: 306d50291e]
2019-11-22 07:54:53 +05:30
Shyam Saladi 84953da127 [HIPIFY] fix typo (#1681)
[ROCm/clr commit: 562336e733]
2019-11-22 07:54:37 +05:30
ansurya 0ece8d0f9b Fix rocBLAS compilation failure (#1677)
SWDEV-212749:
o Recent changes to “add support for extended launch” require hip_runtime.h to be include in hip_ext.h
o Order in which external applications include hip_hcc.h/hip_runtime.h causes compilation failure

[ROCm/clr commit: e5fc5aa41c]
2019-11-22 07:54:17 +05:30
Alex Voicu e1dca187c9 Use native support for atomic FADD when address is in LDS (#1591)
[ROCm/clr commit: 2ed3a0873c]
2019-11-22 07:53:48 +05:30
satyanveshd e2a0a547ff fixed directed tests fail when hcc bumped to 3.0 (#1678)
Handled the HCC version check appropriately as few of the directed tests (SWDEV-212161) were failing when hcc was bumped to 3.0.

[ROCm/clr commit: d4dde7a27d]
2019-11-20 21:37:52 +05:30
Alex Voicu 553a7c2ce3 General sync memcpy improvements. Add hipMemcpyWithStream (#1673)
* General sync memcpy improvements. Add `hipMemcpyWithStream`

* Update hip_memory.cpp


[ROCm/clr commit: 022ac3cb0a]
2019-11-20 21:36:37 +05:30
Rahul Garg 81dd3751a6 Update error codes for hipGetDevice for doxygen and move up null check (#1668)
* [docs] Update error codes for hipGetDevice

* Move up out ptr check


[ROCm/clr commit: 13c2a31d7e]
2019-11-20 21:35:27 +05:30
Paul Fultz II 29bb65f6e0 Fix helper header when using c++17 (#1666)
This will fix issue #1621. It also adds tests for is_callable with c++11, c++14, and c++17.

The fallback implementation was completely broken so I rewrote it so it pass the tests as well. This should be used instead of PR #1631.

[ROCm/clr commit: 57b1b03261]
2019-11-20 21:33:42 +05:30
Alex Voicu 5c27a4f570 Extend vector type capabilities and add tests to reflect it. (#1656)
[ROCm/clr commit: c383f20691]
2019-11-20 21:32:32 +05:30
Sarbojit2019 2c00d66995 Revert [HIP] Fixed hipStreamAddCallback (#1674)
This reverts commit 4d0b7e65bb.
Addresses SWDEV#212675.

[ROCm/clr commit: 8a3c36a424]
2019-11-20 11:55:46 +05:30
Jeff Daily 58d1ae32ee hipStreamSynchronize can skip marker if stream is empty (#1667)
[ROCm/clr commit: dfaaf04611]
2019-11-19 09:42:43 -08:00
Rahul Garg 6813f0436c [dtest] Rename hipExtLaunchMultiKernelMultiDevice dtest (#1671)
[ROCm/clr commit: fa1335f419]
2019-11-19 09:41:13 -08:00
Sameer Sahasrabuddhe 82aa679a6d hostcall: add tests that are only used by the VDI build (#1664)
[ROCm/clr commit: e158b0b255]
2019-11-18 19:52:40 -08:00
mhbliao e9f9ecd519 Fix mathlib and app builds with hip-clang. (#1665)
[ROCm/clr commit: a45de95113]
2019-11-18 08:18:20 -08:00
Rahul Garg f524f75b69 Fix gcc build on NVCC path (#1661)
* Fix gcc build on NVCC path

* Fix CI build errors

* [dtest] Fix texture and surface obj2D tests


[ROCm/clr commit: ff31f734fe]
2019-11-18 12:19:22 +05:30
Nick Curtis 5875d98e6f fix complex conjugate for double-complex (#1659)
The sign in the y component returned from hipConj incorrect for double-complex. Fix to match as in hipConjf above.

[ROCm/clr commit: cae9b13020]
2019-11-18 12:19:12 +05:30
Sarbojit2019 7d5bc56ab7 Added null check in hipEventSynchronize (#1627)
* Added missing null check in hipEventSynchronize
* Minor correction in the Event API description


[ROCm/clr commit: 7985ad218f]
2019-11-18 12:18:55 +05:30
jiabaxie de260a6d60 Updated hipEnvVarDriver to work with Windows (#1614)
* Updated hipEnvVarDriver to work with Windows

* Cleaned up a bit of code

* Fixed a part where putenv was used for both win and linux

* Defines moved to test_common.h and cleaned up code

* Cleaned up some macro defines and used const char instead

* Got rid of some excess commenting

* directory paths are unconditional

* Cleaned some duplicate code, and variables are now declared and defined together


[ROCm/clr commit: 7bbd420068]
2019-11-18 12:18:29 +05:30
kjayapra-amd fffdd3da8d [dtests] Fixing compilation error in saxpy.cpp (#1608)
[ROCm/clr commit: 99da905509]
2019-11-18 12:18:11 +05:30
Alex Voicu f48b38b0ed Add support for extended launch syntax. (#1530)
* Add support for extended launch syntax.

* Add unit test.

* Fix typo

* hipExtLaunchKernelGGL lives in hip_ext.h

Change-Id: Ice32dab0d43475fda65c6a910c11416871a8f2ff

* [dtest] remove redundant include from hipModuleGetGlobal dtest


[ROCm/clr commit: 355d0bdf95]
2019-11-16 22:24:07 -08:00
Maneesh Gupta 57225de63f [ci] Fix failures on nvcc path (#1654)
Change-Id: I97f6b51314e14965618ab510bf57a2b034183e78

[ROCm/clr commit: 4d0a234183]
2019-11-15 07:58:40 +05:30
Maneesh Gupta 71be7baece [packaging] Conform to debian package naming rules (#1647)
* [packaging] Conform to debian package naming rules

* Add hip_hcc metapackage for backward compatibility


[ROCm/clr commit: 5a03482690]
2019-11-14 11:40:34 +05:30
Evgeny Mankov 5cd24367d5 Merge pull request #1651 from emankov/hipify
[HIPIFY][doc] A few words about LLVM's FileCheck in README.md

[ROCm/clr commit: cf79be0f64]
2019-11-12 16:06:25 +03:00
Evgeny Mankov ed11ec3fa9 [HIPIFY][doc] A few words about LLVM's FileCheck in README.md
[ROCm/clr commit: bb6f1648ec]
2019-11-12 16:04:28 +03:00
Maneesh Gupta ca9eb34000 Bump version to 3.0 (#1648)
Change-Id: Id1e6eb222080a339af938212e74a52c39c305eab

[ROCm/clr commit: 0a2682625f]
2019-11-10 20:23:58 -08:00
Rahul Garg c03f67dfbf Merge pull request #1643 from mhbliao/hliao/master/remove.redundant.decl
Remove redundant declarations.

[ROCm/clr commit: 4c73950e31]
2019-11-07 11:16:40 -08:00
Michael LIAO a10e227b56 Remove redundant declarations.
- The revised `hip/hip_ext.h` have that declarations.


[ROCm/clr commit: d28ad401c9]
2019-11-07 10:11:22 -05:00
Evgeny Mankov cdff675646 Merge pull request #1642 from emankov/hipify
[HIPIFY] Clang style formatting

[ROCm/clr commit: 806ae1f43e]
2019-11-07 12:49:52 +03:00
Evgeny Mankov 62b86d1c76 [HIPIFY] Clang style formatting
[ROCm/clr commit: 7f143585f6]
2019-11-07 12:48:05 +03:00
Evgeny Mankov 5561c2390d Merge pull request #1641 from emankov/hipify
[HIPIFY][fix] Abandon canCompileHostAndDeviceInOneJob check for --cuda-compile-host-device option

[ROCm/clr commit: 367261ea0e]
2019-11-07 11:35:15 +03:00
Evgeny Mankov db5f0b0237 [HIPIFY][fix] Delete canCompileHostAndDeviceInOneJob declaration as well
[ROCm/clr commit: 4d2424ad36]
2019-11-07 11:32:53 +03:00
Evgeny Mankov be483c4b3e [HIPIFY][fix] Abandon canCompileHostAndDeviceInOneJob check for --cuda-compile-host-device option
[Reason] It turned out that it is not so: 2 jobs are always


[ROCm/clr commit: 21c3d25403]
2019-11-07 11:30:40 +03:00