Evgeny Mankov
15d0c4b58b
[HIPIFY][fix][ #1246 ][ #1655 ] Sort input files based on their dependency graph
...
+ Implemented by using clang Driver infrastructure
[ToDo] Add tests for in-place hipification of multiple files
[ROCm/clr commit: ef405daaee ]
2019-12-07 11:20:17 +03:00
mhbliao
6643c80e8e
Fix hipExtLaunchMultiKernelMultiDevice refactoring. ( #1714 )
...
- Use the correct condition for HIP VDI runtime.
[ROCm/clr commit: 209f31525f ]
2019-12-06 09:49:17 -08:00
Rahul Garg
48faf7423a
Revert - Changes related to hipMemcpyWithStream ( #1718 )
...
Reverting #1673 , #1697 and #1707 .
Support for hipMemcpyWithStream and memcpy optimizations, will be brought in again once issues seen with these are resolved independently.
[ROCm/clr commit: a369bd4418 ]
2019-12-06 09:51:53 +05:30
Evgeny Mankov
d56471b77b
Merge pull request #1712 from emankov/hipify
...
[HIPIFY] CUDA 10.2 (functions only)
[ROCm/clr commit: c3076a2a07 ]
2019-12-04 18:39:10 +03:00
Evgeny Mankov
8477ab5d86
[HIPIFY] CUDA 10.2 (functions only)
...
+ Both Driver API and RT API are supported and synced with each other
+ Update *.md docs and hipify-perl accordingly
+ Add new conversion type "virtual_memory", introduced in Dirver API
[ROCm/clr commit: 43441eb99c ]
2019-12-04 18:37:31 +03:00
Evgeny Mankov
03250ea190
Merge pull request #1710 from emankov/hipify
...
[HIPIFY] CUDA 10.2 fp_16 support
[ROCm/clr commit: 187f28e26b ]
2019-12-04 13:28:43 +03:00
Evgeny Mankov
f6da983c96
[HIPIFY] CUDA 10.2 fp_16 support
...
[ROCm/clr commit: 721a8540c0 ]
2019-12-04 13:27:46 +03:00
Maneesh Gupta
ecbc77d921
[packaging] Rename hip packages to conform to debian package renaming rules ( #1708 )
...
[ROCm/clr commit: 59b3bd0048 ]
2019-12-04 15:55:37 +05:30
Evgeny Mankov
9976235fb4
Merge pull request #1709 from emankov/hipify
...
[HIPIFY] CUDA 10.2 support (data types only)
[ROCm/clr commit: 89703e4ea9 ]
2019-12-04 12:26:51 +03:00
Evgeny Mankov
deaf478ced
[HIPIFY] CUDA 10.2 support (data types only)
...
+ Both Driver API and RT API are supported and synced with each other
+ Update *.md docs and hipify-perl accordingly
[ROCm/clr commit: 33f81effb1 ]
2019-12-04 12:25:20 +03:00
Rahul Garg
6c7a338471
Fix hipMemcpyWithStream ( #1707 )
...
* Fix hipMemcpyWithStream
* Add a dtest for the same.
[ROCm/clr commit: 261a04580f ]
2019-12-04 11:56:23 +05:30
Aryan Salmanpour
2471c1950c
[hip] refactoring hipExtLaunchMultiKernelMultiDevice API ( #1698 )
...
[Background] it was found that if lazy linking used for a library that calls hipExtLaunchMultiKernelMultiDevice API then this API can get the wrong program_state object for looking up device kernels leading to a "No device code available" error in this API.
To fix this issue, the API was refactored to be inline and get and pass the correct program_state to an internal hip API to request a multi-device kernel launch.
[ROCm/clr commit: 8eaea4d114 ]
2019-12-04 11:50:51 +05:30
Maneesh Gupta
9e9d49c891
Revert "[packaging] Conform to debian package naming rules ( #1647 )" ( #1688 )
...
This reverts commit 71be7baece .
[ROCm/clr commit: 38e971b645 ]
2019-12-01 14:47:43 +05:30
Evgeny Mankov
2c2a726726
Merge pull request #1702 from emankov/hipify
...
[HIPIFY] Sync cudaError/CUresult between CUDA Driver API/CUDA RT API/HIP
[ROCm/clr commit: da468e181f ]
2019-11-29 19:31:28 +03:00
Evgeny Mankov
5317d618d1
[HIPIFY] Sync cudaError/CUresult between CUDA Driver API/CUDA RT API/HIP
...
+ Update *.md docs and hipify-perl accordingly
[Reason]
Starting with CUDA 10.1 all error codes are merged between Driver and RT APIs
[ToDo]
Do the same merge in HIP API as there is no need in distinguishing return codes by API
[ROCm/clr commit: 636108aaa9 ]
2019-11-29 19:29:11 +03:00
Maneesh Gupta
97850ae9a8
Revert changes for atomic FADD support when address is in LDS ( #1701 )
...
This reverts PR #1591 and follow-on PR #1695
[ROCm/clr commit: 4c92bd50c4 ]
2019-11-29 11:58:12 +05:30
Alex Voicu
b26aa2378e
Uniform is_shared query. ( #1695 )
...
[ROCm/clr commit: 17a4780dc6 ]
2019-11-28 13:39:05 +05:30
Jeff Daily
22c07e2331
fix hipMempcy precondition, test for zero size first ( #1697 )
...
[ROCm/clr commit: 6df73e1f12 ]
2019-11-25 19:03:03 -08:00
Alex Voicu
82b55e6502
Unary operators were too restrictive in the type of their argument. ( #1683 )
...
[ROCm/clr commit: 306d50291e ]
2019-11-22 07:54:53 +05:30
Shyam Saladi
84953da127
[HIPIFY] fix typo ( #1681 )
...
[ROCm/clr commit: 562336e733 ]
2019-11-22 07:54:37 +05:30
ansurya
0ece8d0f9b
Fix rocBLAS compilation failure ( #1677 )
...
SWDEV-212749:
o Recent changes to “add support for extended launch” require hip_runtime.h to be include in hip_ext.h
o Order in which external applications include hip_hcc.h/hip_runtime.h causes compilation failure
[ROCm/clr commit: e5fc5aa41c ]
2019-11-22 07:54:17 +05:30
Alex Voicu
e1dca187c9
Use native support for atomic FADD when address is in LDS ( #1591 )
...
[ROCm/clr commit: 2ed3a0873c ]
2019-11-22 07:53:48 +05:30
satyanveshd
e2a0a547ff
fixed directed tests fail when hcc bumped to 3.0 ( #1678 )
...
Handled the HCC version check appropriately as few of the directed tests (SWDEV-212161) were failing when hcc was bumped to 3.0.
[ROCm/clr commit: d4dde7a27d ]
2019-11-20 21:37:52 +05:30
Alex Voicu
553a7c2ce3
General sync memcpy improvements. Add hipMemcpyWithStream ( #1673 )
...
* General sync memcpy improvements. Add `hipMemcpyWithStream`
* Update hip_memory.cpp
[ROCm/clr commit: 022ac3cb0a ]
2019-11-20 21:36:37 +05:30
Rahul Garg
81dd3751a6
Update error codes for hipGetDevice for doxygen and move up null check ( #1668 )
...
* [docs] Update error codes for hipGetDevice
* Move up out ptr check
[ROCm/clr commit: 13c2a31d7e ]
2019-11-20 21:35:27 +05:30
Paul Fultz II
29bb65f6e0
Fix helper header when using c++17 ( #1666 )
...
This will fix issue #1621 . It also adds tests for is_callable with c++11, c++14, and c++17.
The fallback implementation was completely broken so I rewrote it so it pass the tests as well. This should be used instead of PR #1631 .
[ROCm/clr commit: 57b1b03261 ]
2019-11-20 21:33:42 +05:30
Alex Voicu
5c27a4f570
Extend vector type capabilities and add tests to reflect it. ( #1656 )
...
[ROCm/clr commit: c383f20691 ]
2019-11-20 21:32:32 +05:30
Sarbojit2019
2c00d66995
Revert [HIP] Fixed hipStreamAddCallback ( #1674 )
...
This reverts commit 4d0b7e65bb .
Addresses SWDEV#212675.
[ROCm/clr commit: 8a3c36a424 ]
2019-11-20 11:55:46 +05:30
Jeff Daily
58d1ae32ee
hipStreamSynchronize can skip marker if stream is empty ( #1667 )
...
[ROCm/clr commit: dfaaf04611 ]
2019-11-19 09:42:43 -08:00
Rahul Garg
6813f0436c
[dtest] Rename hipExtLaunchMultiKernelMultiDevice dtest ( #1671 )
...
[ROCm/clr commit: fa1335f419 ]
2019-11-19 09:41:13 -08:00
Sameer Sahasrabuddhe
82aa679a6d
hostcall: add tests that are only used by the VDI build ( #1664 )
...
[ROCm/clr commit: e158b0b255 ]
2019-11-18 19:52:40 -08:00
mhbliao
e9f9ecd519
Fix mathlib and app builds with hip-clang. ( #1665 )
...
[ROCm/clr commit: a45de95113 ]
2019-11-18 08:18:20 -08:00
Rahul Garg
f524f75b69
Fix gcc build on NVCC path ( #1661 )
...
* Fix gcc build on NVCC path
* Fix CI build errors
* [dtest] Fix texture and surface obj2D tests
[ROCm/clr commit: ff31f734fe ]
2019-11-18 12:19:22 +05:30
Nick Curtis
5875d98e6f
fix complex conjugate for double-complex ( #1659 )
...
The sign in the y component returned from hipConj incorrect for double-complex. Fix to match as in hipConjf above.
[ROCm/clr commit: cae9b13020 ]
2019-11-18 12:19:12 +05:30
Sarbojit2019
7d5bc56ab7
Added null check in hipEventSynchronize ( #1627 )
...
* Added missing null check in hipEventSynchronize
* Minor correction in the Event API description
[ROCm/clr commit: 7985ad218f ]
2019-11-18 12:18:55 +05:30
jiabaxie
de260a6d60
Updated hipEnvVarDriver to work with Windows ( #1614 )
...
* Updated hipEnvVarDriver to work with Windows
* Cleaned up a bit of code
* Fixed a part where putenv was used for both win and linux
* Defines moved to test_common.h and cleaned up code
* Cleaned up some macro defines and used const char instead
* Got rid of some excess commenting
* directory paths are unconditional
* Cleaned some duplicate code, and variables are now declared and defined together
[ROCm/clr commit: 7bbd420068 ]
2019-11-18 12:18:29 +05:30
kjayapra-amd
fffdd3da8d
[dtests] Fixing compilation error in saxpy.cpp ( #1608 )
...
[ROCm/clr commit: 99da905509 ]
2019-11-18 12:18:11 +05:30
Alex Voicu
f48b38b0ed
Add support for extended launch syntax. ( #1530 )
...
* Add support for extended launch syntax.
* Add unit test.
* Fix typo
* hipExtLaunchKernelGGL lives in hip_ext.h
Change-Id: Ice32dab0d43475fda65c6a910c11416871a8f2ff
* [dtest] remove redundant include from hipModuleGetGlobal dtest
[ROCm/clr commit: 355d0bdf95 ]
2019-11-16 22:24:07 -08:00
Maneesh Gupta
57225de63f
[ci] Fix failures on nvcc path ( #1654 )
...
Change-Id: I97f6b51314e14965618ab510bf57a2b034183e78
[ROCm/clr commit: 4d0a234183 ]
2019-11-15 07:58:40 +05:30
Maneesh Gupta
71be7baece
[packaging] Conform to debian package naming rules ( #1647 )
...
* [packaging] Conform to debian package naming rules
* Add hip_hcc metapackage for backward compatibility
[ROCm/clr commit: 5a03482690 ]
2019-11-14 11:40:34 +05:30
Evgeny Mankov
5cd24367d5
Merge pull request #1651 from emankov/hipify
...
[HIPIFY][doc] A few words about LLVM's FileCheck in README.md
[ROCm/clr commit: cf79be0f64 ]
2019-11-12 16:06:25 +03:00
Evgeny Mankov
ed11ec3fa9
[HIPIFY][doc] A few words about LLVM's FileCheck in README.md
...
[ROCm/clr commit: bb6f1648ec ]
2019-11-12 16:04:28 +03:00
Maneesh Gupta
ca9eb34000
Bump version to 3.0 ( #1648 )
...
Change-Id: Id1e6eb222080a339af938212e74a52c39c305eab
[ROCm/clr commit: 0a2682625f ]
2019-11-10 20:23:58 -08:00
Rahul Garg
c03f67dfbf
Merge pull request #1643 from mhbliao/hliao/master/remove.redundant.decl
...
Remove redundant declarations.
[ROCm/clr commit: 4c73950e31 ]
2019-11-07 11:16:40 -08:00
Michael LIAO
a10e227b56
Remove redundant declarations.
...
- The revised `hip/hip_ext.h` have that declarations.
[ROCm/clr commit: d28ad401c9 ]
2019-11-07 10:11:22 -05:00
Evgeny Mankov
cdff675646
Merge pull request #1642 from emankov/hipify
...
[HIPIFY] Clang style formatting
[ROCm/clr commit: 806ae1f43e ]
2019-11-07 12:49:52 +03:00
Evgeny Mankov
62b86d1c76
[HIPIFY] Clang style formatting
...
[ROCm/clr commit: 7f143585f6 ]
2019-11-07 12:48:05 +03:00
Evgeny Mankov
5561c2390d
Merge pull request #1641 from emankov/hipify
...
[HIPIFY][fix] Abandon canCompileHostAndDeviceInOneJob check for --cuda-compile-host-device option
[ROCm/clr commit: 367261ea0e ]
2019-11-07 11:35:15 +03:00
Evgeny Mankov
db5f0b0237
[HIPIFY][fix] Delete canCompileHostAndDeviceInOneJob declaration as well
...
[ROCm/clr commit: 4d2424ad36 ]
2019-11-07 11:32:53 +03:00
Evgeny Mankov
be483c4b3e
[HIPIFY][fix] Abandon canCompileHostAndDeviceInOneJob check for --cuda-compile-host-device option
...
[Reason] It turned out that it is not so: 2 jobs are always
[ROCm/clr commit: 21c3d25403 ]
2019-11-07 11:30:40 +03:00