Rahul Garg
fa87abd58a
Fix hipMemcpyWithStream ( #1707 )
...
* Fix hipMemcpyWithStream
* Add a dtest for the same.
[ROCm/hip commit: 892d2a562e ]
2019-12-04 11:56:23 +05:30
Aryan Salmanpour
abe7531676
[hip] refactoring hipExtLaunchMultiKernelMultiDevice API ( #1698 )
...
[Background] it was found that if lazy linking used for a library that calls hipExtLaunchMultiKernelMultiDevice API then this API can get the wrong program_state object for looking up device kernels leading to a "No device code available" error in this API.
To fix this issue, the API was refactored to be inline and get and pass the correct program_state to an internal hip API to request a multi-device kernel launch.
[ROCm/hip commit: 68cc787781 ]
2019-12-04 11:50:51 +05:30
Maneesh Gupta
ee44fffd35
Revert "[packaging] Conform to debian package naming rules ( #1647 )" ( #1688 )
...
This reverts commit 8016fa32df .
[ROCm/hip commit: 4585b7f6f3 ]
2019-12-01 14:47:43 +05:30
Evgeny Mankov
389263471e
[HIPIFY] Sync cudaError/CUresult between CUDA Driver API/CUDA RT API/HIP
...
+ Update *.md docs and hipify-perl accordingly
[Reason]
Starting with CUDA 10.1 all error codes are merged between Driver and RT APIs
[ToDo]
Do the same merge in HIP API as there is no need in distinguishing return codes by API
[ROCm/hip commit: d33dd71ed3 ]
2019-11-29 19:29:11 +03:00
Maneesh Gupta
4e564f783e
Revert changes for atomic FADD support when address is in LDS ( #1701 )
...
This reverts PR #1591 and follow-on PR #1695
[ROCm/hip commit: 32442c6506 ]
2019-11-29 11:58:12 +05:30
Alex Voicu
dc9bc8ee86
Uniform is_shared query. ( #1695 )
...
[ROCm/hip commit: b6514fffb9 ]
2019-11-28 13:39:05 +05:30
Jeff Daily
f1544b621d
fix hipMempcy precondition, test for zero size first ( #1697 )
...
[ROCm/hip commit: 299bbe2325 ]
2019-11-25 19:03:03 -08:00
Alex Voicu
f220a7592e
Unary operators were too restrictive in the type of their argument. ( #1683 )
...
[ROCm/hip commit: aaf31b6b96 ]
2019-11-22 07:54:53 +05:30
Shyam Saladi
c8c265891a
[HIPIFY] fix typo ( #1681 )
...
[ROCm/hip commit: 70f457d7ea ]
2019-11-22 07:54:37 +05:30
ansurya
92837d4b4d
Fix rocBLAS compilation failure ( #1677 )
...
SWDEV-212749:
o Recent changes to “add support for extended launch” require hip_runtime.h to be include in hip_ext.h
o Order in which external applications include hip_hcc.h/hip_runtime.h causes compilation failure
[ROCm/hip commit: e60dec51da ]
2019-11-22 07:54:17 +05:30
Alex Voicu
fd08d068c7
Use native support for atomic FADD when address is in LDS ( #1591 )
...
[ROCm/hip commit: d597e7ca20 ]
2019-11-22 07:53:48 +05:30
satyanveshd
3848cd97ea
fixed directed tests fail when hcc bumped to 3.0 ( #1678 )
...
Handled the HCC version check appropriately as few of the directed tests (SWDEV-212161) were failing when hcc was bumped to 3.0.
[ROCm/hip commit: 6b06911ef1 ]
2019-11-20 21:37:52 +05:30
Alex Voicu
1baa84eae4
General sync memcpy improvements. Add hipMemcpyWithStream ( #1673 )
...
* General sync memcpy improvements. Add `hipMemcpyWithStream`
* Update hip_memory.cpp
[ROCm/hip commit: 5a1f823739 ]
2019-11-20 21:36:37 +05:30
Rahul Garg
7e75cf12df
Update error codes for hipGetDevice for doxygen and move up null check ( #1668 )
...
* [docs] Update error codes for hipGetDevice
* Move up out ptr check
[ROCm/hip commit: b3161e9fa0 ]
2019-11-20 21:35:27 +05:30
Paul Fultz II
27112bbdf0
Fix helper header when using c++17 ( #1666 )
...
This will fix issue #1621 . It also adds tests for is_callable with c++11, c++14, and c++17.
The fallback implementation was completely broken so I rewrote it so it pass the tests as well. This should be used instead of PR #1631 .
[ROCm/hip commit: 8519a1411c ]
2019-11-20 21:33:42 +05:30
Alex Voicu
4ec70a7e03
Extend vector type capabilities and add tests to reflect it. ( #1656 )
...
[ROCm/hip commit: b5b3d1bbaa ]
2019-11-20 21:32:32 +05:30
Sarbojit2019
6679cd3998
Revert [HIP] Fixed hipStreamAddCallback ( #1674 )
...
This reverts commit fa1e44aa0e .
Addresses SWDEV#212675.
[ROCm/hip commit: 153a959280 ]
2019-11-20 11:55:46 +05:30
Jeff Daily
492248ef12
hipStreamSynchronize can skip marker if stream is empty ( #1667 )
...
[ROCm/hip commit: 3a7eb694f5 ]
2019-11-19 09:42:43 -08:00
Rahul Garg
f55b1cd6a4
[dtest] Rename hipExtLaunchMultiKernelMultiDevice dtest ( #1671 )
...
[ROCm/hip commit: 188752b2cb ]
2019-11-19 09:41:13 -08:00
Sameer Sahasrabuddhe
cfaf04a467
hostcall: add tests that are only used by the VDI build ( #1664 )
...
[ROCm/hip commit: 363caf29c6 ]
2019-11-18 19:52:40 -08:00
mhbliao
97646b52f1
Fix mathlib and app builds with hip-clang. ( #1665 )
...
[ROCm/hip commit: ebe0c56f4f ]
2019-11-18 08:18:20 -08:00
Rahul Garg
ecdba45d29
Fix gcc build on NVCC path ( #1661 )
...
* Fix gcc build on NVCC path
* Fix CI build errors
* [dtest] Fix texture and surface obj2D tests
[ROCm/hip commit: e39d7497ec ]
2019-11-18 12:19:22 +05:30
Nick Curtis
05ef9bd386
fix complex conjugate for double-complex ( #1659 )
...
The sign in the y component returned from hipConj incorrect for double-complex. Fix to match as in hipConjf above.
[ROCm/hip commit: 3f2316086f ]
2019-11-18 12:19:12 +05:30
Sarbojit2019
a4f2c7717f
Added null check in hipEventSynchronize ( #1627 )
...
* Added missing null check in hipEventSynchronize
* Minor correction in the Event API description
[ROCm/hip commit: b865a50e44 ]
2019-11-18 12:18:55 +05:30
jiabaxie
cd3bf4e205
Updated hipEnvVarDriver to work with Windows ( #1614 )
...
* Updated hipEnvVarDriver to work with Windows
* Cleaned up a bit of code
* Fixed a part where putenv was used for both win and linux
* Defines moved to test_common.h and cleaned up code
* Cleaned up some macro defines and used const char instead
* Got rid of some excess commenting
* directory paths are unconditional
* Cleaned some duplicate code, and variables are now declared and defined together
[ROCm/hip commit: e73927caee ]
2019-11-18 12:18:29 +05:30
kjayapra-amd
fac2e14dab
[dtests] Fixing compilation error in saxpy.cpp ( #1608 )
...
[ROCm/hip commit: 0a68be8b5b ]
2019-11-18 12:18:11 +05:30
Alex Voicu
3ea404a8ba
Add support for extended launch syntax. ( #1530 )
...
* Add support for extended launch syntax.
* Add unit test.
* Fix typo
* hipExtLaunchKernelGGL lives in hip_ext.h
Change-Id: Ice32dab0d43475fda65c6a910c11416871a8f2ff
* [dtest] remove redundant include from hipModuleGetGlobal dtest
[ROCm/hip commit: 69e74c3e96 ]
2019-11-16 22:24:07 -08:00
Maneesh Gupta
5536940e12
[ci] Fix failures on nvcc path ( #1654 )
...
Change-Id: I97f6b51314e14965618ab510bf57a2b034183e78
[ROCm/hip commit: f9fec8fdca ]
2019-11-15 07:58:40 +05:30
Maneesh Gupta
8016fa32df
[packaging] Conform to debian package naming rules ( #1647 )
...
* [packaging] Conform to debian package naming rules
* Add hip_hcc metapackage for backward compatibility
[ROCm/hip commit: 2ca1a82036 ]
2019-11-14 11:40:34 +05:30
Evgeny Mankov
541e2a4935
[HIPIFY][doc] A few words about LLVM's FileCheck in README.md
...
[ROCm/hip commit: 16c8ad0f99 ]
2019-11-12 16:04:28 +03:00
Maneesh Gupta
8723218421
Bump version to 3.0 ( #1648 )
...
Change-Id: Id1e6eb222080a339af938212e74a52c39c305eab
[ROCm/hip commit: dae8630592 ]
2019-11-10 20:23:58 -08:00
Michael LIAO
6318875ebc
Remove redundant declarations.
...
- The revised `hip/hip_ext.h` have that declarations.
[ROCm/hip commit: d6ff22510e ]
2019-11-07 10:11:22 -05:00
Evgeny Mankov
9e34880005
[HIPIFY] Clang style formatting
...
[ROCm/hip commit: 93bc9c9195 ]
2019-11-07 12:48:05 +03:00
Evgeny Mankov
0ae791586b
[HIPIFY][fix] Delete canCompileHostAndDeviceInOneJob declaration as well
...
[ROCm/hip commit: 0c07b6b421 ]
2019-11-07 11:32:53 +03:00
Evgeny Mankov
860e89df5e
[HIPIFY][fix] Abandon canCompileHostAndDeviceInOneJob check for --cuda-compile-host-device option
...
[Reason] It turned out that it is not so: 2 jobs are always
[ROCm/hip commit: c2c144ac04 ]
2019-11-07 11:30:40 +03:00
Evgeny Mankov
58fc750bf0
[HIPIFY] Reorder options propagation to clang
...
[Reason] Leave overriding opportunity for -D, -std=c++XX, etc.
[ROCm/hip commit: 16a27213e4 ]
2019-11-07 11:21:06 +03:00
Jeff Daily
d4fe8ff822
General hipMemset improvements ( #1495 )
...
* hipMemset et al can use HSA API directly for synchronous cases
* lock and flush stream in hipMemset, hold lock until complete
* move hipMemset async check to front of conditional
* use hsa_amd_memory_fill for additional sync memset cases
code cleanup/review for all memset calls
* Fix inversion of execution mutating value.
* ihipMemsetSync fall back to kernel if HSA memset fails
* Never fallback, never surrender.
* Allow NULL stream.
* Optimise memset kernel. Remove deadwood.
* Update hip_memory.cpp
* Clean up stream logic in sync memset
* Revert "Clean up stream logic in sync memset"
This reverts commit 6117dedf673367f44cc704192573a117a3d92477.
[ROCm/hip commit: e31e0ca12e ]
2019-11-07 13:19:54 +05:30
Alex Voicu
68b4fbd043
Remove native vector support from the GCC case, since it never worked ( #1637 )
...
[ROCm/hip commit: 5530c15cc3 ]
2019-11-07 13:19:14 +05:30
Sarbojit2019
fa1e44aa0e
[HIP] Fixed hipStreamAddCallback [SWDEV#165185] ( #1425 )
...
Fixed hipStreamAddCallback() as requested in SWDEV#165185
Added unit test to test the behavior
[ROCm/hip commit: 45613311d7 ]
2019-11-07 13:18:12 +05:30
ansurya
92dcba11ac
Fixed texture 2D mapping for pitched arrays & 3D Texture read ( #1415 )
...
Texture 2D image mapping for pitched arrays:
github issue: Texture Object's Buffer seems to be Misaligned #886
JIRA ticket: SWDEV-199313
SWDEV-151670 : Fixed issue with 3D texture with 4 components
SWDEV-151671 : Issue with 2D layered texture with 4 components
[ROCm/hip commit: e07926ce0f ]
2019-11-07 13:17:46 +05:30
Rahul Garg
6968362d99
Rename hip/hip_hcc.h to hip/hip_ext.h ( #1341 )
...
* Rename hip/hip_hcc.h to hip/hip_ext.h
* Deprecate hip_hcc.h
[ROCm/hip commit: 579a4f36fa ]
2019-11-07 13:17:10 +05:30
Rahul Garg
9248fcd9bc
Merge pull request #1630 from ROCm-Developer-Tools/hotfix_volatile_accessors
...
__half2 should walk like CUDA and talk like CUDA
[ROCm/hip commit: 2fa45c215a ]
2019-11-06 10:25:03 -08:00
Rahul Garg
a48954ef98
Merge pull request #1620 from jeffdaily/hipEventRecord_patch
...
fix race condition in hipEventRecord
[ROCm/hip commit: 85446dbd6a ]
2019-11-06 10:23:24 -08:00
Evgeny Mankov
282c76c26f
[HIPIFY][Win][fix] canCompileHostAndDeviceInOneJob is true only for LLVM >= 10
...
[ROCm/hip commit: 9ca4e96641 ]
2019-11-06 19:25:42 +03:00
Evgeny Mankov
7b216ca51c
[HIPIFY][doc] Update README.md
...
+ Supported versions, testing
[ROCm/hip commit: 96483d0de5 ]
2019-11-06 19:18:13 +03:00
Jeff Daily
f9ad564380
hipEventRecord only needs one lock; remove locked_eventIsReady
...
[ROCm/hip commit: 85080905c0 ]
2019-11-06 15:56:32 +00:00
Jeff Daily
8f033af434
remove ihipEvent_t::refreshEventStatus(), new hipEventElapsedTime
...
This fixes possible races in hipEventElapsedTime.
[ROCm/hip commit: 7986090d9c ]
2019-11-06 15:46:46 +00:00
Evgeny Mankov
1b6e4b495a
[HIPIFY][format] Apply clang style formatting in LLVMCompat
...
[ROCm/hip commit: 3b76fd0b1c ]
2019-11-06 14:43:22 +03:00
Evgeny Mankov
1776ff55d3
[HIPIFY] Add a compatibility for CharSourceRange if LLVM < 7
...
[ROCm/hip commit: c87ab05fc6 ]
2019-11-06 14:30:39 +03:00
Alex Voicu
f8506bb669
Remove leftover noise.
...
[ROCm/hip commit: b9faa9f8ae ]
2019-11-06 02:46:21 +02:00