Граф коммитов

3750 Коммитов

Автор SHA1 Сообщение Дата
Rahul Garg fa87abd58a Fix hipMemcpyWithStream (#1707)
* Fix hipMemcpyWithStream
* Add a dtest for the same.


[ROCm/hip commit: 892d2a562e]
2019-12-04 11:56:23 +05:30
Aryan Salmanpour abe7531676 [hip] refactoring hipExtLaunchMultiKernelMultiDevice API (#1698)
[Background] it was found that if lazy linking used for a library that calls hipExtLaunchMultiKernelMultiDevice API then this API can get the wrong program_state object for looking up device kernels leading to a "No device code available" error in this API.

To fix this issue, the API was refactored to be inline and get and pass the correct program_state to an internal hip API to request a multi-device kernel launch.

[ROCm/hip commit: 68cc787781]
2019-12-04 11:50:51 +05:30
Maneesh Gupta ee44fffd35 Revert "[packaging] Conform to debian package naming rules (#1647)" (#1688)
This reverts commit 8016fa32df.

[ROCm/hip commit: 4585b7f6f3]
2019-12-01 14:47:43 +05:30
Evgeny Mankov 389263471e [HIPIFY] Sync cudaError/CUresult between CUDA Driver API/CUDA RT API/HIP
+ Update *.md docs and hipify-perl accordingly

[Reason]
Starting with CUDA 10.1 all error codes are merged between Driver and RT APIs

[ToDo]
Do the same merge in HIP API as there is no need in distinguishing return codes by API


[ROCm/hip commit: d33dd71ed3]
2019-11-29 19:29:11 +03:00
Maneesh Gupta 4e564f783e Revert changes for atomic FADD support when address is in LDS (#1701)
This reverts PR #1591 and follow-on PR #1695

[ROCm/hip commit: 32442c6506]
2019-11-29 11:58:12 +05:30
Alex Voicu dc9bc8ee86 Uniform is_shared query. (#1695)
[ROCm/hip commit: b6514fffb9]
2019-11-28 13:39:05 +05:30
Jeff Daily f1544b621d fix hipMempcy precondition, test for zero size first (#1697)
[ROCm/hip commit: 299bbe2325]
2019-11-25 19:03:03 -08:00
Alex Voicu f220a7592e Unary operators were too restrictive in the type of their argument. (#1683)
[ROCm/hip commit: aaf31b6b96]
2019-11-22 07:54:53 +05:30
Shyam Saladi c8c265891a [HIPIFY] fix typo (#1681)
[ROCm/hip commit: 70f457d7ea]
2019-11-22 07:54:37 +05:30
ansurya 92837d4b4d Fix rocBLAS compilation failure (#1677)
SWDEV-212749:
o Recent changes to “add support for extended launch” require hip_runtime.h to be include in hip_ext.h
o Order in which external applications include hip_hcc.h/hip_runtime.h causes compilation failure

[ROCm/hip commit: e60dec51da]
2019-11-22 07:54:17 +05:30
Alex Voicu fd08d068c7 Use native support for atomic FADD when address is in LDS (#1591)
[ROCm/hip commit: d597e7ca20]
2019-11-22 07:53:48 +05:30
satyanveshd 3848cd97ea fixed directed tests fail when hcc bumped to 3.0 (#1678)
Handled the HCC version check appropriately as few of the directed tests (SWDEV-212161) were failing when hcc was bumped to 3.0.

[ROCm/hip commit: 6b06911ef1]
2019-11-20 21:37:52 +05:30
Alex Voicu 1baa84eae4 General sync memcpy improvements. Add hipMemcpyWithStream (#1673)
* General sync memcpy improvements. Add `hipMemcpyWithStream`

* Update hip_memory.cpp


[ROCm/hip commit: 5a1f823739]
2019-11-20 21:36:37 +05:30
Rahul Garg 7e75cf12df Update error codes for hipGetDevice for doxygen and move up null check (#1668)
* [docs] Update error codes for hipGetDevice

* Move up out ptr check


[ROCm/hip commit: b3161e9fa0]
2019-11-20 21:35:27 +05:30
Paul Fultz II 27112bbdf0 Fix helper header when using c++17 (#1666)
This will fix issue #1621. It also adds tests for is_callable with c++11, c++14, and c++17.

The fallback implementation was completely broken so I rewrote it so it pass the tests as well. This should be used instead of PR #1631.

[ROCm/hip commit: 8519a1411c]
2019-11-20 21:33:42 +05:30
Alex Voicu 4ec70a7e03 Extend vector type capabilities and add tests to reflect it. (#1656)
[ROCm/hip commit: b5b3d1bbaa]
2019-11-20 21:32:32 +05:30
Sarbojit2019 6679cd3998 Revert [HIP] Fixed hipStreamAddCallback (#1674)
This reverts commit fa1e44aa0e.
Addresses SWDEV#212675.

[ROCm/hip commit: 153a959280]
2019-11-20 11:55:46 +05:30
Jeff Daily 492248ef12 hipStreamSynchronize can skip marker if stream is empty (#1667)
[ROCm/hip commit: 3a7eb694f5]
2019-11-19 09:42:43 -08:00
Rahul Garg f55b1cd6a4 [dtest] Rename hipExtLaunchMultiKernelMultiDevice dtest (#1671)
[ROCm/hip commit: 188752b2cb]
2019-11-19 09:41:13 -08:00
Sameer Sahasrabuddhe cfaf04a467 hostcall: add tests that are only used by the VDI build (#1664)
[ROCm/hip commit: 363caf29c6]
2019-11-18 19:52:40 -08:00
mhbliao 97646b52f1 Fix mathlib and app builds with hip-clang. (#1665)
[ROCm/hip commit: ebe0c56f4f]
2019-11-18 08:18:20 -08:00
Rahul Garg ecdba45d29 Fix gcc build on NVCC path (#1661)
* Fix gcc build on NVCC path

* Fix CI build errors

* [dtest] Fix texture and surface obj2D tests


[ROCm/hip commit: e39d7497ec]
2019-11-18 12:19:22 +05:30
Nick Curtis 05ef9bd386 fix complex conjugate for double-complex (#1659)
The sign in the y component returned from hipConj incorrect for double-complex. Fix to match as in hipConjf above.

[ROCm/hip commit: 3f2316086f]
2019-11-18 12:19:12 +05:30
Sarbojit2019 a4f2c7717f Added null check in hipEventSynchronize (#1627)
* Added missing null check in hipEventSynchronize
* Minor correction in the Event API description


[ROCm/hip commit: b865a50e44]
2019-11-18 12:18:55 +05:30
jiabaxie cd3bf4e205 Updated hipEnvVarDriver to work with Windows (#1614)
* Updated hipEnvVarDriver to work with Windows

* Cleaned up a bit of code

* Fixed a part where putenv was used for both win and linux

* Defines moved to test_common.h and cleaned up code

* Cleaned up some macro defines and used const char instead

* Got rid of some excess commenting

* directory paths are unconditional

* Cleaned some duplicate code, and variables are now declared and defined together


[ROCm/hip commit: e73927caee]
2019-11-18 12:18:29 +05:30
kjayapra-amd fac2e14dab [dtests] Fixing compilation error in saxpy.cpp (#1608)
[ROCm/hip commit: 0a68be8b5b]
2019-11-18 12:18:11 +05:30
Alex Voicu 3ea404a8ba Add support for extended launch syntax. (#1530)
* Add support for extended launch syntax.

* Add unit test.

* Fix typo

* hipExtLaunchKernelGGL lives in hip_ext.h

Change-Id: Ice32dab0d43475fda65c6a910c11416871a8f2ff

* [dtest] remove redundant include from hipModuleGetGlobal dtest


[ROCm/hip commit: 69e74c3e96]
2019-11-16 22:24:07 -08:00
Maneesh Gupta 5536940e12 [ci] Fix failures on nvcc path (#1654)
Change-Id: I97f6b51314e14965618ab510bf57a2b034183e78

[ROCm/hip commit: f9fec8fdca]
2019-11-15 07:58:40 +05:30
Maneesh Gupta 8016fa32df [packaging] Conform to debian package naming rules (#1647)
* [packaging] Conform to debian package naming rules

* Add hip_hcc metapackage for backward compatibility


[ROCm/hip commit: 2ca1a82036]
2019-11-14 11:40:34 +05:30
Evgeny Mankov 541e2a4935 [HIPIFY][doc] A few words about LLVM's FileCheck in README.md
[ROCm/hip commit: 16c8ad0f99]
2019-11-12 16:04:28 +03:00
Maneesh Gupta 8723218421 Bump version to 3.0 (#1648)
Change-Id: Id1e6eb222080a339af938212e74a52c39c305eab

[ROCm/hip commit: dae8630592]
2019-11-10 20:23:58 -08:00
Michael LIAO 6318875ebc Remove redundant declarations.
- The revised `hip/hip_ext.h` have that declarations.


[ROCm/hip commit: d6ff22510e]
2019-11-07 10:11:22 -05:00
Evgeny Mankov 9e34880005 [HIPIFY] Clang style formatting
[ROCm/hip commit: 93bc9c9195]
2019-11-07 12:48:05 +03:00
Evgeny Mankov 0ae791586b [HIPIFY][fix] Delete canCompileHostAndDeviceInOneJob declaration as well
[ROCm/hip commit: 0c07b6b421]
2019-11-07 11:32:53 +03:00
Evgeny Mankov 860e89df5e [HIPIFY][fix] Abandon canCompileHostAndDeviceInOneJob check for --cuda-compile-host-device option
[Reason] It turned out that it is not so: 2 jobs are always


[ROCm/hip commit: c2c144ac04]
2019-11-07 11:30:40 +03:00
Evgeny Mankov 58fc750bf0 [HIPIFY] Reorder options propagation to clang
[Reason] Leave overriding opportunity for -D, -std=c++XX, etc.


[ROCm/hip commit: 16a27213e4]
2019-11-07 11:21:06 +03:00
Jeff Daily d4fe8ff822 General hipMemset improvements (#1495)
* hipMemset et al can use HSA API directly for synchronous cases

* lock and flush stream in hipMemset, hold lock until complete

* move hipMemset async check to front of conditional

* use hsa_amd_memory_fill for additional sync memset cases

code cleanup/review for all memset calls

* Fix inversion of execution mutating value.

* ihipMemsetSync fall back to kernel if HSA memset fails

* Never fallback, never surrender.

* Allow NULL stream.

* Optimise memset kernel. Remove deadwood.

* Update hip_memory.cpp

* Clean up stream logic in sync memset

* Revert "Clean up stream logic in sync memset"

This reverts commit 6117dedf673367f44cc704192573a117a3d92477.


[ROCm/hip commit: e31e0ca12e]
2019-11-07 13:19:54 +05:30
Alex Voicu 68b4fbd043 Remove native vector support from the GCC case, since it never worked (#1637)
[ROCm/hip commit: 5530c15cc3]
2019-11-07 13:19:14 +05:30
Sarbojit2019 fa1e44aa0e [HIP] Fixed hipStreamAddCallback [SWDEV#165185] (#1425)
Fixed hipStreamAddCallback() as requested in SWDEV#165185
Added unit test to test the behavior


[ROCm/hip commit: 45613311d7]
2019-11-07 13:18:12 +05:30
ansurya 92dcba11ac Fixed texture 2D mapping for pitched arrays & 3D Texture read (#1415)
Texture 2D image mapping for pitched arrays:
github issue: Texture Object's Buffer seems to be Misaligned #886
JIRA ticket: SWDEV-199313

SWDEV-151670 : Fixed issue with 3D texture with 4 components
SWDEV-151671 : Issue with 2D layered texture with 4 components

[ROCm/hip commit: e07926ce0f]
2019-11-07 13:17:46 +05:30
Rahul Garg 6968362d99 Rename hip/hip_hcc.h to hip/hip_ext.h (#1341)
* Rename hip/hip_hcc.h to hip/hip_ext.h

* Deprecate hip_hcc.h


[ROCm/hip commit: 579a4f36fa]
2019-11-07 13:17:10 +05:30
Rahul Garg 9248fcd9bc Merge pull request #1630 from ROCm-Developer-Tools/hotfix_volatile_accessors
__half2 should walk like CUDA and talk like CUDA

[ROCm/hip commit: 2fa45c215a]
2019-11-06 10:25:03 -08:00
Rahul Garg a48954ef98 Merge pull request #1620 from jeffdaily/hipEventRecord_patch
fix race condition in hipEventRecord

[ROCm/hip commit: 85446dbd6a]
2019-11-06 10:23:24 -08:00
Evgeny Mankov 282c76c26f [HIPIFY][Win][fix] canCompileHostAndDeviceInOneJob is true only for LLVM >= 10
[ROCm/hip commit: 9ca4e96641]
2019-11-06 19:25:42 +03:00
Evgeny Mankov 7b216ca51c [HIPIFY][doc] Update README.md
+ Supported versions, testing


[ROCm/hip commit: 96483d0de5]
2019-11-06 19:18:13 +03:00
Jeff Daily f9ad564380 hipEventRecord only needs one lock; remove locked_eventIsReady
[ROCm/hip commit: 85080905c0]
2019-11-06 15:56:32 +00:00
Jeff Daily 8f033af434 remove ihipEvent_t::refreshEventStatus(), new hipEventElapsedTime
This fixes possible races in hipEventElapsedTime.


[ROCm/hip commit: 7986090d9c]
2019-11-06 15:46:46 +00:00
Evgeny Mankov 1b6e4b495a [HIPIFY][format] Apply clang style formatting in LLVMCompat
[ROCm/hip commit: 3b76fd0b1c]
2019-11-06 14:43:22 +03:00
Evgeny Mankov 1776ff55d3 [HIPIFY] Add a compatibility for CharSourceRange if LLVM < 7
[ROCm/hip commit: c87ab05fc6]
2019-11-06 14:30:39 +03:00
Alex Voicu f8506bb669 Remove leftover noise.
[ROCm/hip commit: b9faa9f8ae]
2019-11-06 02:46:21 +02:00