Evgeny Mankov
1e9ac70f3b
Merge pull request #1702 from emankov/hipify
...
[HIPIFY] Sync cudaError/CUresult between CUDA Driver API/CUDA RT API/HIP
2019-11-29 19:31:28 +03:00
Evgeny Mankov
d33dd71ed3
[HIPIFY] Sync cudaError/CUresult between CUDA Driver API/CUDA RT API/HIP
...
+ Update *.md docs and hipify-perl accordingly
[Reason]
Starting with CUDA 10.1 all error codes are merged between Driver and RT APIs
[ToDo]
Do the same merge in HIP API as there is no need in distinguishing return codes by API
2019-11-29 19:29:11 +03:00
Maneesh Gupta
32442c6506
Revert changes for atomic FADD support when address is in LDS ( #1701 )
...
This reverts PR #1591 and follow-on PR #1695
2019-11-29 11:58:12 +05:30
Alex Voicu
b6514fffb9
Uniform is_shared query. ( #1695 )
2019-11-28 13:39:05 +05:30
Jeff Daily
299bbe2325
fix hipMempcy precondition, test for zero size first ( #1697 )
2019-11-25 19:03:03 -08:00
Alex Voicu
aaf31b6b96
Unary operators were too restrictive in the type of their argument. ( #1683 )
2019-11-22 07:54:53 +05:30
Shyam Saladi
70f457d7ea
[HIPIFY] fix typo ( #1681 )
2019-11-22 07:54:37 +05:30
ansurya
e60dec51da
Fix rocBLAS compilation failure ( #1677 )
...
SWDEV-212749:
o Recent changes to “add support for extended launch” require hip_runtime.h to be include in hip_ext.h
o Order in which external applications include hip_hcc.h/hip_runtime.h causes compilation failure
2019-11-22 07:54:17 +05:30
Alex Voicu
d597e7ca20
Use native support for atomic FADD when address is in LDS ( #1591 )
2019-11-22 07:53:48 +05:30
satyanveshd
6b06911ef1
fixed directed tests fail when hcc bumped to 3.0 ( #1678 )
...
Handled the HCC version check appropriately as few of the directed tests (SWDEV-212161) were failing when hcc was bumped to 3.0.
2019-11-20 21:37:52 +05:30
Alex Voicu
5a1f823739
General sync memcpy improvements. Add hipMemcpyWithStream ( #1673 )
...
* General sync memcpy improvements. Add `hipMemcpyWithStream`
* Update hip_memory.cpp
2019-11-20 21:36:37 +05:30
Rahul Garg
b3161e9fa0
Update error codes for hipGetDevice for doxygen and move up null check ( #1668 )
...
* [docs] Update error codes for hipGetDevice
* Move up out ptr check
2019-11-20 21:35:27 +05:30
Paul Fultz II
8519a1411c
Fix helper header when using c++17 ( #1666 )
...
This will fix issue #1621 . It also adds tests for is_callable with c++11, c++14, and c++17.
The fallback implementation was completely broken so I rewrote it so it pass the tests as well. This should be used instead of PR #1631 .
2019-11-20 21:33:42 +05:30
Alex Voicu
b5b3d1bbaa
Extend vector type capabilities and add tests to reflect it. ( #1656 )
2019-11-20 21:32:32 +05:30
Sarbojit2019
153a959280
Revert [HIP] Fixed hipStreamAddCallback ( #1674 )
...
This reverts commit 45613311d7 .
Addresses SWDEV#212675.
2019-11-20 11:55:46 +05:30
Jeff Daily
3a7eb694f5
hipStreamSynchronize can skip marker if stream is empty ( #1667 )
2019-11-19 09:42:43 -08:00
Rahul Garg
188752b2cb
[dtest] Rename hipExtLaunchMultiKernelMultiDevice dtest ( #1671 )
2019-11-19 09:41:13 -08:00
Sameer Sahasrabuddhe
363caf29c6
hostcall: add tests that are only used by the VDI build ( #1664 )
2019-11-18 19:52:40 -08:00
mhbliao
ebe0c56f4f
Fix mathlib and app builds with hip-clang. ( #1665 )
2019-11-18 08:18:20 -08:00
Rahul Garg
e39d7497ec
Fix gcc build on NVCC path ( #1661 )
...
* Fix gcc build on NVCC path
* Fix CI build errors
* [dtest] Fix texture and surface obj2D tests
2019-11-18 12:19:22 +05:30
Nick Curtis
3f2316086f
fix complex conjugate for double-complex ( #1659 )
...
The sign in the y component returned from hipConj incorrect for double-complex. Fix to match as in hipConjf above.
2019-11-18 12:19:12 +05:30
Sarbojit2019
b865a50e44
Added null check in hipEventSynchronize ( #1627 )
...
* Added missing null check in hipEventSynchronize
* Minor correction in the Event API description
2019-11-18 12:18:55 +05:30
jiabaxie
e73927caee
Updated hipEnvVarDriver to work with Windows ( #1614 )
...
* Updated hipEnvVarDriver to work with Windows
* Cleaned up a bit of code
* Fixed a part where putenv was used for both win and linux
* Defines moved to test_common.h and cleaned up code
* Cleaned up some macro defines and used const char instead
* Got rid of some excess commenting
* directory paths are unconditional
* Cleaned some duplicate code, and variables are now declared and defined together
2019-11-18 12:18:29 +05:30
kjayapra-amd
0a68be8b5b
[dtests] Fixing compilation error in saxpy.cpp ( #1608 )
2019-11-18 12:18:11 +05:30
Alex Voicu
69e74c3e96
Add support for extended launch syntax. ( #1530 )
...
* Add support for extended launch syntax.
* Add unit test.
* Fix typo
* hipExtLaunchKernelGGL lives in hip_ext.h
Change-Id: Ice32dab0d43475fda65c6a910c11416871a8f2ff
* [dtest] remove redundant include from hipModuleGetGlobal dtest
2019-11-16 22:24:07 -08:00
Maneesh Gupta
f9fec8fdca
[ci] Fix failures on nvcc path ( #1654 )
...
Change-Id: I97f6b51314e14965618ab510bf57a2b034183e78
2019-11-15 07:58:40 +05:30
Maneesh Gupta
2ca1a82036
[packaging] Conform to debian package naming rules ( #1647 )
...
* [packaging] Conform to debian package naming rules
* Add hip_hcc metapackage for backward compatibility
2019-11-14 11:40:34 +05:30
Evgeny Mankov
0a74d771b4
Merge pull request #1651 from emankov/hipify
...
[HIPIFY][doc] A few words about LLVM's FileCheck in README.md
2019-11-12 16:06:25 +03:00
Evgeny Mankov
16c8ad0f99
[HIPIFY][doc] A few words about LLVM's FileCheck in README.md
2019-11-12 16:04:28 +03:00
Maneesh Gupta
dae8630592
Bump version to 3.0 ( #1648 )
...
Change-Id: Id1e6eb222080a339af938212e74a52c39c305eab
2019-11-10 20:23:58 -08:00
Rahul Garg
d80917d147
Merge pull request #1643 from mhbliao/hliao/master/remove.redundant.decl
...
Remove redundant declarations.
2019-11-07 11:16:40 -08:00
Michael LIAO
d6ff22510e
Remove redundant declarations.
...
- The revised `hip/hip_ext.h` have that declarations.
2019-11-07 10:11:22 -05:00
Evgeny Mankov
3be0865cad
Merge pull request #1642 from emankov/hipify
...
[HIPIFY] Clang style formatting
2019-11-07 12:49:52 +03:00
Evgeny Mankov
93bc9c9195
[HIPIFY] Clang style formatting
2019-11-07 12:48:05 +03:00
Evgeny Mankov
67f787e74a
Merge pull request #1641 from emankov/hipify
...
[HIPIFY][fix] Abandon canCompileHostAndDeviceInOneJob check for --cuda-compile-host-device option
2019-11-07 11:35:15 +03:00
Evgeny Mankov
0c07b6b421
[HIPIFY][fix] Delete canCompileHostAndDeviceInOneJob declaration as well
2019-11-07 11:32:53 +03:00
Evgeny Mankov
c2c144ac04
[HIPIFY][fix] Abandon canCompileHostAndDeviceInOneJob check for --cuda-compile-host-device option
...
[Reason] It turned out that it is not so: 2 jobs are always
2019-11-07 11:30:40 +03:00
Evgeny Mankov
8f33397d61
Merge pull request #1640 from emankov/hipify
...
[HIPIFY] Reorder options propagation to clang
2019-11-07 11:23:38 +03:00
Evgeny Mankov
16a27213e4
[HIPIFY] Reorder options propagation to clang
...
[Reason] Leave overriding opportunity for -D, -std=c++XX, etc.
2019-11-07 11:21:06 +03:00
Jeff Daily
e31e0ca12e
General hipMemset improvements ( #1495 )
...
* hipMemset et al can use HSA API directly for synchronous cases
* lock and flush stream in hipMemset, hold lock until complete
* move hipMemset async check to front of conditional
* use hsa_amd_memory_fill for additional sync memset cases
code cleanup/review for all memset calls
* Fix inversion of execution mutating value.
* ihipMemsetSync fall back to kernel if HSA memset fails
* Never fallback, never surrender.
* Allow NULL stream.
* Optimise memset kernel. Remove deadwood.
* Update hip_memory.cpp
* Clean up stream logic in sync memset
* Revert "Clean up stream logic in sync memset"
This reverts commit 6117dedf673367f44cc704192573a117a3d92477.
2019-11-07 13:19:54 +05:30
Alex Voicu
5530c15cc3
Remove native vector support from the GCC case, since it never worked ( #1637 )
2019-11-07 13:19:14 +05:30
Sarbojit2019
45613311d7
[HIP] Fixed hipStreamAddCallback [SWDEV#165185] ( #1425 )
...
Fixed hipStreamAddCallback() as requested in SWDEV#165185
Added unit test to test the behavior
2019-11-07 13:18:12 +05:30
ansurya
e07926ce0f
Fixed texture 2D mapping for pitched arrays & 3D Texture read ( #1415 )
...
Texture 2D image mapping for pitched arrays:
github issue: Texture Object's Buffer seems to be Misaligned #886
JIRA ticket: SWDEV-199313
SWDEV-151670 : Fixed issue with 3D texture with 4 components
SWDEV-151671 : Issue with 2D layered texture with 4 components
2019-11-07 13:17:46 +05:30
Rahul Garg
579a4f36fa
Rename hip/hip_hcc.h to hip/hip_ext.h ( #1341 )
...
* Rename hip/hip_hcc.h to hip/hip_ext.h
* Deprecate hip_hcc.h
2019-11-07 13:17:10 +05:30
Rahul Garg
2fa45c215a
Merge pull request #1630 from ROCm-Developer-Tools/hotfix_volatile_accessors
...
__half2 should walk like CUDA and talk like CUDA
2019-11-06 10:25:03 -08:00
Rahul Garg
85446dbd6a
Merge pull request #1620 from jeffdaily/hipEventRecord_patch
...
fix race condition in hipEventRecord
2019-11-06 10:23:24 -08:00
Evgeny Mankov
30c138eda2
Merge pull request #1636 from emankov/hipify-clang
...
[HIPIFY][Win][fix] canCompileHostAndDeviceInOneJob is true only for LLVM >= 10
2019-11-06 19:27:40 +03:00
Evgeny Mankov
9ca4e96641
[HIPIFY][Win][fix] canCompileHostAndDeviceInOneJob is true only for LLVM >= 10
2019-11-06 19:25:42 +03:00
Evgeny Mankov
7875bb789b
Merge pull request #1635 from emankov/hipify-clang
...
[HIPIFY][doc] Update README.md
2019-11-06 19:20:02 +03:00
Evgeny Mankov
96483d0de5
[HIPIFY][doc] Update README.md
...
+ Supported versions, testing
2019-11-06 19:18:13 +03:00