Rahul Garg
ebdd811632
Revert - Changes related to hipMemcpyWithStream ( #1718 )
...
Reverting #1673 , #1697 and #1707 .
Support for hipMemcpyWithStream and memcpy optimizations, will be brought in again once issues seen with these are resolved independently.
[ROCm/hip commit: e53fc316f1 ]
2019-12-06 09:51:53 +05:30
Rahul Garg
fa87abd58a
Fix hipMemcpyWithStream ( #1707 )
...
* Fix hipMemcpyWithStream
* Add a dtest for the same.
[ROCm/hip commit: 892d2a562e ]
2019-12-04 11:56:23 +05:30
Maneesh Gupta
4e564f783e
Revert changes for atomic FADD support when address is in LDS ( #1701 )
...
This reverts PR #1591 and follow-on PR #1695
[ROCm/hip commit: 32442c6506 ]
2019-11-29 11:58:12 +05:30
Alex Voicu
fd08d068c7
Use native support for atomic FADD when address is in LDS ( #1591 )
...
[ROCm/hip commit: d597e7ca20 ]
2019-11-22 07:53:48 +05:30
Paul Fultz II
27112bbdf0
Fix helper header when using c++17 ( #1666 )
...
This will fix issue #1621 . It also adds tests for is_callable with c++11, c++14, and c++17.
The fallback implementation was completely broken so I rewrote it so it pass the tests as well. This should be used instead of PR #1631 .
[ROCm/hip commit: 8519a1411c ]
2019-11-20 21:33:42 +05:30
Alex Voicu
4ec70a7e03
Extend vector type capabilities and add tests to reflect it. ( #1656 )
...
[ROCm/hip commit: b5b3d1bbaa ]
2019-11-20 21:32:32 +05:30
Sarbojit2019
6679cd3998
Revert [HIP] Fixed hipStreamAddCallback ( #1674 )
...
This reverts commit fa1e44aa0e .
Addresses SWDEV#212675.
[ROCm/hip commit: 153a959280 ]
2019-11-20 11:55:46 +05:30
Rahul Garg
f55b1cd6a4
[dtest] Rename hipExtLaunchMultiKernelMultiDevice dtest ( #1671 )
...
[ROCm/hip commit: 188752b2cb ]
2019-11-19 09:41:13 -08:00
Sameer Sahasrabuddhe
cfaf04a467
hostcall: add tests that are only used by the VDI build ( #1664 )
...
[ROCm/hip commit: 363caf29c6 ]
2019-11-18 19:52:40 -08:00
Rahul Garg
ecdba45d29
Fix gcc build on NVCC path ( #1661 )
...
* Fix gcc build on NVCC path
* Fix CI build errors
* [dtest] Fix texture and surface obj2D tests
[ROCm/hip commit: e39d7497ec ]
2019-11-18 12:19:22 +05:30
jiabaxie
cd3bf4e205
Updated hipEnvVarDriver to work with Windows ( #1614 )
...
* Updated hipEnvVarDriver to work with Windows
* Cleaned up a bit of code
* Fixed a part where putenv was used for both win and linux
* Defines moved to test_common.h and cleaned up code
* Cleaned up some macro defines and used const char instead
* Got rid of some excess commenting
* directory paths are unconditional
* Cleaned some duplicate code, and variables are now declared and defined together
[ROCm/hip commit: e73927caee ]
2019-11-18 12:18:29 +05:30
kjayapra-amd
fac2e14dab
[dtests] Fixing compilation error in saxpy.cpp ( #1608 )
...
[ROCm/hip commit: 0a68be8b5b ]
2019-11-18 12:18:11 +05:30
Alex Voicu
3ea404a8ba
Add support for extended launch syntax. ( #1530 )
...
* Add support for extended launch syntax.
* Add unit test.
* Fix typo
* hipExtLaunchKernelGGL lives in hip_ext.h
Change-Id: Ice32dab0d43475fda65c6a910c11416871a8f2ff
* [dtest] remove redundant include from hipModuleGetGlobal dtest
[ROCm/hip commit: 69e74c3e96 ]
2019-11-16 22:24:07 -08:00
Maneesh Gupta
5536940e12
[ci] Fix failures on nvcc path ( #1654 )
...
Change-Id: I97f6b51314e14965618ab510bf57a2b034183e78
[ROCm/hip commit: f9fec8fdca ]
2019-11-15 07:58:40 +05:30
Sarbojit2019
fa1e44aa0e
[HIP] Fixed hipStreamAddCallback [SWDEV#165185] ( #1425 )
...
Fixed hipStreamAddCallback() as requested in SWDEV#165185
Added unit test to test the behavior
[ROCm/hip commit: 45613311d7 ]
2019-11-07 13:18:12 +05:30
ansurya
92dcba11ac
Fixed texture 2D mapping for pitched arrays & 3D Texture read ( #1415 )
...
Texture 2D image mapping for pitched arrays:
github issue: Texture Object's Buffer seems to be Misaligned #886
JIRA ticket: SWDEV-199313
SWDEV-151670 : Fixed issue with 3D texture with 4 components
SWDEV-151671 : Issue with 2D layered texture with 4 components
[ROCm/hip commit: e07926ce0f ]
2019-11-07 13:17:46 +05:30
Rahul Garg
6968362d99
Rename hip/hip_hcc.h to hip/hip_ext.h ( #1341 )
...
* Rename hip/hip_hcc.h to hip/hip_ext.h
* Deprecate hip_hcc.h
[ROCm/hip commit: 579a4f36fa ]
2019-11-07 13:17:10 +05:30
Evgeny Mankov
b4a30a0c7a
Merge pull request #1624 from emankov/hipify
...
[HIPIFY][#1409 ] Fix for kernel launch macro expansion
[ROCm/hip commit: a601670cb7 ]
2019-11-05 14:04:51 +03:00
Evgeny Mankov
5a9c8168c1
[HIPIFY][ #1409 ] Fix for kernel launch macro expansion
...
+ Add a corresponding test kernel_launch_01.cu
+ Add isBefore() check to avoid crash on Replacement with negative length
TODO:
+ Compatibility with former LLVM versions
+ More complicated kernel launch tests
[ROCm/hip commit: 976f8e8bf6 ]
2019-11-05 14:00:13 +03:00
Aryan Salmanpour
2ad99a6326
[hip][tests] Add two more workgroup sizes for testing hipLaunchCooperativeKernel ( #1613 )
...
[ROCm/hip commit: cf92fae9e6 ]
2019-11-05 12:32:46 +05:30
Rahul Garg
25a5ca94de
Merge pull request #1582 from amd-lthakur/hipExtMLK
...
Adding a directed test case for hipExtModuleLaunchKernel() api.
[ROCm/hip commit: 4739e68bbe ]
2019-10-31 17:13:26 -07:00
Rahul Garg
b68c8d2f60
Formatting changes
...
[ROCm/hip commit: 55f2a38120 ]
2019-10-30 18:12:51 -07:00
Rahul Garg
8429e15052
Formatting changes ,variable name and check update
...
[ROCm/hip commit: 4ab71216b4 ]
2019-10-30 18:09:21 -07:00
Rahul Garg
7e742b1216
Merge pull request #1515 from ansurya/tex_unbind_issue_fix
...
Fix undefined ref to hipUnbindTexture for texture types
[ROCm/hip commit: ba8105e0cd ]
2019-10-30 17:54:15 -07:00
Rahul Garg
70449cfa92
Revert "Fix occupany APIs ( #1560 )"
...
This reverts commit 4f23f9cb18 .
[ROCm/hip commit: e4a1e44162 ]
2019-10-29 11:41:08 -07:00
Evgeny Mankov
17fd872099
[HIPIFY][tests] Rename the ambiguous call as well
...
[ROCm/hip commit: 70c5072302 ]
2019-10-25 16:07:31 +03:00
Evgeny Mankov
536376b341
[HIPIFY][tests] Fix ambiguous call to cusparseGetErrorString declared in cusparse.h
...
[ROCm/hip commit: 0410d5dcd2 ]
2019-10-25 16:04:20 +03:00
Anusha Godavarthy Surya
5c77b7d19a
Merge branch 'master' into tex_unbind_issue_fix
...
[ROCm/hip commit: 03623cc3f1 ]
2019-10-25 15:54:25 +05:30
amd-lthakur
5e11495936
Excluded the test case for nvcc platform
...
[ROCm/hip commit: 4239c94fe5 ]
2019-10-25 15:52:11 +05:30
Anusha Godavarthy Surya
196bdea9c0
merge from master
...
[ROCm/hip commit: 5f47e99ffe ]
2019-10-25 15:52:09 +05:30
Rahul Garg
849ae2bff0
[dtest] Fix hipMemset2D test ( #1579 )
...
Reverts changes made in #1399 . This is a RT api test. For testing hipMemAllocPitch , a new test should be written and that should use correct memset API.
[ROCm/hip commit: 12e1a86ec1 ]
2019-10-25 15:44:05 +05:30
Anusha Godavarthy Surya
3007505d30
Merge branch 'master' into tex_unbind_issue_fix
...
[ROCm/hip commit: 259d8b4cdf ]
2019-10-25 15:36:55 +05:30
amd-lthakur
158cab3bb7
Refactored the file as suggested
...
[ROCm/hip commit: 564418c308 ]
2019-10-25 10:44:38 +05:30
amd-lthakur
629a933b63
Update matmul.cpp
...
[ROCm/hip commit: 318df5c36b ]
2019-10-25 09:22:07 +05:30
amd-lthakur
4b771db194
Update hipExtModuleLaunchKernel.cpp
...
[ROCm/hip commit: cd25149225 ]
2019-10-25 09:19:49 +05:30
satyanveshd
4f23f9cb18
Fix occupany APIs ( #1560 )
...
Addresses SWDEV-205006
[ROCm/hip commit: af351d7e1b ]
2019-10-24 17:44:47 +05:30
amd-lthakur
297a20eac7
Adding a directed test case for hipExtModuleLaunchKernel() api.
...
[ROCm/hip commit: 8b496e4715 ]
2019-10-24 15:06:28 +05:30
Rahul Garg
66f0280f0b
Merge pull request #1559 from vsytch/win10_aligned_alloc
...
Fixes for hipMemcpy_simple on Windows
[ROCm/hip commit: fe5f7d4245 ]
2019-10-23 13:10:59 -07:00
Evgeny Mankov
d39793f0f7
[HIPIFY] Disable delayed template parsing
...
By implicit unconditional passing -fno-delayed-template-parsing option (which appeared in LLVM 3.8.0, thus doesn't need compatibility wrapping) to hipify-clang.
[Reason] To parse uncalled template functions otherwise they are not parsed without calling, thus not hipified.
Affects cub_03.cu test, which has uncalled global template function.
[ROCm/hip commit: b6e6f12b54 ]
2019-10-22 19:07:37 +03:00
Evgeny Mankov
9822351686
[HIPIFY][ #1569 ] Fix
...
[ROCm/hip commit: 6f88c81a78 ]
2019-10-22 11:08:37 +03:00
Evgeny Mankov
a9b05f246b
[HIPIFY][tests] Set max clang's CudaArch for corresponding CUDA major.minor version
...
[Reason] To support maximum CUDA features in offline tests
+ Add defined(__CUDA_ARCH__) && __CUDA_ARCH__ >= 600 restriction for atomicAdd on doubles in atomics.cu.
So if LLVM < 7 and --cuda-gpu-arch doesn't work, __CUDA_ARCH__ is unset too (350 by default in clang);
if LLVM >= 7 --cuda-gpu-arch is used and __CUDA_ARCH__ is set based on it.
[ROCm/hip commit: 39e7d213cf ]
2019-10-21 17:50:00 +03:00
Evgeny Mankov
f345ec8c58
[HIPIFY][tests] Set max clang's CudaArch for corresponding CUDA version
...
[Reason] To support maximum CUDA features in offline tests
+ Add CUDA_VERSION >= 800 restriction for atomics.cu
[TODO] Find a way to use or exclude atomicAdd for doubles if LLVM < 7, because
LLVM 6.0.1 and older do not use --cuda-gpu-arch in clang's Driver code at all (option is only declared)
[ROCm/hip commit: 14b4df126c ]
2019-10-21 15:51:25 +03:00
Evgeny Mankov
57b2e130e0
[HIPIFY][tests] Set -I for CUDA path instead of --cuda-path for LLVM < 4
...
[ROCm/hip commit: 6cfea9b600 ]
2019-10-20 20:08:56 +03:00
Evgeny Mankov
9ec744aaae
[HIPIFY][tests] Exclude all CUB tests if CUDA_CUB_ROOT_DIR is not set
...
[ROCm/hip commit: ccb075b1db ]
2019-10-20 20:03:18 +03:00
Vladislav Sytchenko
05cf0313ce
Remove extra #endif.
...
[ROCm/hip commit: 664b115c44 ]
2019-10-18 16:40:29 -04:00
Evgeny Mankov
376037abfc
[HIPIFY][tests] Test clean-up
...
[ROCm/hip commit: 82adc93e69 ]
2019-10-18 18:55:52 +03:00
Evgeny Mankov
55e2ef4c7a
[HIPIFY][CUB][ #1460 ] Add "using namespace cub" translation support
...
+ Add cub_03.cu
[ROCm/hip commit: 98874c0e7f ]
2019-10-18 18:51:40 +03:00
Vladislav Sytchenko
fd17d3b737
_aligned_malloc() on Windows first takes size, then alignment, which is the opposite of how the similar function behaves on Linux. Memory allocated by it also has to be freed using _aligned_free(), unlike Linux where we can use regular free().
...
Edit aligned_alloc() macro and add a aligned_free() one to align with the above behaviour.
[ROCm/hip commit: 8f0a226660 ]
2019-10-17 18:58:32 -04:00
Rahul Garg
f82e9cd090
Merge pull request #1544 from vsytch/master
...
QoL changes to the hipMemset family
[ROCm/hip commit: e1aac060da ]
2019-10-16 18:54:20 -07:00
Evgeny Mankov
60c5f0a6c7
[HIPIFY][CUB][ #1460 ] Add cub:: namespace support in TemplateInstantiation of cudaLaunchKernel
...
+ Update cub_02.cu test accordingly
[ROCm/hip commit: edfd05a86d ]
2019-10-16 19:02:13 +03:00