Rahul Garg
b68c8d2f60
Formatting changes
...
[ROCm/hip commit: 55f2a38120 ]
2019-10-30 18:12:51 -07:00
Rahul Garg
8429e15052
Formatting changes ,variable name and check update
...
[ROCm/hip commit: 4ab71216b4 ]
2019-10-30 18:09:21 -07:00
amd-lthakur
5e11495936
Excluded the test case for nvcc platform
...
[ROCm/hip commit: 4239c94fe5 ]
2019-10-25 15:52:11 +05:30
amd-lthakur
158cab3bb7
Refactored the file as suggested
...
[ROCm/hip commit: 564418c308 ]
2019-10-25 10:44:38 +05:30
amd-lthakur
629a933b63
Update matmul.cpp
...
[ROCm/hip commit: 318df5c36b ]
2019-10-25 09:22:07 +05:30
amd-lthakur
4b771db194
Update hipExtModuleLaunchKernel.cpp
...
[ROCm/hip commit: cd25149225 ]
2019-10-25 09:19:49 +05:30
amd-lthakur
297a20eac7
Adding a directed test case for hipExtModuleLaunchKernel() api.
...
[ROCm/hip commit: 8b496e4715 ]
2019-10-24 15:06:28 +05:30
Rahul Garg
66f0280f0b
Merge pull request #1559 from vsytch/win10_aligned_alloc
...
Fixes for hipMemcpy_simple on Windows
[ROCm/hip commit: fe5f7d4245 ]
2019-10-23 13:10:59 -07:00
Evgeny Mankov
0cc85a5619
Merge pull request #1578 from emankov/doc
...
[HIPIFY][cmake][#1571 ] Take into account building hipify-clang as a part of building HIP while installing
[ROCm/hip commit: 29e04f99b5 ]
2019-10-23 21:23:05 +03:00
Evgeny Mankov
6a0ce151e5
[HIPIFY][cmake][ #1571 ] Take into account building hipify-clang as a part of building HIP while installing
...
[Algorithm]
[Release]
If CMAKE_INSTALL_PREFIX is set by the user:
If BIN_INSTALL_DIR is set by HIP, use it as CMAKE_INSTALL_PREFIX, otherwise CMAKE_INSTALL_PREFIX is used unchanged.
If the user does not set CMAKE_INSTALL_PREFIX (CMAKE_INSTALL_PREFIX_INITIALIZED_TO_DEFAULT):
If BIN_INSTALL_DIR is set by HIP, use it as CMAKE_INSTALL_PREFIX, otherwise use PROJECT_BINARY_DIR/bin for installation.
[Debug]
If CMAKE_INSTALL_PREFIX is set by the user:
CMAKE_INSTALL_PREFIX is used unchanged.
If the user does not set CMAKE_INSTALL_PREFIX (CMAKE_INSTALL_PREFIX_INITIALIZED_TO_DEFAULT):
use CMAKE_CURRENT_SOURCE_DIR/bin for installation.
Standalone build left unchanged: CMAKE_INSTALL_PREFIX is used if set.
[ROCm/hip commit: 75d70a6714 ]
2019-10-23 18:54:45 +03:00
Evgeny Mankov
2b201de7a9
Merge pull request #1574 from emankov/hipify-clang
...
[HIPIFY] Disable delayed template parsing
[ROCm/hip commit: cc9efa707c ]
2019-10-22 19:09:13 +03:00
Evgeny Mankov
d39793f0f7
[HIPIFY] Disable delayed template parsing
...
By implicit unconditional passing -fno-delayed-template-parsing option (which appeared in LLVM 3.8.0, thus doesn't need compatibility wrapping) to hipify-clang.
[Reason] To parse uncalled template functions otherwise they are not parsed without calling, thus not hipified.
Affects cub_03.cu test, which has uncalled global template function.
[ROCm/hip commit: b6e6f12b54 ]
2019-10-22 19:07:37 +03:00
Evgeny Mankov
cf4cd41e2b
Merge pull request #1570 from emankov/doc
...
[HIPIFY][#1569 ] Fix
[ROCm/hip commit: 76c8406449 ]
2019-10-22 11:13:47 +03:00
Evgeny Mankov
9822351686
[HIPIFY][ #1569 ] Fix
...
[ROCm/hip commit: 6f88c81a78 ]
2019-10-22 11:08:37 +03:00
Evgeny Mankov
9f98afc49a
Merge pull request #1568 from emankov/hipify-clang
...
[HIPIFY][tests] Set max clang's CudaArch for corresponding CUDA major…
[ROCm/hip commit: 239fb0a098 ]
2019-10-21 17:52:02 +03:00
Evgeny Mankov
a9b05f246b
[HIPIFY][tests] Set max clang's CudaArch for corresponding CUDA major.minor version
...
[Reason] To support maximum CUDA features in offline tests
+ Add defined(__CUDA_ARCH__) && __CUDA_ARCH__ >= 600 restriction for atomicAdd on doubles in atomics.cu.
So if LLVM < 7 and --cuda-gpu-arch doesn't work, __CUDA_ARCH__ is unset too (350 by default in clang);
if LLVM >= 7 --cuda-gpu-arch is used and __CUDA_ARCH__ is set based on it.
[ROCm/hip commit: 39e7d213cf ]
2019-10-21 17:50:00 +03:00
Evgeny Mankov
97b65347e8
Merge pull request #1567 from emankov/hipify-clang
...
[HIPIFY][perl] Support of 'using namespace cub'
[ROCm/hip commit: 2ddde17039 ]
2019-10-21 17:16:32 +03:00
Evgeny Mankov
f697f14117
[HIPIFY][perl] Support of 'using namespace cub'
...
[ROCm/hip commit: b08f29a6fa ]
2019-10-21 17:15:05 +03:00
Evgeny Mankov
fbfb6d19d7
Merge pull request #1566 from emankov/hipify-clang
...
[HIPIFY][tests] Set max clang's CudaArch for corresponding CUDA version
[ROCm/hip commit: 1caeb5613d ]
2019-10-21 15:54:34 +03:00
Evgeny Mankov
f345ec8c58
[HIPIFY][tests] Set max clang's CudaArch for corresponding CUDA version
...
[Reason] To support maximum CUDA features in offline tests
+ Add CUDA_VERSION >= 800 restriction for atomics.cu
[TODO] Find a way to use or exclude atomicAdd for doubles if LLVM < 7, because
LLVM 6.0.1 and older do not use --cuda-gpu-arch in clang's Driver code at all (option is only declared)
[ROCm/hip commit: 14b4df126c ]
2019-10-21 15:51:25 +03:00
Evgeny Mankov
80f0470720
Merge pull request #1565 from emankov/hipify-clang
...
[HIPIFY][tests] Set -I for CUDA path instead of --cuda-path for LLVM < 4
[ROCm/hip commit: abb34bab8e ]
2019-10-20 20:10:25 +03:00
Evgeny Mankov
57b2e130e0
[HIPIFY][tests] Set -I for CUDA path instead of --cuda-path for LLVM < 4
...
[ROCm/hip commit: 6cfea9b600 ]
2019-10-20 20:08:56 +03:00
Evgeny Mankov
b8c859c2ab
Merge pull request #1564 from emankov/hipify-clang
...
[HIPIFY][tests] Exclude all CUB tests if CUDA_CUB_ROOT_DIR is not set
[ROCm/hip commit: 7fb633bcc7 ]
2019-10-20 20:04:18 +03:00
Evgeny Mankov
9ec744aaae
[HIPIFY][tests] Exclude all CUB tests if CUDA_CUB_ROOT_DIR is not set
...
[ROCm/hip commit: ccb075b1db ]
2019-10-20 20:03:18 +03:00
Vladislav Sytchenko
05cf0313ce
Remove extra #endif.
...
[ROCm/hip commit: 664b115c44 ]
2019-10-18 16:40:29 -04:00
Evgeny Mankov
05d8e6b7ec
Merge pull request #1562 from emankov/doc
...
[HIPIFY][CUB][#1460 ] Add "using namespace cub" translation support
[ROCm/hip commit: 3baf7f8d93 ]
2019-10-18 18:56:34 +03:00
Evgeny Mankov
376037abfc
[HIPIFY][tests] Test clean-up
...
[ROCm/hip commit: 82adc93e69 ]
2019-10-18 18:55:52 +03:00
Evgeny Mankov
55e2ef4c7a
[HIPIFY][CUB][ #1460 ] Add "using namespace cub" translation support
...
+ Add cub_03.cu
[ROCm/hip commit: 98874c0e7f ]
2019-10-18 18:51:40 +03:00
Evgeny Mankov
5c300e3320
Merge pull request #1558 from aaronenyeshi/fix-hipify-cmake-version
...
[HIPIFY][cmake] Make CMakeLists use default 3.5.1 for Ubuntu 16.04
[ROCm/hip commit: f0ed210b19 ]
2019-10-18 06:39:35 +03:00
Rahul Garg
80ee1cc5a5
Merge pull request #1550 from yxsamliu/new-launch
...
Add -fhip-new-launch-api to hipcc for HIP/VDI
[ROCm/hip commit: 1fd16d7601 ]
2019-10-17 19:07:32 -07:00
Vladislav Sytchenko
fd17d3b737
_aligned_malloc() on Windows first takes size, then alignment, which is the opposite of how the similar function behaves on Linux. Memory allocated by it also has to be freed using _aligned_free(), unlike Linux where we can use regular free().
...
Edit aligned_alloc() macro and add a aligned_free() one to align with the above behaviour.
[ROCm/hip commit: 8f0a226660 ]
2019-10-17 18:58:32 -04:00
Aaron Enye Shi
c3b3de7f95
[HIPIFY][cmake] Make CMakeLists use default 3.5.1 for Ubuntu 16.04
...
[ROCm/hip commit: 31e57f8b64 ]
2019-10-17 21:21:24 +00:00
Evgeny Mankov
bd1e152f75
Merge pull request #1557 from emankov/hipify-clang
...
[HIPIFY][doc] Update README.md
[ROCm/hip commit: 10cc2f4ab3 ]
2019-10-17 22:28:16 +03:00
Evgeny Mankov
222fb46a92
[HIPIFY][doc] Update README.md
...
+ Versions, testing
[ROCm/hip commit: 7ecbd71004 ]
2019-10-17 22:26:48 +03:00
Rahul Garg
d3a40b7b2e
Revert "hipcc defaults to code object v3 ( #1298 )"
...
This reverts commit 7dbc574a4b .
[ROCm/hip commit: 5f37f3174a ]
2019-10-17 13:27:28 -04:00
Evgeny Mankov
5ef2f98fab
Merge pull request #1554 from emankov/clang
...
[HIPIFY][cmake] Add install rule for clang-resource-headers
[ROCm/hip commit: d8f512dcae ]
2019-10-17 16:50:25 +03:00
Evgeny Mankov
4e3443c69e
[HIPIFY][cmake] Add install rule for clang-resource-headers
...
+ Fix: set destination for all installing files to ${CMAKE_INSTALL_PREFIX}
[ROCm/hip commit: f19e7c29df ]
2019-10-17 15:05:55 +03:00
Rahul Garg
f82e9cd090
Merge pull request #1544 from vsytch/master
...
QoL changes to the hipMemset family
[ROCm/hip commit: e1aac060da ]
2019-10-16 18:54:20 -07:00
Evgeny Mankov
ac54ee3df6
Merge pull request #1551 from emankov/clang
...
[HIPIFY][CUB][#1460 ] Add cub:: namespace support in TemplateInstantiation of cudaLaunchKernel
[ROCm/hip commit: 84a73406e9 ]
2019-10-16 19:05:18 +03:00
Evgeny Mankov
60c5f0a6c7
[HIPIFY][CUB][ #1460 ] Add cub:: namespace support in TemplateInstantiation of cudaLaunchKernel
...
+ Update cub_02.cu test accordingly
[ROCm/hip commit: edfd05a86d ]
2019-10-16 19:02:13 +03:00
Vladislav Sytchenko
779d723ac2
hipMemset2D and hipMemset3D tests should be passing by default.
...
[ROCm/hip commit: c747b77ac1 ]
2019-10-16 11:02:38 -04:00
Evgeny Mankov
7c4b4f11d4
Merge pull request #1548 from emankov/clang
...
[HIPIFY] Refactor a couple of matcher functions
[ROCm/hip commit: e805e7d8cb ]
2019-10-16 13:45:59 +03:00
Evgeny Mankov
187eeb86bc
[HIPIFY] Refactor a couple of matcher functions
...
+ Separate out GetSubstrLocation function for finding substr SourceLocation in a given SourceRange
[ROCm/hip commit: 809a67a4f6 ]
2019-10-16 13:43:56 +03:00
Evgeny Mankov
190ffe1acc
Merge pull request #1547 from emankov/clang
...
[HIPIFY][CUB][#1460 ] Implement cubFunctionTemplateDecl matcher
[ROCm/hip commit: a80bad474b ]
2019-10-16 13:09:49 +03:00
Evgeny Mankov
dcbe9432bd
[HIPIFY][CUB][ #1460 ] Implement cubFunctionTemplateDecl matcher
...
+ Add cub_02.cu test
+ Partial fixes #1460
[ROCm/hip commit: 6960574850 ]
2019-10-16 13:08:11 +03:00
kjayapra-amd
c6fd681dd5
Use the correct return type in runTest in 11_texture_driver sample. ( #1546 )
...
Fixes SWDEV-203394.
Currently in runTest() returns true, even if the texture reference copy does not happen. Using the existing testResult Flag to return from runTest().
[ROCm/hip commit: fbb98f4950 ]
2019-10-16 10:52:15 +05:30
vsytch
eaa616bf6d
Update hipMathFunctions, hipTestHalf and hipTestNativeHalf tests to support Navi10 and Navi14. ( #1545 )
...
[ROCm/hip commit: 2fb734ef5f ]
2019-10-16 10:51:48 +05:30
kpyzhov
cba73a44ef
[hipcc] Temporary add -D_OPENMP to clang options to workaround cmake issue ( #1540 )
...
* Temporary add -D_OPENMP to clang options in hipcc to allow using CMake OpenMP detection with hip-clang (until updated CMake version is available).
[ROCm/hip commit: 596bf4e326 ]
2019-10-16 10:51:28 +05:30
Nick Curtis
d2e9718d23
Guard against division by zero for no VGPR usage (e.g., in an empty kernel) ( #1528 )
...
* guard against division by zero for no VGPR usage (e.g., in an empty kernel)
* fix bracket format
* clean up parenthesis
[ROCm/hip commit: 73ca2b0083 ]
2019-10-16 10:49:56 +05:30
Jatin Chaudhary
22978cb1a3
Adding code object manager to rtc ( #1526 )
...
Adding Code Object Manager file to rtc to resolve address of Bundled_code_object in libhiprtc.so
[ROCm/hip commit: c47c2c4f9a ]
2019-10-16 10:49:16 +05:30