Rahul Garg
b68c8d2f60
Formatting changes
...
[ROCm/hip commit: 55f2a38120 ]
2019-10-30 18:12:51 -07:00
Rahul Garg
8429e15052
Formatting changes ,variable name and check update
...
[ROCm/hip commit: 4ab71216b4 ]
2019-10-30 18:09:21 -07:00
amd-lthakur
5e11495936
Excluded the test case for nvcc platform
...
[ROCm/hip commit: 4239c94fe5 ]
2019-10-25 15:52:11 +05:30
amd-lthakur
158cab3bb7
Refactored the file as suggested
...
[ROCm/hip commit: 564418c308 ]
2019-10-25 10:44:38 +05:30
amd-lthakur
629a933b63
Update matmul.cpp
...
[ROCm/hip commit: 318df5c36b ]
2019-10-25 09:22:07 +05:30
amd-lthakur
4b771db194
Update hipExtModuleLaunchKernel.cpp
...
[ROCm/hip commit: cd25149225 ]
2019-10-25 09:19:49 +05:30
amd-lthakur
297a20eac7
Adding a directed test case for hipExtModuleLaunchKernel() api.
...
[ROCm/hip commit: 8b496e4715 ]
2019-10-24 15:06:28 +05:30
Rahul Garg
66f0280f0b
Merge pull request #1559 from vsytch/win10_aligned_alloc
...
Fixes for hipMemcpy_simple on Windows
[ROCm/hip commit: fe5f7d4245 ]
2019-10-23 13:10:59 -07:00
Evgeny Mankov
6a0ce151e5
[HIPIFY][cmake][ #1571 ] Take into account building hipify-clang as a part of building HIP while installing
...
[Algorithm]
[Release]
If CMAKE_INSTALL_PREFIX is set by the user:
If BIN_INSTALL_DIR is set by HIP, use it as CMAKE_INSTALL_PREFIX, otherwise CMAKE_INSTALL_PREFIX is used unchanged.
If the user does not set CMAKE_INSTALL_PREFIX (CMAKE_INSTALL_PREFIX_INITIALIZED_TO_DEFAULT):
If BIN_INSTALL_DIR is set by HIP, use it as CMAKE_INSTALL_PREFIX, otherwise use PROJECT_BINARY_DIR/bin for installation.
[Debug]
If CMAKE_INSTALL_PREFIX is set by the user:
CMAKE_INSTALL_PREFIX is used unchanged.
If the user does not set CMAKE_INSTALL_PREFIX (CMAKE_INSTALL_PREFIX_INITIALIZED_TO_DEFAULT):
use CMAKE_CURRENT_SOURCE_DIR/bin for installation.
Standalone build left unchanged: CMAKE_INSTALL_PREFIX is used if set.
[ROCm/hip commit: 75d70a6714 ]
2019-10-23 18:54:45 +03:00
Evgeny Mankov
d39793f0f7
[HIPIFY] Disable delayed template parsing
...
By implicit unconditional passing -fno-delayed-template-parsing option (which appeared in LLVM 3.8.0, thus doesn't need compatibility wrapping) to hipify-clang.
[Reason] To parse uncalled template functions otherwise they are not parsed without calling, thus not hipified.
Affects cub_03.cu test, which has uncalled global template function.
[ROCm/hip commit: b6e6f12b54 ]
2019-10-22 19:07:37 +03:00
Evgeny Mankov
9822351686
[HIPIFY][ #1569 ] Fix
...
[ROCm/hip commit: 6f88c81a78 ]
2019-10-22 11:08:37 +03:00
Evgeny Mankov
a9b05f246b
[HIPIFY][tests] Set max clang's CudaArch for corresponding CUDA major.minor version
...
[Reason] To support maximum CUDA features in offline tests
+ Add defined(__CUDA_ARCH__) && __CUDA_ARCH__ >= 600 restriction for atomicAdd on doubles in atomics.cu.
So if LLVM < 7 and --cuda-gpu-arch doesn't work, __CUDA_ARCH__ is unset too (350 by default in clang);
if LLVM >= 7 --cuda-gpu-arch is used and __CUDA_ARCH__ is set based on it.
[ROCm/hip commit: 39e7d213cf ]
2019-10-21 17:50:00 +03:00
Evgeny Mankov
f697f14117
[HIPIFY][perl] Support of 'using namespace cub'
...
[ROCm/hip commit: b08f29a6fa ]
2019-10-21 17:15:05 +03:00
Evgeny Mankov
f345ec8c58
[HIPIFY][tests] Set max clang's CudaArch for corresponding CUDA version
...
[Reason] To support maximum CUDA features in offline tests
+ Add CUDA_VERSION >= 800 restriction for atomics.cu
[TODO] Find a way to use or exclude atomicAdd for doubles if LLVM < 7, because
LLVM 6.0.1 and older do not use --cuda-gpu-arch in clang's Driver code at all (option is only declared)
[ROCm/hip commit: 14b4df126c ]
2019-10-21 15:51:25 +03:00
Evgeny Mankov
57b2e130e0
[HIPIFY][tests] Set -I for CUDA path instead of --cuda-path for LLVM < 4
...
[ROCm/hip commit: 6cfea9b600 ]
2019-10-20 20:08:56 +03:00
Evgeny Mankov
9ec744aaae
[HIPIFY][tests] Exclude all CUB tests if CUDA_CUB_ROOT_DIR is not set
...
[ROCm/hip commit: ccb075b1db ]
2019-10-20 20:03:18 +03:00
Vladislav Sytchenko
05cf0313ce
Remove extra #endif.
...
[ROCm/hip commit: 664b115c44 ]
2019-10-18 16:40:29 -04:00
Evgeny Mankov
376037abfc
[HIPIFY][tests] Test clean-up
...
[ROCm/hip commit: 82adc93e69 ]
2019-10-18 18:55:52 +03:00
Evgeny Mankov
55e2ef4c7a
[HIPIFY][CUB][ #1460 ] Add "using namespace cub" translation support
...
+ Add cub_03.cu
[ROCm/hip commit: 98874c0e7f ]
2019-10-18 18:51:40 +03:00
Evgeny Mankov
5c300e3320
Merge pull request #1558 from aaronenyeshi/fix-hipify-cmake-version
...
[HIPIFY][cmake] Make CMakeLists use default 3.5.1 for Ubuntu 16.04
[ROCm/hip commit: f0ed210b19 ]
2019-10-18 06:39:35 +03:00
Rahul Garg
80ee1cc5a5
Merge pull request #1550 from yxsamliu/new-launch
...
Add -fhip-new-launch-api to hipcc for HIP/VDI
[ROCm/hip commit: 1fd16d7601 ]
2019-10-17 19:07:32 -07:00
Vladislav Sytchenko
fd17d3b737
_aligned_malloc() on Windows first takes size, then alignment, which is the opposite of how the similar function behaves on Linux. Memory allocated by it also has to be freed using _aligned_free(), unlike Linux where we can use regular free().
...
Edit aligned_alloc() macro and add a aligned_free() one to align with the above behaviour.
[ROCm/hip commit: 8f0a226660 ]
2019-10-17 18:58:32 -04:00
Aaron Enye Shi
c3b3de7f95
[HIPIFY][cmake] Make CMakeLists use default 3.5.1 for Ubuntu 16.04
...
[ROCm/hip commit: 31e57f8b64 ]
2019-10-17 21:21:24 +00:00
Evgeny Mankov
222fb46a92
[HIPIFY][doc] Update README.md
...
+ Versions, testing
[ROCm/hip commit: 7ecbd71004 ]
2019-10-17 22:26:48 +03:00
Rahul Garg
d3a40b7b2e
Revert "hipcc defaults to code object v3 ( #1298 )"
...
This reverts commit 7dbc574a4b .
[ROCm/hip commit: 5f37f3174a ]
2019-10-17 13:27:28 -04:00
Evgeny Mankov
4e3443c69e
[HIPIFY][cmake] Add install rule for clang-resource-headers
...
+ Fix: set destination for all installing files to ${CMAKE_INSTALL_PREFIX}
[ROCm/hip commit: f19e7c29df ]
2019-10-17 15:05:55 +03:00
Rahul Garg
f82e9cd090
Merge pull request #1544 from vsytch/master
...
QoL changes to the hipMemset family
[ROCm/hip commit: e1aac060da ]
2019-10-16 18:54:20 -07:00
Evgeny Mankov
60c5f0a6c7
[HIPIFY][CUB][ #1460 ] Add cub:: namespace support in TemplateInstantiation of cudaLaunchKernel
...
+ Update cub_02.cu test accordingly
[ROCm/hip commit: edfd05a86d ]
2019-10-16 19:02:13 +03:00
Vladislav Sytchenko
779d723ac2
hipMemset2D and hipMemset3D tests should be passing by default.
...
[ROCm/hip commit: c747b77ac1 ]
2019-10-16 11:02:38 -04:00
Evgeny Mankov
187eeb86bc
[HIPIFY] Refactor a couple of matcher functions
...
+ Separate out GetSubstrLocation function for finding substr SourceLocation in a given SourceRange
[ROCm/hip commit: 809a67a4f6 ]
2019-10-16 13:43:56 +03:00
Evgeny Mankov
dcbe9432bd
[HIPIFY][CUB][ #1460 ] Implement cubFunctionTemplateDecl matcher
...
+ Add cub_02.cu test
+ Partial fixes #1460
[ROCm/hip commit: 6960574850 ]
2019-10-16 13:08:11 +03:00
kjayapra-amd
c6fd681dd5
Use the correct return type in runTest in 11_texture_driver sample. ( #1546 )
...
Fixes SWDEV-203394.
Currently in runTest() returns true, even if the texture reference copy does not happen. Using the existing testResult Flag to return from runTest().
[ROCm/hip commit: fbb98f4950 ]
2019-10-16 10:52:15 +05:30
vsytch
eaa616bf6d
Update hipMathFunctions, hipTestHalf and hipTestNativeHalf tests to support Navi10 and Navi14. ( #1545 )
...
[ROCm/hip commit: 2fb734ef5f ]
2019-10-16 10:51:48 +05:30
kpyzhov
cba73a44ef
[hipcc] Temporary add -D_OPENMP to clang options to workaround cmake issue ( #1540 )
...
* Temporary add -D_OPENMP to clang options in hipcc to allow using CMake OpenMP detection with hip-clang (until updated CMake version is available).
[ROCm/hip commit: 596bf4e326 ]
2019-10-16 10:51:28 +05:30
Nick Curtis
d2e9718d23
Guard against division by zero for no VGPR usage (e.g., in an empty kernel) ( #1528 )
...
* guard against division by zero for no VGPR usage (e.g., in an empty kernel)
* fix bracket format
* clean up parenthesis
[ROCm/hip commit: 73ca2b0083 ]
2019-10-16 10:49:56 +05:30
Jatin Chaudhary
22978cb1a3
Adding code object manager to rtc ( #1526 )
...
Adding Code Object Manager file to rtc to resolve address of Bundled_code_object in libhiprtc.so
[ROCm/hip commit: c47c2c4f9a ]
2019-10-16 10:49:16 +05:30
Xiaozhu Meng
8d8bc72f6a
Fix struct declaration for C ( #1524 )
...
This change is necessary for HPCToolkit to use Roctracer to produce code centric profiling view.
[ROCm/hip commit: e9adb8e932 ]
2019-10-16 10:48:55 +05:30
Yaxun (Sam) Liu
73763f8748
Add -fhip-new-launch-api to hipcc for HIP/VDI
...
[ROCm/hip commit: 4d5cb8351c ]
2019-10-15 21:47:33 -04:00
Vladislav Sytchenko
948dd510eb
In the hipMemset2D and hipMemset3D tests synchronize with the default stream after performing an async memset.
...
[ROCm/hip commit: f5af263aba ]
2019-10-15 17:15:49 -04:00
Vladislav Sytchenko
6180a9edb3
Update indentation in the hipMemset3D test. Replace all tabs with four spaces.
...
[ROCm/hip commit: 346bfa90d6 ]
2019-10-15 15:29:14 -04:00
Vladislav Sytchenko
5a29d10dbb
Add async subtest to hipMemSet3D
...
[ROCm/hip commit: 00425bdf3d ]
2019-10-15 14:24:04 -04:00
Vladislav Sytchenko
28089f61b6
hipMemset2D test should pass only if both async and sync subtests pass.
...
[ROCm/hip commit: 3db2ecc52b ]
2019-10-15 14:20:14 -04:00
Vladislav Sytchenko
370d039d24
Update the declarations of hipMemsetD8, hipMemsetD8Async, hipMemsetD16, hipMemsetD16Async. These functions are type aware and take in as their third argument the number of elements in the buffer, not the buffer size. Change the name of this argument from sizeBytes to count to align with the above description.
...
[ROCm/hip commit: 0b52c1d9d8 ]
2019-10-15 14:18:42 -04:00
Evgeny Mankov
46918133df
Merge pull request #1541 from emankov/doc
...
[HIPIFY][cmake] Make CMakeLists.txt compatible with default cmake 3.5.2 for Ubuntu 16.04
[ROCm/hip commit: dd665f446c ]
2019-10-15 17:11:39 +03:00
Evgeny Mankov
44ca50dfd0
[HIPIFY][tests] Exclude tests for the libs, which are not defined in cmake command line
...
+ affects cuDNN and CUB tests, paths to libraries of which are defined by CUDA_DNN_ROOT_DIR and CUDA_CUB_ROOT_DIR
+ Warn about excluding and why, for instance:
"WARN: cuDNN tests are excluded due to unset CUDA_DNN_ROOT_DIR"
[ROCm/hip commit: 3444834e3a ]
2019-10-15 14:20:23 +03:00
Evgeny Mankov
e49e3bfffa
[HIPIFY][cmake] Make CMakeLists.txt compatible with default cmake 3.5.2 for Ubuntu 16.04
...
+ Update README.md accordingly
[ROCm/hip commit: d57c9a844b ]
2019-10-15 11:26:03 +03:00
Evgeny Mankov
8456642b3d
[HIPIFY] HipifyAction code clean-up
...
[ROCm/hip commit: c0f62157f9 ]
2019-10-14 18:08:29 +03:00
Evgeny Mankov
b3f8782fb9
[HIPIFY][build fix] Add missing new source file
...
[ROCm/hip commit: 2994fb3429 ]
2019-10-14 16:32:57 +03:00
Evgeny Mankov
1554bb5e4e
Merge pull request #1534 from emankov/hipify-clang
...
[HIPIFY][CUB] Initial support (hipify-clang only)
[ROCm/hip commit: 4ffcb0b26f ]
2019-10-14 15:57:09 +03:00
Evgeny Mankov
d174cad0aa
[HIPIFY][CUB] Add missing unit test
...
[ROCm/hip commit: 92fb5a3a94 ]
2019-10-14 12:03:20 +03:00