Evgeny Mankov
388181ada7
Merge pull request #1595 from emankov/cmake
...
[HIP][cmake] Simplify `UNIX` related code (the beginning)
[ROCm/clr commit: 7b1ca3dd91 ]
2019-11-05 20:30:55 +03:00
Evgeny Mankov
ecad81b794
Merge pull request #1624 from emankov/hipify
...
[HIPIFY][#1409 ] Fix for kernel launch macro expansion
[ROCm/clr commit: cf5d1caab3 ]
2019-11-05 14:04:51 +03:00
Evgeny Mankov
3889be3031
[HIPIFY][ #1409 ] Fix for kernel launch macro expansion
...
+ Add a corresponding test kernel_launch_01.cu
+ Add isBefore() check to avoid crash on Replacement with negative length
TODO:
+ Compatibility with former LLVM versions
+ More complicated kernel launch tests
[ROCm/clr commit: a1c380f38d ]
2019-11-05 14:00:13 +03:00
Rahul Garg
92393d6ecf
Deprecate HIP Markers ( #1622 )
...
* Deprecate HIP markers
* Deprecate profiler start/stop
[ROCm/clr commit: 8b3fce8069 ]
2019-11-05 12:32:59 +05:30
Aryan Salmanpour
92a06b531d
[hip][tests] Add two more workgroup sizes for testing hipLaunchCooperativeKernel ( #1613 )
...
[ROCm/clr commit: 7a3d592ed0 ]
2019-11-05 12:32:46 +05:30
Rahul Garg
9d078409e3
Merge pull request #1618 from ROCm-Developer-Tools/hotfix_volatile_accessors
...
Separate volatile for clarity. Handle assignment.
[ROCm/clr commit: 3b8a5cff2f ]
2019-11-02 13:50:59 -07:00
Alex Voicu
1a3d02fb08
Separate volatile for clarity. Handle assignment.
...
[ROCm/clr commit: ed0d6ec51e ]
2019-11-02 22:02:08 +02:00
Evgeny Mankov
8e369f9c7d
Merge pull request #1617 from emankov/hipify-clang
...
[HIPIFY] Introduce --cuda-gpu-arch as hipify-clang's option
[ROCm/clr commit: 758ab23b8f ]
2019-11-02 14:21:21 +03:00
Evgeny Mankov
bc9690328f
[HIPIFY] Introduce --cuda-gpu-arch as hipify-clang's option
...
+ Pass it to clang if specified
[ROCm/clr commit: 7142b884ab ]
2019-11-02 14:19:31 +03:00
Rahul Garg
91832b3b13
Merge pull request #1616 from ROCm-Developer-Tools/hotfix_volatile_accessors
...
Accessors should work even when oddly volatile.
[ROCm/clr commit: dc702f5ffe ]
2019-11-01 13:45:54 -07:00
Alex Voicu
5c2503f0bd
Accessors should work even when oddly volatile.
...
[ROCm/clr commit: 2d76dde05b ]
2019-11-01 22:18:01 +02:00
Evgeny Mankov
3b84d8e638
Merge pull request #1615 from emankov/hipify
...
[HIPIFY][CUB][#1460 ][perl] Add "cub::" namespace prefix support in hipify-perl as well
[ROCm/clr commit: 48bb6df7e2 ]
2019-11-01 14:35:55 +03:00
Evgeny Mankov
43d286e9ce
[HIPIFY][CUB][ #1460 ][perl] Add "cub::" namespace prefix support in hipify-perl as well
...
[ROCm/clr commit: c5a2a2daf2 ]
2019-11-01 14:34:18 +03:00
Rahul Garg
2cf205bc7b
Merge pull request #1582 from amd-lthakur/hipExtMLK
...
Adding a directed test case for hipExtModuleLaunchKernel() api.
[ROCm/clr commit: 1bec1445bb ]
2019-10-31 17:13:26 -07:00
Rahul Garg
e61ff0408c
Merge pull request #1598 from lmoriche/master
...
Fix a code object memory corruption
[ROCm/clr commit: 2199f8a6c6 ]
2019-10-31 17:12:24 -07:00
Rahul Garg
138b0327da
Add stream
...
[ROCm/clr commit: f556e15361 ]
2019-10-31 12:15:56 -04:00
Rahul Garg
440a172f0e
Fix HIP init calls in hipMemcpy2DFromArray
...
[ROCm/clr commit: 0718ba0f00 ]
2019-10-31 12:15:56 -04:00
Evgeny Mankov
f472257a40
Merge pull request #1612 from emankov/hipify
...
[HIPIFY][cmake][#1572 ] Fix: Do not override CMAKE_INSTALL_PREFIX
[ROCm/clr commit: 8b99b0ffd8 ]
2019-10-31 16:58:36 +03:00
Evgeny Mankov
0cfdeda490
[HIPIFY][cmake][ #1572 ] Fix: Do not override CMAKE_INSTALL_PREFIX
...
Affects building with HIP, standalone building is not changed
[ROCm/clr commit: e79fd55d01 ]
2019-10-31 16:55:06 +03:00
Rahul Garg
31fc374120
Formatting changes
...
[ROCm/clr commit: 07f4431de8 ]
2019-10-30 18:12:51 -07:00
Rahul Garg
5bcc1f66c5
Formatting changes ,variable name and check update
...
[ROCm/clr commit: cd1435cbc7 ]
2019-10-30 18:09:21 -07:00
Rahul Garg
04785f2d54
Merge pull request #1515 from ansurya/tex_unbind_issue_fix
...
Fix undefined ref to hipUnbindTexture for texture types
[ROCm/clr commit: aeb7cebbad ]
2019-10-30 17:54:15 -07:00
Laurent Morichetti
f1f649e57c
Addressed review comments
...
Change comment "must exceed" to "must be no shorter than"
move the std::string instead of creating a copy
[ROCm/clr commit: 3243f06eef ]
2019-10-30 13:14:41 -07:00
Evgeny Mankov
5403e7edcc
Merge pull request #1593 from emankov/doc
...
[HIP][cmake] Move all *_INSTALL_DIR variables up before first add_subdirectory()
[ROCm/clr commit: 961bc5737e ]
2019-10-30 22:10:05 +03:00
Rahul Garg
434231ac69
Merge pull request #1607 from mhbliao/hliao/master/missing.api.hip.clang
...
[HIP] Correct headers and add missing function templates for hip-clang.
[ROCm/clr commit: b94f5bd667 ]
2019-10-30 07:48:57 -07:00
Michael LIAO
64f2d5e861
[HIP] Correct headers and add missing function templates for hip-clang.
...
- Fix 2 runtime API prototypes
`hipOccupancyMaxActiveBlocksPerMultiprocessor` and
`hipOccupancyMaxActiveBlocksPerMultiprocessorWithFlags`
- Add missing function templates of them in hip-clang.
[ROCm/clr commit: 61bc68a5f4 ]
2019-10-29 22:00:11 -04:00
Rahul Garg
42ab7b830e
Merge pull request #1602 from ROCm-Developer-Tools/revert-1560-satyanveshd/hipoccupy
...
Revert "Cooperative groups match with cuda SWDEV-205006"
[ROCm/clr commit: 9840cdac99 ]
2019-10-29 16:54:36 -07:00
Evgeny Mankov
615b9b373b
Merge pull request #1604 from emankov/hipify
...
[HIPIFY][#1603 ] Fix
[ROCm/clr commit: daab61e8e8 ]
2019-10-29 22:12:39 +03:00
Evgeny Mankov
dc0186720c
[HIPIFY][ #1603 ] Fix
...
[ROCm/clr commit: 050fdad7b7 ]
2019-10-29 22:10:36 +03:00
Rahul Garg
72c686ed67
Revert "Fix occupany APIs ( #1560 )"
...
This reverts commit ad1e409a24 .
[ROCm/clr commit: 27221bc823 ]
2019-10-29 11:41:08 -07:00
Evgeny Mankov
1a9af0ea4e
Merge pull request #1601 from emankov/hipify
...
[HIPIFY][Linux] Rollback --cuda-compile-host-device on Linux
[ROCm/clr commit: 8a7e6fb747 ]
2019-10-29 20:55:29 +03:00
Evgeny Mankov
d8e846fc91
[HIPIFY][Linux] Rollback --cuda-compile-host-device on Linux
...
[Reason] It doesn't work with LLVM 9 and higher; Windows is fine
[ROCm/clr commit: dd2243f2fa ]
2019-10-29 20:53:54 +03:00
Evgeny Mankov
0989a885f9
Merge pull request #1600 from emankov/hipify
...
[HIPIFY] Introduce --cuda-compile-host-device for LLVM >= 9
[ROCm/clr commit: 99c4a40da1 ]
2019-10-29 19:47:15 +03:00
Evgeny Mankov
21d798394a
[HIPIFY] Introduce --cuda-compile-host-device for LLVM >= 9
...
* LLVM < 9 continues using --cuda-host-only
[ROCm/clr commit: 411b18a124 ]
2019-10-29 19:42:53 +03:00
Evgeny Mankov
532e8138d5
Merge pull request #1599 from emankov/hipify
...
[HIPIFY] cudaMemcpy2DFromArray(Async) support
[ROCm/clr commit: 50df94be1e ]
2019-10-29 19:14:00 +03:00
Evgeny Mankov
bb75fa46f0
[HIPIFY] cudaMemcpy2DFromArray(Async) support
...
[ROCm/clr commit: 5dd00bdf52 ]
2019-10-29 19:12:42 +03:00
Laurent Morichetti
0113961674
Fix a code object memory corruption
...
The lifetime of the buffer given to
hsa_code_object_reader_create_from_memory must exceed that of the
code object reader. We need to create a copy of the code object
binary memory (file) that is kept allocated until the code object
reader is destroyed.
[ROCm/clr commit: 66c91b42e6 ]
2019-10-29 08:23:57 -07:00
Evgeny Mankov
28a5dd488b
Merge pull request #1594 from emankov/HIP
...
[HIP][doc] Fix typo: AMD-clang -> HIP-clang
[ROCm/clr commit: 3921ea9057 ]
2019-10-28 23:22:57 +03:00
Evgeny Mankov
a4c7894255
[HIP][cmake] Simplify UNIX related code (the beginning)
...
[REASONS]
1. Make OS-dependent code more clear and readable
2. To ease Windows support
[ROCm/clr commit: 4c5c6b4910 ]
2019-10-28 23:22:27 +03:00
Evgeny Mankov
4e02b285d6
[HIP][doc] NVIDIA-nvcc -> HIP-nvcc
...
[ROCm/clr commit: 3df22b2fde ]
2019-10-28 22:46:33 +03:00
Evgeny Mankov
935dd4ce94
[HIP][doc] AMD-hcc -> HIP-hcc
...
[ROCm/clr commit: d312bce79d ]
2019-10-28 21:41:12 +03:00
Evgeny Mankov
20b127bf45
[HIP][doc] Fix typo: AMD-clang -> HIP-clang
...
HIP-clang is already used below instead of AMD-clang
[ROCm/clr commit: 6284b041e5 ]
2019-10-28 21:19:21 +03:00
Evgeny Mankov
0737167ee4
[HIP][cmake] Move all *_INSTALL_DIR variables up before first add_subdirectory()
...
[REASON]
Those vars (may) used by cmake in subdirectories (#1571 )
[ROCm/clr commit: 8100e084b8 ]
2019-10-28 21:07:00 +03:00
Evgeny Mankov
1ef94a4f94
Merge pull request #1590 from emankov/doc
...
[HIPIFY][tests] Fix ambiguous call to cusparseGetErrorString declared in cusparse.h
[ROCm/clr commit: 7f367ff933 ]
2019-10-25 16:08:22 +03:00
Evgeny Mankov
bcc9d88b20
[HIPIFY][tests] Rename the ambiguous call as well
...
[ROCm/clr commit: f68bee02f5 ]
2019-10-25 16:07:31 +03:00
Evgeny Mankov
91732f98c0
[HIPIFY][tests] Fix ambiguous call to cusparseGetErrorString declared in cusparse.h
...
[ROCm/clr commit: 9529e1d91d ]
2019-10-25 16:04:20 +03:00
Anusha Godavarthy Surya
dfa019bdf6
Merge branch 'master' into tex_unbind_issue_fix
...
[ROCm/clr commit: 9332a39838 ]
2019-10-25 15:54:25 +05:30
amd-lthakur
6bb2cba965
Excluded the test case for nvcc platform
...
[ROCm/clr commit: 626cd5d07a ]
2019-10-25 15:52:11 +05:30
Anusha Godavarthy Surya
0140ea8e1a
merge from master
...
[ROCm/clr commit: ae838f8cee ]
2019-10-25 15:52:09 +05:30
Alex Voicu
f22391c362
Add missing operators, fix GCC compilation. ( #1589 )
...
[ROCm/clr commit: 40522e2b6a ]
2019-10-25 15:44:24 +05:30