Alex Voicu
ee5097f2c2
Accessors should work even when oddly volatile.
2019-11-01 22:18:01 +02:00
Evgeny Mankov
a3f6e0eda4
Merge pull request #1615 from emankov/hipify
...
[HIPIFY][CUB][#1460 ][perl] Add "cub::" namespace prefix support in hipify-perl as well
2019-11-01 14:35:55 +03:00
Evgeny Mankov
c48fdefee8
[HIPIFY][CUB][ #1460 ][perl] Add "cub::" namespace prefix support in hipify-perl as well
2019-11-01 14:34:18 +03:00
Rahul Garg
4739e68bbe
Merge pull request #1582 from amd-lthakur/hipExtMLK
...
Adding a directed test case for hipExtModuleLaunchKernel() api.
2019-10-31 17:13:26 -07:00
Rahul Garg
782cf1c007
Merge pull request #1598 from lmoriche/master
...
Fix a code object memory corruption
2019-10-31 17:12:24 -07:00
Rahul Garg
85d70086cb
Add stream
2019-10-31 12:15:56 -04:00
Rahul Garg
efe6fa86dc
Fix HIP init calls in hipMemcpy2DFromArray
2019-10-31 12:15:56 -04:00
Evgeny Mankov
6986818172
Merge pull request #1612 from emankov/hipify
...
[HIPIFY][cmake][#1572 ] Fix: Do not override CMAKE_INSTALL_PREFIX
2019-10-31 16:58:36 +03:00
Evgeny Mankov
f563772a25
[HIPIFY][cmake][ #1572 ] Fix: Do not override CMAKE_INSTALL_PREFIX
...
Affects building with HIP, standalone building is not changed
2019-10-31 16:55:06 +03:00
Rahul Garg
55f2a38120
Formatting changes
2019-10-30 18:12:51 -07:00
Rahul Garg
4ab71216b4
Formatting changes ,variable name and check update
2019-10-30 18:09:21 -07:00
Rahul Garg
ba8105e0cd
Merge pull request #1515 from ansurya/tex_unbind_issue_fix
...
Fix undefined ref to hipUnbindTexture for texture types
2019-10-30 17:54:15 -07:00
Laurent Morichetti
91748f4e6c
Addressed review comments
...
Change comment "must exceed" to "must be no shorter than"
move the std::string instead of creating a copy
2019-10-30 13:14:41 -07:00
Evgeny Mankov
77962371e7
Merge pull request #1593 from emankov/doc
...
[HIP][cmake] Move all *_INSTALL_DIR variables up before first add_subdirectory()
2019-10-30 22:10:05 +03:00
Rahul Garg
eb7813c0fb
Merge pull request #1607 from mhbliao/hliao/master/missing.api.hip.clang
...
[HIP] Correct headers and add missing function templates for hip-clang.
2019-10-30 07:48:57 -07:00
Michael LIAO
5c8a7521f4
[HIP] Correct headers and add missing function templates for hip-clang.
...
- Fix 2 runtime API prototypes
`hipOccupancyMaxActiveBlocksPerMultiprocessor` and
`hipOccupancyMaxActiveBlocksPerMultiprocessorWithFlags`
- Add missing function templates of them in hip-clang.
2019-10-29 22:00:11 -04:00
Rahul Garg
4d04baf0cd
Merge pull request #1602 from ROCm-Developer-Tools/revert-1560-satyanveshd/hipoccupy
...
Revert "Cooperative groups match with cuda SWDEV-205006"
2019-10-29 16:54:36 -07:00
Evgeny Mankov
7d0a10cf00
Merge pull request #1604 from emankov/hipify
...
[HIPIFY][#1603 ] Fix
2019-10-29 22:12:39 +03:00
Evgeny Mankov
389b5ec957
[HIPIFY][ #1603 ] Fix
2019-10-29 22:10:36 +03:00
Rahul Garg
e4a1e44162
Revert "Fix occupany APIs ( #1560 )"
...
This reverts commit af351d7e1b .
2019-10-29 11:41:08 -07:00
Evgeny Mankov
265d34e160
Merge pull request #1601 from emankov/hipify
...
[HIPIFY][Linux] Rollback --cuda-compile-host-device on Linux
2019-10-29 20:55:29 +03:00
Evgeny Mankov
85087644da
[HIPIFY][Linux] Rollback --cuda-compile-host-device on Linux
...
[Reason] It doesn't work with LLVM 9 and higher; Windows is fine
2019-10-29 20:53:54 +03:00
Evgeny Mankov
764a1f0023
Merge pull request #1600 from emankov/hipify
...
[HIPIFY] Introduce --cuda-compile-host-device for LLVM >= 9
2019-10-29 19:47:15 +03:00
Evgeny Mankov
3f2eefa82a
[HIPIFY] Introduce --cuda-compile-host-device for LLVM >= 9
...
* LLVM < 9 continues using --cuda-host-only
2019-10-29 19:42:53 +03:00
Evgeny Mankov
bce616e037
Merge pull request #1599 from emankov/hipify
...
[HIPIFY] cudaMemcpy2DFromArray(Async) support
2019-10-29 19:14:00 +03:00
Evgeny Mankov
315a10a59d
[HIPIFY] cudaMemcpy2DFromArray(Async) support
2019-10-29 19:12:42 +03:00
Laurent Morichetti
7473140a76
Fix a code object memory corruption
...
The lifetime of the buffer given to
hsa_code_object_reader_create_from_memory must exceed that of the
code object reader. We need to create a copy of the code object
binary memory (file) that is kept allocated until the code object
reader is destroyed.
2019-10-29 08:23:57 -07:00
Evgeny Mankov
542fa41a9a
Merge pull request #1594 from emankov/HIP
...
[HIP][doc] Fix typo: AMD-clang -> HIP-clang
2019-10-28 23:22:57 +03:00
Evgeny Mankov
3a4165779a
[HIP][doc] NVIDIA-nvcc -> HIP-nvcc
2019-10-28 22:46:33 +03:00
Evgeny Mankov
46b164c17a
[HIP][doc] AMD-hcc -> HIP-hcc
2019-10-28 21:41:12 +03:00
Evgeny Mankov
06d9e426e0
[HIP][doc] Fix typo: AMD-clang -> HIP-clang
...
HIP-clang is already used below instead of AMD-clang
2019-10-28 21:19:21 +03:00
Evgeny Mankov
b089d905c6
[HIP][cmake] Move all *_INSTALL_DIR variables up before first add_subdirectory()
...
[REASON]
Those vars (may) used by cmake in subdirectories (#1571 )
2019-10-28 21:07:00 +03:00
Evgeny Mankov
954d1847b2
Merge pull request #1590 from emankov/doc
...
[HIPIFY][tests] Fix ambiguous call to cusparseGetErrorString declared in cusparse.h
2019-10-25 16:08:22 +03:00
Evgeny Mankov
70c5072302
[HIPIFY][tests] Rename the ambiguous call as well
2019-10-25 16:07:31 +03:00
Evgeny Mankov
0410d5dcd2
[HIPIFY][tests] Fix ambiguous call to cusparseGetErrorString declared in cusparse.h
2019-10-25 16:04:20 +03:00
Anusha Godavarthy Surya
03623cc3f1
Merge branch 'master' into tex_unbind_issue_fix
2019-10-25 15:54:25 +05:30
amd-lthakur
4239c94fe5
Excluded the test case for nvcc platform
2019-10-25 15:52:11 +05:30
Anusha Godavarthy Surya
5f47e99ffe
merge from master
2019-10-25 15:52:09 +05:30
Alex Voicu
dabd939048
Add missing operators, fix GCC compilation. ( #1589 )
2019-10-25 15:44:24 +05:30
Alex Voicu
a855a13c22
Fix deadlock, remove old __sync_* use. ( #1584 )
...
This fixes a deadlock introduced by the switch to TTAS loops, and is therefore mildly urgent (to prevent the CI from hoovering in the broken code).
2019-10-25 15:44:17 +05:30
Rahul Garg
12e1a86ec1
[dtest] Fix hipMemset2D test ( #1579 )
...
Reverts changes made in #1399 . This is a RT api test. For testing hipMemAllocPitch , a new test should be written and that should use correct memset API.
2019-10-25 15:44:05 +05:30
Rahul Garg
356765a223
Add hipMemcpy2DfromArray ( #1510 )
...
Adds hipMemcpy2DFromArray and hipMemcpy2DFromArrayAsync equivalent to cudaMemcpy2DFromArray and cudaMemcpy2DFromArrayAsync.
2019-10-25 15:43:33 +05:30
Anusha Godavarthy Surya
259d8b4cdf
Merge branch 'master' into tex_unbind_issue_fix
2019-10-25 15:36:55 +05:30
Anusha Godavarthy Surya
ce04bdaa1a
Fixed CI build failure
2019-10-25 12:21:41 +05:30
amd-lthakur
564418c308
Refactored the file as suggested
2019-10-25 10:44:38 +05:30
amd-lthakur
318df5c36b
Update matmul.cpp
2019-10-25 09:22:07 +05:30
amd-lthakur
cd25149225
Update hipExtModuleLaunchKernel.cpp
2019-10-25 09:19:49 +05:30
Rahul Garg
70f2cd1317
Update profiling doc ( #1576 )
2019-10-24 17:51:55 +05:30
Jatin Chaudhary
770d3412f8
Adding New Analyze Target Merging with cppcheck ( #1583 )
2019-10-24 17:46:06 +05:30
Rahul Garg
04e10814d8
Add HIP checks in texture driver sample ( #1581 )
2019-10-24 17:45:51 +05:30