Rahul Garg
a48954ef98
Merge pull request #1620 from jeffdaily/hipEventRecord_patch
...
fix race condition in hipEventRecord
[ROCm/hip commit: 85446dbd6a ]
2019-11-06 10:23:24 -08:00
Evgeny Mankov
282c76c26f
[HIPIFY][Win][fix] canCompileHostAndDeviceInOneJob is true only for LLVM >= 10
...
[ROCm/hip commit: 9ca4e96641 ]
2019-11-06 19:25:42 +03:00
Evgeny Mankov
7b216ca51c
[HIPIFY][doc] Update README.md
...
+ Supported versions, testing
[ROCm/hip commit: 96483d0de5 ]
2019-11-06 19:18:13 +03:00
Jeff Daily
f9ad564380
hipEventRecord only needs one lock; remove locked_eventIsReady
...
[ROCm/hip commit: 85080905c0 ]
2019-11-06 15:56:32 +00:00
Jeff Daily
8f033af434
remove ihipEvent_t::refreshEventStatus(), new hipEventElapsedTime
...
This fixes possible races in hipEventElapsedTime.
[ROCm/hip commit: 7986090d9c ]
2019-11-06 15:46:46 +00:00
Evgeny Mankov
1b6e4b495a
[HIPIFY][format] Apply clang style formatting in LLVMCompat
...
[ROCm/hip commit: 3b76fd0b1c ]
2019-11-06 14:43:22 +03:00
Evgeny Mankov
1776ff55d3
[HIPIFY] Add a compatibility for CharSourceRange if LLVM < 7
...
[ROCm/hip commit: c87ab05fc6 ]
2019-11-06 14:30:39 +03:00
Rahul Garg
93bd983bbf
Merge pull request #1626 from mhbliao/hliao/master/fix.windows.build
...
Use portable macro for deprecation message.
[ROCm/hip commit: 32a67c9f14 ]
2019-11-05 10:08:43 -08:00
Evgeny Mankov
15406422fe
Merge pull request #1595 from emankov/cmake
...
[HIP][cmake] Simplify `UNIX` related code (the beginning)
[ROCm/hip commit: 56fc294968 ]
2019-11-05 20:30:55 +03:00
Michael LIAO
e250f76183
Use portable macro for deprecation message.
...
[ROCm/hip commit: a7f311cc14 ]
2019-11-05 11:51:00 -05:00
Evgeny Mankov
b4a30a0c7a
Merge pull request #1624 from emankov/hipify
...
[HIPIFY][#1409 ] Fix for kernel launch macro expansion
[ROCm/hip commit: a601670cb7 ]
2019-11-05 14:04:51 +03:00
Evgeny Mankov
5a9c8168c1
[HIPIFY][ #1409 ] Fix for kernel launch macro expansion
...
+ Add a corresponding test kernel_launch_01.cu
+ Add isBefore() check to avoid crash on Replacement with negative length
TODO:
+ Compatibility with former LLVM versions
+ More complicated kernel launch tests
[ROCm/hip commit: 976f8e8bf6 ]
2019-11-05 14:00:13 +03:00
Rahul Garg
1b8973d6be
Deprecate HIP Markers ( #1622 )
...
* Deprecate HIP markers
* Deprecate profiler start/stop
[ROCm/hip commit: 54fab7c35c ]
2019-11-05 12:32:59 +05:30
Aryan Salmanpour
2ad99a6326
[hip][tests] Add two more workgroup sizes for testing hipLaunchCooperativeKernel ( #1613 )
...
[ROCm/hip commit: cf92fae9e6 ]
2019-11-05 12:32:46 +05:30
Jeff Daily
cdf59d1de0
fix race condition in hipEventRecord
...
[ROCm/hip commit: 6fcff0118e ]
2019-11-04 17:45:11 +00:00
Alex Voicu
e87f9b1944
Separate volatile for clarity. Handle assignment.
...
[ROCm/hip commit: 99b9d5449f ]
2019-11-02 22:02:08 +02:00
Evgeny Mankov
92cd5e4fc1
[HIPIFY] Introduce --cuda-gpu-arch as hipify-clang's option
...
+ Pass it to clang if specified
[ROCm/hip commit: 02589264a0 ]
2019-11-02 14:19:31 +03:00
Alex Voicu
b30a42f678
Accessors should work even when oddly volatile.
...
[ROCm/hip commit: ee5097f2c2 ]
2019-11-01 22:18:01 +02:00
Evgeny Mankov
0e30eb9a31
Merge pull request #1615 from emankov/hipify
...
[HIPIFY][CUB][#1460 ][perl] Add "cub::" namespace prefix support in hipify-perl as well
[ROCm/hip commit: a3f6e0eda4 ]
2019-11-01 14:35:55 +03:00
Evgeny Mankov
07cca5b6bb
[HIPIFY][CUB][ #1460 ][perl] Add "cub::" namespace prefix support in hipify-perl as well
...
[ROCm/hip commit: c48fdefee8 ]
2019-11-01 14:34:18 +03:00
Rahul Garg
25a5ca94de
Merge pull request #1582 from amd-lthakur/hipExtMLK
...
Adding a directed test case for hipExtModuleLaunchKernel() api.
[ROCm/hip commit: 4739e68bbe ]
2019-10-31 17:13:26 -07:00
Rahul Garg
3e3cebe614
Merge pull request #1598 from lmoriche/master
...
Fix a code object memory corruption
[ROCm/hip commit: 782cf1c007 ]
2019-10-31 17:12:24 -07:00
Rahul Garg
75cf902cdb
Add stream
...
[ROCm/hip commit: 85d70086cb ]
2019-10-31 12:15:56 -04:00
Rahul Garg
73ca647852
Fix HIP init calls in hipMemcpy2DFromArray
...
[ROCm/hip commit: efe6fa86dc ]
2019-10-31 12:15:56 -04:00
Evgeny Mankov
0feee792b8
[HIPIFY][cmake][ #1572 ] Fix: Do not override CMAKE_INSTALL_PREFIX
...
Affects building with HIP, standalone building is not changed
[ROCm/hip commit: f563772a25 ]
2019-10-31 16:55:06 +03:00
Rahul Garg
b68c8d2f60
Formatting changes
...
[ROCm/hip commit: 55f2a38120 ]
2019-10-30 18:12:51 -07:00
Rahul Garg
8429e15052
Formatting changes ,variable name and check update
...
[ROCm/hip commit: 4ab71216b4 ]
2019-10-30 18:09:21 -07:00
Rahul Garg
7e742b1216
Merge pull request #1515 from ansurya/tex_unbind_issue_fix
...
Fix undefined ref to hipUnbindTexture for texture types
[ROCm/hip commit: ba8105e0cd ]
2019-10-30 17:54:15 -07:00
Laurent Morichetti
1056ca35dc
Addressed review comments
...
Change comment "must exceed" to "must be no shorter than"
move the std::string instead of creating a copy
[ROCm/hip commit: 91748f4e6c ]
2019-10-30 13:14:41 -07:00
Evgeny Mankov
cee5e37f57
Merge pull request #1593 from emankov/doc
...
[HIP][cmake] Move all *_INSTALL_DIR variables up before first add_subdirectory()
[ROCm/hip commit: 77962371e7 ]
2019-10-30 22:10:05 +03:00
Michael LIAO
2bff0748cd
[HIP] Correct headers and add missing function templates for hip-clang.
...
- Fix 2 runtime API prototypes
`hipOccupancyMaxActiveBlocksPerMultiprocessor` and
`hipOccupancyMaxActiveBlocksPerMultiprocessorWithFlags`
- Add missing function templates of them in hip-clang.
[ROCm/hip commit: 5c8a7521f4 ]
2019-10-29 22:00:11 -04:00
Rahul Garg
2102b59eb6
Merge pull request #1602 from ROCm-Developer-Tools/revert-1560-satyanveshd/hipoccupy
...
Revert "Cooperative groups match with cuda SWDEV-205006"
[ROCm/hip commit: 4d04baf0cd ]
2019-10-29 16:54:36 -07:00
Evgeny Mankov
1ca3744948
[HIPIFY][ #1603 ] Fix
...
[ROCm/hip commit: 389b5ec957 ]
2019-10-29 22:10:36 +03:00
Rahul Garg
70449cfa92
Revert "Fix occupany APIs ( #1560 )"
...
This reverts commit 4f23f9cb18 .
[ROCm/hip commit: e4a1e44162 ]
2019-10-29 11:41:08 -07:00
Evgeny Mankov
bce3beed0c
[HIPIFY][Linux] Rollback --cuda-compile-host-device on Linux
...
[Reason] It doesn't work with LLVM 9 and higher; Windows is fine
[ROCm/hip commit: 85087644da ]
2019-10-29 20:53:54 +03:00
Evgeny Mankov
0fd46e00cb
[HIPIFY] Introduce --cuda-compile-host-device for LLVM >= 9
...
* LLVM < 9 continues using --cuda-host-only
[ROCm/hip commit: 3f2eefa82a ]
2019-10-29 19:42:53 +03:00
Evgeny Mankov
933568681c
[HIPIFY] cudaMemcpy2DFromArray(Async) support
...
[ROCm/hip commit: 315a10a59d ]
2019-10-29 19:12:42 +03:00
Laurent Morichetti
86dd262e9b
Fix a code object memory corruption
...
The lifetime of the buffer given to
hsa_code_object_reader_create_from_memory must exceed that of the
code object reader. We need to create a copy of the code object
binary memory (file) that is kept allocated until the code object
reader is destroyed.
[ROCm/hip commit: 7473140a76 ]
2019-10-29 08:23:57 -07:00
Evgeny Mankov
b7d8b4d148
[HIP][cmake] Simplify UNIX related code (the beginning)
...
[REASONS]
1. Make OS-dependent code more clear and readable
2. To ease Windows support
[ROCm/hip commit: d58b70d8a0 ]
2019-10-28 23:22:27 +03:00
Evgeny Mankov
fa39151e3b
[HIP][doc] NVIDIA-nvcc -> HIP-nvcc
...
[ROCm/hip commit: 3a4165779a ]
2019-10-28 22:46:33 +03:00
Evgeny Mankov
d75d979d31
[HIP][doc] AMD-hcc -> HIP-hcc
...
[ROCm/hip commit: 46b164c17a ]
2019-10-28 21:41:12 +03:00
Evgeny Mankov
995348aecf
[HIP][doc] Fix typo: AMD-clang -> HIP-clang
...
HIP-clang is already used below instead of AMD-clang
[ROCm/hip commit: 06d9e426e0 ]
2019-10-28 21:19:21 +03:00
Evgeny Mankov
a2c162f85e
[HIP][cmake] Move all *_INSTALL_DIR variables up before first add_subdirectory()
...
[REASON]
Those vars (may) used by cmake in subdirectories (#1571 )
[ROCm/hip commit: b089d905c6 ]
2019-10-28 21:07:00 +03:00
Evgeny Mankov
17fd872099
[HIPIFY][tests] Rename the ambiguous call as well
...
[ROCm/hip commit: 70c5072302 ]
2019-10-25 16:07:31 +03:00
Evgeny Mankov
536376b341
[HIPIFY][tests] Fix ambiguous call to cusparseGetErrorString declared in cusparse.h
...
[ROCm/hip commit: 0410d5dcd2 ]
2019-10-25 16:04:20 +03:00
Anusha Godavarthy Surya
5c77b7d19a
Merge branch 'master' into tex_unbind_issue_fix
...
[ROCm/hip commit: 03623cc3f1 ]
2019-10-25 15:54:25 +05:30
amd-lthakur
5e11495936
Excluded the test case for nvcc platform
...
[ROCm/hip commit: 4239c94fe5 ]
2019-10-25 15:52:11 +05:30
Anusha Godavarthy Surya
196bdea9c0
merge from master
...
[ROCm/hip commit: 5f47e99ffe ]
2019-10-25 15:52:09 +05:30
Alex Voicu
8460793117
Add missing operators, fix GCC compilation. ( #1589 )
...
[ROCm/hip commit: dabd939048 ]
2019-10-25 15:44:24 +05:30
Alex Voicu
2e9868d597
Fix deadlock, remove old __sync_* use. ( #1584 )
...
This fixes a deadlock introduced by the switch to TTAS loops, and is therefore mildly urgent (to prevent the CI from hoovering in the broken code).
[ROCm/hip commit: a855a13c22 ]
2019-10-25 15:44:17 +05:30