kjayapra-amd
fffdd3da8d
[dtests] Fixing compilation error in saxpy.cpp ( #1608 )
...
[ROCm/clr commit: 99da905509 ]
2019-11-18 12:18:11 +05:30
Alex Voicu
f48b38b0ed
Add support for extended launch syntax. ( #1530 )
...
* Add support for extended launch syntax.
* Add unit test.
* Fix typo
* hipExtLaunchKernelGGL lives in hip_ext.h
Change-Id: Ice32dab0d43475fda65c6a910c11416871a8f2ff
* [dtest] remove redundant include from hipModuleGetGlobal dtest
[ROCm/clr commit: 355d0bdf95 ]
2019-11-16 22:24:07 -08:00
Maneesh Gupta
57225de63f
[ci] Fix failures on nvcc path ( #1654 )
...
Change-Id: I97f6b51314e14965618ab510bf57a2b034183e78
[ROCm/clr commit: 4d0a234183 ]
2019-11-15 07:58:40 +05:30
Maneesh Gupta
71be7baece
[packaging] Conform to debian package naming rules ( #1647 )
...
* [packaging] Conform to debian package naming rules
* Add hip_hcc metapackage for backward compatibility
[ROCm/clr commit: 5a03482690 ]
2019-11-14 11:40:34 +05:30
Evgeny Mankov
5cd24367d5
Merge pull request #1651 from emankov/hipify
...
[HIPIFY][doc] A few words about LLVM's FileCheck in README.md
[ROCm/clr commit: cf79be0f64 ]
2019-11-12 16:06:25 +03:00
Evgeny Mankov
ed11ec3fa9
[HIPIFY][doc] A few words about LLVM's FileCheck in README.md
...
[ROCm/clr commit: bb6f1648ec ]
2019-11-12 16:04:28 +03:00
Maneesh Gupta
ca9eb34000
Bump version to 3.0 ( #1648 )
...
Change-Id: Id1e6eb222080a339af938212e74a52c39c305eab
[ROCm/clr commit: 0a2682625f ]
2019-11-10 20:23:58 -08:00
Rahul Garg
c03f67dfbf
Merge pull request #1643 from mhbliao/hliao/master/remove.redundant.decl
...
Remove redundant declarations.
[ROCm/clr commit: 4c73950e31 ]
2019-11-07 11:16:40 -08:00
Michael LIAO
a10e227b56
Remove redundant declarations.
...
- The revised `hip/hip_ext.h` have that declarations.
[ROCm/clr commit: d28ad401c9 ]
2019-11-07 10:11:22 -05:00
Evgeny Mankov
cdff675646
Merge pull request #1642 from emankov/hipify
...
[HIPIFY] Clang style formatting
[ROCm/clr commit: 806ae1f43e ]
2019-11-07 12:49:52 +03:00
Evgeny Mankov
62b86d1c76
[HIPIFY] Clang style formatting
...
[ROCm/clr commit: 7f143585f6 ]
2019-11-07 12:48:05 +03:00
Evgeny Mankov
5561c2390d
Merge pull request #1641 from emankov/hipify
...
[HIPIFY][fix] Abandon canCompileHostAndDeviceInOneJob check for --cuda-compile-host-device option
[ROCm/clr commit: 367261ea0e ]
2019-11-07 11:35:15 +03:00
Evgeny Mankov
db5f0b0237
[HIPIFY][fix] Delete canCompileHostAndDeviceInOneJob declaration as well
...
[ROCm/clr commit: 4d2424ad36 ]
2019-11-07 11:32:53 +03:00
Evgeny Mankov
be483c4b3e
[HIPIFY][fix] Abandon canCompileHostAndDeviceInOneJob check for --cuda-compile-host-device option
...
[Reason] It turned out that it is not so: 2 jobs are always
[ROCm/clr commit: 21c3d25403 ]
2019-11-07 11:30:40 +03:00
Evgeny Mankov
e6de34770a
Merge pull request #1640 from emankov/hipify
...
[HIPIFY] Reorder options propagation to clang
[ROCm/clr commit: 6f4ada4196 ]
2019-11-07 11:23:38 +03:00
Evgeny Mankov
ba52f11231
[HIPIFY] Reorder options propagation to clang
...
[Reason] Leave overriding opportunity for -D, -std=c++XX, etc.
[ROCm/clr commit: f18e63b8a9 ]
2019-11-07 11:21:06 +03:00
Jeff Daily
bc5596cc74
General hipMemset improvements ( #1495 )
...
* hipMemset et al can use HSA API directly for synchronous cases
* lock and flush stream in hipMemset, hold lock until complete
* move hipMemset async check to front of conditional
* use hsa_amd_memory_fill for additional sync memset cases
code cleanup/review for all memset calls
* Fix inversion of execution mutating value.
* ihipMemsetSync fall back to kernel if HSA memset fails
* Never fallback, never surrender.
* Allow NULL stream.
* Optimise memset kernel. Remove deadwood.
* Update hip_memory.cpp
* Clean up stream logic in sync memset
* Revert "Clean up stream logic in sync memset"
This reverts commit 6117dedf673367f44cc704192573a117a3d92477.
[ROCm/clr commit: 854147e911 ]
2019-11-07 13:19:54 +05:30
Alex Voicu
44d2a3bb73
Remove native vector support from the GCC case, since it never worked ( #1637 )
...
[ROCm/clr commit: d38cc8efba ]
2019-11-07 13:19:14 +05:30
Sarbojit2019
4d0b7e65bb
[HIP] Fixed hipStreamAddCallback [SWDEV#165185] ( #1425 )
...
Fixed hipStreamAddCallback() as requested in SWDEV#165185
Added unit test to test the behavior
[ROCm/clr commit: 97fca3439d ]
2019-11-07 13:18:12 +05:30
ansurya
f5a92e5cb6
Fixed texture 2D mapping for pitched arrays & 3D Texture read ( #1415 )
...
Texture 2D image mapping for pitched arrays:
github issue: Texture Object's Buffer seems to be Misaligned #886
JIRA ticket: SWDEV-199313
SWDEV-151670 : Fixed issue with 3D texture with 4 components
SWDEV-151671 : Issue with 2D layered texture with 4 components
[ROCm/clr commit: dc8f556460 ]
2019-11-07 13:17:46 +05:30
Rahul Garg
cf648f7dbe
Rename hip/hip_hcc.h to hip/hip_ext.h ( #1341 )
...
* Rename hip/hip_hcc.h to hip/hip_ext.h
* Deprecate hip_hcc.h
[ROCm/clr commit: dfee3ae279 ]
2019-11-07 13:17:10 +05:30
Rahul Garg
7cde192635
Merge pull request #1630 from ROCm-Developer-Tools/hotfix_volatile_accessors
...
__half2 should walk like CUDA and talk like CUDA
[ROCm/clr commit: d8e9a09064 ]
2019-11-06 10:25:03 -08:00
Rahul Garg
15c457396d
Merge pull request #1620 from jeffdaily/hipEventRecord_patch
...
fix race condition in hipEventRecord
[ROCm/clr commit: 5af7ca4445 ]
2019-11-06 10:23:24 -08:00
Evgeny Mankov
ffff62ed03
Merge pull request #1636 from emankov/hipify-clang
...
[HIPIFY][Win][fix] canCompileHostAndDeviceInOneJob is true only for LLVM >= 10
[ROCm/clr commit: c0f9c854a6 ]
2019-11-06 19:27:40 +03:00
Evgeny Mankov
c3a76d58da
[HIPIFY][Win][fix] canCompileHostAndDeviceInOneJob is true only for LLVM >= 10
...
[ROCm/clr commit: 97a0670d29 ]
2019-11-06 19:25:42 +03:00
Evgeny Mankov
ef205f5e80
Merge pull request #1635 from emankov/hipify-clang
...
[HIPIFY][doc] Update README.md
[ROCm/clr commit: d9016ab9b7 ]
2019-11-06 19:20:02 +03:00
Evgeny Mankov
d5b2d0fd63
[HIPIFY][doc] Update README.md
...
+ Supported versions, testing
[ROCm/clr commit: 3f737625ee ]
2019-11-06 19:18:13 +03:00
Jeff Daily
39a98df790
hipEventRecord only needs one lock; remove locked_eventIsReady
...
[ROCm/clr commit: b2af1820f5 ]
2019-11-06 15:56:32 +00:00
Jeff Daily
539d0c2ae3
remove ihipEvent_t::refreshEventStatus(), new hipEventElapsedTime
...
This fixes possible races in hipEventElapsedTime.
[ROCm/clr commit: 444122dfb9 ]
2019-11-06 15:46:46 +00:00
Evgeny Mankov
60cb8e4b56
Merge pull request #1634 from emankov/hipify-clang
...
[HIPIFY][format] Apply clang style formatting in LLVMCompat
[ROCm/clr commit: 2777b24abb ]
2019-11-06 14:46:45 +03:00
Evgeny Mankov
4ef69e128b
[HIPIFY][format] Apply clang style formatting in LLVMCompat
...
[ROCm/clr commit: 4cad5fab78 ]
2019-11-06 14:43:22 +03:00
Evgeny Mankov
564feedbd5
Merge pull request #1633 from emankov/hipify-clang
...
[HIPIFY] Add a compatibility for CharSourceRange if LLVM < 7
[ROCm/clr commit: d0c5ab0e00 ]
2019-11-06 14:32:47 +03:00
Evgeny Mankov
bb93c0b039
[HIPIFY] Add a compatibility for CharSourceRange if LLVM < 7
...
[ROCm/clr commit: 3dcae8760d ]
2019-11-06 14:30:39 +03:00
Alex Voicu
7efa901dfa
Remove leftover noise.
...
[ROCm/clr commit: 1df423165b ]
2019-11-06 02:46:21 +02:00
Alex Voicu
3ce73d1ef6
__half2 should walk like CUDA and talk like CUDA
...
[ROCm/clr commit: 55fd1363e2 ]
2019-11-06 02:43:04 +02:00
Rahul Garg
b854e0a392
Merge pull request #1626 from mhbliao/hliao/master/fix.windows.build
...
Use portable macro for deprecation message.
[ROCm/clr commit: 6833886449 ]
2019-11-05 10:08:43 -08:00
Evgeny Mankov
388181ada7
Merge pull request #1595 from emankov/cmake
...
[HIP][cmake] Simplify `UNIX` related code (the beginning)
[ROCm/clr commit: 7b1ca3dd91 ]
2019-11-05 20:30:55 +03:00
Michael LIAO
a1b2dcf87d
Use portable macro for deprecation message.
...
[ROCm/clr commit: 7ca43b98d1 ]
2019-11-05 11:51:00 -05:00
Evgeny Mankov
ecad81b794
Merge pull request #1624 from emankov/hipify
...
[HIPIFY][#1409 ] Fix for kernel launch macro expansion
[ROCm/clr commit: cf5d1caab3 ]
2019-11-05 14:04:51 +03:00
Evgeny Mankov
3889be3031
[HIPIFY][ #1409 ] Fix for kernel launch macro expansion
...
+ Add a corresponding test kernel_launch_01.cu
+ Add isBefore() check to avoid crash on Replacement with negative length
TODO:
+ Compatibility with former LLVM versions
+ More complicated kernel launch tests
[ROCm/clr commit: a1c380f38d ]
2019-11-05 14:00:13 +03:00
Rahul Garg
92393d6ecf
Deprecate HIP Markers ( #1622 )
...
* Deprecate HIP markers
* Deprecate profiler start/stop
[ROCm/clr commit: 8b3fce8069 ]
2019-11-05 12:32:59 +05:30
Aryan Salmanpour
92a06b531d
[hip][tests] Add two more workgroup sizes for testing hipLaunchCooperativeKernel ( #1613 )
...
[ROCm/clr commit: 7a3d592ed0 ]
2019-11-05 12:32:46 +05:30
Jeff Daily
f7e255c3c4
fix race condition in hipEventRecord
...
[ROCm/clr commit: 6c3479a155 ]
2019-11-04 17:45:11 +00:00
Rahul Garg
9d078409e3
Merge pull request #1618 from ROCm-Developer-Tools/hotfix_volatile_accessors
...
Separate volatile for clarity. Handle assignment.
[ROCm/clr commit: 3b8a5cff2f ]
2019-11-02 13:50:59 -07:00
Alex Voicu
1a3d02fb08
Separate volatile for clarity. Handle assignment.
...
[ROCm/clr commit: ed0d6ec51e ]
2019-11-02 22:02:08 +02:00
Evgeny Mankov
8e369f9c7d
Merge pull request #1617 from emankov/hipify-clang
...
[HIPIFY] Introduce --cuda-gpu-arch as hipify-clang's option
[ROCm/clr commit: 758ab23b8f ]
2019-11-02 14:21:21 +03:00
Evgeny Mankov
bc9690328f
[HIPIFY] Introduce --cuda-gpu-arch as hipify-clang's option
...
+ Pass it to clang if specified
[ROCm/clr commit: 7142b884ab ]
2019-11-02 14:19:31 +03:00
Rahul Garg
91832b3b13
Merge pull request #1616 from ROCm-Developer-Tools/hotfix_volatile_accessors
...
Accessors should work even when oddly volatile.
[ROCm/clr commit: dc702f5ffe ]
2019-11-01 13:45:54 -07:00
Alex Voicu
5c2503f0bd
Accessors should work even when oddly volatile.
...
[ROCm/clr commit: 2d76dde05b ]
2019-11-01 22:18:01 +02:00
Evgeny Mankov
3b84d8e638
Merge pull request #1615 from emankov/hipify
...
[HIPIFY][CUB][#1460 ][perl] Add "cub::" namespace prefix support in hipify-perl as well
[ROCm/clr commit: 48bb6df7e2 ]
2019-11-01 14:35:55 +03:00