Граф коммитов

4189 Коммитов

Автор SHA1 Сообщение Дата
jiabaxie e73927caee Updated hipEnvVarDriver to work with Windows (#1614)
* Updated hipEnvVarDriver to work with Windows

* Cleaned up a bit of code

* Fixed a part where putenv was used for both win and linux

* Defines moved to test_common.h and cleaned up code

* Cleaned up some macro defines and used const char instead

* Got rid of some excess commenting

* directory paths are unconditional

* Cleaned some duplicate code, and variables are now declared and defined together
2019-11-18 12:18:29 +05:30
kjayapra-amd 0a68be8b5b [dtests] Fixing compilation error in saxpy.cpp (#1608) 2019-11-18 12:18:11 +05:30
Alex Voicu 69e74c3e96 Add support for extended launch syntax. (#1530)
* Add support for extended launch syntax.

* Add unit test.

* Fix typo

* hipExtLaunchKernelGGL lives in hip_ext.h

Change-Id: Ice32dab0d43475fda65c6a910c11416871a8f2ff

* [dtest] remove redundant include from hipModuleGetGlobal dtest
2019-11-16 22:24:07 -08:00
Maneesh Gupta f9fec8fdca [ci] Fix failures on nvcc path (#1654)
Change-Id: I97f6b51314e14965618ab510bf57a2b034183e78
2019-11-15 07:58:40 +05:30
Maneesh Gupta 2ca1a82036 [packaging] Conform to debian package naming rules (#1647)
* [packaging] Conform to debian package naming rules

* Add hip_hcc metapackage for backward compatibility
2019-11-14 11:40:34 +05:30
Evgeny Mankov 0a74d771b4 Merge pull request #1651 from emankov/hipify
[HIPIFY][doc] A few words about LLVM's FileCheck in README.md
2019-11-12 16:06:25 +03:00
Evgeny Mankov 16c8ad0f99 [HIPIFY][doc] A few words about LLVM's FileCheck in README.md 2019-11-12 16:04:28 +03:00
Maneesh Gupta dae8630592 Bump version to 3.0 (#1648)
Change-Id: Id1e6eb222080a339af938212e74a52c39c305eab
2019-11-10 20:23:58 -08:00
Rahul Garg d80917d147 Merge pull request #1643 from mhbliao/hliao/master/remove.redundant.decl
Remove redundant declarations.
2019-11-07 11:16:40 -08:00
Michael LIAO d6ff22510e Remove redundant declarations.
- The revised `hip/hip_ext.h` have that declarations.
2019-11-07 10:11:22 -05:00
Evgeny Mankov 3be0865cad Merge pull request #1642 from emankov/hipify
[HIPIFY] Clang style formatting
2019-11-07 12:49:52 +03:00
Evgeny Mankov 93bc9c9195 [HIPIFY] Clang style formatting 2019-11-07 12:48:05 +03:00
Evgeny Mankov 67f787e74a Merge pull request #1641 from emankov/hipify
[HIPIFY][fix] Abandon canCompileHostAndDeviceInOneJob check for --cuda-compile-host-device option
2019-11-07 11:35:15 +03:00
Evgeny Mankov 0c07b6b421 [HIPIFY][fix] Delete canCompileHostAndDeviceInOneJob declaration as well 2019-11-07 11:32:53 +03:00
Evgeny Mankov c2c144ac04 [HIPIFY][fix] Abandon canCompileHostAndDeviceInOneJob check for --cuda-compile-host-device option
[Reason] It turned out that it is not so: 2 jobs are always
2019-11-07 11:30:40 +03:00
Evgeny Mankov 8f33397d61 Merge pull request #1640 from emankov/hipify
[HIPIFY] Reorder options propagation to clang
2019-11-07 11:23:38 +03:00
Evgeny Mankov 16a27213e4 [HIPIFY] Reorder options propagation to clang
[Reason] Leave overriding opportunity for -D, -std=c++XX, etc.
2019-11-07 11:21:06 +03:00
Jeff Daily e31e0ca12e General hipMemset improvements (#1495)
* hipMemset et al can use HSA API directly for synchronous cases

* lock and flush stream in hipMemset, hold lock until complete

* move hipMemset async check to front of conditional

* use hsa_amd_memory_fill for additional sync memset cases

code cleanup/review for all memset calls

* Fix inversion of execution mutating value.

* ihipMemsetSync fall back to kernel if HSA memset fails

* Never fallback, never surrender.

* Allow NULL stream.

* Optimise memset kernel. Remove deadwood.

* Update hip_memory.cpp

* Clean up stream logic in sync memset

* Revert "Clean up stream logic in sync memset"

This reverts commit 6117dedf673367f44cc704192573a117a3d92477.
2019-11-07 13:19:54 +05:30
Alex Voicu 5530c15cc3 Remove native vector support from the GCC case, since it never worked (#1637) 2019-11-07 13:19:14 +05:30
Sarbojit2019 45613311d7 [HIP] Fixed hipStreamAddCallback [SWDEV#165185] (#1425)
Fixed hipStreamAddCallback() as requested in SWDEV#165185
Added unit test to test the behavior
2019-11-07 13:18:12 +05:30
ansurya e07926ce0f Fixed texture 2D mapping for pitched arrays & 3D Texture read (#1415)
Texture 2D image mapping for pitched arrays:
github issue: Texture Object's Buffer seems to be Misaligned #886
JIRA ticket: SWDEV-199313

SWDEV-151670 : Fixed issue with 3D texture with 4 components
SWDEV-151671 : Issue with 2D layered texture with 4 components
2019-11-07 13:17:46 +05:30
Rahul Garg 579a4f36fa Rename hip/hip_hcc.h to hip/hip_ext.h (#1341)
* Rename hip/hip_hcc.h to hip/hip_ext.h

* Deprecate hip_hcc.h
2019-11-07 13:17:10 +05:30
Rahul Garg 2fa45c215a Merge pull request #1630 from ROCm-Developer-Tools/hotfix_volatile_accessors
__half2 should walk like CUDA and talk like CUDA
2019-11-06 10:25:03 -08:00
Rahul Garg 85446dbd6a Merge pull request #1620 from jeffdaily/hipEventRecord_patch
fix race condition in hipEventRecord
2019-11-06 10:23:24 -08:00
Evgeny Mankov 30c138eda2 Merge pull request #1636 from emankov/hipify-clang
[HIPIFY][Win][fix] canCompileHostAndDeviceInOneJob is true only for LLVM >= 10
2019-11-06 19:27:40 +03:00
Evgeny Mankov 9ca4e96641 [HIPIFY][Win][fix] canCompileHostAndDeviceInOneJob is true only for LLVM >= 10 2019-11-06 19:25:42 +03:00
Evgeny Mankov 7875bb789b Merge pull request #1635 from emankov/hipify-clang
[HIPIFY][doc] Update README.md
2019-11-06 19:20:02 +03:00
Evgeny Mankov 96483d0de5 [HIPIFY][doc] Update README.md
+ Supported versions, testing
2019-11-06 19:18:13 +03:00
Jeff Daily 85080905c0 hipEventRecord only needs one lock; remove locked_eventIsReady 2019-11-06 15:56:32 +00:00
Jeff Daily 7986090d9c remove ihipEvent_t::refreshEventStatus(), new hipEventElapsedTime
This fixes possible races in hipEventElapsedTime.
2019-11-06 15:46:46 +00:00
Evgeny Mankov 514180b83c Merge pull request #1634 from emankov/hipify-clang
[HIPIFY][format] Apply clang style formatting in LLVMCompat
2019-11-06 14:46:45 +03:00
Evgeny Mankov 3b76fd0b1c [HIPIFY][format] Apply clang style formatting in LLVMCompat 2019-11-06 14:43:22 +03:00
Evgeny Mankov 89e4330890 Merge pull request #1633 from emankov/hipify-clang
[HIPIFY] Add a compatibility for CharSourceRange if LLVM < 7
2019-11-06 14:32:47 +03:00
Evgeny Mankov c87ab05fc6 [HIPIFY] Add a compatibility for CharSourceRange if LLVM < 7 2019-11-06 14:30:39 +03:00
Alex Voicu b9faa9f8ae Remove leftover noise. 2019-11-06 02:46:21 +02:00
Alex Voicu e5bd00d06b __half2 should walk like CUDA and talk like CUDA 2019-11-06 02:43:04 +02:00
Rahul Garg 32a67c9f14 Merge pull request #1626 from mhbliao/hliao/master/fix.windows.build
Use portable macro for deprecation message.
2019-11-05 10:08:43 -08:00
Evgeny Mankov 56fc294968 Merge pull request #1595 from emankov/cmake
[HIP][cmake] Simplify `UNIX` related code (the beginning)
2019-11-05 20:30:55 +03:00
Michael LIAO a7f311cc14 Use portable macro for deprecation message. 2019-11-05 11:51:00 -05:00
Evgeny Mankov a601670cb7 Merge pull request #1624 from emankov/hipify
[HIPIFY][#1409] Fix for kernel launch macro expansion
2019-11-05 14:04:51 +03:00
Evgeny Mankov 976f8e8bf6 [HIPIFY][#1409] Fix for kernel launch macro expansion
+ Add a corresponding test kernel_launch_01.cu
+ Add isBefore() check to avoid crash on Replacement with negative length

TODO:
+ Compatibility with former LLVM versions
+ More complicated kernel launch tests
2019-11-05 14:00:13 +03:00
Rahul Garg 54fab7c35c Deprecate HIP Markers (#1622)
* Deprecate HIP markers

* Deprecate profiler start/stop
2019-11-05 12:32:59 +05:30
Aryan Salmanpour cf92fae9e6 [hip][tests] Add two more workgroup sizes for testing hipLaunchCooperativeKernel (#1613) 2019-11-05 12:32:46 +05:30
Jeff Daily 6fcff0118e fix race condition in hipEventRecord 2019-11-04 17:45:11 +00:00
Rahul Garg 815a601dcb Merge pull request #1618 from ROCm-Developer-Tools/hotfix_volatile_accessors
Separate volatile for clarity. Handle assignment.
2019-11-02 13:50:59 -07:00
Alex Voicu 99b9d5449f Separate volatile for clarity. Handle assignment. 2019-11-02 22:02:08 +02:00
Evgeny Mankov 728bd683b1 Merge pull request #1617 from emankov/hipify-clang
[HIPIFY] Introduce --cuda-gpu-arch as hipify-clang's option
2019-11-02 14:21:21 +03:00
Evgeny Mankov 02589264a0 [HIPIFY] Introduce --cuda-gpu-arch as hipify-clang's option
+ Pass it to clang if specified
2019-11-02 14:19:31 +03:00
Rahul Garg 562da597a0 Merge pull request #1616 from ROCm-Developer-Tools/hotfix_volatile_accessors
Accessors should work even when oddly volatile.
2019-11-01 13:45:54 -07:00
Alex Voicu ee5097f2c2 Accessors should work even when oddly volatile. 2019-11-01 22:18:01 +02:00