コミットグラフ

3199 コミット

作成者 SHA1 メッセージ 日付
Maneesh Gupta 4fdba912db Merge pull request #853 from ROCm-Developer-Tools/hip-module-hash-fix
Introduce hash key to HIP module implementation

[ROCm/clr commit: d3a5502f17]
2019-01-09 16:28:50 +05:30
Wen-Heng (Jack) Chung 2f64c76789 Introduce hash key to HIP module implementation
A hash calculated via FNV-1a algorithm is introduced in ihipModule_t, the
internal of hipModule_t. The hash is used by HIP module APIs such as

- read_agent_global_from_module

to determine whether the agent-scope globals for a module have been iterated.

This commit fixes one issue that applications which load / unload modules
frequently would occasionally fail. After deep investigation of the issue it
turns out the old implementation in read_agent_global_from_module uses
hipModule_t as the key, which is not robust enough, as hipModule_t instances
are allocated dynamically so there are cases that one memory address may be
used by multiple hipModule_t instances. The real solution is to introduce a
uniquely identifiable hash for the code object associated with the HIP module.
And that's the rationale behind this commit.


[ROCm/clr commit: 6e68d44220]
2019-01-08 17:33:40 +00:00
Maneesh Gupta d41c852742 Merge pull request #845 from mangupta/cleanup-install-markdown
Update INSTALL.md

[ROCm/clr commit: cf7dc26851]
2019-01-07 10:04:36 +05:30
Maneesh Gupta 7b45721b1b Merge pull request #701 from ssahasra/stream-tests
introduce tests for asynchronous streams

[ROCm/clr commit: 03e1058666]
2019-01-07 10:04:29 +05:30
Maneesh Gupta e7195f7dc2 Update hipAsynchronousStreams.cpp
This test needs to be excluded from nvcc platform

[ROCm/clr commit: 79251fc4e2]
2019-01-07 09:26:21 +05:30
Evgeny Mankov 6cd0f04a63 Merge pull request #852 from emankov/master
[HIPIFY][fix][#851] make_absolute only in case of relative path

[ROCm/clr commit: 225a29f7bd]
2019-01-06 18:25:05 +03:00
emankov 5f15b656e0 [HIPIFY][fix][#851] make_absolute only in case of relative path
Otherwise return the input path.


[ROCm/clr commit: 61593291fd]
2019-01-06 17:58:37 +03:00
Evgeny Mankov 8141e3e86b Merge pull request #850 from emankov/master
[HIPIFY][fix][#849] Create temporary file based on source file name only

[ROCm/clr commit: c82dadcd0e]
2019-01-05 15:16:14 +03:00
emankov 32711768a4 [HIPIFY][fix][#849] Create temporary file based on source file name only
[ROCm/clr commit: 50ba722a2d]
2019-01-05 14:52:15 +03:00
Evgeny Mankov b5fc0474b6 Merge pull request #848 from emankov/master
[HIPIFY][cmake] Change min supported version of cmake to 3.12.3

[ROCm/clr commit: e9f18f2bb4]
2019-01-04 19:12:14 +03:00
emankov beab4005b1 [HIPIFY][cmake] Change min supported version of cmake to 3.12.3
[Reason]
CUDA 10 is supported by cmake since 3.12.3.


[ROCm/clr commit: 728fb14ab9]
2019-01-04 19:10:11 +03:00
Evgeny Mankov 9204bc2f6c Merge pull request #847 from emankov/master
[HIPIFY][cmake][win] check for Visual Studio version

[ROCm/clr commit: 51d74f447f]
2019-01-04 18:57:39 +03:00
emankov bfd355c8eb [HIPIFY][cmake][win] check for Visual Studio version
hipify-clang could be built by Visual Studio 14 2015 or higher.

[Reason]
hipify-clang contains c++14 features which are not supported even by the latest VS 2013 and the Platform Toolset "Visual C++ Compiler Nov 2013 CTP".


[ROCm/clr commit: 207bbad778]
2019-01-04 18:52:28 +03:00
Sameer Sahasrabuddhe 0a99432a01 introduce test for asynchronous streams
This test is identical to the HCC tests under
tests/Unit/OCKL/AsyncStreams, but only for synchronous operation.


[ROCm/clr commit: 39574dba4c]
2019-01-04 05:51:50 -05:00
Maneesh Gupta 9c8a2c36fa Update INSTALL.md
- Improved install instructions for AMD-hcc and NVIDIA-nvcc.
- Removed redundant section related to HCC-Native-GCN-ISA.

Change-Id: I33a9e71d95df51dbdcade4a48834840146bdbe6f


[ROCm/clr commit: 92a556c20a]
2019-01-04 11:04:47 +05:30
Evgeny Mankov c111c40f4d Merge pull request #844 from emankov/master
[HIPIFY][tests] CUDA 7.0 is also supported

[ROCm/clr commit: 9145eb9e95]
2019-01-04 01:34:14 +03:00
emankov 0207ebf990 [HIPIFY][tests] CUDA 7.0 is also supported
+ Exclude 1 test for CUDA 7.0
+ Update README.md


[ROCm/clr commit: 9808b22864]
2019-01-04 01:32:30 +03:00
Evgeny Mankov a274c5ed81 Merge pull request #843 from emankov/master
[HIPIFY][tests] Exclude CUDA 8.0 (9.0) tests if CUDA < 8.0 (9.0)

[ROCm/clr commit: 944c3220d1]
2019-01-03 18:08:48 +03:00
emankov e8d2dab718 [HIPIFY][tests] Exclude CUDA 8.0 (9.0) tests if CUDA < 8.0 (9.0)
[ROCm/clr commit: 852bc8d545]
2019-01-03 18:06:49 +03:00
Evgeny Mankov a30093486c Merge pull request #842 from emankov/master
[HIPIFY][tests] Reduce cudaRegister.cu test

[ROCm/clr commit: 1cb0dcad94]
2019-01-03 17:19:11 +03:00
emankov 18ce5086c5 [HIPIFY][tests] Reduce cudaRegister.cu test
[ROCm/clr commit: 410f8f0b92]
2019-01-03 17:17:21 +03:00
Maneesh Gupta 97cab36288 Merge pull request #841 from ROCm-Developer-Tools/rebase-pr827-to-master
Fix issues with prior iteration of 731

[ROCm/clr commit: 0e32ecf980]
2019-01-01 20:22:57 +05:30
Alex Voicu cfed8191f1 Hook into the creaky lazy-reinit machinery. Try to minimise race-risk.
[ROCm/clr commit: 3470692998]
2019-01-01 11:01:16 +05:30
Alex Voicu 6e61270569 More blobs, more problems.
[ROCm/clr commit: b2f76123b1]
2019-01-01 11:01:07 +05:30
Alex Voicu eebc544c62 Start re-working 731 for 2.0.
[ROCm/clr commit: 3e8f198b58]
2019-01-01 11:00:57 +05:30
Maneesh Gupta 5b8b5b842b Merge pull request #818 from yxsamliu/fix-hip-config
Add default arguments for hipConfigureCall

[ROCm/clr commit: aa932d542a]
2019-01-01 10:25:25 +05:30
Yaxun Sam Liu b2d9783c2f Use __dparm for default parameter
[ROCm/clr commit: 57d1f6e6cb]
2018-12-31 13:07:19 -05:00
Maneesh Gupta 389ef21eca Merge pull request #821 from gargrahul/remove_redundancy_memset3d_test
[dtest] Remove redundant arguments from hipMemset3D test

[ROCm/clr commit: 7647bdb664]
2018-12-31 15:32:19 +05:30
Evgeny Mankov 971ebe710e Merge pull request #840 from emankov/master
[HIPIFY][fix] Change system includes order

[ROCm/clr commit: 02de25e60b]
2018-12-29 22:12:06 +03:00
emankov a8a2395057 [HIPIFY][fix] Change system includes order
'include/cuda_wrappers' should go first.


[ROCm/clr commit: 158b358c5b]
2018-12-29 22:10:00 +03:00
Evgeny Mankov 0cfdec8ab5 Merge pull request #839 from emankov/master
[HIPIFY] Add system includes for packaged hipify-clang to use it with…

[ROCm/clr commit: e93a97b14e]
2018-12-29 22:06:29 +03:00
emankov 4c5a332edc [HIPIFY] Add system includes for packaged hipify-clang to use it without the necessity of installing clang
Header files in those dirs are an integral part of clang, which includes wrapper headers for CUDA.
There is no need in checking those dirs.

[Reasons]
1. clang will not take into account empty or irrelevant include dirs;
2. Packages for different clang versions will have different header files in the above dirs.


[ROCm/clr commit: e9589e3330]
2018-12-29 21:53:18 +03:00
Evgeny Mankov 2ee0f87e03 Merge pull request #838 from emankov/master
[HIPIFY] Get rid of setting '--cuda-gpu-arch='

[ROCm/clr commit: 9718a42ebc]
2018-12-29 20:44:42 +03:00
emankov 170add36d8 [HIPIFY] Get rid of setting '--cuda-gpu-arch='
[Reasons]
+ We don't compile kernel code at least for now as HIP kernel syntax is almost equal CUDA's;
+ clang always includes PTX in its binaries, so e.g. a binary compiled with --cuda-gpu-arch= would be forwards-compatible with e.g. sm_35 GPUs.


[ROCm/clr commit: 6f6228cf36]
2018-12-29 19:48:28 +03:00
Evgeny Mankov 8e174dd2ed Merge pull request #837 from emankov/master
[HIPIFY] Starts implicitly setting '-x cuda' by hipify-clang itself

[ROCm/clr commit: d715658379]
2018-12-29 18:20:21 +03:00
emankov 843f936503 [HIPIFY] Starts implicitly setting '-x cuda' by hipify-clang itself
+ No need in setting '-x cuda' for the user anymore;
+ Testing and Readme updated accordingly.


[ROCm/clr commit: cea1e5a1a6]
2018-12-29 17:04:59 +03:00
Evgeny Mankov c0d92fae61 Merge pull request #836 from emankov/master
[HIPIFY][win] Set -std=c++14 if MSVC

[ROCm/clr commit: 2a460586d1]
2018-12-29 16:56:56 +03:00
emankov 1e3c49e8f2 [HIPIFY][win] Set -std=c++14 if MSVC
+ Due to latest Windows SDK implementation based on c++14 features, '-std+c++14' is a must;
+ Doesn't affect older versions of MSVC 2017, 2015 (the latest supported is MSVC 2015);
+ On linux still '-std=c++11' is a must;
+ Exclude '-std=' from lit config.


[ROCm/clr commit: b24e89b40f]
2018-12-29 15:51:06 +03:00
Evgeny Mankov 35bc69677a Merge pull request #834 from emankov/master
[HIPIFY] LLVM compatibility

[ROCm/clr commit: 323e22e5fc]
2018-12-28 15:13:36 +03:00
Maneesh Gupta a439be5ae6 Merge pull request #835 from mangupta/ci-remove-roc-1.9.x
[ci] Remove ROCm 1.9.x from test infrastructure

[ROCm/clr commit: c08ec456c8]
2018-12-28 17:07:19 +05:30
Maneesh Gupta 78235295f6 [ci] Remove ROCm 1.9.x from test infrastructure
Change-Id: I58cce7c60f6def22f337bbe15fab2620419e202b


[ROCm/clr commit: 61e0546120]
2018-12-28 11:23:52 +05:30
emankov 725f821629 [HIPIFY] LLVM compatibility
+ sys::fs::make_absolute fro LLVM < 5.0
+ sys::fs::real_path for LLVM >= 5.0


[ROCm/clr commit: 723c58afa4]
2018-12-28 01:34:35 +03:00
Evgeny Mankov cc8e3644f2 Merge pull request #833 from emankov/master
[HIPIFY] Use sys::fs::createTemporaryFile without file opening

[ROCm/clr commit: e068944154]
2018-12-27 20:16:09 +03:00
emankov 1ff4bf47c6 [HIPIFY] Use sys::fs::createTemporaryFile without file creation
[Reasons]
+ File is opened further by clang
+ There is a regression with LLVM 5.x, when system error occurs: temporary file couldn't be renamed - permission denied.


[ROCm/clr commit: 8f415def6f]
2018-12-27 17:00:18 +03:00
Maneesh Gupta 0a9a2d030b Merge pull request #817 from aaronenyeshi/fix-rn-intrinsics
Add *_rn functions back into HIP intrinsics

[ROCm/clr commit: 274db66ff2]
2018-12-27 16:07:27 +05:30
Evgeny Mankov 6a90c8f2c3 Merge pull request #831 from emankov/master
[HIPIFY] Introduce option -o-dir

[ROCm/clr commit: 8ffd7e5fa4]
2018-12-27 13:36:12 +03:00
emankov c65d6a9e7a [HIPIFY] Introduce option -o-dir
Option -o-dir for output directory:
  + if not specified source file(s) directory is used;
  + creates the directory if the directory doesn't exist (only one level in a tree);
  + if -o and -o-dir both are specified the hipified file is placed to "-o-dir" + "-o";
  + reports an error in case of a wrong directory specified, in case of necessity of creating a tree of subfolders, or in case of a filename specified.


[ROCm/clr commit: c65338fbf7]
2018-12-26 16:30:41 +03:00
Evgeny Mankov de7f819b49 Merge pull request #829 from emankov/master
[HIPIFY][#812] Finishing with temps

[ROCm/clr commit: 3cf4df4571]
2018-12-25 18:39:12 +03:00
emankov 0056b8ad20 [HIPIFY][#812] Finishing with temps
1. Option -temp-dir for temporary directory:
  + if not specified system temp is used;
  + creates the directory if the directory doesn't exist (only one level in a tree);
  + reports an error in case of a wrong directory specified, in case of necessity of creating a tree of subfolders, or in case of a filename specified.
2. Option -save-temps for preserving temporary files:
  + if specified temporary files are not being deleted from system temps and user temps as well.
3. Work with files in terms of calculated absolute paths by collapsing all '.' and '..' patterns, resolving symlinks and expanding '~' expression to the user's home directory:
  + to produce correct include paths;
  + to avoid possible errors on file routines.


[ROCm/clr commit: 7cac7dbfea]
2018-12-22 00:47:19 +03:00
Evgeny Mankov 5d3f13140a Merge pull request #825 from emankov/master
[HIPIFY][fix][#824] Fix the bug with -inplace option

[ROCm/clr commit: 33d4018c67]
2018-12-21 18:37:15 +03:00