Evgeny Mankov
da100b21bc
Merge pull request #1421 from emankov/hipify-clang
...
[HIPIFY] Scripting related refactoring
[ROCm/clr commit: 8d4c80fc9c ]
2019-09-15 21:17:49 +03:00
Evgeny Mankov
9c4ec3eca2
[HIPIFY] Scripting related refactoring
...
+ Move generating hipify-perl and hipify-python into separate namespaces and files
+ Move getAbsoluteFilePath and getAbsoluteDirectoryPath to StringUtils
[ROCm/clr commit: db2fee682b ]
2019-09-15 21:13:56 +03:00
Evgeny Mankov
f5417d5f6d
Merge pull request #1420 from emankov/hipify-clang
...
[HIPIFY][perl] Syn with hipify-clang
[ROCm/clr commit: 32c6bae2f4 ]
2019-09-15 15:26:54 +03:00
Evgeny Mankov
7e629aa218
[HIPIFY][perl] Syn with hipify-clang
...
[ROCm/clr commit: 422403a3ff ]
2019-09-15 15:25:40 +03:00
Evgeny Mankov
96c7aef471
Merge pull request #1419 from emankov/hipify-clang
...
[HIPIFY][perl] Generate sub warnUnsupportedDeviceFunctions
[ROCm/clr commit: a5309b7bc0 ]
2019-09-15 15:20:13 +03:00
Evgeny Mankov
a58504a20f
[HIPIFY][perl] Generate sub warnUnsupportedDeviceFunctions
...
[ROCm/clr commit: b69f7afa67 ]
2019-09-15 15:18:58 +03:00
Evgeny Mankov
c395967d40
Merge pull request #1417 from emankov/master
...
[HIPIFY][doc] Update README.md
[ROCm/clr commit: d05caef113 ]
2019-09-13 18:58:22 +03:00
Evgeny Mankov
4ba645fc09
Merge pull request #1416 from gargrahul/HACK_hipFreeForManangedMalloc
...
[HACK] Temporary fix in hipFree for hipManagedMalloc use
[ROCm/clr commit: 8e2f68c2a0 ]
2019-09-13 18:57:21 +03:00
Evgeny Mankov
267555caff
[HIPIFY][doc] Update README.md
...
+ Adding -DLLVM_TARGETS_TO_BUILD="NVPTX" to LLVM's cmake command line for minimizing LLVM's build (tested)
+ Test logs update
+ Versions update
[ROCm/clr commit: 0416eaf12e ]
2019-09-13 18:56:48 +03:00
Rahul Garg
f79ffd587e
Fix typo and colon in comment
...
[ROCm/clr commit: d9179cbe70 ]
2019-09-13 03:09:01 +05:30
Rahul Garg
b7a06efb6c
[HACK] Temporary fix for hipFree for hipManagedMalloc
...
[ROCm/clr commit: 2e9b8e9d0d ]
2019-09-13 02:10:21 +05:30
Evgeny Mankov
6238b80b9d
Merge pull request #1414 from emankov/master
...
[HIPIFY] Add unsupported fp16 functions
[ROCm/clr commit: b0156df459 ]
2019-09-12 18:53:08 +03:00
Evgeny Mankov
ab1bbd7eca
[HIPIFY] Add unsupported fp16 functions
...
[ROCm/clr commit: 4b3889b47e ]
2019-09-12 18:51:25 +03:00
Evgeny Mankov
3cb5065590
Merge pull request #1413 from emankov/master
...
[HIPIFY] Add more unsupported device functions
[ROCm/clr commit: d8fe39ffd2 ]
2019-09-12 17:55:43 +03:00
Evgeny Mankov
76cdd624ef
[HIPIFY] Add more unsupported device functions
...
[ROCm/clr commit: ece8a71bad ]
2019-09-12 17:53:25 +03:00
Evgeny Mankov
ae66e0b3a1
Merge pull request #1410 from emankov/master
...
[HIPIFY][#1404 ] Fix: add cooperative_groups.h -> hip/hip_cooperative_groups.h
[ROCm/clr commit: 6a75382e86 ]
2019-09-11 18:13:05 +03:00
Evgeny Mankov
54ba179911
[HIPIFY][ #1404 ] Fix: add cooperative_groups.h -> hip/hip_cooperative_groups.h
...
[ROCm/clr commit: 905c6383db ]
2019-09-11 18:07:09 +03:00
Evgeny Mankov
38c128b941
Merge pull request #1406 from emankov/master
...
[HIPIFY][#1400 ] Fix Template Instantiation kernel launch (clang & perl)
[ROCm/clr commit: ede7e2a094 ]
2019-09-11 15:49:46 +03:00
Evgeny Mankov
4cfb3aa7b7
[HIPIFY][ #1400 ] Fix Template Instantiation kernel launch (clang & perl)
...
+ Enclose template instantiation kernel calls into round brackets, leave regular kernel names unchanged (hipify-perl doesn't handle cases with macros).
+ Fix corresponding tests.
PS. hipify-perl couldn't handle correctly the following cases due to macros expansion disability, thus hipify-clang should be used instead:
#define KERNEL_NAME_MACRO axpy<float>
#define KERNEL_CALL_MACRO axpy<float><<<1, 2>>>
#define KERNEL_ARG_LIST_MACRO a, x, y
// CUDA:
KERNEL_NAME_MACRO<<<1, 2>>>(KERNEL_ARG_LIST_MACRO);
KERNEL_CALL_MACRO(KERNEL_ARG_LIST_MACRO);
// hipify-perl:
hipLaunchKernelGGL(KERNEL_NAME_MACRO, dim3(1), dim3(2), 0, 0, KERNEL_ARG_LIST_MACRO);
KERNEL_CALL_MACRO(KERNEL_ARG_LIST_MACRO);
// hipify-clang:
hipLaunchKernelGGL((KERNEL_NAME_MACRO), dim3(1), dim3(2), 0, 0, KERNEL_ARG_LIST_MACRO);
hipLaunchKernelGGL((axpy<float>), dim3(1), dim3(2), 0, 0, KERNEL_ARG_LIST_MACRO);
[ROCm/clr commit: 4e3845be83 ]
2019-09-10 15:59:06 +03:00
Aditya Lad
4225ba4c8a
Replacment of RENAME command in CMakeLists.txt ( #1402 )
...
[ROCm/clr commit: 4a53718e10 ]
2019-09-10 08:35:16 +00:00
Evgeny Mankov
452b53fa21
Merge pull request #1398 from emankov/master
...
[HIPIFY] Add device functions support
[ROCm/clr commit: 861dada8ea ]
2019-09-07 19:00:00 +03:00
Evgeny Mankov
755d1a9ea6
[HIPIFY] Add device functions support
...
+ Add a corresponding matcher cudaDeviceFuncCall to match only (__device__ or __global__) and not __host__ functions.
+ Add a corresponding device functions mapping:
only unsupported are listed, cause supported are exactly the same as of CUDA and do not need transformation;
make FindAndReplace for device functions separated from host API calls.
+ Add a test to distinguish device functions and user-defined.
[ROCm/clr commit: aec10d6ee1 ]
2019-09-06 18:34:12 +03:00
Jeff Daily
42c49b6ba9
fix bug where HIP_DB=1 seg faults at startup ( #1388 )
...
[ROCm/clr commit: 3d7c146296 ]
2019-09-05 10:04:19 +00:00
mhbliao
b55c3dba08
[hip] Stop using noduplicate and replace it with convergent. ( #1390 )
...
[ROCm/clr commit: d511c3e363 ]
2019-09-05 10:03:43 +00:00
Paul Fultz II
a61aaacbca
Use hcc when building with hcc compiler ( #1225 )
...
* Use hcc when building with hcc compiler
* Fix misnamed variable
* Dont set HIP_RUNTIME
[ROCm/clr commit: d359cd4449 ]
2019-09-05 10:03:04 +00:00
Evgeny Mankov
d99539e82d
Merge pull request #1395 from emankov/master
...
[HIPIFY][cmake][#1394 ] Fix
[ROCm/clr commit: 93bf378a56 ]
2019-09-05 12:12:08 +03:00
Evgeny Mankov
b6c6ecab67
[HIPIFY][cmake][ #1394 ] Fix
...
[#1394 ] cmake's VERSION_GREATER_EQUAL introduced in 3.7
[ROCm/clr commit: a847247d9a ]
2019-09-05 12:08:56 +03:00
Evgeny Mankov
17b2943d8a
Merge pull request #1393 from emankov/master
...
[HIPIFY][perl] Code cleanup (preparation for generating)
[ROCm/clr commit: 4dc98fb754 ]
2019-09-05 11:54:10 +03:00
Evgeny Mankov
8dfefa1f9a
[HIPIFY][perl] Code cleanup (preparation for generating)
...
[ROCm/clr commit: f114a79462 ]
2019-09-05 11:52:59 +03:00
Evgeny Mankov
af968ced14
Merge pull request #1391 from emankov/master
...
[HIPIFY][perl] Sync hipify-perl with hipify-clang
[ROCm/clr commit: 2563e782c5 ]
2019-09-05 09:07:32 +03:00
Evgeny Mankov
a9d5b1e29e
[HIPIFY][perl] Sync hipify-perl with hipify-clang
...
Sync by means of semi-automatic generation: hipify-clang -perl
[ROCm/clr commit: 0ca6525d90 ]
2019-09-04 17:07:45 +03:00
Evgeny Mankov
076988c31b
Merge pull request #1389 from emankov/master
...
[HIPIFY][perl] Code cleanup and formatting
[ROCm/clr commit: 9df5971e0d ]
2019-09-04 16:39:12 +03:00
Evgeny Mankov
73e71db792
[HIPIFY][perl] Code cleanup and formatting
...
[ROCm/clr commit: 7ab9afe513 ]
2019-09-04 16:37:26 +03:00
Evgeny Mankov
4da6af254e
Merge pull request #1387 from emankov/master
...
[HIPIFY][perl][#259 ] Fix
[ROCm/clr commit: 9af6190ec9 ]
2019-09-04 16:11:16 +03:00
Evgeny Mankov
790ab87461
[HIPIFY][perl][ #259 ] Fix
...
empty<<<1, 2>>> ( ); >> hipLaunchKernelGGL(empty, dim3(1), dim3(2), 0, 0);
empty<<<1, 2, 0>>>(); >>
empty<<<1, 2, 0, 0>>>(); >>
instead of erroneous: >> hipLaunchKernelGGL((empty), dim3(1), dim3(2), 0, 0, );
[ROCm/clr commit: e26ec02a99 ]
2019-09-03 16:44:20 +03:00
Yaxun (Sam) Liu
732c9829c1
Do not include cuda wappers for OMP for hip-clang ( #1382 )
...
[ROCm/clr commit: e72346dc24 ]
2019-09-03 05:13:59 +00:00
Rahul Garg
33af62d66e
Revert "Using HSA API for hipMemsetAsync ( #1346 )" ( #1381 )
...
This reverts commit 4313f00563 .
[ROCm/clr commit: d433f6fb58 ]
2019-09-03 05:13:46 +00:00
Nicholas Malaya
85c3f035e4
Fix Broken Link in hip_porting_guide ( #1376 )
...
The math library equivalents between CUDA-HIP are broken. This is a key feature for converting to AMD hardware. This fix corrects the broken link and moves the library equivalents to sit under the "Porting a New Cuda Project" header.
[ROCm/clr commit: 796b8aafd7 ]
2019-09-03 05:13:35 +00:00
Sarbojit2019
e0f1719c1b
Removed hipLaunchKernel macro got missed in Merge ( #1374 )
...
[ROCm/clr commit: 4d270775c8 ]
2019-09-03 05:13:07 +00:00
Evgeny Mankov
1352d53c1a
Merge pull request #1385 from emankov/master
...
[HIPIFY][doc] Update README.md (testing, versions)
[ROCm/clr commit: efd2da477b ]
2019-09-02 18:20:44 +03:00
Evgeny Mankov
88a6eadbb6
[HIPIFY][doc] Update README.md (testing, versions)
...
[ROCm/clr commit: d9b5e2f74a ]
2019-09-02 18:18:43 +03:00
Evgeny Mankov
cedd77510f
Merge pull request #1384 from emankov/master
...
[HIPIFY][tests] Add occupancy test
[ROCm/clr commit: 321c0f2a9e ]
2019-09-02 17:56:16 +03:00
Evgeny Mankov
12efe906e7
[HIPIFY][tests] Add occupancy test
...
[ROCm/clr commit: 608331250e ]
2019-09-02 17:54:06 +03:00
Evgeny Mankov
a6948d3c9d
Merge pull request #1383 from emankov/hipify-clang
...
[HIPIFY][fix] Fix for upcoming LLVM 10.0
[ROCm/clr commit: 3001fb4c09 ]
2019-09-01 18:16:24 +03:00
Evgeny Mankov
46373a3631
[HIPIFY][fix] Fix for upcoming LLVM 10.0
...
[ROCm/clr commit: 8eb7d53cec ]
2019-09-01 18:15:03 +03:00
Rahul Garg
41e5c9fad7
Merge pull request #1378 from ssahasra/no-as
...
remove obsolete test for OCKL Asynchronous Streams
[ROCm/clr commit: 30b7557d55 ]
2019-08-30 19:05:37 -07:00
Sameer Sahasrabuddhe
929b6d5f34
remove obsolete test for OCKL Asynchronous Streams
...
The implementation for OCKL AS was recently removed from the device
library since that feature is now superseded by hostcall.
[ROCm/clr commit: 68c133bd78 ]
2019-08-30 20:41:29 +05:30
Evgeny Mankov
1a420bb40a
Merge pull request #1375 from emankov/hipify-clang
...
[HIPIFY][DNN] cuDNN 7.6.3 support
[ROCm/clr commit: a5cc936a2d ]
2019-08-30 17:26:29 +03:00
Evgeny Mankov
56b05d61e1
[HIPIFY][DNN] cuDNN 7.6.3 support
...
[ROCm/clr commit: a0a98790f0 ]
2019-08-29 19:55:38 +03:00
Evgeny Mankov
2b87d21227
Merge pull request #1372 from emankov/master
...
[HIPIFY][#207 ][fix] Translate all preprocessor's conditional blocks
[ROCm/clr commit: 451721a821 ]
2019-08-29 10:30:05 +03:00