Evgeny Mankov
4ba645fc09
Merge pull request #1416 from gargrahul/HACK_hipFreeForManangedMalloc
...
[HACK] Temporary fix in hipFree for hipManagedMalloc use
[ROCm/clr commit: 8e2f68c2a0 ]
2019-09-13 18:57:21 +03:00
Rahul Garg
f79ffd587e
Fix typo and colon in comment
...
[ROCm/clr commit: d9179cbe70 ]
2019-09-13 03:09:01 +05:30
Rahul Garg
b7a06efb6c
[HACK] Temporary fix for hipFree for hipManagedMalloc
...
[ROCm/clr commit: 2e9b8e9d0d ]
2019-09-13 02:10:21 +05:30
Evgeny Mankov
6238b80b9d
Merge pull request #1414 from emankov/master
...
[HIPIFY] Add unsupported fp16 functions
[ROCm/clr commit: b0156df459 ]
2019-09-12 18:53:08 +03:00
Evgeny Mankov
ab1bbd7eca
[HIPIFY] Add unsupported fp16 functions
...
[ROCm/clr commit: 4b3889b47e ]
2019-09-12 18:51:25 +03:00
Evgeny Mankov
3cb5065590
Merge pull request #1413 from emankov/master
...
[HIPIFY] Add more unsupported device functions
[ROCm/clr commit: d8fe39ffd2 ]
2019-09-12 17:55:43 +03:00
Evgeny Mankov
76cdd624ef
[HIPIFY] Add more unsupported device functions
...
[ROCm/clr commit: ece8a71bad ]
2019-09-12 17:53:25 +03:00
Evgeny Mankov
ae66e0b3a1
Merge pull request #1410 from emankov/master
...
[HIPIFY][#1404 ] Fix: add cooperative_groups.h -> hip/hip_cooperative_groups.h
[ROCm/clr commit: 6a75382e86 ]
2019-09-11 18:13:05 +03:00
Evgeny Mankov
54ba179911
[HIPIFY][ #1404 ] Fix: add cooperative_groups.h -> hip/hip_cooperative_groups.h
...
[ROCm/clr commit: 905c6383db ]
2019-09-11 18:07:09 +03:00
Evgeny Mankov
38c128b941
Merge pull request #1406 from emankov/master
...
[HIPIFY][#1400 ] Fix Template Instantiation kernel launch (clang & perl)
[ROCm/clr commit: ede7e2a094 ]
2019-09-11 15:49:46 +03:00
Evgeny Mankov
4cfb3aa7b7
[HIPIFY][ #1400 ] Fix Template Instantiation kernel launch (clang & perl)
...
+ Enclose template instantiation kernel calls into round brackets, leave regular kernel names unchanged (hipify-perl doesn't handle cases with macros).
+ Fix corresponding tests.
PS. hipify-perl couldn't handle correctly the following cases due to macros expansion disability, thus hipify-clang should be used instead:
#define KERNEL_NAME_MACRO axpy<float>
#define KERNEL_CALL_MACRO axpy<float><<<1, 2>>>
#define KERNEL_ARG_LIST_MACRO a, x, y
// CUDA:
KERNEL_NAME_MACRO<<<1, 2>>>(KERNEL_ARG_LIST_MACRO);
KERNEL_CALL_MACRO(KERNEL_ARG_LIST_MACRO);
// hipify-perl:
hipLaunchKernelGGL(KERNEL_NAME_MACRO, dim3(1), dim3(2), 0, 0, KERNEL_ARG_LIST_MACRO);
KERNEL_CALL_MACRO(KERNEL_ARG_LIST_MACRO);
// hipify-clang:
hipLaunchKernelGGL((KERNEL_NAME_MACRO), dim3(1), dim3(2), 0, 0, KERNEL_ARG_LIST_MACRO);
hipLaunchKernelGGL((axpy<float>), dim3(1), dim3(2), 0, 0, KERNEL_ARG_LIST_MACRO);
[ROCm/clr commit: 4e3845be83 ]
2019-09-10 15:59:06 +03:00
Aditya Lad
4225ba4c8a
Replacment of RENAME command in CMakeLists.txt ( #1402 )
...
[ROCm/clr commit: 4a53718e10 ]
2019-09-10 08:35:16 +00:00
Evgeny Mankov
452b53fa21
Merge pull request #1398 from emankov/master
...
[HIPIFY] Add device functions support
[ROCm/clr commit: 861dada8ea ]
2019-09-07 19:00:00 +03:00
Evgeny Mankov
755d1a9ea6
[HIPIFY] Add device functions support
...
+ Add a corresponding matcher cudaDeviceFuncCall to match only (__device__ or __global__) and not __host__ functions.
+ Add a corresponding device functions mapping:
only unsupported are listed, cause supported are exactly the same as of CUDA and do not need transformation;
make FindAndReplace for device functions separated from host API calls.
+ Add a test to distinguish device functions and user-defined.
[ROCm/clr commit: aec10d6ee1 ]
2019-09-06 18:34:12 +03:00
Jeff Daily
42c49b6ba9
fix bug where HIP_DB=1 seg faults at startup ( #1388 )
...
[ROCm/clr commit: 3d7c146296 ]
2019-09-05 10:04:19 +00:00
mhbliao
b55c3dba08
[hip] Stop using noduplicate and replace it with convergent. ( #1390 )
...
[ROCm/clr commit: d511c3e363 ]
2019-09-05 10:03:43 +00:00
Paul Fultz II
a61aaacbca
Use hcc when building with hcc compiler ( #1225 )
...
* Use hcc when building with hcc compiler
* Fix misnamed variable
* Dont set HIP_RUNTIME
[ROCm/clr commit: d359cd4449 ]
2019-09-05 10:03:04 +00:00
Evgeny Mankov
d99539e82d
Merge pull request #1395 from emankov/master
...
[HIPIFY][cmake][#1394 ] Fix
[ROCm/clr commit: 93bf378a56 ]
2019-09-05 12:12:08 +03:00
Evgeny Mankov
b6c6ecab67
[HIPIFY][cmake][ #1394 ] Fix
...
[#1394 ] cmake's VERSION_GREATER_EQUAL introduced in 3.7
[ROCm/clr commit: a847247d9a ]
2019-09-05 12:08:56 +03:00
Evgeny Mankov
17b2943d8a
Merge pull request #1393 from emankov/master
...
[HIPIFY][perl] Code cleanup (preparation for generating)
[ROCm/clr commit: 4dc98fb754 ]
2019-09-05 11:54:10 +03:00
Evgeny Mankov
8dfefa1f9a
[HIPIFY][perl] Code cleanup (preparation for generating)
...
[ROCm/clr commit: f114a79462 ]
2019-09-05 11:52:59 +03:00
Evgeny Mankov
af968ced14
Merge pull request #1391 from emankov/master
...
[HIPIFY][perl] Sync hipify-perl with hipify-clang
[ROCm/clr commit: 2563e782c5 ]
2019-09-05 09:07:32 +03:00
Evgeny Mankov
a9d5b1e29e
[HIPIFY][perl] Sync hipify-perl with hipify-clang
...
Sync by means of semi-automatic generation: hipify-clang -perl
[ROCm/clr commit: 0ca6525d90 ]
2019-09-04 17:07:45 +03:00
Evgeny Mankov
076988c31b
Merge pull request #1389 from emankov/master
...
[HIPIFY][perl] Code cleanup and formatting
[ROCm/clr commit: 9df5971e0d ]
2019-09-04 16:39:12 +03:00
Evgeny Mankov
73e71db792
[HIPIFY][perl] Code cleanup and formatting
...
[ROCm/clr commit: 7ab9afe513 ]
2019-09-04 16:37:26 +03:00
Evgeny Mankov
4da6af254e
Merge pull request #1387 from emankov/master
...
[HIPIFY][perl][#259 ] Fix
[ROCm/clr commit: 9af6190ec9 ]
2019-09-04 16:11:16 +03:00
Evgeny Mankov
790ab87461
[HIPIFY][perl][ #259 ] Fix
...
empty<<<1, 2>>> ( ); >> hipLaunchKernelGGL(empty, dim3(1), dim3(2), 0, 0);
empty<<<1, 2, 0>>>(); >>
empty<<<1, 2, 0, 0>>>(); >>
instead of erroneous: >> hipLaunchKernelGGL((empty), dim3(1), dim3(2), 0, 0, );
[ROCm/clr commit: e26ec02a99 ]
2019-09-03 16:44:20 +03:00
Yaxun (Sam) Liu
732c9829c1
Do not include cuda wappers for OMP for hip-clang ( #1382 )
...
[ROCm/clr commit: e72346dc24 ]
2019-09-03 05:13:59 +00:00
Rahul Garg
33af62d66e
Revert "Using HSA API for hipMemsetAsync ( #1346 )" ( #1381 )
...
This reverts commit 4313f00563 .
[ROCm/clr commit: d433f6fb58 ]
2019-09-03 05:13:46 +00:00
Nicholas Malaya
85c3f035e4
Fix Broken Link in hip_porting_guide ( #1376 )
...
The math library equivalents between CUDA-HIP are broken. This is a key feature for converting to AMD hardware. This fix corrects the broken link and moves the library equivalents to sit under the "Porting a New Cuda Project" header.
[ROCm/clr commit: 796b8aafd7 ]
2019-09-03 05:13:35 +00:00
Sarbojit2019
e0f1719c1b
Removed hipLaunchKernel macro got missed in Merge ( #1374 )
...
[ROCm/clr commit: 4d270775c8 ]
2019-09-03 05:13:07 +00:00
Evgeny Mankov
1352d53c1a
Merge pull request #1385 from emankov/master
...
[HIPIFY][doc] Update README.md (testing, versions)
[ROCm/clr commit: efd2da477b ]
2019-09-02 18:20:44 +03:00
Evgeny Mankov
88a6eadbb6
[HIPIFY][doc] Update README.md (testing, versions)
...
[ROCm/clr commit: d9b5e2f74a ]
2019-09-02 18:18:43 +03:00
Evgeny Mankov
cedd77510f
Merge pull request #1384 from emankov/master
...
[HIPIFY][tests] Add occupancy test
[ROCm/clr commit: 321c0f2a9e ]
2019-09-02 17:56:16 +03:00
Evgeny Mankov
12efe906e7
[HIPIFY][tests] Add occupancy test
...
[ROCm/clr commit: 608331250e ]
2019-09-02 17:54:06 +03:00
Evgeny Mankov
a6948d3c9d
Merge pull request #1383 from emankov/hipify-clang
...
[HIPIFY][fix] Fix for upcoming LLVM 10.0
[ROCm/clr commit: 3001fb4c09 ]
2019-09-01 18:16:24 +03:00
Evgeny Mankov
46373a3631
[HIPIFY][fix] Fix for upcoming LLVM 10.0
...
[ROCm/clr commit: 8eb7d53cec ]
2019-09-01 18:15:03 +03:00
Rahul Garg
41e5c9fad7
Merge pull request #1378 from ssahasra/no-as
...
remove obsolete test for OCKL Asynchronous Streams
[ROCm/clr commit: 30b7557d55 ]
2019-08-30 19:05:37 -07:00
Sameer Sahasrabuddhe
929b6d5f34
remove obsolete test for OCKL Asynchronous Streams
...
The implementation for OCKL AS was recently removed from the device
library since that feature is now superseded by hostcall.
[ROCm/clr commit: 68c133bd78 ]
2019-08-30 20:41:29 +05:30
Evgeny Mankov
1a420bb40a
Merge pull request #1375 from emankov/hipify-clang
...
[HIPIFY][DNN] cuDNN 7.6.3 support
[ROCm/clr commit: a5cc936a2d ]
2019-08-30 17:26:29 +03:00
Evgeny Mankov
56b05d61e1
[HIPIFY][DNN] cuDNN 7.6.3 support
...
[ROCm/clr commit: a0a98790f0 ]
2019-08-29 19:55:38 +03:00
Evgeny Mankov
2b87d21227
Merge pull request #1372 from emankov/master
...
[HIPIFY][#207 ][fix] Translate all preprocessor's conditional blocks
[ROCm/clr commit: 451721a821 ]
2019-08-29 10:30:05 +03:00
Yaxun (Sam) Liu
e5a61a51de
Fix sample 11_texture_driver for hip-clang ( #1370 )
...
[ROCm/clr commit: 793b18c0b1 ]
2019-08-29 01:03:20 +00:00
Yaxun (Sam) Liu
0a81b64a19
Do not change opt level for -g for hip-clang ( #1369 )
...
Currently hipcc uses -O3 for hip-clang by default but uses -O0 if -g is used. This
causes surprise for users since -g should not affect default opt level.
[ROCm/clr commit: 19a9b15306 ]
2019-08-29 01:03:11 +00:00
Sarbojit2019
15b80a759d
Updated hipErrorString and CUDAErrorTohipError ( #1365 )
...
[ROCm/clr commit: b5ff9e71ed ]
2019-08-29 01:02:59 +00:00
Sarbojit2019
4d23e35532
[HIP] Reclaiming hipLaunchKernel API ( #1353 )
...
* [HIP] Reclaiming hipLaunchKernel API
* Reclaiming hipLaunchKernel : Incorporated review comments
* Incorporated review comments
* Removed hipLaunchKernel Macro from nvcc path
[ROCm/clr commit: 1ae43cbeba ]
2019-08-29 01:02:41 +00:00
satyanveshd
0e4701d59f
[sample] add new cookbook sample - occupancy ( #1352 )
...
* occupancy.cpp with Makefile
* occupancy sample changes according tothe comments
* Changes according to the review comments
* Occupancy Sample Changes
* Changes according to review comments
[ROCm/clr commit: d2df21e58c ]
2019-08-29 01:01:49 +00:00
mshivama
757aa67a43
Device side support for Cooperative Group feature ( #1202 )
...
* first cut of the header implementation of cooperative group feature
* add diclarations for device library functions
* fixed various compile time issues in the CG headers
* enabled copy construction and copy assignment
* fixed a minor bug related to conditional compilation macro
* fixed few more CG constructor issues and added a unit testcase
* fixed typo
* extended unit testcase
* compute size of partitioned CG from mask
* bit of code refactoring
* removed boilerplate code
* fixed few of the review comments by Brian
* Changes to the sigantures of few grid and multi-grid related OCKL functions
* changes to declarations of OCKL functions related to CG feature
* removed all the block level support as it is not planned for 2.9
* Have taken care of review comments by Brian
* Have taken care of review comments by Brian
* removed unused functions which were initially intended to use in block level cg support
[ROCm/clr commit: f74903851e ]
2019-08-29 01:01:25 +00:00
Evgeny Mankov
65ba03dd4c
[HIPIFY][ #207 ][fix] Translate all preprocessor's conditional blocks
...
+ Start to translate preprocessor's false conditional blocks too:
based on clang's https://reviews.llvm.org/D66597 ;
available only starting from LLVM 10.0 or trunk.
+ Option -skip-excluded-preprocessor-conditional-blocks for skipping excluded conditional blocks:
the default behavior for hipify-clang built with LLVM < 10.0;
false by default for hipify-clang built with LLVM 10 or trunk.
+ Add 4 preprocessor unit tests, 2 of which are LLVM 10.0 only
+ Update couple of existing tests by setting -skip-excluded-preprocessor-conditional-blocks option:
update lit testing accordingly
[ROCm/clr commit: 32f22a0117 ]
2019-08-28 21:17:35 +03:00
Evgeny Mankov
bd631e815b
Merge pull request #1371 from emankov/master
...
[HIPIFY][cmake] Minor refactoring
[ROCm/clr commit: 7b9f2a8471 ]
2019-08-28 20:42:41 +03:00