Evgeny Mankov
cabe0bb799
[HIPIFY][python] Initial support of hipify-python generation from hipify-clang
...
+ Only a generation of transformation map of CUDA entities is implemented.
+ 2 hipify-clang options are added: -python, -o-python-map-dir.
+ Explicitly set -roc option for cuda_to_hip_mappings.py generation.
+ Generated file already might be used by pytorch team.
[ROCm/hip commit: 6b370e7743 ]
2019-05-08 19:08:55 +03:00
Evgeny Mankov
aea6e34875
[HIPIFY][perl] Support of hipify-perl generation from hipify-clang: next steps
...
+ Generate transformation map sorted by entity type.
+ Add a generation of supported header files.
[ROCm/hip commit: 9ddc316fa7 ]
2019-05-08 15:25:06 +03:00
Maneesh Gupta
8af2fbb2fa
Merge pull request #1088 from ROCm-Developer-Tools/mangupta-patch-1
...
[ci] Enable tests on ROCm 2.4
[ROCm/hip commit: 450c7ab295 ]
2019-05-08 12:44:02 +05:30
Maneesh Gupta
816e86fc24
[ci] Enable tests on ROCm 2.4
...
[ROCm/hip commit: 07bd3ecfad ]
2019-05-08 12:07:33 +05:30
Evgeny Mankov
ef268f7268
Merge pull request #1085 from emankov/master
...
[HIPIFY][perl] Initial support of hipify-perl generation from hipify-clang
[ROCm/hip commit: 26d4677091 ]
2019-05-07 17:30:39 +03:00
Evgeny Mankov
b66ecc6596
[HIPIFY][perl] Initial support of hipify-perl generation from hipify-clang
...
+ Only a generation of transformation map of CUDA entities supported by HIP is implemented.
+ 3 hipify-clang options are added: -perl, -o-perl-map, -o-perl-map-dir.
+ OptionsParser mode is changed from OneOrMore to Optional to support hipify-perl generation without actual hipification.
+ Add explicit control of source files specification absence in case of no perl generation.
[ROCm/hip commit: 5a3d33a338 ]
2019-05-07 17:27:34 +03:00
Maneesh Gupta
049c0d84bb
Merge pull request #1082 from gargrahul/fix_hipmemcpy_symbol_nvcc
...
Fix symbol address issue on NVCC path
[ROCm/hip commit: 7264f6b64e ]
2019-05-07 16:17:01 +05:30
Maneesh Gupta
36d37296d5
Merge pull request #1081 from mangupta/swdev-181624
...
Implement hipExtGetLinkTypeAndHopCount for ROCm devices
[ROCm/hip commit: d5abe65668 ]
2019-05-07 16:15:41 +05:30
Maneesh Gupta
5f54b7f9a6
Merge pull request #1075 from mhbliao/hliao/master/test_fix2
...
[test] Add device variant of `std::declval`.
[ROCm/hip commit: f931152280 ]
2019-05-07 16:15:01 +05:30
Maneesh Gupta
13d2c76d3d
Merge pull request #1074 from mhbliao/hliao/master/test_fix
...
[test] Use explicit cast for address space cast.
[ROCm/hip commit: 1d4941e487 ]
2019-05-07 16:09:15 +05:30
Maneesh Gupta
fb8d225e25
Merge pull request #1073 from kpyzhov/multi-thread-device-test
...
hipMultiThreadDevice test: Reduced maximum number of created HIP stre…
[ROCm/hip commit: 98ab402fcb ]
2019-05-07 16:08:37 +05:30
Maneesh Gupta
d6e2d71d99
Merge pull request #1072 from kpyzhov/master
...
Refined hipSetDevice test.
[ROCm/hip commit: fea21dc6d5 ]
2019-05-07 16:07:36 +05:30
Maneesh Gupta
b6393eb059
Merge pull request #1069 from mhbliao/hliao/master/test_cleanup
...
[test] Remove unused common routines.
[ROCm/hip commit: 730763c817 ]
2019-05-07 16:02:57 +05:30
Maneesh Gupta
0527ac9038
Merge pull request #1068 from mhbliao/hliao/master/dev_vec_func
...
[devfunc] Add necessary `__device__` and `__host__` attributes.
[ROCm/hip commit: d82d6b499e ]
2019-05-07 16:01:48 +05:30
Yaxun (Sam) Liu
6f69a5d16e
Add documentation for supported clang options ( #1065 )
...
* Add documentation for supported clang options
* Fix typo
[ROCm/hip commit: 669d177079 ]
2019-05-07 15:59:40 +05:30
wkwchau
236834c679
Return hipErrorInsufficientDriver status when CPU device not found ( #1064 )
...
* Return hipErrorInsufficientDriver status when CPU device not found - no exception thrown
* Return hipErrorInsufficientDriver status when CPU device not found
[ROCm/hip commit: 29b3b46b42 ]
2019-05-07 15:58:25 +05:30
Maneesh Gupta
1690053091
Merge pull request #1061 from mhbliao/hliao/master/hipcc
...
[hip] Repace `--rpath` with `--rpath-link`
[ROCm/hip commit: 7eff09edad ]
2019-05-07 15:57:57 +05:30
Maneesh Gupta
892d8f8feb
Merge pull request #1054 from ssahasra/dry
...
minor cleanup: eliminate repetition
[ROCm/hip commit: 927fd0a4bc ]
2019-05-07 15:57:46 +05:30
Rahul Garg
d700f67b79
Fix symbol address issue on NVCC path
...
[ROCm/hip commit: e906bd69ce ]
2019-05-07 03:59:43 +05:30
Maneesh Gupta
f7eaff474b
Implement hipExtGetLinkTypeAndHopCount for ROCm devices
...
Change-Id: Ie5bb4f640ac6d189c7fceeab22627a7494fd10bd
[ROCm/hip commit: 035ef04e1f ]
2019-05-06 15:54:31 +05:30
Michael LIAO
d71327882c
[test] Add device variant of std::declval.
...
- Current clang disallows any invocation of wrong-side functions even
under context with type-inspection only. Work around that by adding a
variant of `std::decl` with `__device__` attribute.
[ROCm/hip commit: 4fbfb646f6 ]
2019-05-03 15:58:31 -04:00
Michael LIAO
cb55dc384a
[test] Use explicit cast for address space cast.
...
[ROCm/hip commit: 2746d95906 ]
2019-05-03 14:56:00 -04:00
Maneesh Gupta
f49a4c6d89
Merge pull request #1062 from mhbliao/hliao/master/icmp
...
[hip] Re-implement ballot using AMDGCN builtins
[ROCm/hip commit: 117bdd8774 ]
2019-05-03 17:48:19 +05:30
Maneesh Gupta
f42dc08384
Merge pull request #1058 from mhbliao/hliao/master/devfunc
...
[Device Function] Fix implementation
[ROCm/hip commit: 37d01a7da9 ]
2019-05-03 17:47:51 +05:30
Konstantin Pyzhov
ae391b3211
hipMultiThreadDevice test: Reduced maximum number of created HIP streams on Windows.
...
[ROCm/hip commit: e29c9d50a6 ]
2019-05-03 05:43:30 -04:00
Konstantin Pyzhov
45eb04ea5c
Refined hipSetDevice test.
...
[ROCm/hip commit: 069fb7e486 ]
2019-05-03 05:39:45 -04:00
Michael LIAO
b7ae0b6442
[test] Remove unused common routines.
...
- Remove used common routines, which variadic functions, not supported
feature, are used.
[ROCm/hip commit: 16a27e06ee ]
2019-05-02 12:34:07 -04:00
Michael LIAO
2c2b897902
[devfunc] Add necessary __device__ and __host__ attributes.
...
- Minor clean up to keep consistent function declaration.
[ROCm/hip commit: 44692978fe ]
2019-05-01 22:26:35 -04:00
Evgeny Mankov
12d4111e16
Merge pull request #1063 from emankov/master
...
[HIPIFY][tests] Add cuSPARSE CSR-BCSR-SPMV-conversions example
[ROCm/hip commit: 4e09081554 ]
2019-04-30 17:40:05 +03:00
emankov
1f57ee6125
[HIPIFY][tests] Add cuSPARSE CSR-BCSR-SPMV-conversions example
...
[ROCm/hip commit: e3082f5142 ]
2019-04-30 17:37:34 +03:00
Michael LIAO
e637e72364
[Device Function] Fix implementation of __bitinsert_u64
...
- It's a common mistake by assuming 1 << shamt would be promoted to
64-bit, if shamt is a 64-bit integer. That's not the case. Replace
that left shift to a 64-bit one to ensure it won't fall into undefined
behavior.
- Fix the host-side implementation as well for device function testing.
[ROCm/hip commit: 9bd2d5746d ]
2019-04-30 08:59:13 -04:00
Michael LIAO
cc4de2bc28
[devfunc] Re-implement ballot using AMDGCN builtins
...
- As the signature of `amdgcn.icmp` is changed for next-gen chip, using
clang builtins is portable way to hide that details.
[ROCm/hip commit: a64637da2c ]
2019-04-29 17:21:25 -04:00
Michael LIAO
448be14118
[hipcc] Repace --rpath with --rpath-link
...
- With `--rpath`, app's RUNPATH is set to the temporary library path
during build. Instead, `--rpath-link` should be used to leave RUNPATH
untouched. User should set LD_LIBRARY_PATH or system ld.so.conf to
proper setting.
[ROCm/hip commit: 7f8b80103d ]
2019-04-29 16:01:16 -04:00
Evgeny Mankov
bdb3fd30d5
Merge pull request #1060 from emankov/master
...
[HIPIFY][doc] Update Readme.md: latest cuDNN 7.5.1.10 is supported
[ROCm/hip commit: 1639629f0a ]
2019-04-29 15:42:37 +03:00
Evgeny Mankov
2b16e01c66
[HIPIFY][doc] Update Readme.md: latest cuDNN 7.5.1.10 is supported
...
+ tested with CUDA 9.0, 9,2, 10.0 and 10.1
[ROCm/hip commit: c0705f892b ]
2019-04-29 15:41:08 +03:00
Aaron Enye Shi
62e96cb4cf
Revert "Use COMgr to read Kernel Args Metadata ( #1006 )"
...
This reverts commit 882006555b .
[ROCm/hip commit: a3d118eaa8 ]
2019-04-26 16:04:56 -04:00
Aaron Enye Shi
c797e6e27b
Revert "Add COMGR relative path for build machines"
...
This reverts commit 01a11c94f8 .
[ROCm/hip commit: 48701ad4ba ]
2019-04-26 16:04:56 -04:00
Aaron Enye Shi
3585159140
Revert "Add dependency on amd_comgr in hip-config-*.cmake.in"
...
This reverts commit b9526364bb .
[ROCm/hip commit: 59a5965fe1 ]
2019-04-26 16:04:56 -04:00
Maneesh Gupta
b9526364bb
Add dependency on amd_comgr in hip-config-*.cmake.in
...
Change-Id: Iac1d851a8cfb99224e9c5926780273d9b9b08426
[ROCm/hip commit: ef99ffd9f4 ]
2019-04-25 15:26:33 -04:00
Sameer Sahasrabuddhe
c3db7b929a
minor cleanup: eliminate repetition
...
[ROCm/hip commit: abb9375707 ]
2019-04-25 20:41:16 +05:30
Evgeny Mankov
b841e50d9d
Merge pull request #1053 from emankov/master
...
[HIPIFY][perl][fix][258] Memory fence device functions are supported now
[ROCm/hip commit: c72ed8ac6d ]
2019-04-25 13:28:59 +03:00
Evgeny Mankov
d8f4eff1ef
[HIPIFY][perl][fix][258] Memory fence device functions are supported now
...
[ROCm/hip commit: abd1c53cf8 ]
2019-04-25 13:27:30 +03:00
Evgeny Mankov
0af20d6a32
Merge pull request #1051 from emankov/master
...
[HIPIFY][DNN] cudnnSetFilter4dDescriptor support
[ROCm/hip commit: 525d4158f8 ]
2019-04-25 12:20:09 +03:00
Evgeny Mankov
86d68901da
[HIPIFY][DNN] cudnnSetFilter4dDescriptor support
...
[ROCm/hip commit: 3fee0f3765 ]
2019-04-25 12:18:51 +03:00
Evgeny Mankov
940fab8caa
Merge pull request #1049 from emankov/master
...
[HIPIFY][fix][#204 ] Suppress warning message: #pragma once in main file
[ROCm/hip commit: a673df6388 ]
2019-04-24 20:37:28 +03:00
Evgeny Mankov
d51679ed8e
[HIPIFY][fix][ #204 ] Suppress warning message: #pragma once in main file
...
[ROCm/hip commit: 6d3c443234 ]
2019-04-24 20:35:52 +03:00
Evgeny Mankov
1f3622cfd9
Merge pull request #1048 from emankov/master
...
[HIPIFY][doc] Update README.md
[ROCm/hip commit: e67bde9108 ]
2019-04-24 18:04:14 +03:00
Evgeny Mankov
43839c780e
[HIPIFY][doc] Update README.md
...
+ A few words about clang patches to work with CUDA 9.2 - 10.0 on Windows;
+ Fix cuDNN versions with correct values.
[ROCm/hip commit: 4651dce3f0 ]
2019-04-24 17:40:35 +03:00
Maneesh Gupta
c36aa3a056
Merge pull request #1043 from mhbliao/hliao/master/fp16
...
[hip] Fix including of hip_fp16.h
[ROCm/hip commit: ffe9f86fe8 ]
2019-04-24 16:50:46 +05:30
Maneesh Gupta
7938012378
Merge pull request #1042 from mhbliao/hliao/master/ldg
...
[hip] Fix use of `__HIP_CLANG_ONLY__` in `hip_ldg.h`.
[ROCm/hip commit: de6c680767 ]
2019-04-24 16:50:37 +05:30