Michael LIAO
cb55dc384a
[test] Use explicit cast for address space cast.
...
[ROCm/hip commit: 2746d95906 ]
2019-05-03 14:56:00 -04:00
Maneesh Gupta
f49a4c6d89
Merge pull request #1062 from mhbliao/hliao/master/icmp
...
[hip] Re-implement ballot using AMDGCN builtins
[ROCm/hip commit: 117bdd8774 ]
2019-05-03 17:48:19 +05:30
Maneesh Gupta
f42dc08384
Merge pull request #1058 from mhbliao/hliao/master/devfunc
...
[Device Function] Fix implementation
[ROCm/hip commit: 37d01a7da9 ]
2019-05-03 17:47:51 +05:30
Evgeny Mankov
12d4111e16
Merge pull request #1063 from emankov/master
...
[HIPIFY][tests] Add cuSPARSE CSR-BCSR-SPMV-conversions example
[ROCm/hip commit: 4e09081554 ]
2019-04-30 17:40:05 +03:00
emankov
1f57ee6125
[HIPIFY][tests] Add cuSPARSE CSR-BCSR-SPMV-conversions example
...
[ROCm/hip commit: e3082f5142 ]
2019-04-30 17:37:34 +03:00
Michael LIAO
e637e72364
[Device Function] Fix implementation of __bitinsert_u64
...
- It's a common mistake by assuming 1 << shamt would be promoted to
64-bit, if shamt is a 64-bit integer. That's not the case. Replace
that left shift to a 64-bit one to ensure it won't fall into undefined
behavior.
- Fix the host-side implementation as well for device function testing.
[ROCm/hip commit: 9bd2d5746d ]
2019-04-30 08:59:13 -04:00
Michael LIAO
cc4de2bc28
[devfunc] Re-implement ballot using AMDGCN builtins
...
- As the signature of `amdgcn.icmp` is changed for next-gen chip, using
clang builtins is portable way to hide that details.
[ROCm/hip commit: a64637da2c ]
2019-04-29 17:21:25 -04:00
Evgeny Mankov
bdb3fd30d5
Merge pull request #1060 from emankov/master
...
[HIPIFY][doc] Update Readme.md: latest cuDNN 7.5.1.10 is supported
[ROCm/hip commit: 1639629f0a ]
2019-04-29 15:42:37 +03:00
Evgeny Mankov
2b16e01c66
[HIPIFY][doc] Update Readme.md: latest cuDNN 7.5.1.10 is supported
...
+ tested with CUDA 9.0, 9,2, 10.0 and 10.1
[ROCm/hip commit: c0705f892b ]
2019-04-29 15:41:08 +03:00
Aaron Enye Shi
62e96cb4cf
Revert "Use COMgr to read Kernel Args Metadata ( #1006 )"
...
This reverts commit 882006555b .
[ROCm/hip commit: a3d118eaa8 ]
2019-04-26 16:04:56 -04:00
Aaron Enye Shi
c797e6e27b
Revert "Add COMGR relative path for build machines"
...
This reverts commit 01a11c94f8 .
[ROCm/hip commit: 48701ad4ba ]
2019-04-26 16:04:56 -04:00
Aaron Enye Shi
3585159140
Revert "Add dependency on amd_comgr in hip-config-*.cmake.in"
...
This reverts commit b9526364bb .
[ROCm/hip commit: 59a5965fe1 ]
2019-04-26 16:04:56 -04:00
Maneesh Gupta
b9526364bb
Add dependency on amd_comgr in hip-config-*.cmake.in
...
Change-Id: Iac1d851a8cfb99224e9c5926780273d9b9b08426
[ROCm/hip commit: ef99ffd9f4 ]
2019-04-25 15:26:33 -04:00
Evgeny Mankov
b841e50d9d
Merge pull request #1053 from emankov/master
...
[HIPIFY][perl][fix][258] Memory fence device functions are supported now
[ROCm/hip commit: c72ed8ac6d ]
2019-04-25 13:28:59 +03:00
Evgeny Mankov
d8f4eff1ef
[HIPIFY][perl][fix][258] Memory fence device functions are supported now
...
[ROCm/hip commit: abd1c53cf8 ]
2019-04-25 13:27:30 +03:00
Evgeny Mankov
0af20d6a32
Merge pull request #1051 from emankov/master
...
[HIPIFY][DNN] cudnnSetFilter4dDescriptor support
[ROCm/hip commit: 525d4158f8 ]
2019-04-25 12:20:09 +03:00
Evgeny Mankov
86d68901da
[HIPIFY][DNN] cudnnSetFilter4dDescriptor support
...
[ROCm/hip commit: 3fee0f3765 ]
2019-04-25 12:18:51 +03:00
Evgeny Mankov
940fab8caa
Merge pull request #1049 from emankov/master
...
[HIPIFY][fix][#204 ] Suppress warning message: #pragma once in main file
[ROCm/hip commit: a673df6388 ]
2019-04-24 20:37:28 +03:00
Evgeny Mankov
d51679ed8e
[HIPIFY][fix][ #204 ] Suppress warning message: #pragma once in main file
...
[ROCm/hip commit: 6d3c443234 ]
2019-04-24 20:35:52 +03:00
Evgeny Mankov
1f3622cfd9
Merge pull request #1048 from emankov/master
...
[HIPIFY][doc] Update README.md
[ROCm/hip commit: e67bde9108 ]
2019-04-24 18:04:14 +03:00
Evgeny Mankov
43839c780e
[HIPIFY][doc] Update README.md
...
+ A few words about clang patches to work with CUDA 9.2 - 10.0 on Windows;
+ Fix cuDNN versions with correct values.
[ROCm/hip commit: 4651dce3f0 ]
2019-04-24 17:40:35 +03:00
Maneesh Gupta
c36aa3a056
Merge pull request #1043 from mhbliao/hliao/master/fp16
...
[hip] Fix including of hip_fp16.h
[ROCm/hip commit: ffe9f86fe8 ]
2019-04-24 16:50:46 +05:30
Maneesh Gupta
7938012378
Merge pull request #1042 from mhbliao/hliao/master/ldg
...
[hip] Fix use of `__HIP_CLANG_ONLY__` in `hip_ldg.h`.
[ROCm/hip commit: de6c680767 ]
2019-04-24 16:50:37 +05:30
Maneesh Gupta
7b5ed45968
Merge pull request #1040 from eshcherb/roctracer-hip-frontend-190422
...
hip_prof_api.h include under __cplusplus
[ROCm/hip commit: e489f7579a ]
2019-04-24 16:50:27 +05:30
Maneesh Gupta
5235da375e
Merge pull request #1039 from gargrahul/fix_ptrgetattr_nvcc
...
Fix hipPointerGetAttributes for NVCC
[ROCm/hip commit: 2975221560 ]
2019-04-24 16:50:18 +05:30
Rahul Garg
c01236f679
Add hipMallocManaged default functional support ( #1036 )
...
* Add hipMallocManaged default functional support
* Fix build error
* Add dtest
[ROCm/hip commit: 2bc2c46d4d ]
2019-04-24 16:50:03 +05:30
Maneesh Gupta
4d9977c10b
Merge pull request #1034 from kpyzhov/master
...
Minor fixes for 64-bit device functions.
[ROCm/hip commit: a016777acb ]
2019-04-24 16:49:36 +05:30
Maneesh Gupta
72c7388422
Merge pull request #1031 from yxsamliu/fix-init
...
Fix missing arg in HIP_INIT_API
[ROCm/hip commit: 8dc8c58ddd ]
2019-04-24 16:49:23 +05:30
Maneesh Gupta
b17642bb10
Merge pull request #1028 from gargrahul/fix_d2d_async_test
...
[dtest] Fix D2DAsync test
[ROCm/hip commit: eed5928007 ]
2019-04-24 16:49:13 +05:30
Aaron Enye Shi
01a11c94f8
Add COMGR relative path for build machines
...
[ROCm/hip commit: 920fe246d7 ]
2019-04-23 17:16:26 -04:00
Evgeny Mankov
1db546d108
Merge pull request #1045 from emankov/master
...
[HIPIFY][doc] Provide patches for clang's bug 38811
[ROCm/hip commit: 6dc1165259 ]
2019-04-23 21:15:33 +03:00
Evgeny Mankov
3fb77dbef5
[HIPIFY][doc] Provide patches for clang's bug 38811
...
+ Update Readme.md accordingly
[ROCm/hip commit: 57931b3056 ]
2019-04-23 21:13:00 +03:00
Evgeny Mankov
488ae670c7
Merge pull request #1044 from emankov/master
...
[HIPIFY][hipify-perl] Formatting
[ROCm/hip commit: 3402f9110b ]
2019-04-23 18:30:38 +03:00
Evgeny Mankov
a2b6b1e7e9
[HIPIFY][hipify-perl] Formatting
...
[ROCm/hip commit: defc6f8155 ]
2019-04-23 17:55:47 +03:00
Michael LIAO
6eec9d4ac8
[hip] Fix including of hip_fp16.h
...
- Separate the definition of `__HCC_OR_HIP_CLANG__`, `__HCC_ONLY__`, and
`__HIP_CLANG_ONLY__` into hip_common.h so that it could be included in
hip_fp16.h, which may be included separately in app.
[ROCm/hip commit: dc0d7bd5ce ]
2019-04-23 09:16:00 -04:00
Michael LIAO
27aed8c172
[hip] Fix use of __HIP_CLANG_ONLY__ in hip_ldg.h.
...
- Check its value instead of whether it's defined or not.
[ROCm/hip commit: 6fb07acc8c ]
2019-04-22 23:22:32 -04:00
Evgeny
17474d2b58
hip_prof_api.h include under __cplusplus
...
[ROCm/hip commit: af3f3ccb2b ]
2019-04-22 21:14:18 -05:00
Rahul Garg
be49602b29
Fix hipPointerGetAttributes for NVCC
...
[ROCm/hip commit: 69a3d6b72a ]
2019-04-23 03:22:25 +05:30
Konstantin Pyzhov
2e0c67bbd3
Fix for __popcll() device function implementation.
...
[ROCm/hip commit: beadaab661 ]
2019-04-19 08:53:22 -04:00
Yaxun (Sam) Liu
d4bce6c019
Fix missing arg in HIP_INIT_API
...
[ROCm/hip commit: bb5c620b13 ]
2019-04-18 16:18:31 -04:00
Konstantin Pyzhov
53b232fb2a
Fix for __ffsll() device functions.
...
[ROCm/hip commit: b7bd29924a ]
2019-04-18 13:07:24 -04:00
David Salinas
d4c72ad357
Revert "append the ELF flags for sram-ecc and xnack to the target triple per code object"
...
This reverts commit ee887dd6a2 .
[ROCm/hip commit: 5843530a06 ]
2019-04-18 11:49:40 -04:00
Rahul Garg
d0951a9c20
Fix D2DAsync test
...
[ROCm/hip commit: e5e7651a4a ]
2019-04-18 07:35:06 +05:30
Evgeny Mankov
a8763347a5
Merge pull request #1025 from emankov/master
...
[HIPIFY][SPARSE] cuSPARSE 10.1 support
[ROCm/hip commit: d95448dd26 ]
2019-04-16 15:01:19 +03:00
Evgeny Mankov
43d7714fe4
[HIPIFY][SPARSE] cuSPARSE 10.1 support
...
[ROCm/hip commit: e1c87d8cae ]
2019-04-16 14:59:44 +03:00
Evgeny Mankov
8da1b26ffd
Merge pull request #1024 from emankov/master
...
[HIPIFY][BLAS] cuBLAS 10.1 support
[ROCm/hip commit: 85de95386e ]
2019-04-16 12:54:18 +03:00
Evgeny Mankov
fec9f62e0f
[HIPIFY][BLAS] cuBLAS 10.1 support
...
[ROCm/hip commit: 032c3bf5b8 ]
2019-04-16 12:52:58 +03:00
Evgeny Mankov
6268243c74
Merge pull request #1023 from emankov/master
...
[HIPIFY][cuDNN] Add partial cudnnRNNBiasMode_t support
[ROCm/hip commit: ea389cd7b8 ]
2019-04-16 11:03:22 +03:00
Evgeny Mankov
a7e36937ca
[HIPIFY][cuDNN] Add partial cudnnRNNBiasMode_t support
...
[ROCm/hip commit: 1b36987c5f ]
2019-04-16 11:01:01 +03:00
Maneesh Gupta
f529a6af13
Merge pull request #995 from david-salinas/add_sram-ecc_and_xnack_flags_to_triple
...
Append the ELF flags for sram-ecc and xnack to the target triple per code object
[ROCm/hip commit: 8309632e2d ]
2019-04-16 09:10:04 +05:30