Revīziju grafs

3513 Revīzijas

Autors SHA1 Ziņojums Datums
Rahul Garg 620a07102d Maintain HIP_VISIBLE_DEVICES for kernel launch 2019-05-07 05:09:02 +05:30
Maneesh Gupta 117bdd8774 Merge pull request #1062 from mhbliao/hliao/master/icmp
[hip] Re-implement ballot using AMDGCN builtins
2019-05-03 17:48:19 +05:30
Maneesh Gupta 37d01a7da9 Merge pull request #1058 from mhbliao/hliao/master/devfunc
[Device Function] Fix implementation
2019-05-03 17:47:51 +05:30
Evgeny Mankov 4e09081554 Merge pull request #1063 from emankov/master
[HIPIFY][tests] Add cuSPARSE CSR-BCSR-SPMV-conversions example
2019-04-30 17:40:05 +03:00
emankov e3082f5142 [HIPIFY][tests] Add cuSPARSE CSR-BCSR-SPMV-conversions example 2019-04-30 17:37:34 +03:00
Michael LIAO 9bd2d5746d [Device Function] Fix implementation of __bitinsert_u64
- It's a common mistake by assuming 1 << shamt would be promoted to
  64-bit, if shamt is a 64-bit integer. That's not the case. Replace
  that left shift to a 64-bit one to ensure it won't fall into undefined
  behavior.
- Fix the host-side implementation as well for device function testing.
2019-04-30 08:59:13 -04:00
Michael LIAO a64637da2c [devfunc] Re-implement ballot using AMDGCN builtins
- As the signature of `amdgcn.icmp` is changed for next-gen chip, using
  clang builtins is portable way to hide that details.
2019-04-29 17:21:25 -04:00
Evgeny Mankov 1639629f0a Merge pull request #1060 from emankov/master
[HIPIFY][doc] Update Readme.md: latest cuDNN 7.5.1.10 is supported
2019-04-29 15:42:37 +03:00
Evgeny Mankov c0705f892b [HIPIFY][doc] Update Readme.md: latest cuDNN 7.5.1.10 is supported
+ tested with CUDA 9.0, 9,2, 10.0 and 10.1
2019-04-29 15:41:08 +03:00
Aaron Enye Shi a3d118eaa8 Revert "Use COMgr to read Kernel Args Metadata (#1006)"
This reverts commit 8a548bf40b.
2019-04-26 16:04:56 -04:00
Aaron Enye Shi 48701ad4ba Revert "Add COMGR relative path for build machines"
This reverts commit 920fe246d7.
2019-04-26 16:04:56 -04:00
Aaron Enye Shi 59a5965fe1 Revert "Add dependency on amd_comgr in hip-config-*.cmake.in"
This reverts commit ef99ffd9f4.
2019-04-26 16:04:56 -04:00
Maneesh Gupta ef99ffd9f4 Add dependency on amd_comgr in hip-config-*.cmake.in
Change-Id: Iac1d851a8cfb99224e9c5926780273d9b9b08426
2019-04-25 15:26:33 -04:00
Evgeny Mankov c72ed8ac6d Merge pull request #1053 from emankov/master
[HIPIFY][perl][fix][258] Memory fence device functions are supported now
2019-04-25 13:28:59 +03:00
Evgeny Mankov abd1c53cf8 [HIPIFY][perl][fix][258] Memory fence device functions are supported now 2019-04-25 13:27:30 +03:00
Evgeny Mankov 525d4158f8 Merge pull request #1051 from emankov/master
[HIPIFY][DNN] cudnnSetFilter4dDescriptor support
2019-04-25 12:20:09 +03:00
Evgeny Mankov 3fee0f3765 [HIPIFY][DNN] cudnnSetFilter4dDescriptor support 2019-04-25 12:18:51 +03:00
Evgeny Mankov a673df6388 Merge pull request #1049 from emankov/master
[HIPIFY][fix][#204] Suppress warning message: #pragma once in main file
2019-04-24 20:37:28 +03:00
Evgeny Mankov 6d3c443234 [HIPIFY][fix][#204] Suppress warning message: #pragma once in main file 2019-04-24 20:35:52 +03:00
Evgeny Mankov e67bde9108 Merge pull request #1048 from emankov/master
[HIPIFY][doc] Update README.md
2019-04-24 18:04:14 +03:00
Evgeny Mankov 4651dce3f0 [HIPIFY][doc] Update README.md
+ A few words about clang patches to work with CUDA 9.2 - 10.0 on Windows;
+ Fix cuDNN versions with correct values.
2019-04-24 17:40:35 +03:00
Maneesh Gupta ffe9f86fe8 Merge pull request #1043 from mhbliao/hliao/master/fp16
[hip] Fix including of hip_fp16.h
2019-04-24 16:50:46 +05:30
Maneesh Gupta de6c680767 Merge pull request #1042 from mhbliao/hliao/master/ldg
[hip] Fix use of `__HIP_CLANG_ONLY__` in `hip_ldg.h`.
2019-04-24 16:50:37 +05:30
Maneesh Gupta e489f7579a Merge pull request #1040 from eshcherb/roctracer-hip-frontend-190422
hip_prof_api.h include under __cplusplus
2019-04-24 16:50:27 +05:30
Maneesh Gupta 2975221560 Merge pull request #1039 from gargrahul/fix_ptrgetattr_nvcc
Fix hipPointerGetAttributes for NVCC
2019-04-24 16:50:18 +05:30
Rahul Garg 2bc2c46d4d Add hipMallocManaged default functional support (#1036)
* Add hipMallocManaged default functional support

* Fix build error

* Add dtest
2019-04-24 16:50:03 +05:30
Maneesh Gupta a016777acb Merge pull request #1034 from kpyzhov/master
Minor fixes for 64-bit device functions.
2019-04-24 16:49:36 +05:30
Maneesh Gupta 8dc8c58ddd Merge pull request #1031 from yxsamliu/fix-init
Fix missing arg in HIP_INIT_API
2019-04-24 16:49:23 +05:30
Maneesh Gupta eed5928007 Merge pull request #1028 from gargrahul/fix_d2d_async_test
[dtest] Fix D2DAsync test
2019-04-24 16:49:13 +05:30
Aaron Enye Shi 920fe246d7 Add COMGR relative path for build machines 2019-04-23 17:16:26 -04:00
Evgeny Mankov 6dc1165259 Merge pull request #1045 from emankov/master
[HIPIFY][doc] Provide patches for clang's bug 38811
2019-04-23 21:15:33 +03:00
Evgeny Mankov 57931b3056 [HIPIFY][doc] Provide patches for clang's bug 38811
+ Update Readme.md accordingly
2019-04-23 21:13:00 +03:00
Evgeny Mankov 3402f9110b Merge pull request #1044 from emankov/master
[HIPIFY][hipify-perl] Formatting
2019-04-23 18:30:38 +03:00
Evgeny Mankov defc6f8155 [HIPIFY][hipify-perl] Formatting 2019-04-23 17:55:47 +03:00
Michael LIAO dc0d7bd5ce [hip] Fix including of hip_fp16.h
- Separate the definition of `__HCC_OR_HIP_CLANG__`, `__HCC_ONLY__`, and
  `__HIP_CLANG_ONLY__` into hip_common.h so that it could be included in
  hip_fp16.h, which may be included separately in app.
2019-04-23 09:16:00 -04:00
Michael LIAO 6fb07acc8c [hip] Fix use of __HIP_CLANG_ONLY__ in hip_ldg.h.
- Check its value instead of whether it's defined or not.
2019-04-22 23:22:32 -04:00
Evgeny af3f3ccb2b hip_prof_api.h include under __cplusplus 2019-04-22 21:14:18 -05:00
Rahul Garg 69a3d6b72a Fix hipPointerGetAttributes for NVCC 2019-04-23 03:22:25 +05:30
Konstantin Pyzhov beadaab661 Fix for __popcll() device function implementation. 2019-04-19 08:53:22 -04:00
Yaxun (Sam) Liu bb5c620b13 Fix missing arg in HIP_INIT_API 2019-04-18 16:18:31 -04:00
Konstantin Pyzhov b7bd29924a Fix for __ffsll() device functions. 2019-04-18 13:07:24 -04:00
David Salinas 5843530a06 Revert "append the ELF flags for sram-ecc and xnack to the target triple per code object"
This reverts commit c61f265657.
2019-04-18 11:49:40 -04:00
Rahul Garg e5e7651a4a Fix D2DAsync test 2019-04-18 07:35:06 +05:30
Evgeny Mankov d95448dd26 Merge pull request #1025 from emankov/master
[HIPIFY][SPARSE] cuSPARSE 10.1 support
2019-04-16 15:01:19 +03:00
Evgeny Mankov e1c87d8cae [HIPIFY][SPARSE] cuSPARSE 10.1 support 2019-04-16 14:59:44 +03:00
Evgeny Mankov 85de95386e Merge pull request #1024 from emankov/master
[HIPIFY][BLAS] cuBLAS 10.1 support
2019-04-16 12:54:18 +03:00
Evgeny Mankov 032c3bf5b8 [HIPIFY][BLAS] cuBLAS 10.1 support 2019-04-16 12:52:58 +03:00
Evgeny Mankov ea389cd7b8 Merge pull request #1023 from emankov/master
[HIPIFY][cuDNN] Add partial cudnnRNNBiasMode_t support
2019-04-16 11:03:22 +03:00
Evgeny Mankov 1b36987c5f [HIPIFY][cuDNN] Add partial cudnnRNNBiasMode_t support 2019-04-16 11:01:01 +03:00
Maneesh Gupta 8309632e2d Merge pull request #995 from david-salinas/add_sram-ecc_and_xnack_flags_to_triple
Append the ELF flags for sram-ecc and xnack to the target triple per code object
2019-04-16 09:10:04 +05:30