Commit-Graf

3192 Incheckningar

Upphovsman SHA1 Meddelande Datum
Maneesh Gupta f7eaff474b Implement hipExtGetLinkTypeAndHopCount for ROCm devices
Change-Id: Ie5bb4f640ac6d189c7fceeab22627a7494fd10bd


[ROCm/hip commit: 035ef04e1f]
2019-05-06 15:54:31 +05:30
Maneesh Gupta f49a4c6d89 Merge pull request #1062 from mhbliao/hliao/master/icmp
[hip] Re-implement ballot using AMDGCN builtins

[ROCm/hip commit: 117bdd8774]
2019-05-03 17:48:19 +05:30
Maneesh Gupta f42dc08384 Merge pull request #1058 from mhbliao/hliao/master/devfunc
[Device Function] Fix implementation

[ROCm/hip commit: 37d01a7da9]
2019-05-03 17:47:51 +05:30
emankov 1f57ee6125 [HIPIFY][tests] Add cuSPARSE CSR-BCSR-SPMV-conversions example
[ROCm/hip commit: e3082f5142]
2019-04-30 17:37:34 +03:00
Michael LIAO e637e72364 [Device Function] Fix implementation of __bitinsert_u64
- It's a common mistake by assuming 1 << shamt would be promoted to
  64-bit, if shamt is a 64-bit integer. That's not the case. Replace
  that left shift to a 64-bit one to ensure it won't fall into undefined
  behavior.
- Fix the host-side implementation as well for device function testing.


[ROCm/hip commit: 9bd2d5746d]
2019-04-30 08:59:13 -04:00
Michael LIAO cc4de2bc28 [devfunc] Re-implement ballot using AMDGCN builtins
- As the signature of `amdgcn.icmp` is changed for next-gen chip, using
  clang builtins is portable way to hide that details.


[ROCm/hip commit: a64637da2c]
2019-04-29 17:21:25 -04:00
Evgeny Mankov 2b16e01c66 [HIPIFY][doc] Update Readme.md: latest cuDNN 7.5.1.10 is supported
+ tested with CUDA 9.0, 9,2, 10.0 and 10.1


[ROCm/hip commit: c0705f892b]
2019-04-29 15:41:08 +03:00
Aaron Enye Shi 62e96cb4cf Revert "Use COMgr to read Kernel Args Metadata (#1006)"
This reverts commit 882006555b.


[ROCm/hip commit: a3d118eaa8]
2019-04-26 16:04:56 -04:00
Aaron Enye Shi c797e6e27b Revert "Add COMGR relative path for build machines"
This reverts commit 01a11c94f8.


[ROCm/hip commit: 48701ad4ba]
2019-04-26 16:04:56 -04:00
Aaron Enye Shi 3585159140 Revert "Add dependency on amd_comgr in hip-config-*.cmake.in"
This reverts commit b9526364bb.


[ROCm/hip commit: 59a5965fe1]
2019-04-26 16:04:56 -04:00
Maneesh Gupta b9526364bb Add dependency on amd_comgr in hip-config-*.cmake.in
Change-Id: Iac1d851a8cfb99224e9c5926780273d9b9b08426


[ROCm/hip commit: ef99ffd9f4]
2019-04-25 15:26:33 -04:00
Evgeny Mankov d8f4eff1ef [HIPIFY][perl][fix][258] Memory fence device functions are supported now
[ROCm/hip commit: abd1c53cf8]
2019-04-25 13:27:30 +03:00
Evgeny Mankov 86d68901da [HIPIFY][DNN] cudnnSetFilter4dDescriptor support
[ROCm/hip commit: 3fee0f3765]
2019-04-25 12:18:51 +03:00
Evgeny Mankov d51679ed8e [HIPIFY][fix][#204] Suppress warning message: #pragma once in main file
[ROCm/hip commit: 6d3c443234]
2019-04-24 20:35:52 +03:00
Evgeny Mankov 43839c780e [HIPIFY][doc] Update README.md
+ A few words about clang patches to work with CUDA 9.2 - 10.0 on Windows;
+ Fix cuDNN versions with correct values.


[ROCm/hip commit: 4651dce3f0]
2019-04-24 17:40:35 +03:00
Maneesh Gupta c36aa3a056 Merge pull request #1043 from mhbliao/hliao/master/fp16
[hip] Fix including of hip_fp16.h

[ROCm/hip commit: ffe9f86fe8]
2019-04-24 16:50:46 +05:30
Maneesh Gupta 7938012378 Merge pull request #1042 from mhbliao/hliao/master/ldg
[hip] Fix use of `__HIP_CLANG_ONLY__` in `hip_ldg.h`.

[ROCm/hip commit: de6c680767]
2019-04-24 16:50:37 +05:30
Maneesh Gupta 7b5ed45968 Merge pull request #1040 from eshcherb/roctracer-hip-frontend-190422
hip_prof_api.h include under __cplusplus

[ROCm/hip commit: e489f7579a]
2019-04-24 16:50:27 +05:30
Maneesh Gupta 5235da375e Merge pull request #1039 from gargrahul/fix_ptrgetattr_nvcc
Fix hipPointerGetAttributes for NVCC

[ROCm/hip commit: 2975221560]
2019-04-24 16:50:18 +05:30
Rahul Garg c01236f679 Add hipMallocManaged default functional support (#1036)
* Add hipMallocManaged default functional support

* Fix build error

* Add dtest


[ROCm/hip commit: 2bc2c46d4d]
2019-04-24 16:50:03 +05:30
Maneesh Gupta 4d9977c10b Merge pull request #1034 from kpyzhov/master
Minor fixes for 64-bit device functions.

[ROCm/hip commit: a016777acb]
2019-04-24 16:49:36 +05:30
Maneesh Gupta 72c7388422 Merge pull request #1031 from yxsamliu/fix-init
Fix missing arg in HIP_INIT_API

[ROCm/hip commit: 8dc8c58ddd]
2019-04-24 16:49:23 +05:30
Maneesh Gupta b17642bb10 Merge pull request #1028 from gargrahul/fix_d2d_async_test
[dtest] Fix D2DAsync test

[ROCm/hip commit: eed5928007]
2019-04-24 16:49:13 +05:30
Aaron Enye Shi 01a11c94f8 Add COMGR relative path for build machines
[ROCm/hip commit: 920fe246d7]
2019-04-23 17:16:26 -04:00
Evgeny Mankov 3fb77dbef5 [HIPIFY][doc] Provide patches for clang's bug 38811
+ Update Readme.md accordingly


[ROCm/hip commit: 57931b3056]
2019-04-23 21:13:00 +03:00
Evgeny Mankov a2b6b1e7e9 [HIPIFY][hipify-perl] Formatting
[ROCm/hip commit: defc6f8155]
2019-04-23 17:55:47 +03:00
Michael LIAO 6eec9d4ac8 [hip] Fix including of hip_fp16.h
- Separate the definition of `__HCC_OR_HIP_CLANG__`, `__HCC_ONLY__`, and
  `__HIP_CLANG_ONLY__` into hip_common.h so that it could be included in
  hip_fp16.h, which may be included separately in app.


[ROCm/hip commit: dc0d7bd5ce]
2019-04-23 09:16:00 -04:00
Michael LIAO 27aed8c172 [hip] Fix use of __HIP_CLANG_ONLY__ in hip_ldg.h.
- Check its value instead of whether it's defined or not.


[ROCm/hip commit: 6fb07acc8c]
2019-04-22 23:22:32 -04:00
Evgeny 17474d2b58 hip_prof_api.h include under __cplusplus
[ROCm/hip commit: af3f3ccb2b]
2019-04-22 21:14:18 -05:00
Rahul Garg be49602b29 Fix hipPointerGetAttributes for NVCC
[ROCm/hip commit: 69a3d6b72a]
2019-04-23 03:22:25 +05:30
Konstantin Pyzhov 2e0c67bbd3 Fix for __popcll() device function implementation.
[ROCm/hip commit: beadaab661]
2019-04-19 08:53:22 -04:00
Yaxun (Sam) Liu d4bce6c019 Fix missing arg in HIP_INIT_API
[ROCm/hip commit: bb5c620b13]
2019-04-18 16:18:31 -04:00
Konstantin Pyzhov 53b232fb2a Fix for __ffsll() device functions.
[ROCm/hip commit: b7bd29924a]
2019-04-18 13:07:24 -04:00
David Salinas d4c72ad357 Revert "append the ELF flags for sram-ecc and xnack to the target triple per code object"
This reverts commit ee887dd6a2.


[ROCm/hip commit: 5843530a06]
2019-04-18 11:49:40 -04:00
Rahul Garg d0951a9c20 Fix D2DAsync test
[ROCm/hip commit: e5e7651a4a]
2019-04-18 07:35:06 +05:30
Evgeny Mankov 43d7714fe4 [HIPIFY][SPARSE] cuSPARSE 10.1 support
[ROCm/hip commit: e1c87d8cae]
2019-04-16 14:59:44 +03:00
Evgeny Mankov fec9f62e0f [HIPIFY][BLAS] cuBLAS 10.1 support
[ROCm/hip commit: 032c3bf5b8]
2019-04-16 12:52:58 +03:00
Evgeny Mankov 6268243c74 Merge pull request #1023 from emankov/master
[HIPIFY][cuDNN] Add partial cudnnRNNBiasMode_t support

[ROCm/hip commit: ea389cd7b8]
2019-04-16 11:03:22 +03:00
Evgeny Mankov a7e36937ca [HIPIFY][cuDNN] Add partial cudnnRNNBiasMode_t support
[ROCm/hip commit: 1b36987c5f]
2019-04-16 11:01:01 +03:00
Maneesh Gupta f529a6af13 Merge pull request #995 from david-salinas/add_sram-ecc_and_xnack_flags_to_triple
Append the ELF flags for sram-ecc and xnack to the target triple per code object

[ROCm/hip commit: 8309632e2d]
2019-04-16 09:10:04 +05:30
Maneesh Gupta 8b7cd90f0c Merge pull request #1019 from scchan/lazy_binding
minor workaround for lazy binding

[ROCm/hip commit: d789aef46e]
2019-04-16 08:36:10 +05:30
Jeff Daily a0172ca884 In hipFree, synchronize owner of memory (#1018)
* In hipFree, if memory is associated with a device, synchronize that device's streams.

This changes the behavior from synchronizing the currently set TLS device.

* All devices sync in hipFree for _appId=-1 case.

* Revert "All devices sync in hipFree for _appId=-1 case."

This reverts commit 1efb34d6a8426661e45bc5f763422a1147aeac10.

* add HIP_SYNC_FREE env var


[ROCm/hip commit: 2b3037a6ea]
2019-04-16 08:35:55 +05:30
Mr-LiuSw ba23b5c5db add little changes in hip_runtime_api.h to work with c language (#1017)
* Update hip_runtime_api.h

when i try to use mpicc or gcc to compile a c language code which call some hip runtime api , error occured as
> /path/to/hcc_detail/hip_runtime_api.h:2268:33: error: unknown type name ‘hipFuncAttributes’; 
> hipFuncGetAttributes(hipFuncAttributes* attr, const void* func);
 
add ' struct ' for the first parameter of hipFuncGetAttributes will get ride of this problem.


[ROCm/hip commit: dc07b1b06c]
2019-04-16 08:35:36 +05:30
Aaron Enye Shi 882006555b Use COMgr to read Kernel Args Metadata (#1006)
* Add CMAKE dep to amd_comgr

* Use COMGR for read_kernarg_metadata in COV2

* Do not assume kernargs exist

* Add proper metadata destroy cleanup

* Use a process function for easier destroy

* Remove old read_kernarg_metadata

* Clean up HCC, prints, names

* Use COMGR in CMAKE by default

* Move metadata lookup for keyword values into helper

* Remove C string usage for lookup_keyword_value

* Guard COMGR for non-NVCC path

* Add hip_hcc dependency on comgr package

* Add lifetime to metadata nodes

* Find COMGR config file for amd_comgr target

* Move set_active data earlier


[ROCm/hip commit: 8a548bf40b]
2019-04-16 08:34:39 +05:30
Evgeny Mankov e73bfab77c [HIPIFY] cuDNN 7.5.0.56 support
[ROCm/hip commit: d3a810ab11]
2019-04-15 15:46:46 +03:00
Maneesh Gupta ab22256587 [ci] Enable tests on ROCm 2.3
Change-Id: Id344ef600b0868f36f2e7ac08d5664234d88835b


[ROCm/hip commit: 5a0e14fbb1]
2019-04-15 12:38:01 +05:30
Yaxun (Sam) Liu 9899d0523e hip-clang: Add __align__
CUDA has __align__. Define eqivalent for hip-clang.


[ROCm/hip commit: b9f8f977fb]
2019-04-10 14:17:18 -04:00
Evgeny Mankov 9087c06f8f [HIPIFY] CUDA 10.1 Runtime API support
[ROCm/hip commit: d2d2889bfe]
2019-04-10 18:41:36 +03:00
Evgeny Mankov ac64c184fb [HIPIFY] CUDA 10.1 Driver API support
[ROCm/hip commit: b0879cd959]
2019-04-10 15:03:34 +03:00
Maneesh Gupta ae6c61575e Merge pull request #1013 from yxsamliu/config
Fix hip-config.cmake for hip-clang

[ROCm/hip commit: fa6ba987a5]
2019-04-10 07:53:22 +00:00