Граф коммитов

3503 Коммитов

Автор SHA1 Сообщение Дата
Aaron Enye Shi acfa46bbbc Revert "Add COMGR relative path for build machines"
This reverts commit 6b3095f7cb.
2019-04-26 16:04:56 -04:00
Aaron Enye Shi 2378c7a20b Revert "Add dependency on amd_comgr in hip-config-*.cmake.in"
This reverts commit b8fe5ba572.
2019-04-26 16:04:56 -04:00
Maneesh Gupta b8fe5ba572 Add dependency on amd_comgr in hip-config-*.cmake.in
Change-Id: Iac1d851a8cfb99224e9c5926780273d9b9b08426
2019-04-25 15:26:33 -04:00
Evgeny Mankov 30b49bfaff Merge pull request #1053 from emankov/master
[HIPIFY][perl][fix][258] Memory fence device functions are supported now
2019-04-25 13:28:59 +03:00
Evgeny Mankov f0c2fdc6d7 [HIPIFY][perl][fix][258] Memory fence device functions are supported now 2019-04-25 13:27:30 +03:00
Evgeny Mankov 9f2d7c4872 Merge pull request #1051 from emankov/master
[HIPIFY][DNN] cudnnSetFilter4dDescriptor support
2019-04-25 12:20:09 +03:00
Evgeny Mankov 72a809caf6 [HIPIFY][DNN] cudnnSetFilter4dDescriptor support 2019-04-25 12:18:51 +03:00
Evgeny Mankov 86aa658d0e Merge pull request #1049 from emankov/master
[HIPIFY][fix][#204] Suppress warning message: #pragma once in main file
2019-04-24 20:37:28 +03:00
Evgeny Mankov df9418c3cd [HIPIFY][fix][#204] Suppress warning message: #pragma once in main file 2019-04-24 20:35:52 +03:00
Evgeny Mankov 45c7535b1f Merge pull request #1048 from emankov/master
[HIPIFY][doc] Update README.md
2019-04-24 18:04:14 +03:00
Evgeny Mankov 1049031d98 [HIPIFY][doc] Update README.md
+ A few words about clang patches to work with CUDA 9.2 - 10.0 on Windows;
+ Fix cuDNN versions with correct values.
2019-04-24 17:40:35 +03:00
Maneesh Gupta 7f81c72f1c Merge pull request #1043 from mhbliao/hliao/master/fp16
[hip] Fix including of hip_fp16.h
2019-04-24 16:50:46 +05:30
Maneesh Gupta 63ab2ea945 Merge pull request #1042 from mhbliao/hliao/master/ldg
[hip] Fix use of `__HIP_CLANG_ONLY__` in `hip_ldg.h`.
2019-04-24 16:50:37 +05:30
Maneesh Gupta 54cdeabe6e Merge pull request #1040 from eshcherb/roctracer-hip-frontend-190422
hip_prof_api.h include under __cplusplus
2019-04-24 16:50:27 +05:30
Maneesh Gupta 7edb43bc83 Merge pull request #1039 from gargrahul/fix_ptrgetattr_nvcc
Fix hipPointerGetAttributes for NVCC
2019-04-24 16:50:18 +05:30
Rahul Garg 94769fc8dd Add hipMallocManaged default functional support (#1036)
* Add hipMallocManaged default functional support

* Fix build error

* Add dtest
2019-04-24 16:50:03 +05:30
Maneesh Gupta 61861faddc Merge pull request #1034 from kpyzhov/master
Minor fixes for 64-bit device functions.
2019-04-24 16:49:36 +05:30
Maneesh Gupta 3ac21336eb Merge pull request #1031 from yxsamliu/fix-init
Fix missing arg in HIP_INIT_API
2019-04-24 16:49:23 +05:30
Maneesh Gupta 3ba7afcfc1 Merge pull request #1028 from gargrahul/fix_d2d_async_test
[dtest] Fix D2DAsync test
2019-04-24 16:49:13 +05:30
Aaron Enye Shi 6b3095f7cb Add COMGR relative path for build machines 2019-04-23 17:16:26 -04:00
Evgeny Mankov 716ed30878 Merge pull request #1045 from emankov/master
[HIPIFY][doc] Provide patches for clang's bug 38811
2019-04-23 21:15:33 +03:00
Evgeny Mankov 87fa81f7be [HIPIFY][doc] Provide patches for clang's bug 38811
+ Update Readme.md accordingly
2019-04-23 21:13:00 +03:00
Evgeny Mankov d8a51ee26c Merge pull request #1044 from emankov/master
[HIPIFY][hipify-perl] Formatting
2019-04-23 18:30:38 +03:00
Evgeny Mankov 65dd1d4c7d [HIPIFY][hipify-perl] Formatting 2019-04-23 17:55:47 +03:00
Michael LIAO d086dbd0e5 [hip] Fix including of hip_fp16.h
- Separate the definition of `__HCC_OR_HIP_CLANG__`, `__HCC_ONLY__`, and
  `__HIP_CLANG_ONLY__` into hip_common.h so that it could be included in
  hip_fp16.h, which may be included separately in app.
2019-04-23 09:16:00 -04:00
Michael LIAO ca6a5c07eb [hip] Fix use of __HIP_CLANG_ONLY__ in hip_ldg.h.
- Check its value instead of whether it's defined or not.
2019-04-22 23:22:32 -04:00
Evgeny 165c42483b hip_prof_api.h include under __cplusplus 2019-04-22 21:14:18 -05:00
Rahul Garg c0e0f0b7fd Fix hipPointerGetAttributes for NVCC 2019-04-23 03:22:25 +05:30
Konstantin Pyzhov f6fbf8751d Fix for __popcll() device function implementation. 2019-04-19 08:53:22 -04:00
Yaxun (Sam) Liu 710e633bdd Fix missing arg in HIP_INIT_API 2019-04-18 16:18:31 -04:00
Konstantin Pyzhov 5664ed3206 Fix for __ffsll() device functions. 2019-04-18 13:07:24 -04:00
David Salinas 1237a0b691 Revert "append the ELF flags for sram-ecc and xnack to the target triple per code object"
This reverts commit 4d0dc45078.
2019-04-18 11:49:40 -04:00
Rahul Garg f1dc017167 Fix D2DAsync test 2019-04-18 07:35:06 +05:30
Evgeny Mankov 9514678aeb Merge pull request #1025 from emankov/master
[HIPIFY][SPARSE] cuSPARSE 10.1 support
2019-04-16 15:01:19 +03:00
Evgeny Mankov 95aca4f9a9 [HIPIFY][SPARSE] cuSPARSE 10.1 support 2019-04-16 14:59:44 +03:00
Evgeny Mankov 542d0c3d7a Merge pull request #1024 from emankov/master
[HIPIFY][BLAS] cuBLAS 10.1 support
2019-04-16 12:54:18 +03:00
Evgeny Mankov bbcacd0146 [HIPIFY][BLAS] cuBLAS 10.1 support 2019-04-16 12:52:58 +03:00
Evgeny Mankov f5f1636181 Merge pull request #1023 from emankov/master
[HIPIFY][cuDNN] Add partial cudnnRNNBiasMode_t support
2019-04-16 11:03:22 +03:00
Evgeny Mankov 5fa84735a6 [HIPIFY][cuDNN] Add partial cudnnRNNBiasMode_t support 2019-04-16 11:01:01 +03:00
Maneesh Gupta 715a500b97 Merge pull request #995 from david-salinas/add_sram-ecc_and_xnack_flags_to_triple
Append the ELF flags for sram-ecc and xnack to the target triple per code object
2019-04-16 09:10:04 +05:30
Maneesh Gupta 22660bed74 Merge pull request #1019 from scchan/lazy_binding
minor workaround for lazy binding
2019-04-16 08:36:10 +05:30
Jeff Daily cf8fb43e6b In hipFree, synchronize owner of memory (#1018)
* In hipFree, if memory is associated with a device, synchronize that device's streams.

This changes the behavior from synchronizing the currently set TLS device.

* All devices sync in hipFree for _appId=-1 case.

* Revert "All devices sync in hipFree for _appId=-1 case."

This reverts commit 1efb34d6a8426661e45bc5f763422a1147aeac10.

* add HIP_SYNC_FREE env var
2019-04-16 08:35:55 +05:30
Mr-LiuSw 64bdf82265 add little changes in hip_runtime_api.h to work with c language (#1017)
* Update hip_runtime_api.h

when i try to use mpicc or gcc to compile a c language code which call some hip runtime api , error occured as
> /path/to/hcc_detail/hip_runtime_api.h:2268:33: error: unknown type name ‘hipFuncAttributes’; 
> hipFuncGetAttributes(hipFuncAttributes* attr, const void* func);
 
add ' struct ' for the first parameter of hipFuncGetAttributes will get ride of this problem.
2019-04-16 08:35:36 +05:30
Aaron Enye Shi 2c80975e9c Use COMgr to read Kernel Args Metadata (#1006)
* Add CMAKE dep to amd_comgr

* Use COMGR for read_kernarg_metadata in COV2

* Do not assume kernargs exist

* Add proper metadata destroy cleanup

* Use a process function for easier destroy

* Remove old read_kernarg_metadata

* Clean up HCC, prints, names

* Use COMGR in CMAKE by default

* Move metadata lookup for keyword values into helper

* Remove C string usage for lookup_keyword_value

* Guard COMGR for non-NVCC path

* Add hip_hcc dependency on comgr package

* Add lifetime to metadata nodes

* Find COMGR config file for amd_comgr target

* Move set_active data earlier
2019-04-16 08:34:39 +05:30
Evgeny Mankov 3bc6f1e7b5 Merge pull request #1022 from emankov/master
[HIPIFY] cuDNN 7.5.0.56 support
2019-04-15 15:48:38 +03:00
Evgeny Mankov 64f0f29111 [HIPIFY] cuDNN 7.5.0.56 support 2019-04-15 15:46:46 +03:00
Maneesh Gupta b700c19985 Merge pull request #1021 from mangupta/ci_update_builds
[ci] Enable tests on ROCm 2.3
2019-04-15 13:27:49 +05:30
Maneesh Gupta 72e17e3c92 [ci] Enable tests on ROCm 2.3
Change-Id: Id344ef600b0868f36f2e7ac08d5664234d88835b
2019-04-15 12:38:01 +05:30
Maneesh Gupta 464e098796 Merge pull request #1016 from yxsamliu/align
hip-clang: Add __align__
2019-04-11 08:54:23 +05:30
Yaxun (Sam) Liu e200ece4da hip-clang: Add __align__
CUDA has __align__. Define eqivalent for hip-clang.
2019-04-10 14:17:18 -04:00