Aaron Enye Shi
6b3095f7cb
Add COMGR relative path for build machines
2019-04-23 17:16:26 -04:00
Evgeny Mankov
716ed30878
Merge pull request #1045 from emankov/master
...
[HIPIFY][doc] Provide patches for clang's bug 38811
2019-04-23 21:15:33 +03:00
Evgeny Mankov
87fa81f7be
[HIPIFY][doc] Provide patches for clang's bug 38811
...
+ Update Readme.md accordingly
2019-04-23 21:13:00 +03:00
Evgeny Mankov
d8a51ee26c
Merge pull request #1044 from emankov/master
...
[HIPIFY][hipify-perl] Formatting
2019-04-23 18:30:38 +03:00
Evgeny Mankov
65dd1d4c7d
[HIPIFY][hipify-perl] Formatting
2019-04-23 17:55:47 +03:00
David Salinas
1237a0b691
Revert "append the ELF flags for sram-ecc and xnack to the target triple per code object"
...
This reverts commit 4d0dc45078 .
2019-04-18 11:49:40 -04:00
Evgeny Mankov
9514678aeb
Merge pull request #1025 from emankov/master
...
[HIPIFY][SPARSE] cuSPARSE 10.1 support
2019-04-16 15:01:19 +03:00
Evgeny Mankov
95aca4f9a9
[HIPIFY][SPARSE] cuSPARSE 10.1 support
2019-04-16 14:59:44 +03:00
Evgeny Mankov
542d0c3d7a
Merge pull request #1024 from emankov/master
...
[HIPIFY][BLAS] cuBLAS 10.1 support
2019-04-16 12:54:18 +03:00
Evgeny Mankov
bbcacd0146
[HIPIFY][BLAS] cuBLAS 10.1 support
2019-04-16 12:52:58 +03:00
Evgeny Mankov
f5f1636181
Merge pull request #1023 from emankov/master
...
[HIPIFY][cuDNN] Add partial cudnnRNNBiasMode_t support
2019-04-16 11:03:22 +03:00
Evgeny Mankov
5fa84735a6
[HIPIFY][cuDNN] Add partial cudnnRNNBiasMode_t support
2019-04-16 11:01:01 +03:00
Maneesh Gupta
715a500b97
Merge pull request #995 from david-salinas/add_sram-ecc_and_xnack_flags_to_triple
...
Append the ELF flags for sram-ecc and xnack to the target triple per code object
2019-04-16 09:10:04 +05:30
Maneesh Gupta
22660bed74
Merge pull request #1019 from scchan/lazy_binding
...
minor workaround for lazy binding
2019-04-16 08:36:10 +05:30
Jeff Daily
cf8fb43e6b
In hipFree, synchronize owner of memory ( #1018 )
...
* In hipFree, if memory is associated with a device, synchronize that device's streams.
This changes the behavior from synchronizing the currently set TLS device.
* All devices sync in hipFree for _appId=-1 case.
* Revert "All devices sync in hipFree for _appId=-1 case."
This reverts commit 1efb34d6a8426661e45bc5f763422a1147aeac10.
* add HIP_SYNC_FREE env var
2019-04-16 08:35:55 +05:30
Mr-LiuSw
64bdf82265
add little changes in hip_runtime_api.h to work with c language ( #1017 )
...
* Update hip_runtime_api.h
when i try to use mpicc or gcc to compile a c language code which call some hip runtime api , error occured as
> /path/to/hcc_detail/hip_runtime_api.h:2268:33: error: unknown type name ‘hipFuncAttributes’;
> hipFuncGetAttributes(hipFuncAttributes* attr, const void* func);
add ' struct ' for the first parameter of hipFuncGetAttributes will get ride of this problem.
2019-04-16 08:35:36 +05:30
Aaron Enye Shi
2c80975e9c
Use COMgr to read Kernel Args Metadata ( #1006 )
...
* Add CMAKE dep to amd_comgr
* Use COMGR for read_kernarg_metadata in COV2
* Do not assume kernargs exist
* Add proper metadata destroy cleanup
* Use a process function for easier destroy
* Remove old read_kernarg_metadata
* Clean up HCC, prints, names
* Use COMGR in CMAKE by default
* Move metadata lookup for keyword values into helper
* Remove C string usage for lookup_keyword_value
* Guard COMGR for non-NVCC path
* Add hip_hcc dependency on comgr package
* Add lifetime to metadata nodes
* Find COMGR config file for amd_comgr target
* Move set_active data earlier
2019-04-16 08:34:39 +05:30
Evgeny Mankov
3bc6f1e7b5
Merge pull request #1022 from emankov/master
...
[HIPIFY] cuDNN 7.5.0.56 support
2019-04-15 15:48:38 +03:00
Evgeny Mankov
64f0f29111
[HIPIFY] cuDNN 7.5.0.56 support
2019-04-15 15:46:46 +03:00
Maneesh Gupta
b700c19985
Merge pull request #1021 from mangupta/ci_update_builds
...
[ci] Enable tests on ROCm 2.3
2019-04-15 13:27:49 +05:30
Maneesh Gupta
72e17e3c92
[ci] Enable tests on ROCm 2.3
...
Change-Id: Id344ef600b0868f36f2e7ac08d5664234d88835b
2019-04-15 12:38:01 +05:30
Maneesh Gupta
464e098796
Merge pull request #1016 from yxsamliu/align
...
hip-clang: Add __align__
2019-04-11 08:54:23 +05:30
Yaxun (Sam) Liu
e200ece4da
hip-clang: Add __align__
...
CUDA has __align__. Define eqivalent for hip-clang.
2019-04-10 14:17:18 -04:00
Evgeny Mankov
131e74c64c
Merge pull request #1015 from emankov/master
...
[HIPIFY] CUDA 10.1 Runtime API support
2019-04-10 18:44:21 +03:00
Evgeny Mankov
b11bf48270
[HIPIFY] CUDA 10.1 Runtime API support
2019-04-10 18:41:36 +03:00
Evgeny Mankov
2603fb409b
Merge pull request #1014 from emankov/master
...
[HIPIFY] CUDA 10.1 Driver API support
2019-04-10 15:05:28 +03:00
Evgeny Mankov
9a660c0d48
[HIPIFY] CUDA 10.1 Driver API support
2019-04-10 15:03:34 +03:00
Maneesh Gupta
75691ff3e4
Merge pull request #1013 from yxsamliu/config
...
Fix hip-config.cmake for hip-clang
2019-04-10 07:53:22 +00:00
Maneesh Gupta
9a1f5c99fc
Merge pull request #1012 from yxsamliu/header
...
hip-clang: add include path for hsa headers
2019-04-10 07:53:13 +00:00
Yaxun (Sam) Liu
419eeb4fe0
hip-clang: fix hipcc options
...
Use -O3 for default optimization level.
Fix --hip-device-lib-path for warnings about unused option.
2019-04-09 22:31:13 -04:00
Yaxun (Sam) Liu
a781d2e017
Fix hip-config.cmake for hip-clang
...
Define HIP_COMPILER and HIP_RUNTIME in hip-config.cmake to
facilitate other packages identify hip-clang and HIP/VDI
runtime.
2019-04-09 16:41:01 -04:00
Yaxun (Sam) Liu
98642129cc
hip-clang: add include path for hsa headers
2019-04-09 15:01:50 -04:00
Maneesh Gupta
aa8f17193e
Merge pull request #1010 from ROCm-Developer-Tools/mangupta-patch-1
...
[ci] Disable hipLaunchParm.tst for now
2019-04-09 03:19:56 +00:00
Evgeny Mankov
18b3cbbb4c
Merge pull request #1011 from emankov/master
...
[HIPIFY][cmake][doc] Update CMakeLists and Readme cause CUDA 10.1 and clang 8.0.0 are released
2019-04-08 20:10:32 +03:00
Evgeny Mankov
79e6bd8426
[HIPIFY][cmake][doc] Update CMakeLists and Readme cause CUDA 10.1 and clang 8.0.0 are released
...
+ Fix the error in cmake: clang 8.0.0 supports CUDA 10.0, not 10.1; only trunk clang 9.0.0 supports CUDA 10.1.
+ Update Readme:
1. clang's bug 36384 is gone, so latest stable release for Linux is clang 8.0.0, which supports CUDA up to 10.0;
2. update dependencies, testing and building notes.
2019-04-08 20:05:21 +03:00
Maneesh Gupta
27013c9af6
[ci] Disable hipLaunchParm.tst for now
...
Due to a HCC bug in rocm-head, the test fails to compile. Disabling the test until the issue is resolved.
Change-Id: Ib4e7baf0b9c2cb5f5dbe38e6dd2bab894d28886a
2019-04-08 16:14:25 +05:30
Maneesh Gupta
6b47979b8a
Merge pull request #1009 from yxsamliu/fix-mgpu
...
Fix regression on multi-gpu due to PR#997
2019-04-08 08:20:17 +00:00
Yaxun (Sam) Liu
5c67ee11f4
Fix regression on multi-gpu due to PR#997
2019-04-05 22:54:41 -04:00
Evgeny Mankov
3352d7b174
Merge pull request #1003 from acowley/patch-1
...
hipify: translate cudaErrorInvalidSymbol
2019-04-05 20:21:27 +03:00
Evgeny Mankov
90225d2a39
Merge pull request #1007 from emankov/master
...
[HIP][doc] Fix typos and minor text inaccuracies in HIP FAQ
2019-04-05 20:18:01 +03:00
David Salinas
4d0dc45078
append the ELF flags for sram-ecc and xnack to the target triple per code object
2019-04-05 13:17:11 -04:00
Evgeny Mankov
4990a1003b
[HIPIFY][doc] Fix typos and minor text inaccuracies in HIP FAQ
2019-04-05 19:19:38 +03:00
Siu Chi Chan
b5045af7e9
minor workaround for lazy binding
2019-04-02 17:28:06 -04:00
Anthony Cowley
f15dd58a91
hipify: translate cudaErrorInvalidSymbol
2019-04-02 08:54:37 -04:00
Maneesh Gupta
2d9e5615b7
Merge pull request #998 from yxsamliu/doc
...
hip-clang: update installation guide.
2019-04-02 05:08:18 +00:00
Maneesh Gupta
a6e7dfefe3
Merge pull request #997 from yxsamliu/mgpu
...
hip-clang: fix kernel not found on multi-gpu
2019-04-02 05:07:31 +00:00
Evgeny Mankov
887c096ed7
Merge pull request #1001 from emankov/master
...
[HIPIFY][tests] Fix typo in test for CUDA 10.x
2019-04-01 19:53:42 +03:00
Evgeny Mankov
5b59f87305
[HIPIFY][tests] Fix typo in test for CUDA 10.x
2019-04-01 19:52:08 +03:00
Evgeny Mankov
16128da4a3
Merge pull request #1000 from emankov/master
...
[HIPIFY][cmake] Update CMakeLists and Readme cause CUDA 10.1 and clang 8.0.0 are released
2019-04-01 19:49:46 +03:00
Evgeny Mankov
799a6f5512
[HIPIFY][cmake] Update CMakeLists and Readme cause CUDA 10.1 and clang 8.0.0 are released
2019-04-01 19:44:52 +03:00