Commit-Graf

3215 Incheckningar

Upphovsman SHA1 Meddelande Datum
Evgeny Mankov 2cdf3d7e73 Merge pull request #1085 from emankov/master
[HIPIFY][perl] Initial support of hipify-perl generation from hipify-clang

[ROCm/clr commit: df8909d73d]
2019-05-07 17:30:39 +03:00
Evgeny Mankov 413e0f97fb [HIPIFY][perl] Initial support of hipify-perl generation from hipify-clang
+ Only a generation of transformation map of CUDA entities supported by HIP is implemented.
+ 3 hipify-clang options are added: -perl, -o-perl-map, -o-perl-map-dir.
+ OptionsParser mode is changed from OneOrMore to Optional to support hipify-perl generation without actual hipification.
+ Add explicit control of source files specification absence in case of no perl generation.


[ROCm/clr commit: 849155d865]
2019-05-07 17:27:34 +03:00
Maneesh Gupta 301a9292ff Merge pull request #1082 from gargrahul/fix_hipmemcpy_symbol_nvcc
Fix symbol address issue on NVCC path

[ROCm/clr commit: c6cf2a9e26]
2019-05-07 16:17:01 +05:30
Maneesh Gupta 30c7ed3e28 Merge pull request #1081 from mangupta/swdev-181624
Implement hipExtGetLinkTypeAndHopCount for ROCm devices

[ROCm/clr commit: c6c5e4cee8]
2019-05-07 16:15:41 +05:30
Maneesh Gupta 532725c9c8 Merge pull request #1075 from mhbliao/hliao/master/test_fix2
[test] Add device variant of `std::declval`.

[ROCm/clr commit: 51e158c633]
2019-05-07 16:15:01 +05:30
Maneesh Gupta dce65678d7 Merge pull request #1074 from mhbliao/hliao/master/test_fix
[test] Use explicit cast for address space cast.

[ROCm/clr commit: 7f759750d1]
2019-05-07 16:09:15 +05:30
Maneesh Gupta fb08e0f25e Merge pull request #1073 from kpyzhov/multi-thread-device-test
hipMultiThreadDevice test: Reduced maximum number of created HIP stre…

[ROCm/clr commit: d71afeccc8]
2019-05-07 16:08:37 +05:30
Maneesh Gupta e514de5b33 Merge pull request #1072 from kpyzhov/master
Refined hipSetDevice test.

[ROCm/clr commit: 8f352427f4]
2019-05-07 16:07:36 +05:30
Maneesh Gupta 4a4745e466 Merge pull request #1069 from mhbliao/hliao/master/test_cleanup
[test] Remove unused common routines.

[ROCm/clr commit: 0fffbbe67a]
2019-05-07 16:02:57 +05:30
Maneesh Gupta 46d0385435 Merge pull request #1068 from mhbliao/hliao/master/dev_vec_func
[devfunc] Add necessary `__device__` and `__host__` attributes.

[ROCm/clr commit: 11972049c6]
2019-05-07 16:01:48 +05:30
Yaxun (Sam) Liu 01ef00b568 Add documentation for supported clang options (#1065)
* Add documentation for supported clang options

* Fix typo


[ROCm/clr commit: 0b43b24d3f]
2019-05-07 15:59:40 +05:30
wkwchau 7eaaf6f1ae Return hipErrorInsufficientDriver status when CPU device not found (#1064)
* Return hipErrorInsufficientDriver status when CPU device not found - no exception thrown

* Return hipErrorInsufficientDriver status when CPU device not found


[ROCm/clr commit: ebf986dcee]
2019-05-07 15:58:25 +05:30
Maneesh Gupta c6e14467f7 Merge pull request #1061 from mhbliao/hliao/master/hipcc
[hip] Repace `--rpath` with `--rpath-link`

[ROCm/clr commit: 46ac83a429]
2019-05-07 15:57:57 +05:30
Maneesh Gupta 0364d5c710 Merge pull request #1054 from ssahasra/dry
minor cleanup: eliminate repetition

[ROCm/clr commit: 1a1feb600f]
2019-05-07 15:57:46 +05:30
Rahul Garg bf3bafb9f5 Fix symbol address issue on NVCC path
[ROCm/clr commit: 6cbc70d238]
2019-05-07 03:59:43 +05:30
Maneesh Gupta f657eba4a5 Implement hipExtGetLinkTypeAndHopCount for ROCm devices
Change-Id: Ie5bb4f640ac6d189c7fceeab22627a7494fd10bd


[ROCm/clr commit: 2f43f110d9]
2019-05-06 15:54:31 +05:30
Michael LIAO f644d4daaa [test] Add device variant of std::declval.
- Current clang disallows any invocation of wrong-side functions even
  under context with type-inspection only. Work around that by adding a
  variant of `std::decl` with `__device__` attribute.


[ROCm/clr commit: 32f69c8bc4]
2019-05-03 15:58:31 -04:00
Michael LIAO fa74e75fc1 [test] Use explicit cast for address space cast.
[ROCm/clr commit: a27877794f]
2019-05-03 14:56:00 -04:00
Maneesh Gupta 13b13c3493 Merge pull request #1062 from mhbliao/hliao/master/icmp
[hip] Re-implement ballot using AMDGCN builtins

[ROCm/clr commit: 2eafa5dcf9]
2019-05-03 17:48:19 +05:30
Maneesh Gupta c73c864fdf Merge pull request #1058 from mhbliao/hliao/master/devfunc
[Device Function] Fix implementation

[ROCm/clr commit: ad070d4da5]
2019-05-03 17:47:51 +05:30
Konstantin Pyzhov 1be1dd207a hipMultiThreadDevice test: Reduced maximum number of created HIP streams on Windows.
[ROCm/clr commit: e04e408a37]
2019-05-03 05:43:30 -04:00
Konstantin Pyzhov b2e3880696 Refined hipSetDevice test.
[ROCm/clr commit: 29fd3812e5]
2019-05-03 05:39:45 -04:00
Michael LIAO 9823184397 [test] Remove unused common routines.
- Remove used common routines, which variadic functions, not supported
  feature, are used.


[ROCm/clr commit: aad19f6171]
2019-05-02 12:34:07 -04:00
Michael LIAO 08fa23f774 [devfunc] Add necessary __device__ and __host__ attributes.
- Minor clean up to keep consistent function declaration.


[ROCm/clr commit: a9f90713f3]
2019-05-01 22:26:35 -04:00
emankov c27f0b952e [HIPIFY][tests] Add cuSPARSE CSR-BCSR-SPMV-conversions example
[ROCm/clr commit: d5c3e5ea71]
2019-04-30 17:37:34 +03:00
Michael LIAO eb43303d0b [Device Function] Fix implementation of __bitinsert_u64
- It's a common mistake by assuming 1 << shamt would be promoted to
  64-bit, if shamt is a 64-bit integer. That's not the case. Replace
  that left shift to a 64-bit one to ensure it won't fall into undefined
  behavior.
- Fix the host-side implementation as well for device function testing.


[ROCm/clr commit: 2380eb8ecc]
2019-04-30 08:59:13 -04:00
Michael LIAO e8de293fc5 [devfunc] Re-implement ballot using AMDGCN builtins
- As the signature of `amdgcn.icmp` is changed for next-gen chip, using
  clang builtins is portable way to hide that details.


[ROCm/clr commit: a7a4d80f54]
2019-04-29 17:21:25 -04:00
Michael LIAO fb427976d7 [hipcc] Repace --rpath with --rpath-link
- With `--rpath`, app's RUNPATH is set to the temporary library path
  during build. Instead, `--rpath-link` should be used to leave RUNPATH
  untouched. User should set LD_LIBRARY_PATH or system ld.so.conf to
  proper setting.


[ROCm/clr commit: 92f90c3651]
2019-04-29 16:01:16 -04:00
Evgeny Mankov 7efe95bc42 [HIPIFY][doc] Update Readme.md: latest cuDNN 7.5.1.10 is supported
+ tested with CUDA 9.0, 9,2, 10.0 and 10.1


[ROCm/clr commit: 116daa3a1f]
2019-04-29 15:41:08 +03:00
Aaron Enye Shi f8d108a815 Revert "Use COMgr to read Kernel Args Metadata (#1006)"
This reverts commit 10048a5631.


[ROCm/clr commit: 235c6877c8]
2019-04-26 16:04:56 -04:00
Aaron Enye Shi 2105ed24fc Revert "Add COMGR relative path for build machines"
This reverts commit 9cd137f2e4.


[ROCm/clr commit: acfa46bbbc]
2019-04-26 16:04:56 -04:00
Aaron Enye Shi 9fbc8c0b58 Revert "Add dependency on amd_comgr in hip-config-*.cmake.in"
This reverts commit a1aa1f6f10.


[ROCm/clr commit: 2378c7a20b]
2019-04-26 16:04:56 -04:00
Maneesh Gupta a1aa1f6f10 Add dependency on amd_comgr in hip-config-*.cmake.in
Change-Id: Iac1d851a8cfb99224e9c5926780273d9b9b08426


[ROCm/clr commit: b8fe5ba572]
2019-04-25 15:26:33 -04:00
Sameer Sahasrabuddhe 4f69390332 minor cleanup: eliminate repetition
[ROCm/clr commit: c74a97f756]
2019-04-25 20:41:16 +05:30
Evgeny Mankov 729de93385 [HIPIFY][perl][fix][258] Memory fence device functions are supported now
[ROCm/clr commit: f0c2fdc6d7]
2019-04-25 13:27:30 +03:00
Evgeny Mankov 3c3255fbe5 [HIPIFY][DNN] cudnnSetFilter4dDescriptor support
[ROCm/clr commit: 72a809caf6]
2019-04-25 12:18:51 +03:00
Evgeny Mankov 6c114ca626 [HIPIFY][fix][#204] Suppress warning message: #pragma once in main file
[ROCm/clr commit: df9418c3cd]
2019-04-24 20:35:52 +03:00
Evgeny Mankov ac39a25328 [HIPIFY][doc] Update README.md
+ A few words about clang patches to work with CUDA 9.2 - 10.0 on Windows;
+ Fix cuDNN versions with correct values.


[ROCm/clr commit: 1049031d98]
2019-04-24 17:40:35 +03:00
Maneesh Gupta dad6abcd7a Merge pull request #1043 from mhbliao/hliao/master/fp16
[hip] Fix including of hip_fp16.h

[ROCm/clr commit: 7f81c72f1c]
2019-04-24 16:50:46 +05:30
Maneesh Gupta f8f49d57dc Merge pull request #1042 from mhbliao/hliao/master/ldg
[hip] Fix use of `__HIP_CLANG_ONLY__` in `hip_ldg.h`.

[ROCm/clr commit: 63ab2ea945]
2019-04-24 16:50:37 +05:30
Maneesh Gupta b41d81d74e Merge pull request #1040 from eshcherb/roctracer-hip-frontend-190422
hip_prof_api.h include under __cplusplus

[ROCm/clr commit: 54cdeabe6e]
2019-04-24 16:50:27 +05:30
Maneesh Gupta a709855e9d Merge pull request #1039 from gargrahul/fix_ptrgetattr_nvcc
Fix hipPointerGetAttributes for NVCC

[ROCm/clr commit: 7edb43bc83]
2019-04-24 16:50:18 +05:30
Rahul Garg d69edbbb7f Add hipMallocManaged default functional support (#1036)
* Add hipMallocManaged default functional support

* Fix build error

* Add dtest


[ROCm/clr commit: 94769fc8dd]
2019-04-24 16:50:03 +05:30
Maneesh Gupta 9835229370 Merge pull request #1034 from kpyzhov/master
Minor fixes for 64-bit device functions.

[ROCm/clr commit: 61861faddc]
2019-04-24 16:49:36 +05:30
Maneesh Gupta a0fdc93902 Merge pull request #1031 from yxsamliu/fix-init
Fix missing arg in HIP_INIT_API

[ROCm/clr commit: 3ac21336eb]
2019-04-24 16:49:23 +05:30
Maneesh Gupta c2011c2a2d Merge pull request #1028 from gargrahul/fix_d2d_async_test
[dtest] Fix D2DAsync test

[ROCm/clr commit: 3ba7afcfc1]
2019-04-24 16:49:13 +05:30
Aaron Enye Shi 9cd137f2e4 Add COMGR relative path for build machines
[ROCm/clr commit: 6b3095f7cb]
2019-04-23 17:16:26 -04:00
Evgeny Mankov bc3583c9bd [HIPIFY][doc] Provide patches for clang's bug 38811
+ Update Readme.md accordingly


[ROCm/clr commit: 87fa81f7be]
2019-04-23 21:13:00 +03:00
Evgeny Mankov 176db946b2 [HIPIFY][hipify-perl] Formatting
[ROCm/clr commit: 65dd1d4c7d]
2019-04-23 17:55:47 +03:00
Michael LIAO 59e6127969 [hip] Fix including of hip_fp16.h
- Separate the definition of `__HCC_OR_HIP_CLANG__`, `__HCC_ONLY__`, and
  `__HIP_CLANG_ONLY__` into hip_common.h so that it could be included in
  hip_fp16.h, which may be included separately in app.


[ROCm/clr commit: d086dbd0e5]
2019-04-23 09:16:00 -04:00