Aaron Enye Shi
decf9beec9
__forceinline__ needs inline and always_inline
...
Related to SWDEV-163084
2018-08-29 18:33:35 +00:00
Maneesh Gupta
190ead8337
Merge pull request #646 from mangupta/new_ci_flow
...
[ci] New CI flow
2018-08-28 13:14:57 +05:30
Maneesh Gupta
8c95267bf5
[ci] New CI flow
...
- Removed hcc integration logic for now
- No longer publish docker images to artifactory or docker hub
- Switch to new docker images and new test node
Change-Id: Idb06bf0a52658d1869367cb38bb03ef4f1649103
2018-08-28 12:36:54 +05:30
Maneesh Gupta
c9af0d6c52
Merge pull request #629 from mangupta/fix_swdev-160956
...
[hipcc] Remove libhsakmt from linker flags
2018-08-16 06:10:49 +05:30
Maneesh Gupta
2f5ee304f8
[hipcc] Remove libhsakmt from linker flags
...
Change-Id: I6dc769d09d8c7546fbc594c1eb65f9edf0f0ff58
2018-08-14 15:31:04 +05:30
Maneesh Gupta
e10fa70173
Merge pull request #621 from ROCm-Developer-Tools/disable_startup_loader
...
Disable startup loader by default and guard with env var
2018-08-10 10:18:25 +05:30
Maneesh Gupta
546378292d
Merge pull request #623 from ROCm-Developer-Tools/tf18
...
Fix declaration conflict when hip/math_functions.h is included first
2018-08-10 09:36:27 +05:30
Maneesh Gupta
695746421f
Merge pull request #616 from ROCm-Developer-Tools/hip-clang-static-lib
...
Let hipcc handle static library for hip-clang
2018-08-10 09:35:22 +05:30
Maneesh Gupta
0123fb816d
Merge pull request #615 from aaronenyeshi/enable-hip-compiler
...
Enable HIP_COMPILER to choose HIP-Clang lib links
2018-08-10 09:34:51 +05:30
Maneesh Gupta
a5f9ae90d2
Merge pull request #613 from ROCm-Developer-Tools/hcc-compat
...
Add HCC compatibility mode for hip-clang
2018-08-10 09:32:31 +05:30
Maneesh Gupta
497610f96c
Merge pull request #617 from gargrahul/genco_remove_adipose_extn
...
Remove adipose extension from genco output
2018-08-10 09:31:44 +05:30
Maneesh Gupta
79e9a2dfa4
Merge pull request #619 from ROCm-Developer-Tools/revert-pr457
...
Revert pr457
2018-08-10 09:25:57 +05:30
Evgeny Mankov
d1540de3b8
Merge pull request #624 from emankov/cuComplex
...
[HIPIFY][Complex] Add cuComplex support
2018-08-09 20:01:51 +03:00
sunway513
7c467220a8
resolve a segfault bug when env var not set; remove startup_kernel_loader class
2018-08-09 16:40:26 +00:00
Rahul Garg
b6c5d7d71b
Clean up module api samples
2018-08-08 22:28:13 +05:30
Evgeny Mankov
ce57b99f9d
[HIPIFY][Complex] Add cuComplex support
...
+ Add API_COMPLEX support (data types and functions)
+ Add cuComplex_API_supported_by_HIP.md
+ Add cuComplex_Julia.cu test
+ Update README.md
2018-08-08 18:34:57 +03:00
Yaxun Sam Liu
87de95975a
Fix __HIP_ARCH_* not defined after including math_functions.h
...
hcc_detail/math_functions.h used to include hcc_detail/hip_runtime.h.
Removing it has caused regression in TensorFlow 1.8.
Put it back for backward compatibiliity.
2018-08-08 08:55:28 -04:00
Yaxun Sam Liu
058eb7535b
Fix __HIP_DEVICE_COMPILE__ not defined when hip/math_functions.h is included
...
This fixes build failure in TensorFlow 1.8 for HCC
2018-08-07 17:14:39 -04:00
Yaxun Sam Liu
69bbf45b44
Fix declaration conflict when hip/math_functions.h is included first
...
This fixes build failure in TensorFlow 1.8 for HCC
2018-08-07 15:44:59 -04:00
sunway513
7d776ae665
Add more check to ensure the startup loader only be enabled with the env var set to 1
2018-08-04 01:52:27 +00:00
sunway513
9af862d364
Add startup loader under HIP_STARTUP_LOADER env var, disable by default
2018-08-04 01:48:06 +00:00
Wen-Heng (Jack) Chung
1fe7f1be91
Revert "HIP program state re-initialization logic"
...
This reverts commit 04640992dc .
2018-08-03 17:03:04 -05:00
Wen-Heng (Jack) Chung
f6178c2159
Revert "Improve performance of re-initialization logic"
...
This reverts commit b883ea759d .
2018-08-03 17:02:58 -05:00
Wen-Heng (Jack) Chung
329f3b7a5e
Revert "Keep the map which tracks GPU kernel symbols to grow monotonically"
...
This reverts commit 8f521edff1 .
2018-08-03 17:02:50 -05:00
Maneesh Gupta
ac18a3b701
Merge pull request #588 from Jorghi12/patch-10
...
Fixing a HIP bug that causes DEBUG symbols to be dropped.
2018-08-03 08:48:56 +05:30
Maneesh Gupta
b485957858
Merge pull request #603 from b-sumner/master
...
Drop use of NVCC_COMPAT since it is always set
2018-08-03 08:45:18 +05:30
Maneesh Gupta
ad44a77ed4
Merge pull request #601 from sarunyap/fix-shareWithAll-mem-dev-mapping
...
Remove device mapping from shareWithAll memory
2018-08-03 08:45:04 +05:30
Rahul Garg
e9d7584b02
Remove adipose extension from genco output
2018-08-02 15:16:36 +05:30
Maneesh Gupta
0354367a07
Merge pull request #610 from aaronenyeshi/enable-hcc-module-launch
...
Enable hipHccModuleLaunchKernel on HIP-Clang
2018-08-02 08:08:46 +05:30
Maneesh Gupta
4544e681ac
Merge pull request #600 from ROCm-Developer-Tools/enable-texture-functions
...
Enable HIP texture functions with HIP vector types
2018-08-02 08:01:00 +05:30
Maneesh Gupta
1692af19cc
Merge pull request #593 from mangupta/patch2-removerdynamic
...
[FindHIP] Remove rdynamic from CMAKE_SHARED_LIBRARY_LINK_HIP_FLAGS
2018-08-02 07:58:34 +05:30
Maneesh Gupta
2960739c43
Merge pull request #514 from luckynikki/GETandALGN
...
[GETandALGN]: Get channel and alignment details
2018-08-02 07:57:50 +05:30
Yaxun Sam Liu
2ba45a23b8
Let hipcc handle static library for hip-clang
2018-08-01 22:03:01 -04:00
Aaron Enye Shi
7560956233
Enable HIP_COMPILER to choose HIP-Clang lib links
2018-08-01 20:37:46 +00:00
Yaxun Sam Liu
9c28758fa0
Add hc_get_workitem_absolute_id for hip-clang HCC compatibility mode
2018-08-01 15:14:38 -04:00
Yaxun Sam Liu
b91691966f
Add HCC compatibility mode
2018-08-01 15:14:38 -04:00
Evgeny Mankov
a244ddd659
Merge pull request #612 from emankov/docs
...
[HIPIFY][docs] Formatting
2018-08-01 17:30:35 +03:00
Evgeny Mankov
55ee734b22
[HIPIFY][docs] Formatting
2018-08-01 17:29:00 +03:00
Evgeny Mankov
58d9df900c
Merge pull request #609 from emankov/master
...
[HIPIFY][FFT] Add cuFFT support
2018-08-01 17:11:19 +03:00
lthakur
b7a4b2c300
Added offset value verification.
...
Added offset value verification.
2018-08-01 10:41:30 +05:30
Aaron Enye Shi
0ad36dfa58
Enable hipHccModuleLaunchKernel on HIP-Clang
2018-07-31 20:38:20 +00:00
Evgeny Mankov
c57bfd50e3
[HIPIFY][FFT] Add cuFFT support
...
+ Add API_FFT support (data types and functions)
+ Add CUFFT_API_supported_by_HIP.md
+ Add simple_cufft.cu test
+ Update README.md
2018-07-31 22:25:48 +03:00
Aaron Enye Shi
e42af991d1
Protect macro TYPEDEF_VECTOR_VALUE_TYPE and prefix __hip_ for vector_value_types
2018-07-30 20:41:24 +00:00
Aaron Enye Shi
ff1e49d191
Ensure correct vector value type used in texture
...
Implement the vector value types in terms of SCALAR_TYPE##SIZE_vector_value_type for float, int, uint and sizes 2, 3, 4, 8, and 16.
2018-07-30 19:43:12 +00:00
Sarunya Pumma
84aadb9274
Remove device mapping from shareWithAll memory
...
When shareWithAll memory (e.g., host memory) is allocated, set appId
in hc::AmPointerInfo to -1 to indicate that this memory is not mapped
to any device. Peer checking in ihipStream_t::canSeeMemory is not
necessary if memory is shared with all devices. Thus, it is skipped.
Note that earlier host memory is always mapped to device 0 and HIP
always performs peer checking for all kinds of hipMemcpy. Since the
peer checking process requires context locking, hipMemcpy from/to host
memory always grabs device 0's context lock. Therefore, if there is
another thread holding the context lock of device 0 (e.g.,
hipDeviceSynchronize on device 0), hipMemcpy will have to wait for the
lock until it can actually perform memcpy. This can significantly
deteriorate execution performance.
Signed-off-by: Sarunya Pumma <sarunya.pumma@amd.com >
2018-07-28 23:15:16 -07:00
Evgeny Mankov
f2e1b9d220
Merge pull request #607 from emankov/hipBLAS
...
[HIPIFY][BLAS] cuBLAS support revise
2018-07-28 21:11:03 +03:00
Evgeny Mankov
e00cfefd4f
[HIPIFY][BLAS] cuBLAS support revise
...
+ doc for CUBLAS API supported by HIP
+ 3 tests: 0 & 1 based indexing, sgemm matmul
2018-07-28 21:00:44 +03:00
Brian Sumner
8d80abb30f
Drop use of NVCC_COMPAT since it is always set
2018-07-27 09:29:29 -07:00
Maneesh Gupta
17f612def1
Merge pull request #596 from mangupta/patch5-fixhiplaunchparm
...
[tests] Fixed hipLaunchParm test on nvcc path
2018-07-26 06:17:50 +05:30
Maneesh Gupta
86188f04c8
Merge pull request #595 from mangupta/patch4-fixhipStdComplex
...
[tests] Fixed build & disabled run of hipStdComplex on nvcc path
2018-07-26 06:17:04 +05:30