Michael LIAO
f282ea815a
[Device Function] Support immediate argument.
...
- `immarg`, immediate argument, is enabled on all AMDGPU intrinsics.
Revise device functions using these intrinsics with immediate
arguments.
[ROCm/hip commit: 950b6efe72 ]
2019-03-15 12:38:04 -04:00
Maneesh Gupta
7cc48a4836
Merge pull request #964 from eshcherb/roctracer-hip-frontend-190314
...
tracing callback layer update
[ROCm/hip commit: 38de9baf28 ]
2019-03-15 16:24:46 +05:30
Evgeny
36b5313d65
tracing callback layer update
...
[ROCm/hip commit: 31475c5ac8 ]
2019-03-14 22:43:52 -05:00
Maneesh Gupta
f0fd33a301
Merge pull request #963 from gargrahul/add_module_get_global_test
...
Test hipModuleGetGlobal
[ROCm/hip commit: e2977fffa3 ]
2019-03-15 06:17:50 +05:30
Maneesh Gupta
1d61f5ff58
Merge pull request #958 from aaronenyeshi/cxxabi-mismatch-workaround
...
CXX11 ABI Mismatch Workaround
[ROCm/hip commit: 419127172e ]
2019-03-15 06:15:46 +05:30
Rahul Garg
e02104a16a
Test hipModuleGetGlobal
...
[ROCm/hip commit: 34e675d0f6 ]
2019-03-15 04:08:03 +05:30
Siu Chi Chan
151b288155
remove visibility hidden attribute
...
[ROCm/hip commit: 10d3084e20 ]
2019-03-13 11:58:32 -04:00
Maneesh Gupta
2d20f354b0
Merge pull request #957 from eshcherb/roctracer-hip-frontend-190311
...
adding memset32d
[ROCm/hip commit: 72fdeb2d2f ]
2019-03-12 10:58:56 +05:30
Evgeny
75d89bf129
adding memset32d
...
[ROCm/hip commit: 4ffe413cfd ]
2019-03-11 21:28:27 -05:00
Siu Chi Chan
ce064f942a
minor cleanup
...
[ROCm/hip commit: cb9ea5cefc ]
2019-03-11 19:51:57 +00:00
Siu Chi Chan
ac10afb0f7
remove old style triple name
...
[ROCm/hip commit: d37f9e6b2d ]
2019-03-11 19:51:51 +00:00
Siu Chi Chan
fc9af438a4
move triple_to_hsa_isa into the header
...
[ROCm/hip commit: 824ee1aa72 ]
2019-03-11 19:51:44 +00:00
Siu Chi Chan
15061ddfcc
Fix memory leak introduced by previous change to Agent_global.
...
Make Agent_global manage the lifetime of the name string
[ROCm/hip commit: bf1d48bf78 ]
2019-03-11 19:51:32 +00:00
Aaron Enye Shi
1e07be3ab3
Fix Agent_global variables failing hipTestDeviceSymbol
...
Issue: Header uses std::vector<Agent_global> agent_globals which is created by hip_module.cpp
- Move iterator fails to copy Agent_global from library source into header version
- Due to different versions of std::string name in struct Agent_global
Fix: Change Agent_global to use char* name instead of std::string name
[ROCm/hip commit: 00d24d254d ]
2019-03-11 19:51:25 +00:00
Aaron Enye Shi
4b87bd25e8
Fix hash_for undefined reference in hipTestConstant test
...
Issue: mismatch undefined symbols in different user env
- Binary expects modified return value std::string&
- Fails to match libhip_hcc.so: return value is std::string& but doesn't match modified C++ env
Fix: Change return value to char*, create new key std::string in header from char*
[ROCm/hip commit: 23e9968752 ]
2019-03-11 19:51:18 +00:00
Maneesh Gupta
645c29159c
Merge pull request #951 from tycho/nvcc-hipDeviceSetCacheConfig
...
nvcc_detail/hip_runtime_api.h: add missing hipDeviceSetCacheConfig API
[ROCm/hip commit: 250bd58254 ]
2019-03-08 09:22:25 +05:30
Maneesh Gupta
b3f2a33a7f
Merge pull request #953 from yxsamliu/vdi
...
Fix HIP/VDI build failure due to dlopen change
[ROCm/hip commit: b5ca257cad ]
2019-03-08 06:02:47 +05:30
Yaxun Sam Liu
a431326408
Fix HIP/VDI build failure due to dlopen change
...
[ROCm/hip commit: 6e9e90addd ]
2019-03-07 14:45:45 -05:00
Evgeny Mankov
35a28363b1
Merge pull request #950 from emankov/master
...
[HIPIFY][tests] Update lit testing infrastructure
[ROCm/hip commit: 6d8171cbb7 ]
2019-03-07 15:19:15 +03:00
Steven Noonan
17e2bfe3f8
nvcc_detail/hip_runtime_api.h: add missing hipDeviceSetCacheConfig API
...
Signed-off-by: Steven Noonan <steven@uplinklabs.net >
[ROCm/hip commit: ee750d5ea4 ]
2019-03-06 11:21:36 -08:00
Evgeny Mankov
1397bf79d8
[HIPIFY][tests] Update lit testing infrastructure
...
+ Set -D__LP64__ in case of 64-bit hipify-clang binary
[partial workaround for clang's bug https://bugs.llvm.org/show_bug.cgi?id=38811 ]
C:/GIT/LLVM/trunk/llvm-64-release-vs2017/dist/lib/clang/9.0.0\include\__clang_cuda_device_functions.h(1609,45): error GEF7559A7: no matching function for call to 'roundf'
__DEVICE__ long lroundf(float __a) { return roundf(__a); }
#if defined(__LP64__)
__DEVICE__ long lround(double __a) { return llround(__a); }
__DEVICE__ long lroundf(float __a) { return llroundf(__a); } // ok: llroundf should be used when 64-bit
#else
__DEVICE__ long lround(double __a) { return round(__a); }
__DEVICE__ long lroundf(float __a) { return roundf(__a); } // error
#endif
+ Print more system info while testing in the following form:
========================================
CUDA 9.0 - will be used for testing
LLVM 9.0.0svn - will be used for testing
AMD64 - Platform architecture
Windows 10 - Platform OS
64 - hipify-clang binary bitness
32 - python 3.7.2 binary bitness
========================================
[ROCm/hip commit: e899ee0e06 ]
2019-03-06 19:26:05 +03:00
eshcherb
7f40a2cd9a
roctracer-proto header find_path ( #884 )
...
[ROCm/hip commit: 88b4621dbd ]
2019-03-06 17:36:34 +05:30
Maneesh Gupta
3f5e937afc
Merge pull request #949 from gargrahul/single_stream_concurrent_kernels
...
Add extension for kernel concurrency on same stream
[ROCm/hip commit: 352b17346c ]
2019-03-06 17:34:54 +05:30
Maneesh Gupta
df7d8c46ed
Merge pull request #932 from ROCm-Developer-Tools/feature_maybe_dlopen_test
...
Add one test case for complex dynamic loading behavior
[ROCm/hip commit: 0c4a40efcc ]
2019-03-06 17:32:23 +05:30
Alex Voicu
0c16497abd
dlopen() fixes ( #929 )
...
* Initial attempt to switch over to internally linked state.
* Add missing CMake update.
* hipLaunchKernelGGLImpl must be inline as well. Ensure internal linkage.
* Ensure global retrieval uses internally linked state.
* Hide HC in the implementation. Minimise ADL woes.
* Strange software exists, and must be catered to.
* Use a less spammy mechanism for ensuring internal linkage / non-export.
* Remove leftover internal detail.
[ROCm/hip commit: ea0fcf3e61 ]
2019-03-06 17:31:44 +05:30
Rahul Garg
5d6fd17fbe
Add extension for kernel concurrency on same stream
...
[ROCm/hip commit: 59081c69fc ]
2019-03-06 12:55:39 +05:30
Maneesh Gupta
2264e69da3
Merge pull request #936 from mangupta/swdev-174923
...
[hipconfig] Update HIP_PLATFORM detection logic
[ROCm/hip commit: 2d67bc5e9c ]
2019-03-06 06:08:11 +05:30
Evgeny Mankov
537e13c2ca
Merge pull request #948 from emankov/master
...
[HIPIFY] Change CUDA Driver's functions' cuMemsetD32(Async) mapping
[ROCm/hip commit: 23de66bc7e ]
2019-03-05 18:18:39 +03:00
Evgeny Mankov
effdffc006
[HIPIFY] Change CUDA Driver's functions' cuMemsetD32(Async) mapping
...
cuMemsetD32(Async) -> hipMemsetD32(Async) (was hipMemset(Async))
based on:
[#933 ] https://github.com/ROCm-Developer-Tools/HIP/pull/933
[ROCm/hip commit: a87d798d90 ]
2019-03-05 18:13:18 +03:00
Maneesh Gupta
31457951ba
Merge pull request #933 from ROCm-Developer-Tools/fix_hipmemset
...
Add HIP memset APIs to cope with non-zero initial values of integer types
[ROCm/hip commit: 2ed31e9f6c ]
2019-03-05 14:31:38 +05:30
Maneesh Gupta
54c72c1079
Update hipMemset.cpp
...
Address build issues on nvcc path.
[ROCm/hip commit: 51f29b9cee ]
2019-03-05 12:11:11 +05:30
Maneesh Gupta
19b6b2b570
Update hip_runtime_api.h
...
Use hipCUResultTohipError instead of hipCUDAErrorTohipError in hipMemsetD32 & hipMemsetD32Async.
[ROCm/hip commit: 8db717c769 ]
2019-03-05 12:10:01 +05:30
Wen-Heng (Jack) Chung
da589e38ed
Address code review comments to use hipDeviceptr_t
...
[ROCm/hip commit: 5cbd28f29b ]
2019-03-05 05:51:05 +00:00
Wen-Heng (Jack) Chung
ea689f63ca
Add implementation for NVCC path
...
[ROCm/hip commit: 0337b686ef ]
2019-03-04 20:11:12 -08:00
Wen-Heng (Jack) Chung
085c6c15b2
Add direct test for hipMemsetD32 and hipMemsetD32Async
...
[ROCm/hip commit: bfde8a7fab ]
2019-03-04 17:20:32 +00:00
Wen-Heng (Jack) Chung
0b7f38d100
Add hipMemsetD32 and hipMemsetD32Async
...
Add 2 extra memset functions which fills memory with integer-typed data
Also change the parameters of ihipMemset to better explain the semantic
[ROCm/hip commit: 7ebbbd3525 ]
2019-03-04 17:00:33 +00:00
Maneesh Gupta
a60c0222f0
Merge pull request #939 from gargrahul/update_hipmemset_test
...
[dtest] Update hipMemset test
[ROCm/hip commit: de4a9b8446 ]
2019-03-03 20:29:55 +05:30
Rahul Garg
7a9ebee08a
Fix review comments
...
[ROCm/hip commit: 131c0ea326 ]
2019-03-02 23:38:37 +05:30
Maneesh Gupta
91f6e6b1e8
Merge pull request #945 from wkwchau/hipMemset3D_fix
...
Fix hipMemset3D test
[ROCm/hip commit: d28c13ed69 ]
2019-03-01 21:18:12 +05:30
Maneesh Gupta
87b3859ec2
Merge pull request #942 from yxsamliu/v3
...
revert hipcc changes about code object v3
[ROCm/hip commit: b051d44d24 ]
2019-03-01 21:17:10 +05:30
Wilkin Chau
4a0d68ba3f
Fix hipMemset3D test
...
Calculate the allocated size based on the width, height and depth.
[ROCm/hip commit: 8d92d1ebd7 ]
2019-02-28 22:42:46 +00:00
Rahul Garg
4093c03dd7
Fix hipMemset test for HIP/NVCC
...
[ROCm/hip commit: 77c6d5451f ]
2019-03-01 03:46:57 +05:30
Yaxun Sam Liu
d9f59b18f8
Revert "hipcc should consume -mcode-object-v3 flag"
...
This reverts commit 1a8b50e63c .
[ROCm/hip commit: e8556388f9 ]
2019-02-28 11:21:47 -05:00
Yaxun Sam Liu
e748fd2ae0
Revert "Change code-object flag to only HIP-Clang"
...
This reverts commit 17a853c277 .
[ROCm/hip commit: 1bd8d24278 ]
2019-02-28 11:20:04 -05:00
Yaxun Sam Liu
f46d2a3010
Revert "Consume the code obj args to prevent duplicates"
...
This reverts commit b19e4b5b63 .
[ROCm/hip commit: 737972d31c ]
2019-02-28 11:19:35 -05:00
Maneesh Gupta
f3def1c107
Merge pull request #938 from gargrahul/fix_hipBusBW_p2p_bidir
...
Fix hipBusBW sample for P2P bidirectional test
[ROCm/hip commit: b93666bdde ]
2019-02-28 07:14:38 +05:30
Maneesh Gupta
f0bb5690f1
Merge pull request #937 from yxsamliu/nan2
...
Fix nan for windows
[ROCm/hip commit: 07b0e9574c ]
2019-02-28 07:14:27 +05:30
Maneesh Gupta
2e700104de
Merge pull request #935 from gargrahul/fix_hipbusbw_beatsoverflow
...
Fix hipBusBW overflow with setting beats/iterations
[ROCm/hip commit: 2da14e97f2 ]
2019-02-28 07:14:16 +05:30
Maneesh Gupta
c3b91c0bcd
Merge pull request #934 from gargrahul/fix_forceinline_non_hcc
...
Fix forceinline for non HCC compilation
[ROCm/hip commit: 2619b11efc ]
2019-02-28 07:14:05 +05:30
Rahul Garg
fa0c9d8a73
Update hipMemset test
...
[ROCm/hip commit: 026dc9457e ]
2019-02-28 06:54:49 +05:30