Graphe des révisions

3392 Révisions

Auteur SHA1 Message Date
Yaxun Sam Liu e9bc4034d8 Add declaration of symbol related API for VDI 2019-03-19 11:11:49 -04:00
Maneesh Gupta b1752fc9a6 Merge pull request #969 from nicholasmalaya/patch-1
Update hip_faq.md
2019-03-19 18:42:05 +05:30
Maneesh Gupta 4094824b9e Merge pull request #965 from mhbliao/hliao/master/immarg
[Device Function] Support immediate argument.
2019-03-19 18:41:31 +05:30
Maneesh Gupta 91ebfadabf Merge pull request #954 from mhbliao/master
[hip] Re-implement hipLaunchKernelGGL as macros.
2019-03-19 18:39:27 +05:30
Nicholas Malaya 0bddfc09e1 Update hip_faq.md
Making more clear what this list details. In particular, this list is intended to indicate what items for each CUDA release is supported, and which are not.
2019-03-18 14:51:18 -05:00
Michael LIAO 950b6efe72 [Device Function] Support immediate argument.
- `immarg`, immediate argument, is enabled on all AMDGPU intrinsics.
  Revise device functions using these intrinsics with immediate
  arguments.
2019-03-15 12:38:04 -04:00
Maneesh Gupta 38de9baf28 Merge pull request #964 from eshcherb/roctracer-hip-frontend-190314
tracing callback layer update
2019-03-15 16:24:46 +05:30
Evgeny 31475c5ac8 tracing callback layer update 2019-03-14 22:43:52 -05:00
Maneesh Gupta e2977fffa3 Merge pull request #963 from gargrahul/add_module_get_global_test
Test hipModuleGetGlobal
2019-03-15 06:17:50 +05:30
Maneesh Gupta 419127172e Merge pull request #958 from aaronenyeshi/cxxabi-mismatch-workaround
CXX11 ABI Mismatch Workaround
2019-03-15 06:15:46 +05:30
Rahul Garg 34e675d0f6 Test hipModuleGetGlobal 2019-03-15 04:08:03 +05:30
Siu Chi Chan 10d3084e20 remove visibility hidden attribute 2019-03-13 11:58:32 -04:00
Maneesh Gupta 72fdeb2d2f Merge pull request #957 from eshcherb/roctracer-hip-frontend-190311
adding memset32d
2019-03-12 10:58:56 +05:30
Evgeny 4ffe413cfd adding memset32d 2019-03-11 21:28:27 -05:00
Siu Chi Chan cb9ea5cefc minor cleanup 2019-03-11 19:51:57 +00:00
Siu Chi Chan d37f9e6b2d remove old style triple name 2019-03-11 19:51:51 +00:00
Siu Chi Chan 824ee1aa72 move triple_to_hsa_isa into the header 2019-03-11 19:51:44 +00:00
Siu Chi Chan bf1d48bf78 Fix memory leak introduced by previous change to Agent_global.
Make Agent_global manage the lifetime of the name string
2019-03-11 19:51:32 +00:00
Aaron Enye Shi 00d24d254d Fix Agent_global variables failing hipTestDeviceSymbol
Issue: Header uses std::vector<Agent_global> agent_globals which is created by hip_module.cpp
  - Move iterator fails to copy Agent_global from library source into header version
  - Due to different versions of std::string name in struct Agent_global
Fix: Change Agent_global to use char* name instead of std::string name
2019-03-11 19:51:25 +00:00
Aaron Enye Shi 23e9968752 Fix hash_for undefined reference in hipTestConstant test
Issue: mismatch undefined symbols in different user env
  - Binary expects modified return value std::string&
  - Fails to match libhip_hcc.so: return value is std::string& but doesn't match modified C++ env
Fix: Change return value to char*, create new key std::string in header from char*
2019-03-11 19:51:18 +00:00
Michael Liao 30185b4114 [hip] Re-implement hipLaunchKernelGGL as macros. 2019-03-08 09:10:51 -05:00
Maneesh Gupta 250bd58254 Merge pull request #951 from tycho/nvcc-hipDeviceSetCacheConfig
nvcc_detail/hip_runtime_api.h: add missing hipDeviceSetCacheConfig API
2019-03-08 09:22:25 +05:30
Maneesh Gupta b5ca257cad Merge pull request #953 from yxsamliu/vdi
Fix HIP/VDI build failure due to dlopen change
2019-03-08 06:02:47 +05:30
Yaxun Sam Liu 6e9e90addd Fix HIP/VDI build failure due to dlopen change 2019-03-07 14:45:45 -05:00
Evgeny Mankov 6d8171cbb7 Merge pull request #950 from emankov/master
[HIPIFY][tests] Update lit testing infrastructure
2019-03-07 15:19:15 +03:00
Steven Noonan ee750d5ea4 nvcc_detail/hip_runtime_api.h: add missing hipDeviceSetCacheConfig API
Signed-off-by: Steven Noonan <steven@uplinklabs.net>
2019-03-06 11:21:36 -08:00
Evgeny Mankov e899ee0e06 [HIPIFY][tests] Update lit testing infrastructure
+ Set -D__LP64__ in case of 64-bit hipify-clang binary
  [partial workaround for clang's bug https://bugs.llvm.org/show_bug.cgi?id=38811]

  C:/GIT/LLVM/trunk/llvm-64-release-vs2017/dist/lib/clang/9.0.0\include\__clang_cuda_device_functions.h(1609,45): error GEF7559A7: no matching function for call to 'roundf'
  __DEVICE__ long lroundf(float __a) { return roundf(__a); }

  #if defined(__LP64__)
  __DEVICE__ long lround(double __a) { return llround(__a); }
  __DEVICE__ long lroundf(float __a) { return llroundf(__a); } // ok: llroundf should be used when 64-bit
  #else
  __DEVICE__ long lround(double __a) { return round(__a); }
  __DEVICE__ long lroundf(float __a) { return roundf(__a); } // error
  #endif

+ Print more system info while testing in the following form:
  ========================================
  CUDA 9.0 - will be used for testing
  LLVM 9.0.0svn - will be used for testing
  AMD64 - Platform architecture
  Windows 10 - Platform OS
  64 - hipify-clang binary bitness
  32 - python 3.7.2 binary bitness
  ========================================
2019-03-06 19:26:05 +03:00
eshcherb 88b4621dbd roctracer-proto header find_path (#884) 2019-03-06 17:36:34 +05:30
Maneesh Gupta 352b17346c Merge pull request #949 from gargrahul/single_stream_concurrent_kernels
Add extension for kernel concurrency on same stream
2019-03-06 17:34:54 +05:30
Maneesh Gupta 0c4a40efcc Merge pull request #932 from ROCm-Developer-Tools/feature_maybe_dlopen_test
Add one test case for complex dynamic loading behavior
2019-03-06 17:32:23 +05:30
Alex Voicu ea0fcf3e61 dlopen() fixes (#929)
* Initial attempt to switch over to internally linked state.

* Add missing CMake update.

* hipLaunchKernelGGLImpl must be inline as well. Ensure internal linkage.

* Ensure global retrieval uses internally linked state.

* Hide HC in the implementation. Minimise ADL woes.

* Strange software exists, and must be catered to.

* Use a less spammy mechanism for ensuring internal linkage / non-export.

* Remove leftover internal detail.
2019-03-06 17:31:44 +05:30
Rahul Garg 59081c69fc Add extension for kernel concurrency on same stream 2019-03-06 12:55:39 +05:30
Maneesh Gupta 2d67bc5e9c Merge pull request #936 from mangupta/swdev-174923
[hipconfig] Update HIP_PLATFORM detection logic
2019-03-06 06:08:11 +05:30
Evgeny Mankov 23de66bc7e Merge pull request #948 from emankov/master
[HIPIFY] Change CUDA Driver's functions' cuMemsetD32(Async) mapping
2019-03-05 18:18:39 +03:00
Evgeny Mankov a87d798d90 [HIPIFY] Change CUDA Driver's functions' cuMemsetD32(Async) mapping
cuMemsetD32(Async) -> hipMemsetD32(Async) (was hipMemset(Async))

based on:
[#933] https://github.com/ROCm-Developer-Tools/HIP/pull/933
2019-03-05 18:13:18 +03:00
Maneesh Gupta 2ed31e9f6c Merge pull request #933 from ROCm-Developer-Tools/fix_hipmemset
Add HIP memset APIs to cope with non-zero initial values of integer types
2019-03-05 14:31:38 +05:30
Maneesh Gupta 51f29b9cee Update hipMemset.cpp
Address build issues on nvcc path.
2019-03-05 12:11:11 +05:30
Maneesh Gupta 8db717c769 Update hip_runtime_api.h
Use hipCUResultTohipError instead of hipCUDAErrorTohipError in hipMemsetD32 & hipMemsetD32Async.
2019-03-05 12:10:01 +05:30
Wen-Heng (Jack) Chung 5cbd28f29b Address code review comments to use hipDeviceptr_t 2019-03-05 05:51:05 +00:00
Wen-Heng (Jack) Chung 0337b686ef Add implementation for NVCC path 2019-03-04 20:11:12 -08:00
Wen-Heng (Jack) Chung bfde8a7fab Add direct test for hipMemsetD32 and hipMemsetD32Async 2019-03-04 17:20:32 +00:00
Wen-Heng (Jack) Chung 7ebbbd3525 Add hipMemsetD32 and hipMemsetD32Async
Add 2 extra memset functions which fills memory with integer-typed data

Also change the parameters of ihipMemset to better explain the semantic
2019-03-04 17:00:33 +00:00
Maneesh Gupta de4a9b8446 Merge pull request #939 from gargrahul/update_hipmemset_test
[dtest] Update hipMemset test
2019-03-03 20:29:55 +05:30
Rahul Garg 131c0ea326 Fix review comments 2019-03-02 23:38:37 +05:30
Maneesh Gupta d28c13ed69 Merge pull request #945 from wkwchau/hipMemset3D_fix
Fix hipMemset3D test
2019-03-01 21:18:12 +05:30
Maneesh Gupta b051d44d24 Merge pull request #942 from yxsamliu/v3
revert hipcc changes about code object v3
2019-03-01 21:17:10 +05:30
Wilkin Chau 8d92d1ebd7 Fix hipMemset3D test
Calculate the allocated size based on the width, height and depth.
2019-02-28 22:42:46 +00:00
Rahul Garg 77c6d5451f Fix hipMemset test for HIP/NVCC 2019-03-01 03:46:57 +05:30
Yaxun Sam Liu e8556388f9 Revert "hipcc should consume -mcode-object-v3 flag"
This reverts commit eb33507774.
2019-02-28 11:21:47 -05:00
Yaxun Sam Liu 1bd8d24278 Revert "Change code-object flag to only HIP-Clang"
This reverts commit 1c6365fbe7.
2019-02-28 11:20:04 -05:00