Commit graph

3404 Commits

Autor SHA1 Nachricht Datum
Maneesh Gupta bfb73dd148 Merge pull request #982 from ROCm-Developer-Tools/hack_swdev-173477
HACK for SWDEV-173477

[ROCm/hip commit: a75f375788]
2019-03-22 09:14:38 +00:00
Maneesh Gupta 56dbf717bd Merge pull request #980 from ntrost57/master
fixed loss of accuracy in hipCfma()

[ROCm/hip commit: 5bf08930a9]
2019-03-22 09:14:09 +00:00
Wen-Heng (Jack) Chung 3eb009913c HACK for SWDEV-173477
For code objects with global symbols of length 0, ROCR runtime would
ignore them even though they exist in the symbol table. Therefore the
result from read_agent_globals() can't be trusted entirely.

As a workaround to tame applications which depend on the existence of
global symbols with length 0, always return hipSuccess here.

This behavior shall be reverted once ROCR runtime has been fixed to
address SWDEV-173477


[ROCm/hip commit: d941f19399]
2019-03-21 17:18:16 +00:00
Nico Trost 5802734f1b fixed loss of accuracy in hipCfma()
[ROCm/hip commit: 7bb5a14a4e]
2019-03-21 10:30:10 +01:00
eshcherb 621ceaf8bc adding hip_prof_gen verbose log (#977)
* adding hip_prof_gen verbose log

* adding stderr fatal error

* adding no error exit by default

* adding hip_prof_str regeneration dependencies

* adding more informative messages

* fixing error mesage


[ROCm/hip commit: f167e4daf6]
2019-03-21 05:28:18 +00:00
Maneesh Gupta 1bc0e84e83 Merge pull request #972 from yxsamliu/global
Add declaration of symbol related API for VDI

[ROCm/hip commit: 6ade928535]
2019-03-20 05:12:21 +00:00
Maneesh Gupta 7892ca44b4 Merge pull request #973 from mhbliao/hliao/master/build
[Device Function] Fix typos.

[ROCm/hip commit: 87551e29f6]
2019-03-20 05:12:14 +00:00
Maneesh Gupta 5f3f8a2cde Merge pull request #974 from yxsamliu/name2
Change HIP dll name to amdhip64.dll on Windows

[ROCm/hip commit: cfb2711768]
2019-03-20 05:11:58 +00:00
eshcherb bbc63ad2ec adding prof primitives generator (#967)
* adding prof primitives generator

* minor change, renaming

* minor cosmetic changes, comments correcting and dead code removing

* minor changes and renaming

* minor chane, fixing comments


[ROCm/hip commit: 673cdeea55]
2019-03-20 05:11:40 +00:00
Siu Chi Chan 99c4f43a6c reimplement HIP_INIT as hip_impl::hip_init(), add hip_init() to some of the inlined API (#966)
* reimplement HIP_INIT as a function, expose it as hip_impl::hip_init()
so that it could be called from hipLaunchKernelGGL and other inlined
HIP functions

* Don't call hip_init from ihipPreLaunchKernel


[ROCm/hip commit: 24d08beef8]
2019-03-20 05:11:15 +00:00
Yaxun Sam Liu 8d6e48820f Change HIP dll name to amdhip64.dll on Windows
[ROCm/hip commit: e2bf34cd5e]
2019-03-19 16:27:18 -04:00
Michael LIAO d0680ef1ee [Device Function] Fix typos.
[ROCm/hip commit: e51fa174c8]
2019-03-19 15:32:19 -04:00
Yaxun Sam Liu 4f094c4d21 Add declaration of symbol related API for VDI
[ROCm/hip commit: e9bc4034d8]
2019-03-19 11:11:49 -04:00
Maneesh Gupta 98dcdae4f0 Merge pull request #969 from nicholasmalaya/patch-1
Update hip_faq.md

[ROCm/hip commit: b1752fc9a6]
2019-03-19 18:42:05 +05:30
Maneesh Gupta 88761d52ce Merge pull request #965 from mhbliao/hliao/master/immarg
[Device Function] Support immediate argument.

[ROCm/hip commit: 4094824b9e]
2019-03-19 18:41:31 +05:30
Maneesh Gupta 173c910f5d Merge pull request #954 from mhbliao/master
[hip] Re-implement hipLaunchKernelGGL as macros.

[ROCm/hip commit: 91ebfadabf]
2019-03-19 18:39:27 +05:30
Nicholas Malaya 2569ed7c91 Update hip_faq.md
Making more clear what this list details. In particular, this list is intended to indicate what items for each CUDA release is supported, and which are not.

[ROCm/hip commit: 0bddfc09e1]
2019-03-18 14:51:18 -05:00
Michael LIAO f282ea815a [Device Function] Support immediate argument.
- `immarg`, immediate argument, is enabled on all AMDGPU intrinsics.
  Revise device functions using these intrinsics with immediate
  arguments.


[ROCm/hip commit: 950b6efe72]
2019-03-15 12:38:04 -04:00
Maneesh Gupta 7cc48a4836 Merge pull request #964 from eshcherb/roctracer-hip-frontend-190314
tracing callback layer update

[ROCm/hip commit: 38de9baf28]
2019-03-15 16:24:46 +05:30
Evgeny 36b5313d65 tracing callback layer update
[ROCm/hip commit: 31475c5ac8]
2019-03-14 22:43:52 -05:00
Maneesh Gupta f0fd33a301 Merge pull request #963 from gargrahul/add_module_get_global_test
Test hipModuleGetGlobal

[ROCm/hip commit: e2977fffa3]
2019-03-15 06:17:50 +05:30
Maneesh Gupta 1d61f5ff58 Merge pull request #958 from aaronenyeshi/cxxabi-mismatch-workaround
CXX11 ABI Mismatch Workaround

[ROCm/hip commit: 419127172e]
2019-03-15 06:15:46 +05:30
Rahul Garg e02104a16a Test hipModuleGetGlobal
[ROCm/hip commit: 34e675d0f6]
2019-03-15 04:08:03 +05:30
Siu Chi Chan 151b288155 remove visibility hidden attribute
[ROCm/hip commit: 10d3084e20]
2019-03-13 11:58:32 -04:00
Maneesh Gupta 2d20f354b0 Merge pull request #957 from eshcherb/roctracer-hip-frontend-190311
adding memset32d

[ROCm/hip commit: 72fdeb2d2f]
2019-03-12 10:58:56 +05:30
Evgeny 75d89bf129 adding memset32d
[ROCm/hip commit: 4ffe413cfd]
2019-03-11 21:28:27 -05:00
Siu Chi Chan ce064f942a minor cleanup
[ROCm/hip commit: cb9ea5cefc]
2019-03-11 19:51:57 +00:00
Siu Chi Chan ac10afb0f7 remove old style triple name
[ROCm/hip commit: d37f9e6b2d]
2019-03-11 19:51:51 +00:00
Siu Chi Chan fc9af438a4 move triple_to_hsa_isa into the header
[ROCm/hip commit: 824ee1aa72]
2019-03-11 19:51:44 +00:00
Siu Chi Chan 15061ddfcc Fix memory leak introduced by previous change to Agent_global.
Make Agent_global manage the lifetime of the name string


[ROCm/hip commit: bf1d48bf78]
2019-03-11 19:51:32 +00:00
Aaron Enye Shi 1e07be3ab3 Fix Agent_global variables failing hipTestDeviceSymbol
Issue: Header uses std::vector<Agent_global> agent_globals which is created by hip_module.cpp
  - Move iterator fails to copy Agent_global from library source into header version
  - Due to different versions of std::string name in struct Agent_global
Fix: Change Agent_global to use char* name instead of std::string name


[ROCm/hip commit: 00d24d254d]
2019-03-11 19:51:25 +00:00
Aaron Enye Shi 4b87bd25e8 Fix hash_for undefined reference in hipTestConstant test
Issue: mismatch undefined symbols in different user env
  - Binary expects modified return value std::string&
  - Fails to match libhip_hcc.so: return value is std::string& but doesn't match modified C++ env
Fix: Change return value to char*, create new key std::string in header from char*


[ROCm/hip commit: 23e9968752]
2019-03-11 19:51:18 +00:00
Michael Liao c0f6fa358f [hip] Re-implement hipLaunchKernelGGL as macros.
[ROCm/hip commit: 30185b4114]
2019-03-08 09:10:51 -05:00
Maneesh Gupta 645c29159c Merge pull request #951 from tycho/nvcc-hipDeviceSetCacheConfig
nvcc_detail/hip_runtime_api.h: add missing hipDeviceSetCacheConfig API

[ROCm/hip commit: 250bd58254]
2019-03-08 09:22:25 +05:30
Maneesh Gupta b3f2a33a7f Merge pull request #953 from yxsamliu/vdi
Fix HIP/VDI build failure due to dlopen change

[ROCm/hip commit: b5ca257cad]
2019-03-08 06:02:47 +05:30
Yaxun Sam Liu a431326408 Fix HIP/VDI build failure due to dlopen change
[ROCm/hip commit: 6e9e90addd]
2019-03-07 14:45:45 -05:00
Evgeny Mankov 35a28363b1 Merge pull request #950 from emankov/master
[HIPIFY][tests] Update lit testing infrastructure

[ROCm/hip commit: 6d8171cbb7]
2019-03-07 15:19:15 +03:00
Steven Noonan 17e2bfe3f8 nvcc_detail/hip_runtime_api.h: add missing hipDeviceSetCacheConfig API
Signed-off-by: Steven Noonan <steven@uplinklabs.net>


[ROCm/hip commit: ee750d5ea4]
2019-03-06 11:21:36 -08:00
Evgeny Mankov 1397bf79d8 [HIPIFY][tests] Update lit testing infrastructure
+ Set -D__LP64__ in case of 64-bit hipify-clang binary
  [partial workaround for clang's bug https://bugs.llvm.org/show_bug.cgi?id=38811]

  C:/GIT/LLVM/trunk/llvm-64-release-vs2017/dist/lib/clang/9.0.0\include\__clang_cuda_device_functions.h(1609,45): error GEF7559A7: no matching function for call to 'roundf'
  __DEVICE__ long lroundf(float __a) { return roundf(__a); }

  #if defined(__LP64__)
  __DEVICE__ long lround(double __a) { return llround(__a); }
  __DEVICE__ long lroundf(float __a) { return llroundf(__a); } // ok: llroundf should be used when 64-bit
  #else
  __DEVICE__ long lround(double __a) { return round(__a); }
  __DEVICE__ long lroundf(float __a) { return roundf(__a); } // error
  #endif

+ Print more system info while testing in the following form:
  ========================================
  CUDA 9.0 - will be used for testing
  LLVM 9.0.0svn - will be used for testing
  AMD64 - Platform architecture
  Windows 10 - Platform OS
  64 - hipify-clang binary bitness
  32 - python 3.7.2 binary bitness
  ========================================


[ROCm/hip commit: e899ee0e06]
2019-03-06 19:26:05 +03:00
eshcherb 7f40a2cd9a roctracer-proto header find_path (#884)
[ROCm/hip commit: 88b4621dbd]
2019-03-06 17:36:34 +05:30
Maneesh Gupta 3f5e937afc Merge pull request #949 from gargrahul/single_stream_concurrent_kernels
Add extension for kernel concurrency on same stream

[ROCm/hip commit: 352b17346c]
2019-03-06 17:34:54 +05:30
Maneesh Gupta df7d8c46ed Merge pull request #932 from ROCm-Developer-Tools/feature_maybe_dlopen_test
Add one test case for complex dynamic loading behavior

[ROCm/hip commit: 0c4a40efcc]
2019-03-06 17:32:23 +05:30
Alex Voicu 0c16497abd dlopen() fixes (#929)
* Initial attempt to switch over to internally linked state.

* Add missing CMake update.

* hipLaunchKernelGGLImpl must be inline as well. Ensure internal linkage.

* Ensure global retrieval uses internally linked state.

* Hide HC in the implementation. Minimise ADL woes.

* Strange software exists, and must be catered to.

* Use a less spammy mechanism for ensuring internal linkage / non-export.

* Remove leftover internal detail.


[ROCm/hip commit: ea0fcf3e61]
2019-03-06 17:31:44 +05:30
Rahul Garg 5d6fd17fbe Add extension for kernel concurrency on same stream
[ROCm/hip commit: 59081c69fc]
2019-03-06 12:55:39 +05:30
Maneesh Gupta 2264e69da3 Merge pull request #936 from mangupta/swdev-174923
[hipconfig] Update HIP_PLATFORM detection logic

[ROCm/hip commit: 2d67bc5e9c]
2019-03-06 06:08:11 +05:30
Evgeny Mankov 537e13c2ca Merge pull request #948 from emankov/master
[HIPIFY] Change CUDA Driver's functions' cuMemsetD32(Async) mapping

[ROCm/hip commit: 23de66bc7e]
2019-03-05 18:18:39 +03:00
Evgeny Mankov effdffc006 [HIPIFY] Change CUDA Driver's functions' cuMemsetD32(Async) mapping
cuMemsetD32(Async) -> hipMemsetD32(Async) (was hipMemset(Async))

based on:
[#933] https://github.com/ROCm-Developer-Tools/HIP/pull/933


[ROCm/hip commit: a87d798d90]
2019-03-05 18:13:18 +03:00
Maneesh Gupta 31457951ba Merge pull request #933 from ROCm-Developer-Tools/fix_hipmemset
Add HIP memset APIs to cope with non-zero initial values of integer types

[ROCm/hip commit: 2ed31e9f6c]
2019-03-05 14:31:38 +05:30
Maneesh Gupta 54c72c1079 Update hipMemset.cpp
Address build issues on nvcc path.

[ROCm/hip commit: 51f29b9cee]
2019-03-05 12:11:11 +05:30
Maneesh Gupta 19b6b2b570 Update hip_runtime_api.h
Use hipCUResultTohipError instead of hipCUDAErrorTohipError in hipMemsetD32 & hipMemsetD32Async.

[ROCm/hip commit: 8db717c769]
2019-03-05 12:10:01 +05:30