Граф коммитов

3433 Коммитов

Автор SHA1 Сообщение Дата
Anthony Cowley b00a98f27a hipify: translate cudaErrorInvalidSymbol
[ROCm/hip commit: e9b1881c60]
2019-04-02 08:54:37 -04:00
Maneesh Gupta 8a93d8e9aa Merge pull request #998 from yxsamliu/doc
hip-clang: update installation guide.

[ROCm/hip commit: c19874e0f1]
2019-04-02 05:08:18 +00:00
Maneesh Gupta 1fbff012b4 Merge pull request #997 from yxsamliu/mgpu
hip-clang: fix kernel not found on multi-gpu

[ROCm/hip commit: 5b1e63ee2d]
2019-04-02 05:07:31 +00:00
Evgeny Mankov 772b4978ed Merge pull request #1001 from emankov/master
[HIPIFY][tests] Fix typo in test for CUDA 10.x

[ROCm/hip commit: 9f40e0aa97]
2019-04-01 19:53:42 +03:00
Evgeny Mankov 7bbf825c74 [HIPIFY][tests] Fix typo in test for CUDA 10.x
[ROCm/hip commit: 8bca9f1d7a]
2019-04-01 19:52:08 +03:00
Evgeny Mankov 3b28e2f9b4 Merge pull request #1000 from emankov/master
[HIPIFY][cmake] Update CMakeLists and Readme cause CUDA 10.1 and clang 8.0.0 are released

[ROCm/hip commit: a7d0e40058]
2019-04-01 19:49:46 +03:00
Evgeny Mankov 9e32584753 [HIPIFY][cmake] Update CMakeLists and Readme cause CUDA 10.1 and clang 8.0.0 are released
[ROCm/hip commit: 7c221450fa]
2019-04-01 19:44:52 +03:00
Yaxun Sam Liu 5072c98f32 hip-clang: fix kernel not found on multi-gpu
__hipRegisterFunction is called during by .init functions during program initialization.
It calls hipModuleGetFunction to locate kernel symbol in code objects. hipModuleGetFunction
assumes current device when locating kernel symbols. This works for HCC but not for hip-clang,
since hip-clang needs to locate kernel symbols for different devices without switching
between devices.

This patch introduces a new hsa agent parameter to ihipModuleGetFunction, which allows
__hipRegisterFunction to choose the correct hsa agent when locating kernel symbols. By
default it uses this_agent(), therefore this patch has no impact on HCC.


[ROCm/hip commit: 98b9e92908]
2019-03-31 10:08:20 -04:00
Yaxun (Sam) Liu 595d95cfe8 Update INSTALL.md
[ROCm/hip commit: ed2fbb27e4]
2019-03-30 08:29:08 -04:00
Yaxun Sam Liu c0682dda96 hip-clang: update installation guide.
[ROCm/hip commit: 409aa49218]
2019-03-30 08:24:49 -04:00
Wen-Heng (Jack) Chung 1cc94f9369 Make hipModuleGetGlobal be in HIP runtime so it can be discovered at runtime (#981)
* Make hipModuleGetGlobal be in HIP runtime so it can be discovered at runtime

In HIP PR #929, quite a few HIP public APIs were made as inline functions with
hidden visibility. It was necessary to support applications with shared
libraries with GPU kernels launched via hipLaunchKernelGGL(), after HIP runtime
is initialized.

In empirical tests, the implementation has been proved to be a bit too
excessive, especially for hipModuleGetGlobal(). The function is used by another
type of client applications which relies on the existence of this function
within HIP runtime so global symbols from HSA code objects loaded dynamically
at runtime can be retrieved programmtically.

This commit moves hipModuleGetGlobal() back to src/hip_module.cpp, and makes it
visible and not inline, to fulfill requirements for applications
aforementioned. It does not change the behavior of applications depending on
hipLaunchKernelGGL().

* Add HIP_INIT_API into the implementation of hipModuleGetGlobal

Address review comments.

* Fix failing HIP unit tests


[ROCm/hip commit: 4b7177ac42]
2019-03-29 03:45:04 +00:00
Maneesh Gupta f63c4be10e Merge pull request #992 from gargrahul/handle_d2d_memcpy2d
Handle D2D in memcpy2D

[ROCm/hip commit: d0e5fbeb72]
2019-03-28 04:41:36 +00:00
Maneesh Gupta 6effbfe2dd Merge pull request #991 from jeffdaily/per_agent_program_state
load program state once per agent

[ROCm/hip commit: 8af327e439]
2019-03-28 04:40:46 +00:00
Jeff Daily fb2beb0c88 improve program state commentary
Disambiguate calling many varibles "agent".
More detail in exception message.
Create and discard map placeholders; no need to call std::vector::clear() on map value.


[ROCm/hip commit: 7aada87cbd]
2019-03-27 21:40:27 +00:00
Rahul Garg 0eaa29ad06 Handle D2D in memcpy2D
[ROCm/hip commit: 0c55db8552]
2019-03-28 02:21:45 +05:30
Jeff Daily 21f5e142f5 load program state once per agent
[ROCm/hip commit: c9117de8eb]
2019-03-27 18:19:10 +00:00
Maneesh Gupta 87772a479e Merge pull request #987 from gargrahul/fix_hostmalloc_double_device_map
Avoid double mapping of devices to hostMalloc buffer

[ROCm/hip commit: 08e12e730e]
2019-03-27 05:23:47 +00:00
Maneesh Gupta 464c023bb5 Merge pull request #990 from mhbliao/hliao/master/sw
SWDEV-184380 Fix hcc compilation

[ROCm/hip commit: e66dfa6ea3]
2019-03-27 05:23:26 +00:00
Maneesh Gupta ed99892664 Merge pull request #986 from mhbliao/hliao/master/build
[hip] Fix typo in macro `hipLaunchKernel`

[ROCm/hip commit: 3535ced23f]
2019-03-27 05:22:44 +00:00
Michael LIAO a9a397bc4b SWDEV-184380 Fix hcc compilation
- `hcc` has no builtin. Need to invoke LLVM intrinsic directly.


[ROCm/hip commit: 939d153c1a]
2019-03-26 15:20:17 -04:00
Rahul Garg d98d5ca12a Let hipHostMalloc always share/map pinned host ptr
[ROCm/hip commit: f0af073793]
2019-03-26 10:19:13 +05:30
Rahul Garg c6ef785464 Avoid double mapping of devices to hostMalloc buffer
[ROCm/hip commit: 5e917d70f3]
2019-03-25 23:07:05 +05:30
Michael LIAO 94b7856a1a [hip] Fix typo in macro hipLaunchKernel
[ROCm/hip commit: b3497824ec]
2019-03-25 12:06:46 -04:00
Maneesh Gupta 82fd86e63f Merge pull request #970 from mangupta/swdev-172995
hipExtMallocWithFlags implementation

[ROCm/hip commit: 30b5c02ec4]
2019-03-25 07:46:53 +00:00
Maneesh Gupta 8c6b5bf266 Merge pull request #962 from gargrahul/add_2d_copy_fallback
Add 2D fallback to use copy kernel

[ROCm/hip commit: eb03d50de9]
2019-03-25 07:46:43 +00:00
Rahul Garg 63e926cec7 2D Fallback needs hcc workweek 19101 or higher
[ROCm/hip commit: 9bbfbceb64]
2019-03-25 12:07:28 +05:30
Maneesh Gupta 67819c0395 hipExtMallocWithFlags needs hcc workweek 19115 or higher
[ROCm/hip commit: cab119c8b2]
2019-03-25 11:41:20 +05:30
Maneesh Gupta bfb73dd148 Merge pull request #982 from ROCm-Developer-Tools/hack_swdev-173477
HACK for SWDEV-173477

[ROCm/hip commit: a75f375788]
2019-03-22 09:14:38 +00:00
Maneesh Gupta 56dbf717bd Merge pull request #980 from ntrost57/master
fixed loss of accuracy in hipCfma()

[ROCm/hip commit: 5bf08930a9]
2019-03-22 09:14:09 +00:00
Wen-Heng (Jack) Chung 3eb009913c HACK for SWDEV-173477
For code objects with global symbols of length 0, ROCR runtime would
ignore them even though they exist in the symbol table. Therefore the
result from read_agent_globals() can't be trusted entirely.

As a workaround to tame applications which depend on the existence of
global symbols with length 0, always return hipSuccess here.

This behavior shall be reverted once ROCR runtime has been fixed to
address SWDEV-173477


[ROCm/hip commit: d941f19399]
2019-03-21 17:18:16 +00:00
Nico Trost 5802734f1b fixed loss of accuracy in hipCfma()
[ROCm/hip commit: 7bb5a14a4e]
2019-03-21 10:30:10 +01:00
eshcherb 621ceaf8bc adding hip_prof_gen verbose log (#977)
* adding hip_prof_gen verbose log

* adding stderr fatal error

* adding no error exit by default

* adding hip_prof_str regeneration dependencies

* adding more informative messages

* fixing error mesage


[ROCm/hip commit: f167e4daf6]
2019-03-21 05:28:18 +00:00
Maneesh Gupta 1bc0e84e83 Merge pull request #972 from yxsamliu/global
Add declaration of symbol related API for VDI

[ROCm/hip commit: 6ade928535]
2019-03-20 05:12:21 +00:00
Maneesh Gupta 7892ca44b4 Merge pull request #973 from mhbliao/hliao/master/build
[Device Function] Fix typos.

[ROCm/hip commit: 87551e29f6]
2019-03-20 05:12:14 +00:00
Maneesh Gupta 5f3f8a2cde Merge pull request #974 from yxsamliu/name2
Change HIP dll name to amdhip64.dll on Windows

[ROCm/hip commit: cfb2711768]
2019-03-20 05:11:58 +00:00
eshcherb bbc63ad2ec adding prof primitives generator (#967)
* adding prof primitives generator

* minor change, renaming

* minor cosmetic changes, comments correcting and dead code removing

* minor changes and renaming

* minor chane, fixing comments


[ROCm/hip commit: 673cdeea55]
2019-03-20 05:11:40 +00:00
Siu Chi Chan 99c4f43a6c reimplement HIP_INIT as hip_impl::hip_init(), add hip_init() to some of the inlined API (#966)
* reimplement HIP_INIT as a function, expose it as hip_impl::hip_init()
so that it could be called from hipLaunchKernelGGL and other inlined
HIP functions

* Don't call hip_init from ihipPreLaunchKernel


[ROCm/hip commit: 24d08beef8]
2019-03-20 05:11:15 +00:00
Yaxun Sam Liu 8d6e48820f Change HIP dll name to amdhip64.dll on Windows
[ROCm/hip commit: e2bf34cd5e]
2019-03-19 16:27:18 -04:00
Michael LIAO d0680ef1ee [Device Function] Fix typos.
[ROCm/hip commit: e51fa174c8]
2019-03-19 15:32:19 -04:00
Yaxun Sam Liu 4f094c4d21 Add declaration of symbol related API for VDI
[ROCm/hip commit: e9bc4034d8]
2019-03-19 11:11:49 -04:00
Maneesh Gupta 98dcdae4f0 Merge pull request #969 from nicholasmalaya/patch-1
Update hip_faq.md

[ROCm/hip commit: b1752fc9a6]
2019-03-19 18:42:05 +05:30
Maneesh Gupta 88761d52ce Merge pull request #965 from mhbliao/hliao/master/immarg
[Device Function] Support immediate argument.

[ROCm/hip commit: 4094824b9e]
2019-03-19 18:41:31 +05:30
Maneesh Gupta 173c910f5d Merge pull request #954 from mhbliao/master
[hip] Re-implement hipLaunchKernelGGL as macros.

[ROCm/hip commit: 91ebfadabf]
2019-03-19 18:39:27 +05:30
Maneesh Gupta 9ac6005d35 hipExtMallocWithFlags implementation
Change-Id: Iee9e119796472200b2933d5e23be60813f33bc75


[ROCm/hip commit: 73ec5d54b5]
2019-03-19 11:59:22 +05:30
Nicholas Malaya 2569ed7c91 Update hip_faq.md
Making more clear what this list details. In particular, this list is intended to indicate what items for each CUDA release is supported, and which are not.

[ROCm/hip commit: 0bddfc09e1]
2019-03-18 14:51:18 -05:00
Michael LIAO f282ea815a [Device Function] Support immediate argument.
- `immarg`, immediate argument, is enabled on all AMDGPU intrinsics.
  Revise device functions using these intrinsics with immediate
  arguments.


[ROCm/hip commit: 950b6efe72]
2019-03-15 12:38:04 -04:00
Maneesh Gupta 7cc48a4836 Merge pull request #964 from eshcherb/roctracer-hip-frontend-190314
tracing callback layer update

[ROCm/hip commit: 38de9baf28]
2019-03-15 16:24:46 +05:30
Evgeny 36b5313d65 tracing callback layer update
[ROCm/hip commit: 31475c5ac8]
2019-03-14 22:43:52 -05:00
Maneesh Gupta f0fd33a301 Merge pull request #963 from gargrahul/add_module_get_global_test
Test hipModuleGetGlobal

[ROCm/hip commit: e2977fffa3]
2019-03-15 06:17:50 +05:30
Maneesh Gupta 1d61f5ff58 Merge pull request #958 from aaronenyeshi/cxxabi-mismatch-workaround
CXX11 ABI Mismatch Workaround

[ROCm/hip commit: 419127172e]
2019-03-15 06:15:46 +05:30