Graf commitů

3442 Commity

Autor SHA1 Zpráva Datum
Maneesh Gupta 035a3e5aa2 Merge pull request #1010 from ROCm-Developer-Tools/mangupta-patch-1
[ci] Disable hipLaunchParm.tst for now
2019-04-09 03:19:56 +00:00
Evgeny Mankov 1616929d37 Merge pull request #1011 from emankov/master
[HIPIFY][cmake][doc] Update CMakeLists and Readme cause CUDA 10.1 and clang 8.0.0 are released
2019-04-08 20:10:32 +03:00
Evgeny Mankov 53091e5378 [HIPIFY][cmake][doc] Update CMakeLists and Readme cause CUDA 10.1 and clang 8.0.0 are released
+ Fix the error in cmake: clang 8.0.0 supports CUDA 10.0, not 10.1; only trunk clang 9.0.0 supports CUDA 10.1.
+ Update Readme:
  1. clang's bug 36384 is gone, so latest stable release for Linux is clang 8.0.0, which supports CUDA up to 10.0;
  2. update dependencies, testing and building notes.
2019-04-08 20:05:21 +03:00
Maneesh Gupta 0a52c6661d [ci] Disable hipLaunchParm.tst for now
Due to a HCC bug in rocm-head, the test fails to compile. Disabling the test until the issue is resolved.

Change-Id: Ib4e7baf0b9c2cb5f5dbe38e6dd2bab894d28886a
2019-04-08 16:14:25 +05:30
Maneesh Gupta df09d42392 Merge pull request #1009 from yxsamliu/fix-mgpu
Fix regression on multi-gpu due to PR#997
2019-04-08 08:20:17 +00:00
Yaxun (Sam) Liu 271fdc4e4d Fix regression on multi-gpu due to PR#997 2019-04-05 22:54:41 -04:00
Evgeny Mankov 25bb74a2bb Merge pull request #1003 from acowley/patch-1
hipify: translate cudaErrorInvalidSymbol
2019-04-05 20:21:27 +03:00
Evgeny Mankov dc7f7d14c7 Merge pull request #1007 from emankov/master
[HIP][doc] Fix typos and minor text inaccuracies in HIP FAQ
2019-04-05 20:18:01 +03:00
Evgeny Mankov 6e70b8494b [HIPIFY][doc] Fix typos and minor text inaccuracies in HIP FAQ 2019-04-05 19:19:38 +03:00
Anthony Cowley e9b1881c60 hipify: translate cudaErrorInvalidSymbol 2019-04-02 08:54:37 -04:00
Maneesh Gupta c19874e0f1 Merge pull request #998 from yxsamliu/doc
hip-clang: update installation guide.
2019-04-02 05:08:18 +00:00
Maneesh Gupta 5b1e63ee2d Merge pull request #997 from yxsamliu/mgpu
hip-clang: fix kernel not found on multi-gpu
2019-04-02 05:07:31 +00:00
Evgeny Mankov 9f40e0aa97 Merge pull request #1001 from emankov/master
[HIPIFY][tests] Fix typo in test for CUDA 10.x
2019-04-01 19:53:42 +03:00
Evgeny Mankov 8bca9f1d7a [HIPIFY][tests] Fix typo in test for CUDA 10.x 2019-04-01 19:52:08 +03:00
Evgeny Mankov a7d0e40058 Merge pull request #1000 from emankov/master
[HIPIFY][cmake] Update CMakeLists and Readme cause CUDA 10.1 and clang 8.0.0 are released
2019-04-01 19:49:46 +03:00
Evgeny Mankov 7c221450fa [HIPIFY][cmake] Update CMakeLists and Readme cause CUDA 10.1 and clang 8.0.0 are released 2019-04-01 19:44:52 +03:00
Yaxun Sam Liu 98b9e92908 hip-clang: fix kernel not found on multi-gpu
__hipRegisterFunction is called during by .init functions during program initialization.
It calls hipModuleGetFunction to locate kernel symbol in code objects. hipModuleGetFunction
assumes current device when locating kernel symbols. This works for HCC but not for hip-clang,
since hip-clang needs to locate kernel symbols for different devices without switching
between devices.

This patch introduces a new hsa agent parameter to ihipModuleGetFunction, which allows
__hipRegisterFunction to choose the correct hsa agent when locating kernel symbols. By
default it uses this_agent(), therefore this patch has no impact on HCC.
2019-03-31 10:08:20 -04:00
Yaxun (Sam) Liu ed2fbb27e4 Update INSTALL.md 2019-03-30 08:29:08 -04:00
Yaxun Sam Liu 409aa49218 hip-clang: update installation guide. 2019-03-30 08:24:49 -04:00
Wen-Heng (Jack) Chung 4b7177ac42 Make hipModuleGetGlobal be in HIP runtime so it can be discovered at runtime (#981)
* Make hipModuleGetGlobal be in HIP runtime so it can be discovered at runtime

In HIP PR #929, quite a few HIP public APIs were made as inline functions with
hidden visibility. It was necessary to support applications with shared
libraries with GPU kernels launched via hipLaunchKernelGGL(), after HIP runtime
is initialized.

In empirical tests, the implementation has been proved to be a bit too
excessive, especially for hipModuleGetGlobal(). The function is used by another
type of client applications which relies on the existence of this function
within HIP runtime so global symbols from HSA code objects loaded dynamically
at runtime can be retrieved programmtically.

This commit moves hipModuleGetGlobal() back to src/hip_module.cpp, and makes it
visible and not inline, to fulfill requirements for applications
aforementioned. It does not change the behavior of applications depending on
hipLaunchKernelGGL().

* Add HIP_INIT_API into the implementation of hipModuleGetGlobal

Address review comments.

* Fix failing HIP unit tests
2019-03-29 03:45:04 +00:00
Maneesh Gupta d0e5fbeb72 Merge pull request #992 from gargrahul/handle_d2d_memcpy2d
Handle D2D in memcpy2D
2019-03-28 04:41:36 +00:00
Maneesh Gupta 8af327e439 Merge pull request #991 from jeffdaily/per_agent_program_state
load program state once per agent
2019-03-28 04:40:46 +00:00
Jeff Daily 7aada87cbd improve program state commentary
Disambiguate calling many varibles "agent".
More detail in exception message.
Create and discard map placeholders; no need to call std::vector::clear() on map value.
2019-03-27 21:40:27 +00:00
Rahul Garg 0c55db8552 Handle D2D in memcpy2D 2019-03-28 02:21:45 +05:30
Jeff Daily c9117de8eb load program state once per agent 2019-03-27 18:19:10 +00:00
Maneesh Gupta 08e12e730e Merge pull request #987 from gargrahul/fix_hostmalloc_double_device_map
Avoid double mapping of devices to hostMalloc buffer
2019-03-27 05:23:47 +00:00
Maneesh Gupta e66dfa6ea3 Merge pull request #990 from mhbliao/hliao/master/sw
SWDEV-184380 Fix hcc compilation
2019-03-27 05:23:26 +00:00
Maneesh Gupta 3535ced23f Merge pull request #986 from mhbliao/hliao/master/build
[hip] Fix typo in macro `hipLaunchKernel`
2019-03-27 05:22:44 +00:00
Michael LIAO 939d153c1a SWDEV-184380 Fix hcc compilation
- `hcc` has no builtin. Need to invoke LLVM intrinsic directly.
2019-03-26 15:20:17 -04:00
Rahul Garg f0af073793 Let hipHostMalloc always share/map pinned host ptr 2019-03-26 10:19:13 +05:30
Rahul Garg 5e917d70f3 Avoid double mapping of devices to hostMalloc buffer 2019-03-25 23:07:05 +05:30
Michael LIAO b3497824ec [hip] Fix typo in macro hipLaunchKernel 2019-03-25 12:06:46 -04:00
Maneesh Gupta 30b5c02ec4 Merge pull request #970 from mangupta/swdev-172995
hipExtMallocWithFlags implementation
2019-03-25 07:46:53 +00:00
Maneesh Gupta eb03d50de9 Merge pull request #962 from gargrahul/add_2d_copy_fallback
Add 2D fallback to use copy kernel
2019-03-25 07:46:43 +00:00
Rahul Garg 9bbfbceb64 2D Fallback needs hcc workweek 19101 or higher 2019-03-25 12:07:28 +05:30
Maneesh Gupta cab119c8b2 hipExtMallocWithFlags needs hcc workweek 19115 or higher 2019-03-25 11:41:20 +05:30
Maneesh Gupta a75f375788 Merge pull request #982 from ROCm-Developer-Tools/hack_swdev-173477
HACK for SWDEV-173477
2019-03-22 09:14:38 +00:00
Maneesh Gupta 5bf08930a9 Merge pull request #980 from ntrost57/master
fixed loss of accuracy in hipCfma()
2019-03-22 09:14:09 +00:00
Wen-Heng (Jack) Chung d941f19399 HACK for SWDEV-173477
For code objects with global symbols of length 0, ROCR runtime would
ignore them even though they exist in the symbol table. Therefore the
result from read_agent_globals() can't be trusted entirely.

As a workaround to tame applications which depend on the existence of
global symbols with length 0, always return hipSuccess here.

This behavior shall be reverted once ROCR runtime has been fixed to
address SWDEV-173477
2019-03-21 17:18:16 +00:00
Nico Trost 7bb5a14a4e fixed loss of accuracy in hipCfma() 2019-03-21 10:30:10 +01:00
eshcherb f167e4daf6 adding hip_prof_gen verbose log (#977)
* adding hip_prof_gen verbose log

* adding stderr fatal error

* adding no error exit by default

* adding hip_prof_str regeneration dependencies

* adding more informative messages

* fixing error mesage
2019-03-21 05:28:18 +00:00
Maneesh Gupta 6ade928535 Merge pull request #972 from yxsamliu/global
Add declaration of symbol related API for VDI
2019-03-20 05:12:21 +00:00
Maneesh Gupta 87551e29f6 Merge pull request #973 from mhbliao/hliao/master/build
[Device Function] Fix typos.
2019-03-20 05:12:14 +00:00
Maneesh Gupta cfb2711768 Merge pull request #974 from yxsamliu/name2
Change HIP dll name to amdhip64.dll on Windows
2019-03-20 05:11:58 +00:00
eshcherb 673cdeea55 adding prof primitives generator (#967)
* adding prof primitives generator

* minor change, renaming

* minor cosmetic changes, comments correcting and dead code removing

* minor changes and renaming

* minor chane, fixing comments
2019-03-20 05:11:40 +00:00
Siu Chi Chan 24d08beef8 reimplement HIP_INIT as hip_impl::hip_init(), add hip_init() to some of the inlined API (#966)
* reimplement HIP_INIT as a function, expose it as hip_impl::hip_init()
so that it could be called from hipLaunchKernelGGL and other inlined
HIP functions

* Don't call hip_init from ihipPreLaunchKernel
2019-03-20 05:11:15 +00:00
Yaxun Sam Liu e2bf34cd5e Change HIP dll name to amdhip64.dll on Windows 2019-03-19 16:27:18 -04:00
Michael LIAO e51fa174c8 [Device Function] Fix typos. 2019-03-19 15:32:19 -04:00
Yaxun Sam Liu e9bc4034d8 Add declaration of symbol related API for VDI 2019-03-19 11:11:49 -04:00
Maneesh Gupta b1752fc9a6 Merge pull request #969 from nicholasmalaya/patch-1
Update hip_faq.md
2019-03-19 18:42:05 +05:30