Граф коммитов

1852 Коммитов

Автор SHA1 Сообщение Дата
Ben Sander bd434502db Lock streams when waiting on event completion or querying event safety.
[ROCm/hip commit: 6ff74d0e97]
2017-08-28 18:40:16 -05:00
Ben Sander 0f0414a5ba Merge pull request #165 from gargrahul/enable_strmsync_rcrdevnt_dtests
Enable hipStreamSync2 and record_event dtests

[ROCm/hip commit: 3ffe5c9189]
2017-08-28 15:51:35 -05:00
Rahul Garg 1d4213af20 Enable hipStreamSync2 and record_event dtests
[ROCm/hip commit: 4b98572f47]
2017-08-24 15:51:58 +05:30
Maneesh Gupta eac539eca8 Merge pull request #162 from gargrahul/correct_devicecnt_check
Device count range check on input device id function argument

[ROCm/hip commit: 4051b9fac0]
2017-08-23 14:45:00 +05:30
Maneesh Gupta d4f53c3628 Merge pull request #163 from gargrahul/pcibusid_dtests
PCI Bus ID related dtests

[ROCm/hip commit: 87c9cee86c]
2017-08-23 14:44:25 +05:30
Rahul Garg 82970c4dc1 PCI Bus ID related dtests
[ROCm/hip commit: 9441f3a50e]
2017-08-22 15:41:16 +05:30
Maneesh Gupta 7b99e32ec0 Merge pull request #159 from bensander/hipDispatchLatency
Refactor dispatch latency test and fix several bugs.

[ROCm/hip commit: 7bbde1fe22]
2017-08-22 14:49:14 +05:30
Rahul Garg e673304653 Device count check on input device id
[ROCm/hip commit: 38ecfb9384]
2017-08-22 14:05:44 +05:30
Ben Sander b3b134d65a Refactor dispatch latency test and fix several bugs.
[ROCm/hip commit: e87f7a4d00]
2017-08-17 08:46:58 -05:00
Maneesh Gupta 897449746f Merge pull request #156 from bensander/update_trace_api
Add some new HIP_TRACE_API options.

[ROCm/hip commit: f34b89b33f]
2017-08-17 12:02:29 +05:30
Maneesh Gupta 9088acab1e Merge pull request #158 from mangupta/bump_hip_version
Bump HIP version to 1.3 and update release notes

[ROCm/hip commit: c4737636d2]
2017-08-16 12:30:13 +05:30
Maneesh Gupta ad692f9fa2 Merge pull request #157 from mangupta/switch_archived_ci_artifacts
[ci] Archive packages based on hcc-ctu instead of hcc-1.6

[ROCm/hip commit: 05bde84918]
2017-08-16 11:19:20 +05:30
Maneesh Gupta 5a81e6d50d Bump HIP version to 1.3 and update release notes
Change-Id: I53f89efca03472cb906bd56200a2bf2996453915


[ROCm/hip commit: 8c94bc0009]
2017-08-16 11:18:51 +05:30
Maneesh Gupta 049f36bc96 [ci] Archive packages based on hcc-ctu instead of hcc-1.6
Change-Id: Ie1ea6411ed19aae87b979fa001db3e8927fc11f6


[ROCm/hip commit: c11b8737b9]
2017-08-16 10:57:07 +05:30
Ben Sander 269271535e Merge pull request #152 from scchan/fix_threadfence_system_test
add C++11 compilation flags and minor bug fixes

[ROCm/hip commit: 04385f12cb]
2017-08-15 23:04:49 -05:00
Ben Sander d3df8b5908 Merge pull request #153 from mangupta/ci_ignore_testbuild_failures
[ci] Continue building remaining dtests on failure

[ROCm/hip commit: 1575fc2579]
2017-08-15 23:03:41 -05:00
Ben Sander 542bd863a2 Add some new HIP_TRACE_API options.
[ROCm/hip commit: e3161bb40e]
2017-08-16 04:00:33 +00:00
Ben Sander c26e1d8c3d Merge pull request #148 from weixingzhang/vmaddr1
[HIP Texture] The GPU virtual address for texture memory needs to be

[ROCm/hip commit: 8b7dde6519]
2017-08-15 14:19:45 -05:00
Maneesh Gupta c036a1b25c [texture] guard new HCC APIs under workweek
Change-Id: I4f60a64fb0b0496ca1eb01ffe6ddda121c25d976


[ROCm/hip commit: e40047f2bf]
2017-08-15 15:51:38 +05:30
Maneesh Gupta 478b7fb9ae [ci] Continue building remaining dtests on failure
Change-Id: Iab7fd53517e0b66dd5f71efb9e65bd2e9c91355b


[ROCm/hip commit: ec3db6791e]
2017-08-11 15:57:20 +05:30
Maneesh Gupta 52a97377ac [dtests] Fix hip_threadfence_system test's BUILD command
Change-Id: I037c523369e4bddd7a386ffe117fe97a178900fd


[ROCm/hip commit: 4bcb3372b5]
2017-08-11 14:49:17 +05:30
Siu Chi Chan 588ca89c92 add C++11 compilation flags and minor bug fixes
[ROCm/hip commit: 9a04c5cdd4]
2017-08-10 17:58:36 +00:00
Maneesh Gupta 0a25b3e972 Merge pull request #138 from scchan/threadfence_system
implement __threadfence_system

[ROCm/hip commit: f14e31d9d7]
2017-08-10 12:51:05 +05:30
Maneesh Gupta 7390368693 [dtests] Disable hip_threadfence_system.tst on nvcc
The test needs some more modifications before it is ready for building
on nvcc path. Disabling it for now so that the pull request can be
merged.

Change-Id: I19a894fcda7b2159f86a4e4e95a409c5655d3760


[ROCm/hip commit: 69e431f035]
2017-08-10 12:05:58 +05:30
Ben Sander 5709a38ef3 Merge pull request #146 from gargrahul/docs_update_memcpysymbol
docs update for memcpytosymbol support

[ROCm/hip commit: e7a7574cf3]
2017-08-09 17:48:48 -05:00
Ben Sander d4d9836a96 Merge pull request #145 from gargrahul/context_mgmt_changes
Updated context management logic:

[ROCm/hip commit: 660061c3b4]
2017-08-09 17:47:45 -05:00
Ben Sander d2454ffb9f Merge pull request #139 from gargrahul/disable_p2pcoherency_test
Disable failing test p2p_copy_coherency

[ROCm/hip commit: 4a34bce26f]
2017-08-09 16:54:33 -05:00
Maneesh Gupta 8856953fed Merge pull request #147 from mangupta/bump_workweek
Bump min hcc_workweek required for named kernel dispatch to 17312

[ROCm/hip commit: fa3f9c2474]
2017-08-09 14:10:07 +05:30
Weixing Zhang 5bf177340a [HIP Texture] The GPU virtual address for texture memory needs to be
aligned.

In hcc_am, a bigger buffer will be allocated for alignment purpose
and _unalignedDevicePointer is added in struct AmPointerInfo for
original allocated address.


[ROCm/hip commit: 4264a4ce56]
2017-08-08 11:18:00 -04:00
Maneesh Gupta c050396510 Bump min hcc_workweek required for named kernel dispatch to 17312
Change-Id: I8c7b58306b279ed113d03260e4bc6086bb8b4e68


[ROCm/hip commit: 8d6acec135]
2017-08-08 11:08:55 +05:30
Rahul Garg d571697190 docs update for memcpytosymbol support
[ROCm/hip commit: eb25bd72db]
2017-08-08 09:13:12 +05:30
Rahul Garg 8d254a7113 Updated context management logic:
1) hipSetDevice sets a flag so that next call to hipCtxGetCurrent returns primary context on current device
2) hipCtxGetCurrent returns primary context on current device if TLS context stack is empty
3) hipCtxPopCurrent falls back to primary context on current device as default
4) hipCtxPushCurrent, hipCtxSetCurrent and hipCtxCreate reset the flag set in hipSetDevice


[ROCm/hip commit: 3e84cf4aba]
2017-08-08 07:02:22 +05:30
Rahul Garg 63456bfea0 Disable failing test p2p_copy_coherency
[ROCm/hip commit: 2f617bac6d]
2017-08-02 14:47:13 +05:30
Siu Chi Chan 21c1d69e77 implement __threadfence_system
[ROCm/hip commit: 96e79f97e2]
2017-08-02 08:50:18 +00:00
Ben Sander f88f99276a Merge pull request #136 from mangupta/update_install_markdown
[doc] Add details of HIP build time dependencies to INSTALL.md

[ROCm/hip commit: 6d6c325626]
2017-07-31 11:05:30 -05:00
Ben Sander aba975dd29 Merge pull request #128 from mangupta/cleanup_amdgpu_target_logic
[hipcc] Cleanup amdgpu target logic

[ROCm/hip commit: 0fcd27586b]
2017-07-31 11:05:06 -05:00
Maneesh Gupta eba208b1fd [doc] Add details of HIP build time dependencies to INSTALL.md
Change-Id: Iee87d1345aba716d0ce49f7afb45ba073a16be88


[ROCm/hip commit: e9c995959b]
2017-07-31 14:57:32 +05:30
Maneesh Gupta e7c823b4ed Merge pull request #134 from gargrahul/fix_hipMemcpy2DAsync
fix hipMemcpy2DAsync

[ROCm/hip commit: fce1d0d7f0]
2017-07-31 10:27:18 +05:30
Maneesh Gupta 1be0719063 Merge pull request #135 from bensander/fix_tracing
Some fixes to tracing.

[ROCm/hip commit: 48573a037e]
2017-07-31 10:24:41 +05:30
Maneesh Gupta 5c9d90007f [hipcc] Fix amdgpu target selection logic
- Refactoring introduced a bug when user does not specify any target via
--amdgpu-target, but has an invalid target specified in
HCC_AMDGPU_TARGET. In this case the selection logic was defaulting to
gf803.
- Removed defaulting to any specific target if rocm_agent_enumerator
  fails. hipcc will report this and die if linking was required.

Change-Id: I76131867049fef92331807dd19a926406dcc1d02


[ROCm/hip commit: 85ff6e3ef4]
2017-07-31 10:11:19 +05:30
Ben Sander 4a234edea9 Some fixes to tracing.
[ROCm/hip commit: ecbb494a58]
2017-07-28 22:13:43 -05:00
Rahul Garg 7e9c13f8de fix hipMemcpy2DAsync
[ROCm/hip commit: 0053e4db03]
2017-07-29 06:50:56 +05:30
Ben Sander 8c5df22733 Merge pull request #119 from emankov/master
[HIPIFY] CUDA RT Textures support

[ROCm/hip commit: 98f09debbb]
2017-07-28 12:12:41 -05:00
Maneesh Gupta 514f5cb8fb [hipcc] Cleanup amdgpu target logic
Existing logic has a bug. If user specifies targetA via commandline
options, while enumerator returns targetB, hipcc will create a fatbin
containing targets targetA and targetB. enumerator should only be used
when no target is specified by user (commandline or env var).

Change-Id: I6da857f86860c0e671b5988cd858644a08f723b9


[ROCm/hip commit: 830e85060b]
2017-07-28 16:18:15 +05:30
Maneesh Gupta b763034c65 Merge pull request #127 from bensander/pass_module_kname
Pass kernel name to HCC dispatch_hsa_kernel, for debug/profile

[ROCm/hip commit: 375ec84bc6]
2017-07-28 10:19:15 +05:30
Ben Sander 1ff23db17b Add workweek check to make sure we have a new enough compiler
[ROCm/hip commit: 4980a6d3ab]
2017-07-27 23:00:58 -05:00
Maneesh Gupta 90fa6b18eb Merge pull request #122 from bensander/enable_async_null_stream
Set HIP_SYNC_NULL_STREAM=0.

[ROCm/hip commit: fcc294ed41]
2017-07-28 09:15:56 +05:30
Maneesh Gupta 5ab79f8182 Merge pull request #121 from syifan/master
Allow hip cmake to link libraries

[ROCm/hip commit: 821f8b9e02]
2017-07-28 09:12:15 +05:30
Ben Sander 3ab70ad042 Pass kernel name to HCC dispatch_hsa_kernel, for debug/profile
[ROCm/hip commit: 3a4dfc0f85]
2017-07-27 22:00:15 -05:00
Evgeny Mankov 24fc06496b Update Cuda2Hip.cpp
[ROCm/hip commit: d614f53ad7]
2017-07-28 01:45:24 +03:00