Ben Sander
bd434502db
Lock streams when waiting on event completion or querying event safety.
...
[ROCm/hip commit: 6ff74d0e97 ]
2017-08-28 18:40:16 -05:00
Ben Sander
0f0414a5ba
Merge pull request #165 from gargrahul/enable_strmsync_rcrdevnt_dtests
...
Enable hipStreamSync2 and record_event dtests
[ROCm/hip commit: 3ffe5c9189 ]
2017-08-28 15:51:35 -05:00
Rahul Garg
1d4213af20
Enable hipStreamSync2 and record_event dtests
...
[ROCm/hip commit: 4b98572f47 ]
2017-08-24 15:51:58 +05:30
Maneesh Gupta
eac539eca8
Merge pull request #162 from gargrahul/correct_devicecnt_check
...
Device count range check on input device id function argument
[ROCm/hip commit: 4051b9fac0 ]
2017-08-23 14:45:00 +05:30
Maneesh Gupta
d4f53c3628
Merge pull request #163 from gargrahul/pcibusid_dtests
...
PCI Bus ID related dtests
[ROCm/hip commit: 87c9cee86c ]
2017-08-23 14:44:25 +05:30
Rahul Garg
82970c4dc1
PCI Bus ID related dtests
...
[ROCm/hip commit: 9441f3a50e ]
2017-08-22 15:41:16 +05:30
Maneesh Gupta
7b99e32ec0
Merge pull request #159 from bensander/hipDispatchLatency
...
Refactor dispatch latency test and fix several bugs.
[ROCm/hip commit: 7bbde1fe22 ]
2017-08-22 14:49:14 +05:30
Rahul Garg
e673304653
Device count check on input device id
...
[ROCm/hip commit: 38ecfb9384 ]
2017-08-22 14:05:44 +05:30
Ben Sander
b3b134d65a
Refactor dispatch latency test and fix several bugs.
...
[ROCm/hip commit: e87f7a4d00 ]
2017-08-17 08:46:58 -05:00
Maneesh Gupta
897449746f
Merge pull request #156 from bensander/update_trace_api
...
Add some new HIP_TRACE_API options.
[ROCm/hip commit: f34b89b33f ]
2017-08-17 12:02:29 +05:30
Maneesh Gupta
9088acab1e
Merge pull request #158 from mangupta/bump_hip_version
...
Bump HIP version to 1.3 and update release notes
[ROCm/hip commit: c4737636d2 ]
2017-08-16 12:30:13 +05:30
Maneesh Gupta
ad692f9fa2
Merge pull request #157 from mangupta/switch_archived_ci_artifacts
...
[ci] Archive packages based on hcc-ctu instead of hcc-1.6
[ROCm/hip commit: 05bde84918 ]
2017-08-16 11:19:20 +05:30
Maneesh Gupta
5a81e6d50d
Bump HIP version to 1.3 and update release notes
...
Change-Id: I53f89efca03472cb906bd56200a2bf2996453915
[ROCm/hip commit: 8c94bc0009 ]
2017-08-16 11:18:51 +05:30
Maneesh Gupta
049f36bc96
[ci] Archive packages based on hcc-ctu instead of hcc-1.6
...
Change-Id: Ie1ea6411ed19aae87b979fa001db3e8927fc11f6
[ROCm/hip commit: c11b8737b9 ]
2017-08-16 10:57:07 +05:30
Ben Sander
269271535e
Merge pull request #152 from scchan/fix_threadfence_system_test
...
add C++11 compilation flags and minor bug fixes
[ROCm/hip commit: 04385f12cb ]
2017-08-15 23:04:49 -05:00
Ben Sander
d3df8b5908
Merge pull request #153 from mangupta/ci_ignore_testbuild_failures
...
[ci] Continue building remaining dtests on failure
[ROCm/hip commit: 1575fc2579 ]
2017-08-15 23:03:41 -05:00
Ben Sander
542bd863a2
Add some new HIP_TRACE_API options.
...
[ROCm/hip commit: e3161bb40e ]
2017-08-16 04:00:33 +00:00
Ben Sander
c26e1d8c3d
Merge pull request #148 from weixingzhang/vmaddr1
...
[HIP Texture] The GPU virtual address for texture memory needs to be
[ROCm/hip commit: 8b7dde6519 ]
2017-08-15 14:19:45 -05:00
Maneesh Gupta
c036a1b25c
[texture] guard new HCC APIs under workweek
...
Change-Id: I4f60a64fb0b0496ca1eb01ffe6ddda121c25d976
[ROCm/hip commit: e40047f2bf ]
2017-08-15 15:51:38 +05:30
Maneesh Gupta
478b7fb9ae
[ci] Continue building remaining dtests on failure
...
Change-Id: Iab7fd53517e0b66dd5f71efb9e65bd2e9c91355b
[ROCm/hip commit: ec3db6791e ]
2017-08-11 15:57:20 +05:30
Maneesh Gupta
52a97377ac
[dtests] Fix hip_threadfence_system test's BUILD command
...
Change-Id: I037c523369e4bddd7a386ffe117fe97a178900fd
[ROCm/hip commit: 4bcb3372b5 ]
2017-08-11 14:49:17 +05:30
Siu Chi Chan
588ca89c92
add C++11 compilation flags and minor bug fixes
...
[ROCm/hip commit: 9a04c5cdd4 ]
2017-08-10 17:58:36 +00:00
Maneesh Gupta
0a25b3e972
Merge pull request #138 from scchan/threadfence_system
...
implement __threadfence_system
[ROCm/hip commit: f14e31d9d7 ]
2017-08-10 12:51:05 +05:30
Maneesh Gupta
7390368693
[dtests] Disable hip_threadfence_system.tst on nvcc
...
The test needs some more modifications before it is ready for building
on nvcc path. Disabling it for now so that the pull request can be
merged.
Change-Id: I19a894fcda7b2159f86a4e4e95a409c5655d3760
[ROCm/hip commit: 69e431f035 ]
2017-08-10 12:05:58 +05:30
Ben Sander
5709a38ef3
Merge pull request #146 from gargrahul/docs_update_memcpysymbol
...
docs update for memcpytosymbol support
[ROCm/hip commit: e7a7574cf3 ]
2017-08-09 17:48:48 -05:00
Ben Sander
d4d9836a96
Merge pull request #145 from gargrahul/context_mgmt_changes
...
Updated context management logic:
[ROCm/hip commit: 660061c3b4 ]
2017-08-09 17:47:45 -05:00
Ben Sander
d2454ffb9f
Merge pull request #139 from gargrahul/disable_p2pcoherency_test
...
Disable failing test p2p_copy_coherency
[ROCm/hip commit: 4a34bce26f ]
2017-08-09 16:54:33 -05:00
Maneesh Gupta
8856953fed
Merge pull request #147 from mangupta/bump_workweek
...
Bump min hcc_workweek required for named kernel dispatch to 17312
[ROCm/hip commit: fa3f9c2474 ]
2017-08-09 14:10:07 +05:30
Weixing Zhang
5bf177340a
[HIP Texture] The GPU virtual address for texture memory needs to be
...
aligned.
In hcc_am, a bigger buffer will be allocated for alignment purpose
and _unalignedDevicePointer is added in struct AmPointerInfo for
original allocated address.
[ROCm/hip commit: 4264a4ce56 ]
2017-08-08 11:18:00 -04:00
Maneesh Gupta
c050396510
Bump min hcc_workweek required for named kernel dispatch to 17312
...
Change-Id: I8c7b58306b279ed113d03260e4bc6086bb8b4e68
[ROCm/hip commit: 8d6acec135 ]
2017-08-08 11:08:55 +05:30
Rahul Garg
d571697190
docs update for memcpytosymbol support
...
[ROCm/hip commit: eb25bd72db ]
2017-08-08 09:13:12 +05:30
Rahul Garg
8d254a7113
Updated context management logic:
...
1) hipSetDevice sets a flag so that next call to hipCtxGetCurrent returns primary context on current device
2) hipCtxGetCurrent returns primary context on current device if TLS context stack is empty
3) hipCtxPopCurrent falls back to primary context on current device as default
4) hipCtxPushCurrent, hipCtxSetCurrent and hipCtxCreate reset the flag set in hipSetDevice
[ROCm/hip commit: 3e84cf4aba ]
2017-08-08 07:02:22 +05:30
Rahul Garg
63456bfea0
Disable failing test p2p_copy_coherency
...
[ROCm/hip commit: 2f617bac6d ]
2017-08-02 14:47:13 +05:30
Siu Chi Chan
21c1d69e77
implement __threadfence_system
...
[ROCm/hip commit: 96e79f97e2 ]
2017-08-02 08:50:18 +00:00
Ben Sander
f88f99276a
Merge pull request #136 from mangupta/update_install_markdown
...
[doc] Add details of HIP build time dependencies to INSTALL.md
[ROCm/hip commit: 6d6c325626 ]
2017-07-31 11:05:30 -05:00
Ben Sander
aba975dd29
Merge pull request #128 from mangupta/cleanup_amdgpu_target_logic
...
[hipcc] Cleanup amdgpu target logic
[ROCm/hip commit: 0fcd27586b ]
2017-07-31 11:05:06 -05:00
Maneesh Gupta
eba208b1fd
[doc] Add details of HIP build time dependencies to INSTALL.md
...
Change-Id: Iee87d1345aba716d0ce49f7afb45ba073a16be88
[ROCm/hip commit: e9c995959b ]
2017-07-31 14:57:32 +05:30
Maneesh Gupta
e7c823b4ed
Merge pull request #134 from gargrahul/fix_hipMemcpy2DAsync
...
fix hipMemcpy2DAsync
[ROCm/hip commit: fce1d0d7f0 ]
2017-07-31 10:27:18 +05:30
Maneesh Gupta
1be0719063
Merge pull request #135 from bensander/fix_tracing
...
Some fixes to tracing.
[ROCm/hip commit: 48573a037e ]
2017-07-31 10:24:41 +05:30
Maneesh Gupta
5c9d90007f
[hipcc] Fix amdgpu target selection logic
...
- Refactoring introduced a bug when user does not specify any target via
--amdgpu-target, but has an invalid target specified in
HCC_AMDGPU_TARGET. In this case the selection logic was defaulting to
gf803.
- Removed defaulting to any specific target if rocm_agent_enumerator
fails. hipcc will report this and die if linking was required.
Change-Id: I76131867049fef92331807dd19a926406dcc1d02
[ROCm/hip commit: 85ff6e3ef4 ]
2017-07-31 10:11:19 +05:30
Ben Sander
4a234edea9
Some fixes to tracing.
...
[ROCm/hip commit: ecbb494a58 ]
2017-07-28 22:13:43 -05:00
Rahul Garg
7e9c13f8de
fix hipMemcpy2DAsync
...
[ROCm/hip commit: 0053e4db03 ]
2017-07-29 06:50:56 +05:30
Ben Sander
8c5df22733
Merge pull request #119 from emankov/master
...
[HIPIFY] CUDA RT Textures support
[ROCm/hip commit: 98f09debbb ]
2017-07-28 12:12:41 -05:00
Maneesh Gupta
514f5cb8fb
[hipcc] Cleanup amdgpu target logic
...
Existing logic has a bug. If user specifies targetA via commandline
options, while enumerator returns targetB, hipcc will create a fatbin
containing targets targetA and targetB. enumerator should only be used
when no target is specified by user (commandline or env var).
Change-Id: I6da857f86860c0e671b5988cd858644a08f723b9
[ROCm/hip commit: 830e85060b ]
2017-07-28 16:18:15 +05:30
Maneesh Gupta
b763034c65
Merge pull request #127 from bensander/pass_module_kname
...
Pass kernel name to HCC dispatch_hsa_kernel, for debug/profile
[ROCm/hip commit: 375ec84bc6 ]
2017-07-28 10:19:15 +05:30
Ben Sander
1ff23db17b
Add workweek check to make sure we have a new enough compiler
...
[ROCm/hip commit: 4980a6d3ab ]
2017-07-27 23:00:58 -05:00
Maneesh Gupta
90fa6b18eb
Merge pull request #122 from bensander/enable_async_null_stream
...
Set HIP_SYNC_NULL_STREAM=0.
[ROCm/hip commit: fcc294ed41 ]
2017-07-28 09:15:56 +05:30
Maneesh Gupta
5ab79f8182
Merge pull request #121 from syifan/master
...
Allow hip cmake to link libraries
[ROCm/hip commit: 821f8b9e02 ]
2017-07-28 09:12:15 +05:30
Ben Sander
3ab70ad042
Pass kernel name to HCC dispatch_hsa_kernel, for debug/profile
...
[ROCm/hip commit: 3a4dfc0f85 ]
2017-07-27 22:00:15 -05:00
Evgeny Mankov
24fc06496b
Update Cuda2Hip.cpp
...
[ROCm/hip commit: d614f53ad7 ]
2017-07-28 01:45:24 +03:00