Ben Sander
e2a25fcab4
Merge pull request #165 from gargrahul/enable_strmsync_rcrdevnt_dtests
...
Enable hipStreamSync2 and record_event dtests
2017-08-28 15:51:35 -05:00
Rahul Garg
3e4df05632
Enable hipStreamSync2 and record_event dtests
2017-08-24 15:51:58 +05:30
Maneesh Gupta
fbcc8764df
Merge pull request #162 from gargrahul/correct_devicecnt_check
...
Device count range check on input device id function argument
2017-08-23 14:45:00 +05:30
Maneesh Gupta
76700d0d81
Merge pull request #163 from gargrahul/pcibusid_dtests
...
PCI Bus ID related dtests
2017-08-23 14:44:25 +05:30
Rahul Garg
5a49396e3e
PCI Bus ID related dtests
2017-08-22 15:41:16 +05:30
Maneesh Gupta
eab54316ee
Merge pull request #159 from bensander/hipDispatchLatency
...
Refactor dispatch latency test and fix several bugs.
2017-08-22 14:49:14 +05:30
Rahul Garg
04a5a36ef7
Device count check on input device id
2017-08-22 14:05:44 +05:30
Ben Sander
9da4352142
Refactor dispatch latency test and fix several bugs.
2017-08-17 08:46:58 -05:00
Maneesh Gupta
ec84e8818e
Merge pull request #156 from bensander/update_trace_api
...
Add some new HIP_TRACE_API options.
2017-08-17 12:02:29 +05:30
Maneesh Gupta
f18167570d
Merge pull request #158 from mangupta/bump_hip_version
...
Bump HIP version to 1.3 and update release notes
2017-08-16 12:30:13 +05:30
Maneesh Gupta
dbfe12083e
Merge pull request #157 from mangupta/switch_archived_ci_artifacts
...
[ci] Archive packages based on hcc-ctu instead of hcc-1.6
2017-08-16 11:19:20 +05:30
Maneesh Gupta
809a548bd4
Bump HIP version to 1.3 and update release notes
...
Change-Id: I53f89efca03472cb906bd56200a2bf2996453915
2017-08-16 11:18:51 +05:30
Maneesh Gupta
59c0df4d5c
[ci] Archive packages based on hcc-ctu instead of hcc-1.6
...
Change-Id: Ie1ea6411ed19aae87b979fa001db3e8927fc11f6
2017-08-16 10:57:07 +05:30
Ben Sander
a0bbe2da97
Merge pull request #152 from scchan/fix_threadfence_system_test
...
add C++11 compilation flags and minor bug fixes
2017-08-15 23:04:49 -05:00
Ben Sander
6c9c7b9e22
Merge pull request #153 from mangupta/ci_ignore_testbuild_failures
...
[ci] Continue building remaining dtests on failure
2017-08-15 23:03:41 -05:00
Ben Sander
d6aee9ad8f
Add some new HIP_TRACE_API options.
2017-08-16 04:00:33 +00:00
Ben Sander
94554500db
Merge pull request #148 from weixingzhang/vmaddr1
...
[HIP Texture] The GPU virtual address for texture memory needs to be
2017-08-15 14:19:45 -05:00
Maneesh Gupta
172a568aa6
[texture] guard new HCC APIs under workweek
...
Change-Id: I4f60a64fb0b0496ca1eb01ffe6ddda121c25d976
2017-08-15 15:51:38 +05:30
Maneesh Gupta
c48b1dc8ff
[ci] Continue building remaining dtests on failure
...
Change-Id: Iab7fd53517e0b66dd5f71efb9e65bd2e9c91355b
2017-08-11 15:57:20 +05:30
Maneesh Gupta
88ff8d9da4
[dtests] Fix hip_threadfence_system test's BUILD command
...
Change-Id: I037c523369e4bddd7a386ffe117fe97a178900fd
2017-08-11 14:49:17 +05:30
Siu Chi Chan
8c2cda8f9e
add C++11 compilation flags and minor bug fixes
2017-08-10 17:58:36 +00:00
Maneesh Gupta
2bd79ab9f5
Merge pull request #138 from scchan/threadfence_system
...
implement __threadfence_system
2017-08-10 12:51:05 +05:30
Maneesh Gupta
0d537f9966
[dtests] Disable hip_threadfence_system.tst on nvcc
...
The test needs some more modifications before it is ready for building
on nvcc path. Disabling it for now so that the pull request can be
merged.
Change-Id: I19a894fcda7b2159f86a4e4e95a409c5655d3760
2017-08-10 12:05:58 +05:30
Ben Sander
e810da61f3
Merge pull request #146 from gargrahul/docs_update_memcpysymbol
...
docs update for memcpytosymbol support
2017-08-09 17:48:48 -05:00
Ben Sander
dfc87a85bd
Merge pull request #145 from gargrahul/context_mgmt_changes
...
Updated context management logic:
2017-08-09 17:47:45 -05:00
Ben Sander
2a3ba24776
Merge pull request #139 from gargrahul/disable_p2pcoherency_test
...
Disable failing test p2p_copy_coherency
2017-08-09 16:54:33 -05:00
Maneesh Gupta
077c5a75af
Merge pull request #147 from mangupta/bump_workweek
...
Bump min hcc_workweek required for named kernel dispatch to 17312
2017-08-09 14:10:07 +05:30
Weixing Zhang
e4de2d1138
[HIP Texture] The GPU virtual address for texture memory needs to be
...
aligned.
In hcc_am, a bigger buffer will be allocated for alignment purpose
and _unalignedDevicePointer is added in struct AmPointerInfo for
original allocated address.
2017-08-08 11:18:00 -04:00
Maneesh Gupta
dabd6d2bc6
Bump min hcc_workweek required for named kernel dispatch to 17312
...
Change-Id: I8c7b58306b279ed113d03260e4bc6086bb8b4e68
2017-08-08 11:08:55 +05:30
Rahul Garg
a8512b7597
docs update for memcpytosymbol support
2017-08-08 09:13:12 +05:30
Rahul Garg
c4e9323877
Updated context management logic:
...
1) hipSetDevice sets a flag so that next call to hipCtxGetCurrent returns primary context on current device
2) hipCtxGetCurrent returns primary context on current device if TLS context stack is empty
3) hipCtxPopCurrent falls back to primary context on current device as default
4) hipCtxPushCurrent, hipCtxSetCurrent and hipCtxCreate reset the flag set in hipSetDevice
2017-08-08 07:02:22 +05:30
Rahul Garg
f916a7654e
Disable failing test p2p_copy_coherency
2017-08-02 14:47:13 +05:30
Siu Chi Chan
b6e22b5c93
implement __threadfence_system
2017-08-02 08:50:18 +00:00
Ben Sander
3061e394bf
Merge pull request #136 from mangupta/update_install_markdown
...
[doc] Add details of HIP build time dependencies to INSTALL.md
2017-07-31 11:05:30 -05:00
Ben Sander
3ed8c6c034
Merge pull request #128 from mangupta/cleanup_amdgpu_target_logic
...
[hipcc] Cleanup amdgpu target logic
2017-07-31 11:05:06 -05:00
Maneesh Gupta
87e71ae681
[doc] Add details of HIP build time dependencies to INSTALL.md
...
Change-Id: Iee87d1345aba716d0ce49f7afb45ba073a16be88
2017-07-31 14:57:32 +05:30
Maneesh Gupta
9ded1d02a1
Merge pull request #134 from gargrahul/fix_hipMemcpy2DAsync
...
fix hipMemcpy2DAsync
2017-07-31 10:27:18 +05:30
Maneesh Gupta
06b51109c6
Merge pull request #135 from bensander/fix_tracing
...
Some fixes to tracing.
2017-07-31 10:24:41 +05:30
Maneesh Gupta
8f6c150134
[hipcc] Fix amdgpu target selection logic
...
- Refactoring introduced a bug when user does not specify any target via
--amdgpu-target, but has an invalid target specified in
HCC_AMDGPU_TARGET. In this case the selection logic was defaulting to
gf803.
- Removed defaulting to any specific target if rocm_agent_enumerator
fails. hipcc will report this and die if linking was required.
Change-Id: I76131867049fef92331807dd19a926406dcc1d02
2017-07-31 10:11:19 +05:30
Ben Sander
9e9f384899
Some fixes to tracing.
2017-07-28 22:13:43 -05:00
Rahul Garg
320ae86d44
fix hipMemcpy2DAsync
2017-07-29 06:50:56 +05:30
Ben Sander
fad59ea8c3
Merge pull request #119 from emankov/master
...
[HIPIFY] CUDA RT Textures support
2017-07-28 12:12:41 -05:00
Maneesh Gupta
2a2c7575eb
[hipcc] Cleanup amdgpu target logic
...
Existing logic has a bug. If user specifies targetA via commandline
options, while enumerator returns targetB, hipcc will create a fatbin
containing targets targetA and targetB. enumerator should only be used
when no target is specified by user (commandline or env var).
Change-Id: I6da857f86860c0e671b5988cd858644a08f723b9
2017-07-28 16:18:15 +05:30
Maneesh Gupta
791063ad6e
Merge pull request #127 from bensander/pass_module_kname
...
Pass kernel name to HCC dispatch_hsa_kernel, for debug/profile
2017-07-28 10:19:15 +05:30
Ben Sander
2781293e89
Add workweek check to make sure we have a new enough compiler
2017-07-27 23:00:58 -05:00
Maneesh Gupta
8330fb3fe0
Merge pull request #122 from bensander/enable_async_null_stream
...
Set HIP_SYNC_NULL_STREAM=0.
2017-07-28 09:15:56 +05:30
Maneesh Gupta
6835f5ac59
Merge pull request #121 from syifan/master
...
Allow hip cmake to link libraries
2017-07-28 09:12:15 +05:30
Ben Sander
a282351763
Pass kernel name to HCC dispatch_hsa_kernel, for debug/profile
2017-07-27 22:00:15 -05:00
Evgeny Mankov
6c5adf42dd
Update Cuda2Hip.cpp
2017-07-28 01:45:24 +03:00
Evgeny Mankov
4486f91350
Update CUDA_Runtime_API_functions_supported_by_HIP.md
2017-07-28 01:41:14 +03:00