Wykres commitów

1851 Commity

Autor SHA1 Wiadomość Data
Ben Sander e2a25fcab4 Merge pull request #165 from gargrahul/enable_strmsync_rcrdevnt_dtests
Enable hipStreamSync2 and record_event dtests
2017-08-28 15:51:35 -05:00
Rahul Garg 3e4df05632 Enable hipStreamSync2 and record_event dtests 2017-08-24 15:51:58 +05:30
Maneesh Gupta fbcc8764df Merge pull request #162 from gargrahul/correct_devicecnt_check
Device count range check on input device id function argument
2017-08-23 14:45:00 +05:30
Maneesh Gupta 76700d0d81 Merge pull request #163 from gargrahul/pcibusid_dtests
PCI Bus ID related dtests
2017-08-23 14:44:25 +05:30
Rahul Garg 5a49396e3e PCI Bus ID related dtests 2017-08-22 15:41:16 +05:30
Maneesh Gupta eab54316ee Merge pull request #159 from bensander/hipDispatchLatency
Refactor dispatch latency test and fix several bugs.
2017-08-22 14:49:14 +05:30
Rahul Garg 04a5a36ef7 Device count check on input device id 2017-08-22 14:05:44 +05:30
Ben Sander 9da4352142 Refactor dispatch latency test and fix several bugs. 2017-08-17 08:46:58 -05:00
Maneesh Gupta ec84e8818e Merge pull request #156 from bensander/update_trace_api
Add some new HIP_TRACE_API options.
2017-08-17 12:02:29 +05:30
Maneesh Gupta f18167570d Merge pull request #158 from mangupta/bump_hip_version
Bump HIP version to 1.3 and update release notes
2017-08-16 12:30:13 +05:30
Maneesh Gupta dbfe12083e Merge pull request #157 from mangupta/switch_archived_ci_artifacts
[ci] Archive packages based on hcc-ctu instead of hcc-1.6
2017-08-16 11:19:20 +05:30
Maneesh Gupta 809a548bd4 Bump HIP version to 1.3 and update release notes
Change-Id: I53f89efca03472cb906bd56200a2bf2996453915
2017-08-16 11:18:51 +05:30
Maneesh Gupta 59c0df4d5c [ci] Archive packages based on hcc-ctu instead of hcc-1.6
Change-Id: Ie1ea6411ed19aae87b979fa001db3e8927fc11f6
2017-08-16 10:57:07 +05:30
Ben Sander a0bbe2da97 Merge pull request #152 from scchan/fix_threadfence_system_test
add C++11 compilation flags and minor bug fixes
2017-08-15 23:04:49 -05:00
Ben Sander 6c9c7b9e22 Merge pull request #153 from mangupta/ci_ignore_testbuild_failures
[ci] Continue building remaining dtests on failure
2017-08-15 23:03:41 -05:00
Ben Sander d6aee9ad8f Add some new HIP_TRACE_API options. 2017-08-16 04:00:33 +00:00
Ben Sander 94554500db Merge pull request #148 from weixingzhang/vmaddr1
[HIP Texture] The GPU virtual address for texture memory needs to be
2017-08-15 14:19:45 -05:00
Maneesh Gupta 172a568aa6 [texture] guard new HCC APIs under workweek
Change-Id: I4f60a64fb0b0496ca1eb01ffe6ddda121c25d976
2017-08-15 15:51:38 +05:30
Maneesh Gupta c48b1dc8ff [ci] Continue building remaining dtests on failure
Change-Id: Iab7fd53517e0b66dd5f71efb9e65bd2e9c91355b
2017-08-11 15:57:20 +05:30
Maneesh Gupta 88ff8d9da4 [dtests] Fix hip_threadfence_system test's BUILD command
Change-Id: I037c523369e4bddd7a386ffe117fe97a178900fd
2017-08-11 14:49:17 +05:30
Siu Chi Chan 8c2cda8f9e add C++11 compilation flags and minor bug fixes 2017-08-10 17:58:36 +00:00
Maneesh Gupta 2bd79ab9f5 Merge pull request #138 from scchan/threadfence_system
implement __threadfence_system
2017-08-10 12:51:05 +05:30
Maneesh Gupta 0d537f9966 [dtests] Disable hip_threadfence_system.tst on nvcc
The test needs some more modifications before it is ready for building
on nvcc path. Disabling it for now so that the pull request can be
merged.

Change-Id: I19a894fcda7b2159f86a4e4e95a409c5655d3760
2017-08-10 12:05:58 +05:30
Ben Sander e810da61f3 Merge pull request #146 from gargrahul/docs_update_memcpysymbol
docs update for memcpytosymbol support
2017-08-09 17:48:48 -05:00
Ben Sander dfc87a85bd Merge pull request #145 from gargrahul/context_mgmt_changes
Updated context management logic:
2017-08-09 17:47:45 -05:00
Ben Sander 2a3ba24776 Merge pull request #139 from gargrahul/disable_p2pcoherency_test
Disable failing test p2p_copy_coherency
2017-08-09 16:54:33 -05:00
Maneesh Gupta 077c5a75af Merge pull request #147 from mangupta/bump_workweek
Bump min hcc_workweek required for named kernel dispatch to 17312
2017-08-09 14:10:07 +05:30
Weixing Zhang e4de2d1138 [HIP Texture] The GPU virtual address for texture memory needs to be
aligned.

In hcc_am, a bigger buffer will be allocated for alignment purpose
and _unalignedDevicePointer is added in struct AmPointerInfo for
original allocated address.
2017-08-08 11:18:00 -04:00
Maneesh Gupta dabd6d2bc6 Bump min hcc_workweek required for named kernel dispatch to 17312
Change-Id: I8c7b58306b279ed113d03260e4bc6086bb8b4e68
2017-08-08 11:08:55 +05:30
Rahul Garg a8512b7597 docs update for memcpytosymbol support 2017-08-08 09:13:12 +05:30
Rahul Garg c4e9323877 Updated context management logic:
1) hipSetDevice sets a flag so that next call to hipCtxGetCurrent returns primary context on current device
2) hipCtxGetCurrent returns primary context on current device if TLS context stack is empty
3) hipCtxPopCurrent falls back to primary context on current device as default
4) hipCtxPushCurrent, hipCtxSetCurrent and hipCtxCreate reset the flag set in hipSetDevice
2017-08-08 07:02:22 +05:30
Rahul Garg f916a7654e Disable failing test p2p_copy_coherency 2017-08-02 14:47:13 +05:30
Siu Chi Chan b6e22b5c93 implement __threadfence_system 2017-08-02 08:50:18 +00:00
Ben Sander 3061e394bf Merge pull request #136 from mangupta/update_install_markdown
[doc] Add details of HIP build time dependencies to INSTALL.md
2017-07-31 11:05:30 -05:00
Ben Sander 3ed8c6c034 Merge pull request #128 from mangupta/cleanup_amdgpu_target_logic
[hipcc] Cleanup amdgpu target logic
2017-07-31 11:05:06 -05:00
Maneesh Gupta 87e71ae681 [doc] Add details of HIP build time dependencies to INSTALL.md
Change-Id: Iee87d1345aba716d0ce49f7afb45ba073a16be88
2017-07-31 14:57:32 +05:30
Maneesh Gupta 9ded1d02a1 Merge pull request #134 from gargrahul/fix_hipMemcpy2DAsync
fix hipMemcpy2DAsync
2017-07-31 10:27:18 +05:30
Maneesh Gupta 06b51109c6 Merge pull request #135 from bensander/fix_tracing
Some fixes to tracing.
2017-07-31 10:24:41 +05:30
Maneesh Gupta 8f6c150134 [hipcc] Fix amdgpu target selection logic
- Refactoring introduced a bug when user does not specify any target via
--amdgpu-target, but has an invalid target specified in
HCC_AMDGPU_TARGET. In this case the selection logic was defaulting to
gf803.
- Removed defaulting to any specific target if rocm_agent_enumerator
  fails. hipcc will report this and die if linking was required.

Change-Id: I76131867049fef92331807dd19a926406dcc1d02
2017-07-31 10:11:19 +05:30
Ben Sander 9e9f384899 Some fixes to tracing. 2017-07-28 22:13:43 -05:00
Rahul Garg 320ae86d44 fix hipMemcpy2DAsync 2017-07-29 06:50:56 +05:30
Ben Sander fad59ea8c3 Merge pull request #119 from emankov/master
[HIPIFY] CUDA RT Textures support
2017-07-28 12:12:41 -05:00
Maneesh Gupta 2a2c7575eb [hipcc] Cleanup amdgpu target logic
Existing logic has a bug. If user specifies targetA via commandline
options, while enumerator returns targetB, hipcc will create a fatbin
containing targets targetA and targetB. enumerator should only be used
when no target is specified by user (commandline or env var).

Change-Id: I6da857f86860c0e671b5988cd858644a08f723b9
2017-07-28 16:18:15 +05:30
Maneesh Gupta 791063ad6e Merge pull request #127 from bensander/pass_module_kname
Pass kernel name to HCC dispatch_hsa_kernel, for debug/profile
2017-07-28 10:19:15 +05:30
Ben Sander 2781293e89 Add workweek check to make sure we have a new enough compiler 2017-07-27 23:00:58 -05:00
Maneesh Gupta 8330fb3fe0 Merge pull request #122 from bensander/enable_async_null_stream
Set HIP_SYNC_NULL_STREAM=0.
2017-07-28 09:15:56 +05:30
Maneesh Gupta 6835f5ac59 Merge pull request #121 from syifan/master
Allow hip cmake to link libraries
2017-07-28 09:12:15 +05:30
Ben Sander a282351763 Pass kernel name to HCC dispatch_hsa_kernel, for debug/profile 2017-07-27 22:00:15 -05:00
Evgeny Mankov 6c5adf42dd Update Cuda2Hip.cpp 2017-07-28 01:45:24 +03:00
Evgeny Mankov 4486f91350 Update CUDA_Runtime_API_functions_supported_by_HIP.md 2017-07-28 01:41:14 +03:00