Граф коммитов

1850 Коммитов

Автор SHA1 Сообщение Дата
Rahul Garg 38c9febe21 Null check on input pointer arguments 2017-08-25 08:46:34 +05:30
Maneesh Gupta 4051b9fac0 Merge pull request #162 from gargrahul/correct_devicecnt_check
Device count range check on input device id function argument
2017-08-23 14:45:00 +05:30
Maneesh Gupta 87c9cee86c Merge pull request #163 from gargrahul/pcibusid_dtests
PCI Bus ID related dtests
2017-08-23 14:44:25 +05:30
Rahul Garg 9441f3a50e PCI Bus ID related dtests 2017-08-22 15:41:16 +05:30
Maneesh Gupta 7bbde1fe22 Merge pull request #159 from bensander/hipDispatchLatency
Refactor dispatch latency test and fix several bugs.
2017-08-22 14:49:14 +05:30
Rahul Garg 38ecfb9384 Device count check on input device id 2017-08-22 14:05:44 +05:30
Ben Sander e87f7a4d00 Refactor dispatch latency test and fix several bugs. 2017-08-17 08:46:58 -05:00
Maneesh Gupta f34b89b33f Merge pull request #156 from bensander/update_trace_api
Add some new HIP_TRACE_API options.
2017-08-17 12:02:29 +05:30
Maneesh Gupta c4737636d2 Merge pull request #158 from mangupta/bump_hip_version
Bump HIP version to 1.3 and update release notes
2017-08-16 12:30:13 +05:30
Maneesh Gupta 05bde84918 Merge pull request #157 from mangupta/switch_archived_ci_artifacts
[ci] Archive packages based on hcc-ctu instead of hcc-1.6
2017-08-16 11:19:20 +05:30
Maneesh Gupta 8c94bc0009 Bump HIP version to 1.3 and update release notes
Change-Id: I53f89efca03472cb906bd56200a2bf2996453915
2017-08-16 11:18:51 +05:30
Maneesh Gupta c11b8737b9 [ci] Archive packages based on hcc-ctu instead of hcc-1.6
Change-Id: Ie1ea6411ed19aae87b979fa001db3e8927fc11f6
2017-08-16 10:57:07 +05:30
Ben Sander 04385f12cb Merge pull request #152 from scchan/fix_threadfence_system_test
add C++11 compilation flags and minor bug fixes
2017-08-15 23:04:49 -05:00
Ben Sander 1575fc2579 Merge pull request #153 from mangupta/ci_ignore_testbuild_failures
[ci] Continue building remaining dtests on failure
2017-08-15 23:03:41 -05:00
Ben Sander e3161bb40e Add some new HIP_TRACE_API options. 2017-08-16 04:00:33 +00:00
Ben Sander 8b7dde6519 Merge pull request #148 from weixingzhang/vmaddr1
[HIP Texture] The GPU virtual address for texture memory needs to be
2017-08-15 14:19:45 -05:00
Maneesh Gupta e40047f2bf [texture] guard new HCC APIs under workweek
Change-Id: I4f60a64fb0b0496ca1eb01ffe6ddda121c25d976
2017-08-15 15:51:38 +05:30
Maneesh Gupta ec3db6791e [ci] Continue building remaining dtests on failure
Change-Id: Iab7fd53517e0b66dd5f71efb9e65bd2e9c91355b
2017-08-11 15:57:20 +05:30
Maneesh Gupta 4bcb3372b5 [dtests] Fix hip_threadfence_system test's BUILD command
Change-Id: I037c523369e4bddd7a386ffe117fe97a178900fd
2017-08-11 14:49:17 +05:30
Siu Chi Chan 9a04c5cdd4 add C++11 compilation flags and minor bug fixes 2017-08-10 17:58:36 +00:00
Maneesh Gupta f14e31d9d7 Merge pull request #138 from scchan/threadfence_system
implement __threadfence_system
2017-08-10 12:51:05 +05:30
Maneesh Gupta 69e431f035 [dtests] Disable hip_threadfence_system.tst on nvcc
The test needs some more modifications before it is ready for building
on nvcc path. Disabling it for now so that the pull request can be
merged.

Change-Id: I19a894fcda7b2159f86a4e4e95a409c5655d3760
2017-08-10 12:05:58 +05:30
Ben Sander e7a7574cf3 Merge pull request #146 from gargrahul/docs_update_memcpysymbol
docs update for memcpytosymbol support
2017-08-09 17:48:48 -05:00
Ben Sander 660061c3b4 Merge pull request #145 from gargrahul/context_mgmt_changes
Updated context management logic:
2017-08-09 17:47:45 -05:00
Ben Sander 4a34bce26f Merge pull request #139 from gargrahul/disable_p2pcoherency_test
Disable failing test p2p_copy_coherency
2017-08-09 16:54:33 -05:00
Maneesh Gupta fa3f9c2474 Merge pull request #147 from mangupta/bump_workweek
Bump min hcc_workweek required for named kernel dispatch to 17312
2017-08-09 14:10:07 +05:30
Weixing Zhang 4264a4ce56 [HIP Texture] The GPU virtual address for texture memory needs to be
aligned.

In hcc_am, a bigger buffer will be allocated for alignment purpose
and _unalignedDevicePointer is added in struct AmPointerInfo for
original allocated address.
2017-08-08 11:18:00 -04:00
Maneesh Gupta 8d6acec135 Bump min hcc_workweek required for named kernel dispatch to 17312
Change-Id: I8c7b58306b279ed113d03260e4bc6086bb8b4e68
2017-08-08 11:08:55 +05:30
Rahul Garg eb25bd72db docs update for memcpytosymbol support 2017-08-08 09:13:12 +05:30
Rahul Garg 3e84cf4aba Updated context management logic:
1) hipSetDevice sets a flag so that next call to hipCtxGetCurrent returns primary context on current device
2) hipCtxGetCurrent returns primary context on current device if TLS context stack is empty
3) hipCtxPopCurrent falls back to primary context on current device as default
4) hipCtxPushCurrent, hipCtxSetCurrent and hipCtxCreate reset the flag set in hipSetDevice
2017-08-08 07:02:22 +05:30
Rahul Garg 2f617bac6d Disable failing test p2p_copy_coherency 2017-08-02 14:47:13 +05:30
Siu Chi Chan 96e79f97e2 implement __threadfence_system 2017-08-02 08:50:18 +00:00
Ben Sander 6d6c325626 Merge pull request #136 from mangupta/update_install_markdown
[doc] Add details of HIP build time dependencies to INSTALL.md
2017-07-31 11:05:30 -05:00
Ben Sander 0fcd27586b Merge pull request #128 from mangupta/cleanup_amdgpu_target_logic
[hipcc] Cleanup amdgpu target logic
2017-07-31 11:05:06 -05:00
Maneesh Gupta e9c995959b [doc] Add details of HIP build time dependencies to INSTALL.md
Change-Id: Iee87d1345aba716d0ce49f7afb45ba073a16be88
2017-07-31 14:57:32 +05:30
Maneesh Gupta fce1d0d7f0 Merge pull request #134 from gargrahul/fix_hipMemcpy2DAsync
fix hipMemcpy2DAsync
2017-07-31 10:27:18 +05:30
Maneesh Gupta 48573a037e Merge pull request #135 from bensander/fix_tracing
Some fixes to tracing.
2017-07-31 10:24:41 +05:30
Maneesh Gupta 85ff6e3ef4 [hipcc] Fix amdgpu target selection logic
- Refactoring introduced a bug when user does not specify any target via
--amdgpu-target, but has an invalid target specified in
HCC_AMDGPU_TARGET. In this case the selection logic was defaulting to
gf803.
- Removed defaulting to any specific target if rocm_agent_enumerator
  fails. hipcc will report this and die if linking was required.

Change-Id: I76131867049fef92331807dd19a926406dcc1d02
2017-07-31 10:11:19 +05:30
Ben Sander ecbb494a58 Some fixes to tracing. 2017-07-28 22:13:43 -05:00
Rahul Garg 0053e4db03 fix hipMemcpy2DAsync 2017-07-29 06:50:56 +05:30
Ben Sander 98f09debbb Merge pull request #119 from emankov/master
[HIPIFY] CUDA RT Textures support
2017-07-28 12:12:41 -05:00
Maneesh Gupta 830e85060b [hipcc] Cleanup amdgpu target logic
Existing logic has a bug. If user specifies targetA via commandline
options, while enumerator returns targetB, hipcc will create a fatbin
containing targets targetA and targetB. enumerator should only be used
when no target is specified by user (commandline or env var).

Change-Id: I6da857f86860c0e671b5988cd858644a08f723b9
2017-07-28 16:18:15 +05:30
Maneesh Gupta 375ec84bc6 Merge pull request #127 from bensander/pass_module_kname
Pass kernel name to HCC dispatch_hsa_kernel, for debug/profile
2017-07-28 10:19:15 +05:30
Ben Sander 4980a6d3ab Add workweek check to make sure we have a new enough compiler 2017-07-27 23:00:58 -05:00
Maneesh Gupta fcc294ed41 Merge pull request #122 from bensander/enable_async_null_stream
Set HIP_SYNC_NULL_STREAM=0.
2017-07-28 09:15:56 +05:30
Maneesh Gupta 821f8b9e02 Merge pull request #121 from syifan/master
Allow hip cmake to link libraries
2017-07-28 09:12:15 +05:30
Ben Sander 3a4dfc0f85 Pass kernel name to HCC dispatch_hsa_kernel, for debug/profile 2017-07-27 22:00:15 -05:00
Evgeny Mankov d614f53ad7 Update Cuda2Hip.cpp 2017-07-28 01:45:24 +03:00
Evgeny Mankov bcca169631 Update CUDA_Runtime_API_functions_supported_by_HIP.md 2017-07-28 01:41:14 +03:00
Evgeny Mankov 36a9ccacb6 Update CUDA_Runtime_API_functions_supported_by_HIP.md 2017-07-28 01:38:50 +03:00