Rahul Garg
38c9febe21
Null check on input pointer arguments
2017-08-25 08:46:34 +05:30
Maneesh Gupta
4051b9fac0
Merge pull request #162 from gargrahul/correct_devicecnt_check
...
Device count range check on input device id function argument
2017-08-23 14:45:00 +05:30
Maneesh Gupta
87c9cee86c
Merge pull request #163 from gargrahul/pcibusid_dtests
...
PCI Bus ID related dtests
2017-08-23 14:44:25 +05:30
Rahul Garg
9441f3a50e
PCI Bus ID related dtests
2017-08-22 15:41:16 +05:30
Maneesh Gupta
7bbde1fe22
Merge pull request #159 from bensander/hipDispatchLatency
...
Refactor dispatch latency test and fix several bugs.
2017-08-22 14:49:14 +05:30
Rahul Garg
38ecfb9384
Device count check on input device id
2017-08-22 14:05:44 +05:30
Ben Sander
e87f7a4d00
Refactor dispatch latency test and fix several bugs.
2017-08-17 08:46:58 -05:00
Maneesh Gupta
f34b89b33f
Merge pull request #156 from bensander/update_trace_api
...
Add some new HIP_TRACE_API options.
2017-08-17 12:02:29 +05:30
Maneesh Gupta
c4737636d2
Merge pull request #158 from mangupta/bump_hip_version
...
Bump HIP version to 1.3 and update release notes
2017-08-16 12:30:13 +05:30
Maneesh Gupta
05bde84918
Merge pull request #157 from mangupta/switch_archived_ci_artifacts
...
[ci] Archive packages based on hcc-ctu instead of hcc-1.6
2017-08-16 11:19:20 +05:30
Maneesh Gupta
8c94bc0009
Bump HIP version to 1.3 and update release notes
...
Change-Id: I53f89efca03472cb906bd56200a2bf2996453915
2017-08-16 11:18:51 +05:30
Maneesh Gupta
c11b8737b9
[ci] Archive packages based on hcc-ctu instead of hcc-1.6
...
Change-Id: Ie1ea6411ed19aae87b979fa001db3e8927fc11f6
2017-08-16 10:57:07 +05:30
Ben Sander
04385f12cb
Merge pull request #152 from scchan/fix_threadfence_system_test
...
add C++11 compilation flags and minor bug fixes
2017-08-15 23:04:49 -05:00
Ben Sander
1575fc2579
Merge pull request #153 from mangupta/ci_ignore_testbuild_failures
...
[ci] Continue building remaining dtests on failure
2017-08-15 23:03:41 -05:00
Ben Sander
e3161bb40e
Add some new HIP_TRACE_API options.
2017-08-16 04:00:33 +00:00
Ben Sander
8b7dde6519
Merge pull request #148 from weixingzhang/vmaddr1
...
[HIP Texture] The GPU virtual address for texture memory needs to be
2017-08-15 14:19:45 -05:00
Maneesh Gupta
e40047f2bf
[texture] guard new HCC APIs under workweek
...
Change-Id: I4f60a64fb0b0496ca1eb01ffe6ddda121c25d976
2017-08-15 15:51:38 +05:30
Maneesh Gupta
ec3db6791e
[ci] Continue building remaining dtests on failure
...
Change-Id: Iab7fd53517e0b66dd5f71efb9e65bd2e9c91355b
2017-08-11 15:57:20 +05:30
Maneesh Gupta
4bcb3372b5
[dtests] Fix hip_threadfence_system test's BUILD command
...
Change-Id: I037c523369e4bddd7a386ffe117fe97a178900fd
2017-08-11 14:49:17 +05:30
Siu Chi Chan
9a04c5cdd4
add C++11 compilation flags and minor bug fixes
2017-08-10 17:58:36 +00:00
Maneesh Gupta
f14e31d9d7
Merge pull request #138 from scchan/threadfence_system
...
implement __threadfence_system
2017-08-10 12:51:05 +05:30
Maneesh Gupta
69e431f035
[dtests] Disable hip_threadfence_system.tst on nvcc
...
The test needs some more modifications before it is ready for building
on nvcc path. Disabling it for now so that the pull request can be
merged.
Change-Id: I19a894fcda7b2159f86a4e4e95a409c5655d3760
2017-08-10 12:05:58 +05:30
Ben Sander
e7a7574cf3
Merge pull request #146 from gargrahul/docs_update_memcpysymbol
...
docs update for memcpytosymbol support
2017-08-09 17:48:48 -05:00
Ben Sander
660061c3b4
Merge pull request #145 from gargrahul/context_mgmt_changes
...
Updated context management logic:
2017-08-09 17:47:45 -05:00
Ben Sander
4a34bce26f
Merge pull request #139 from gargrahul/disable_p2pcoherency_test
...
Disable failing test p2p_copy_coherency
2017-08-09 16:54:33 -05:00
Maneesh Gupta
fa3f9c2474
Merge pull request #147 from mangupta/bump_workweek
...
Bump min hcc_workweek required for named kernel dispatch to 17312
2017-08-09 14:10:07 +05:30
Weixing Zhang
4264a4ce56
[HIP Texture] The GPU virtual address for texture memory needs to be
...
aligned.
In hcc_am, a bigger buffer will be allocated for alignment purpose
and _unalignedDevicePointer is added in struct AmPointerInfo for
original allocated address.
2017-08-08 11:18:00 -04:00
Maneesh Gupta
8d6acec135
Bump min hcc_workweek required for named kernel dispatch to 17312
...
Change-Id: I8c7b58306b279ed113d03260e4bc6086bb8b4e68
2017-08-08 11:08:55 +05:30
Rahul Garg
eb25bd72db
docs update for memcpytosymbol support
2017-08-08 09:13:12 +05:30
Rahul Garg
3e84cf4aba
Updated context management logic:
...
1) hipSetDevice sets a flag so that next call to hipCtxGetCurrent returns primary context on current device
2) hipCtxGetCurrent returns primary context on current device if TLS context stack is empty
3) hipCtxPopCurrent falls back to primary context on current device as default
4) hipCtxPushCurrent, hipCtxSetCurrent and hipCtxCreate reset the flag set in hipSetDevice
2017-08-08 07:02:22 +05:30
Rahul Garg
2f617bac6d
Disable failing test p2p_copy_coherency
2017-08-02 14:47:13 +05:30
Siu Chi Chan
96e79f97e2
implement __threadfence_system
2017-08-02 08:50:18 +00:00
Ben Sander
6d6c325626
Merge pull request #136 from mangupta/update_install_markdown
...
[doc] Add details of HIP build time dependencies to INSTALL.md
2017-07-31 11:05:30 -05:00
Ben Sander
0fcd27586b
Merge pull request #128 from mangupta/cleanup_amdgpu_target_logic
...
[hipcc] Cleanup amdgpu target logic
2017-07-31 11:05:06 -05:00
Maneesh Gupta
e9c995959b
[doc] Add details of HIP build time dependencies to INSTALL.md
...
Change-Id: Iee87d1345aba716d0ce49f7afb45ba073a16be88
2017-07-31 14:57:32 +05:30
Maneesh Gupta
fce1d0d7f0
Merge pull request #134 from gargrahul/fix_hipMemcpy2DAsync
...
fix hipMemcpy2DAsync
2017-07-31 10:27:18 +05:30
Maneesh Gupta
48573a037e
Merge pull request #135 from bensander/fix_tracing
...
Some fixes to tracing.
2017-07-31 10:24:41 +05:30
Maneesh Gupta
85ff6e3ef4
[hipcc] Fix amdgpu target selection logic
...
- Refactoring introduced a bug when user does not specify any target via
--amdgpu-target, but has an invalid target specified in
HCC_AMDGPU_TARGET. In this case the selection logic was defaulting to
gf803.
- Removed defaulting to any specific target if rocm_agent_enumerator
fails. hipcc will report this and die if linking was required.
Change-Id: I76131867049fef92331807dd19a926406dcc1d02
2017-07-31 10:11:19 +05:30
Ben Sander
ecbb494a58
Some fixes to tracing.
2017-07-28 22:13:43 -05:00
Rahul Garg
0053e4db03
fix hipMemcpy2DAsync
2017-07-29 06:50:56 +05:30
Ben Sander
98f09debbb
Merge pull request #119 from emankov/master
...
[HIPIFY] CUDA RT Textures support
2017-07-28 12:12:41 -05:00
Maneesh Gupta
830e85060b
[hipcc] Cleanup amdgpu target logic
...
Existing logic has a bug. If user specifies targetA via commandline
options, while enumerator returns targetB, hipcc will create a fatbin
containing targets targetA and targetB. enumerator should only be used
when no target is specified by user (commandline or env var).
Change-Id: I6da857f86860c0e671b5988cd858644a08f723b9
2017-07-28 16:18:15 +05:30
Maneesh Gupta
375ec84bc6
Merge pull request #127 from bensander/pass_module_kname
...
Pass kernel name to HCC dispatch_hsa_kernel, for debug/profile
2017-07-28 10:19:15 +05:30
Ben Sander
4980a6d3ab
Add workweek check to make sure we have a new enough compiler
2017-07-27 23:00:58 -05:00
Maneesh Gupta
fcc294ed41
Merge pull request #122 from bensander/enable_async_null_stream
...
Set HIP_SYNC_NULL_STREAM=0.
2017-07-28 09:15:56 +05:30
Maneesh Gupta
821f8b9e02
Merge pull request #121 from syifan/master
...
Allow hip cmake to link libraries
2017-07-28 09:12:15 +05:30
Ben Sander
3a4dfc0f85
Pass kernel name to HCC dispatch_hsa_kernel, for debug/profile
2017-07-27 22:00:15 -05:00
Evgeny Mankov
d614f53ad7
Update Cuda2Hip.cpp
2017-07-28 01:45:24 +03:00
Evgeny Mankov
bcca169631
Update CUDA_Runtime_API_functions_supported_by_HIP.md
2017-07-28 01:41:14 +03:00
Evgeny Mankov
36a9ccacb6
Update CUDA_Runtime_API_functions_supported_by_HIP.md
2017-07-28 01:38:50 +03:00