Ben Sander
dcd8961a9d
Update docs.
...
- Review unsupported comments and refresh.
- Review experimental tag in API.
[ROCm/clr commit: a6d1778fff ]
2017-08-28 15:44:38 -05:00
Maneesh Gupta
ca9ca1f808
Merge pull request #162 from gargrahul/correct_devicecnt_check
...
Device count range check on input device id function argument
[ROCm/clr commit: fbcc8764df ]
2017-08-23 14:45:00 +05:30
Rahul Garg
a76d10b58e
PCI Bus ID related dtests
...
[ROCm/clr commit: 5a49396e3e ]
2017-08-22 15:41:16 +05:30
Rahul Garg
7c51575223
Device count check on input device id
...
[ROCm/clr commit: 04a5a36ef7 ]
2017-08-22 14:05:44 +05:30
Ben Sander
3901200dca
Refactor dispatch latency test and fix several bugs.
...
[ROCm/clr commit: 9da4352142 ]
2017-08-17 08:46:58 -05:00
Maneesh Gupta
ae1c58fd1d
Merge pull request #156 from bensander/update_trace_api
...
Add some new HIP_TRACE_API options.
[ROCm/clr commit: ec84e8818e ]
2017-08-17 12:02:29 +05:30
Maneesh Gupta
41f4db0878
Merge pull request #158 from mangupta/bump_hip_version
...
Bump HIP version to 1.3 and update release notes
[ROCm/clr commit: f18167570d ]
2017-08-16 12:30:13 +05:30
Maneesh Gupta
333984302c
Bump HIP version to 1.3 and update release notes
...
Change-Id: I53f89efca03472cb906bd56200a2bf2996453915
[ROCm/clr commit: 809a548bd4 ]
2017-08-16 11:18:51 +05:30
Maneesh Gupta
57fbb0f290
[ci] Archive packages based on hcc-ctu instead of hcc-1.6
...
Change-Id: Ie1ea6411ed19aae87b979fa001db3e8927fc11f6
[ROCm/clr commit: 59c0df4d5c ]
2017-08-16 10:57:07 +05:30
Ben Sander
c25b5ab653
Merge pull request #152 from scchan/fix_threadfence_system_test
...
add C++11 compilation flags and minor bug fixes
[ROCm/clr commit: a0bbe2da97 ]
2017-08-15 23:04:49 -05:00
Ben Sander
d47041017b
Merge pull request #153 from mangupta/ci_ignore_testbuild_failures
...
[ci] Continue building remaining dtests on failure
[ROCm/clr commit: 6c9c7b9e22 ]
2017-08-15 23:03:41 -05:00
Ben Sander
967ca1a44e
Add some new HIP_TRACE_API options.
...
[ROCm/clr commit: d6aee9ad8f ]
2017-08-16 04:00:33 +00:00
Ben Sander
ce1eab116f
Merge pull request #148 from weixingzhang/vmaddr1
...
[HIP Texture] The GPU virtual address for texture memory needs to be
[ROCm/clr commit: 94554500db ]
2017-08-15 14:19:45 -05:00
Maneesh Gupta
d74896f873
[texture] guard new HCC APIs under workweek
...
Change-Id: I4f60a64fb0b0496ca1eb01ffe6ddda121c25d976
[ROCm/clr commit: 172a568aa6 ]
2017-08-15 15:51:38 +05:30
Maneesh Gupta
3585157c01
[ci] Continue building remaining dtests on failure
...
Change-Id: Iab7fd53517e0b66dd5f71efb9e65bd2e9c91355b
[ROCm/clr commit: c48b1dc8ff ]
2017-08-11 15:57:20 +05:30
Maneesh Gupta
bf29705489
[dtests] Fix hip_threadfence_system test's BUILD command
...
Change-Id: I037c523369e4bddd7a386ffe117fe97a178900fd
[ROCm/clr commit: 88ff8d9da4 ]
2017-08-11 14:49:17 +05:30
Siu Chi Chan
a69503394d
add C++11 compilation flags and minor bug fixes
...
[ROCm/clr commit: 8c2cda8f9e ]
2017-08-10 17:58:36 +00:00
Maneesh Gupta
b37cc92f36
Merge pull request #138 from scchan/threadfence_system
...
implement __threadfence_system
[ROCm/clr commit: 2bd79ab9f5 ]
2017-08-10 12:51:05 +05:30
Maneesh Gupta
a8dc5dc11b
[dtests] Disable hip_threadfence_system.tst on nvcc
...
The test needs some more modifications before it is ready for building
on nvcc path. Disabling it for now so that the pull request can be
merged.
Change-Id: I19a894fcda7b2159f86a4e4e95a409c5655d3760
[ROCm/clr commit: 0d537f9966 ]
2017-08-10 12:05:58 +05:30
Ben Sander
b17e0394f5
Merge pull request #146 from gargrahul/docs_update_memcpysymbol
...
docs update for memcpytosymbol support
[ROCm/clr commit: e810da61f3 ]
2017-08-09 17:48:48 -05:00
Ben Sander
34775b1296
Merge pull request #145 from gargrahul/context_mgmt_changes
...
Updated context management logic:
[ROCm/clr commit: dfc87a85bd ]
2017-08-09 17:47:45 -05:00
Ben Sander
086cd96e56
Merge pull request #139 from gargrahul/disable_p2pcoherency_test
...
Disable failing test p2p_copy_coherency
[ROCm/clr commit: 2a3ba24776 ]
2017-08-09 16:54:33 -05:00
Weixing Zhang
a7f95bb8ee
[HIP Texture] The GPU virtual address for texture memory needs to be
...
aligned.
In hcc_am, a bigger buffer will be allocated for alignment purpose
and _unalignedDevicePointer is added in struct AmPointerInfo for
original allocated address.
[ROCm/clr commit: e4de2d1138 ]
2017-08-08 11:18:00 -04:00
Maneesh Gupta
51c48e4cef
Bump min hcc_workweek required for named kernel dispatch to 17312
...
Change-Id: I8c7b58306b279ed113d03260e4bc6086bb8b4e68
[ROCm/clr commit: dabd6d2bc6 ]
2017-08-08 11:08:55 +05:30
Rahul Garg
93d30119d0
docs update for memcpytosymbol support
...
[ROCm/clr commit: a8512b7597 ]
2017-08-08 09:13:12 +05:30
Rahul Garg
687fac97dc
Updated context management logic:
...
1) hipSetDevice sets a flag so that next call to hipCtxGetCurrent returns primary context on current device
2) hipCtxGetCurrent returns primary context on current device if TLS context stack is empty
3) hipCtxPopCurrent falls back to primary context on current device as default
4) hipCtxPushCurrent, hipCtxSetCurrent and hipCtxCreate reset the flag set in hipSetDevice
[ROCm/clr commit: c4e9323877 ]
2017-08-08 07:02:22 +05:30
Rahul Garg
b32cf1b3e8
Disable failing test p2p_copy_coherency
...
[ROCm/clr commit: f916a7654e ]
2017-08-02 14:47:13 +05:30
Siu Chi Chan
6598eabd24
implement __threadfence_system
...
[ROCm/clr commit: b6e22b5c93 ]
2017-08-02 08:50:18 +00:00
Ben Sander
ddf72a1b2c
Merge pull request #136 from mangupta/update_install_markdown
...
[doc] Add details of HIP build time dependencies to INSTALL.md
[ROCm/clr commit: 3061e394bf ]
2017-07-31 11:05:30 -05:00
Ben Sander
10a2a2e3e6
Merge pull request #128 from mangupta/cleanup_amdgpu_target_logic
...
[hipcc] Cleanup amdgpu target logic
[ROCm/clr commit: 3ed8c6c034 ]
2017-07-31 11:05:06 -05:00
Maneesh Gupta
e090aef2d2
[doc] Add details of HIP build time dependencies to INSTALL.md
...
Change-Id: Iee87d1345aba716d0ce49f7afb45ba073a16be88
[ROCm/clr commit: 87e71ae681 ]
2017-07-31 14:57:32 +05:30
Maneesh Gupta
d3d3cedc1d
Merge pull request #134 from gargrahul/fix_hipMemcpy2DAsync
...
fix hipMemcpy2DAsync
[ROCm/clr commit: 9ded1d02a1 ]
2017-07-31 10:27:18 +05:30
Maneesh Gupta
602c3466f9
Merge pull request #135 from bensander/fix_tracing
...
Some fixes to tracing.
[ROCm/clr commit: 06b51109c6 ]
2017-07-31 10:24:41 +05:30
Maneesh Gupta
5e2f8804ba
[hipcc] Fix amdgpu target selection logic
...
- Refactoring introduced a bug when user does not specify any target via
--amdgpu-target, but has an invalid target specified in
HCC_AMDGPU_TARGET. In this case the selection logic was defaulting to
gf803.
- Removed defaulting to any specific target if rocm_agent_enumerator
fails. hipcc will report this and die if linking was required.
Change-Id: I76131867049fef92331807dd19a926406dcc1d02
[ROCm/clr commit: 8f6c150134 ]
2017-07-31 10:11:19 +05:30
Ben Sander
6079b4b4f6
Some fixes to tracing.
...
[ROCm/clr commit: 9e9f384899 ]
2017-07-28 22:13:43 -05:00
Rahul Garg
5a1578847e
fix hipMemcpy2DAsync
...
[ROCm/clr commit: 320ae86d44 ]
2017-07-29 06:50:56 +05:30
Ben Sander
5a891afc7c
Merge pull request #119 from emankov/master
...
[HIPIFY] CUDA RT Textures support
[ROCm/clr commit: fad59ea8c3 ]
2017-07-28 12:12:41 -05:00
Maneesh Gupta
8e35e7a287
[hipcc] Cleanup amdgpu target logic
...
Existing logic has a bug. If user specifies targetA via commandline
options, while enumerator returns targetB, hipcc will create a fatbin
containing targets targetA and targetB. enumerator should only be used
when no target is specified by user (commandline or env var).
Change-Id: I6da857f86860c0e671b5988cd858644a08f723b9
[ROCm/clr commit: 2a2c7575eb ]
2017-07-28 16:18:15 +05:30
Maneesh Gupta
2405c239e1
Merge pull request #127 from bensander/pass_module_kname
...
Pass kernel name to HCC dispatch_hsa_kernel, for debug/profile
[ROCm/clr commit: 791063ad6e ]
2017-07-28 10:19:15 +05:30
Ben Sander
004080bff8
Add workweek check to make sure we have a new enough compiler
...
[ROCm/clr commit: 2781293e89 ]
2017-07-27 23:00:58 -05:00
Maneesh Gupta
8b7ce77fc0
Merge pull request #122 from bensander/enable_async_null_stream
...
Set HIP_SYNC_NULL_STREAM=0.
[ROCm/clr commit: 8330fb3fe0 ]
2017-07-28 09:15:56 +05:30
Maneesh Gupta
0f78021303
Merge pull request #121 from syifan/master
...
Allow hip cmake to link libraries
[ROCm/clr commit: 6835f5ac59 ]
2017-07-28 09:12:15 +05:30
Ben Sander
c8b72cb926
Pass kernel name to HCC dispatch_hsa_kernel, for debug/profile
...
[ROCm/clr commit: a282351763 ]
2017-07-27 22:00:15 -05:00
Evgeny Mankov
6e010bec22
Update Cuda2Hip.cpp
...
[ROCm/clr commit: 6c5adf42dd ]
2017-07-28 01:45:24 +03:00
Evgeny Mankov
f9780654b8
Update CUDA_Runtime_API_functions_supported_by_HIP.md
...
[ROCm/clr commit: 4486f91350 ]
2017-07-28 01:41:14 +03:00
Evgeny Mankov
4940eea975
Update CUDA_Runtime_API_functions_supported_by_HIP.md
...
[ROCm/clr commit: 681dd9e14d ]
2017-07-28 01:38:50 +03:00
Ben Sander
82a347659c
Merge pull request #110 from sunway513/fix_hipEnvVar_test
...
Fix hipEnvVar test
[ROCm/clr commit: bd563bf761 ]
2017-07-27 13:03:43 -05:00
sunway513
67afe0ec04
Change to use hipDeviceGetPCIBusId api for hipEnvVarDriver test
...
[ROCm/clr commit: b7c34fb80f ]
2017-07-27 17:41:49 +00:00
Ben Sander
1aa0656d34
Set HIP_SYNC_NULL_STREAM=0.
...
Optimizes null stream synchronization so it uses GPU-side dependency
resolution. Requires HCC __hcc_workweek__ > 17300.
[ROCm/clr commit: cd42711134 ]
2017-07-27 11:11:54 -05:00
Yifan Sun
1edab9f118
Allow hip cmake to link libraries
...
[ROCm/clr commit: 679075e2f0 ]
2017-07-27 11:46:18 -04:00