Gráfico de Commits

1779 Commits

Autor SHA1 Mensagem Data
Ben Sander dcd8961a9d Update docs.
- Review unsupported comments and refresh.
- Review experimental tag in API.


[ROCm/clr commit: a6d1778fff]
2017-08-28 15:44:38 -05:00
Maneesh Gupta ca9ca1f808 Merge pull request #162 from gargrahul/correct_devicecnt_check
Device count range check on input device id function argument

[ROCm/clr commit: fbcc8764df]
2017-08-23 14:45:00 +05:30
Rahul Garg a76d10b58e PCI Bus ID related dtests
[ROCm/clr commit: 5a49396e3e]
2017-08-22 15:41:16 +05:30
Rahul Garg 7c51575223 Device count check on input device id
[ROCm/clr commit: 04a5a36ef7]
2017-08-22 14:05:44 +05:30
Ben Sander 3901200dca Refactor dispatch latency test and fix several bugs.
[ROCm/clr commit: 9da4352142]
2017-08-17 08:46:58 -05:00
Maneesh Gupta ae1c58fd1d Merge pull request #156 from bensander/update_trace_api
Add some new HIP_TRACE_API options.

[ROCm/clr commit: ec84e8818e]
2017-08-17 12:02:29 +05:30
Maneesh Gupta 41f4db0878 Merge pull request #158 from mangupta/bump_hip_version
Bump HIP version to 1.3 and update release notes

[ROCm/clr commit: f18167570d]
2017-08-16 12:30:13 +05:30
Maneesh Gupta 333984302c Bump HIP version to 1.3 and update release notes
Change-Id: I53f89efca03472cb906bd56200a2bf2996453915


[ROCm/clr commit: 809a548bd4]
2017-08-16 11:18:51 +05:30
Maneesh Gupta 57fbb0f290 [ci] Archive packages based on hcc-ctu instead of hcc-1.6
Change-Id: Ie1ea6411ed19aae87b979fa001db3e8927fc11f6


[ROCm/clr commit: 59c0df4d5c]
2017-08-16 10:57:07 +05:30
Ben Sander c25b5ab653 Merge pull request #152 from scchan/fix_threadfence_system_test
add C++11 compilation flags and minor bug fixes

[ROCm/clr commit: a0bbe2da97]
2017-08-15 23:04:49 -05:00
Ben Sander d47041017b Merge pull request #153 from mangupta/ci_ignore_testbuild_failures
[ci] Continue building remaining dtests on failure

[ROCm/clr commit: 6c9c7b9e22]
2017-08-15 23:03:41 -05:00
Ben Sander 967ca1a44e Add some new HIP_TRACE_API options.
[ROCm/clr commit: d6aee9ad8f]
2017-08-16 04:00:33 +00:00
Ben Sander ce1eab116f Merge pull request #148 from weixingzhang/vmaddr1
[HIP Texture] The GPU virtual address for texture memory needs to be

[ROCm/clr commit: 94554500db]
2017-08-15 14:19:45 -05:00
Maneesh Gupta d74896f873 [texture] guard new HCC APIs under workweek
Change-Id: I4f60a64fb0b0496ca1eb01ffe6ddda121c25d976


[ROCm/clr commit: 172a568aa6]
2017-08-15 15:51:38 +05:30
Maneesh Gupta 3585157c01 [ci] Continue building remaining dtests on failure
Change-Id: Iab7fd53517e0b66dd5f71efb9e65bd2e9c91355b


[ROCm/clr commit: c48b1dc8ff]
2017-08-11 15:57:20 +05:30
Maneesh Gupta bf29705489 [dtests] Fix hip_threadfence_system test's BUILD command
Change-Id: I037c523369e4bddd7a386ffe117fe97a178900fd


[ROCm/clr commit: 88ff8d9da4]
2017-08-11 14:49:17 +05:30
Siu Chi Chan a69503394d add C++11 compilation flags and minor bug fixes
[ROCm/clr commit: 8c2cda8f9e]
2017-08-10 17:58:36 +00:00
Maneesh Gupta b37cc92f36 Merge pull request #138 from scchan/threadfence_system
implement __threadfence_system

[ROCm/clr commit: 2bd79ab9f5]
2017-08-10 12:51:05 +05:30
Maneesh Gupta a8dc5dc11b [dtests] Disable hip_threadfence_system.tst on nvcc
The test needs some more modifications before it is ready for building
on nvcc path. Disabling it for now so that the pull request can be
merged.

Change-Id: I19a894fcda7b2159f86a4e4e95a409c5655d3760


[ROCm/clr commit: 0d537f9966]
2017-08-10 12:05:58 +05:30
Ben Sander b17e0394f5 Merge pull request #146 from gargrahul/docs_update_memcpysymbol
docs update for memcpytosymbol support

[ROCm/clr commit: e810da61f3]
2017-08-09 17:48:48 -05:00
Ben Sander 34775b1296 Merge pull request #145 from gargrahul/context_mgmt_changes
Updated context management logic:

[ROCm/clr commit: dfc87a85bd]
2017-08-09 17:47:45 -05:00
Ben Sander 086cd96e56 Merge pull request #139 from gargrahul/disable_p2pcoherency_test
Disable failing test p2p_copy_coherency

[ROCm/clr commit: 2a3ba24776]
2017-08-09 16:54:33 -05:00
Weixing Zhang a7f95bb8ee [HIP Texture] The GPU virtual address for texture memory needs to be
aligned.

In hcc_am, a bigger buffer will be allocated for alignment purpose
and _unalignedDevicePointer is added in struct AmPointerInfo for
original allocated address.


[ROCm/clr commit: e4de2d1138]
2017-08-08 11:18:00 -04:00
Maneesh Gupta 51c48e4cef Bump min hcc_workweek required for named kernel dispatch to 17312
Change-Id: I8c7b58306b279ed113d03260e4bc6086bb8b4e68


[ROCm/clr commit: dabd6d2bc6]
2017-08-08 11:08:55 +05:30
Rahul Garg 93d30119d0 docs update for memcpytosymbol support
[ROCm/clr commit: a8512b7597]
2017-08-08 09:13:12 +05:30
Rahul Garg 687fac97dc Updated context management logic:
1) hipSetDevice sets a flag so that next call to hipCtxGetCurrent returns primary context on current device
2) hipCtxGetCurrent returns primary context on current device if TLS context stack is empty
3) hipCtxPopCurrent falls back to primary context on current device as default
4) hipCtxPushCurrent, hipCtxSetCurrent and hipCtxCreate reset the flag set in hipSetDevice


[ROCm/clr commit: c4e9323877]
2017-08-08 07:02:22 +05:30
Rahul Garg b32cf1b3e8 Disable failing test p2p_copy_coherency
[ROCm/clr commit: f916a7654e]
2017-08-02 14:47:13 +05:30
Siu Chi Chan 6598eabd24 implement __threadfence_system
[ROCm/clr commit: b6e22b5c93]
2017-08-02 08:50:18 +00:00
Ben Sander ddf72a1b2c Merge pull request #136 from mangupta/update_install_markdown
[doc] Add details of HIP build time dependencies to INSTALL.md

[ROCm/clr commit: 3061e394bf]
2017-07-31 11:05:30 -05:00
Ben Sander 10a2a2e3e6 Merge pull request #128 from mangupta/cleanup_amdgpu_target_logic
[hipcc] Cleanup amdgpu target logic

[ROCm/clr commit: 3ed8c6c034]
2017-07-31 11:05:06 -05:00
Maneesh Gupta e090aef2d2 [doc] Add details of HIP build time dependencies to INSTALL.md
Change-Id: Iee87d1345aba716d0ce49f7afb45ba073a16be88


[ROCm/clr commit: 87e71ae681]
2017-07-31 14:57:32 +05:30
Maneesh Gupta d3d3cedc1d Merge pull request #134 from gargrahul/fix_hipMemcpy2DAsync
fix hipMemcpy2DAsync

[ROCm/clr commit: 9ded1d02a1]
2017-07-31 10:27:18 +05:30
Maneesh Gupta 602c3466f9 Merge pull request #135 from bensander/fix_tracing
Some fixes to tracing.

[ROCm/clr commit: 06b51109c6]
2017-07-31 10:24:41 +05:30
Maneesh Gupta 5e2f8804ba [hipcc] Fix amdgpu target selection logic
- Refactoring introduced a bug when user does not specify any target via
--amdgpu-target, but has an invalid target specified in
HCC_AMDGPU_TARGET. In this case the selection logic was defaulting to
gf803.
- Removed defaulting to any specific target if rocm_agent_enumerator
  fails. hipcc will report this and die if linking was required.

Change-Id: I76131867049fef92331807dd19a926406dcc1d02


[ROCm/clr commit: 8f6c150134]
2017-07-31 10:11:19 +05:30
Ben Sander 6079b4b4f6 Some fixes to tracing.
[ROCm/clr commit: 9e9f384899]
2017-07-28 22:13:43 -05:00
Rahul Garg 5a1578847e fix hipMemcpy2DAsync
[ROCm/clr commit: 320ae86d44]
2017-07-29 06:50:56 +05:30
Ben Sander 5a891afc7c Merge pull request #119 from emankov/master
[HIPIFY] CUDA RT Textures support

[ROCm/clr commit: fad59ea8c3]
2017-07-28 12:12:41 -05:00
Maneesh Gupta 8e35e7a287 [hipcc] Cleanup amdgpu target logic
Existing logic has a bug. If user specifies targetA via commandline
options, while enumerator returns targetB, hipcc will create a fatbin
containing targets targetA and targetB. enumerator should only be used
when no target is specified by user (commandline or env var).

Change-Id: I6da857f86860c0e671b5988cd858644a08f723b9


[ROCm/clr commit: 2a2c7575eb]
2017-07-28 16:18:15 +05:30
Maneesh Gupta 2405c239e1 Merge pull request #127 from bensander/pass_module_kname
Pass kernel name to HCC dispatch_hsa_kernel, for debug/profile

[ROCm/clr commit: 791063ad6e]
2017-07-28 10:19:15 +05:30
Ben Sander 004080bff8 Add workweek check to make sure we have a new enough compiler
[ROCm/clr commit: 2781293e89]
2017-07-27 23:00:58 -05:00
Maneesh Gupta 8b7ce77fc0 Merge pull request #122 from bensander/enable_async_null_stream
Set HIP_SYNC_NULL_STREAM=0.

[ROCm/clr commit: 8330fb3fe0]
2017-07-28 09:15:56 +05:30
Maneesh Gupta 0f78021303 Merge pull request #121 from syifan/master
Allow hip cmake to link libraries

[ROCm/clr commit: 6835f5ac59]
2017-07-28 09:12:15 +05:30
Ben Sander c8b72cb926 Pass kernel name to HCC dispatch_hsa_kernel, for debug/profile
[ROCm/clr commit: a282351763]
2017-07-27 22:00:15 -05:00
Evgeny Mankov 6e010bec22 Update Cuda2Hip.cpp
[ROCm/clr commit: 6c5adf42dd]
2017-07-28 01:45:24 +03:00
Evgeny Mankov f9780654b8 Update CUDA_Runtime_API_functions_supported_by_HIP.md
[ROCm/clr commit: 4486f91350]
2017-07-28 01:41:14 +03:00
Evgeny Mankov 4940eea975 Update CUDA_Runtime_API_functions_supported_by_HIP.md
[ROCm/clr commit: 681dd9e14d]
2017-07-28 01:38:50 +03:00
Ben Sander 82a347659c Merge pull request #110 from sunway513/fix_hipEnvVar_test
Fix hipEnvVar test

[ROCm/clr commit: bd563bf761]
2017-07-27 13:03:43 -05:00
sunway513 67afe0ec04 Change to use hipDeviceGetPCIBusId api for hipEnvVarDriver test
[ROCm/clr commit: b7c34fb80f]
2017-07-27 17:41:49 +00:00
Ben Sander 1aa0656d34 Set HIP_SYNC_NULL_STREAM=0.
Optimizes null stream synchronization so it uses GPU-side dependency
resolution. Requires HCC __hcc_workweek__ > 17300.


[ROCm/clr commit: cd42711134]
2017-07-27 11:11:54 -05:00
Yifan Sun 1edab9f118 Allow hip cmake to link libraries
[ROCm/clr commit: 679075e2f0]
2017-07-27 11:46:18 -04:00