Ben Sander
fe32685fbc
Merge pull request #237 from bensander/use_ctxptr_for_p2p
...
Use ctxptr for p2p
2017-11-01 18:55:25 +01:00
Ben Sander
5a80c4108e
Merge pull request #245 from scchan/centos_fixes
...
various fixes for centos/rhel
2017-11-01 18:10:29 +01:00
Siu Chi Chan
c1a4ff7b20
Centos/RHEL - remove usage of constexpr since libc++ doesn't enable ctor for constexpr pair in C++11
2017-10-31 18:16:12 +00:00
Ben Sander
dc7d993a02
Add ns-level timer for HIP API routines
...
Refactor some miuses of ihipLogStatus, these should only be in top-level
HIP APIs and should be paired with HIP_API_INIT calls.
2017-10-30 20:20:51 +00:00
Ben Sander
7e8b39fc96
Merge pull request #222 from bensander/fix_device_prop
...
Fix device prop
2017-10-30 17:58:48 +01:00
Ben Sander
4c7b2be1c2
Check for null copyEngine before looking at peers.
2017-10-30 16:58:03 +00:00
Siu Chi Chan
cf98d0e37d
Merge remote-tracking branch 'origin/master' into HEAD
2017-10-27 01:18:28 -04:00
Ben Sander
8a64feef61
Merge pull request #198 from AlexVlx/feature_support_globals_for_module_api
...
Feature support globals for module api
2017-10-27 01:53:34 +02:00
Ben Sander
a417241507
Fix bug with peer-to-peer combined with context API
...
- Store context inside the tracker rather than using int deviceID that
was always mapped to primary context
- IsPeerWatcher now based on device IDs rather than specific peers.
2017-10-26 19:44:22 +00:00
Aditya Atluri
698721be34
Enhance debug for copy pointers
...
- show more pointer tracking fields
- show pointer info before and after "tailoring'
2017-10-26 19:44:22 +00:00
Siu Chi Chan
085d63099c
add HC_FEATURE_PRINTF around the printf buffer definition
2017-10-25 12:00:02 -04:00
Siu Chi Chan
4ed3e9e15f
printf support for module API
2017-10-24 00:55:41 -04:00
Siu Chi Chan
da7c37947c
replace __hcc_workweek__ with HC_FEATURE_PRINTF flag
2017-10-23 18:30:08 -04:00
Maneesh Gupta
281c543bc3
Make elfio headers private
...
Change-Id: I3ba174bb46e84a75380207d93a0da6fe3703689e
2017-10-23 10:24:36 +05:30
Ben Sander
b4c7876244
Remove printf
2017-10-20 13:24:04 -07:00
Ben Sander
ed85b15c3e
Update device properties.
...
- clear properties to defined initial state.
- enable some property flags which are now supported.
2017-10-20 15:52:13 +00:00
Ben Sander
e738fa66b5
Modify device properties to use pool API.
...
- Also better error code checking
2017-10-20 14:49:29 +00:00
Siu Chi Chan
a1956f64e6
hipDeviceReset(): make sure to reinitialize the printf buffer in hcc RT
2017-10-18 16:26:13 -04:00
Alex Voicu
b222448efd
This fixes incorrect usage of the reader object, which created arcane
...
mismatches when one reader accessed another's section.
2017-10-09 15:46:38 +01:00
Alex Voicu
582779c6c6
This adds cursory support for globals to the HIP module loading API. The
...
style is purposefully alien so as to signal that HIP experts should turn
it into HIP worthy code as soon as possible.
2017-10-09 13:27:11 +01:00
Rahul Garg
9e07b08ce5
Fixed hipTexture fetch issue
2017-09-25 06:54:29 +05:30
Wen-Heng (Jack) Chung
1efc99e69f
Bump device major version from 2 to 3
...
This would significantly improve performance for certain apps in kernel
selection logic.
2017-09-15 15:47:39 +00:00
Ben Sander
b54ab82694
Merge pull request #179 from gargrahul/fix_hipmallocarray
...
Fixed hipMallocArray for 1D cases
2017-09-14 12:59:33 -05:00
Ben Sander
1ea468e279
Merge branch 'master' into hip_init_alloc
2017-09-14 11:53:33 -05:00
Ben Sander
fff42fd591
Add HIP_INIT_ALLOC to init allocated memory.
2017-09-13 23:31:48 +00:00
Rahul Garg
baef2321c8
Fixed hipMallocArray for 1D cases
2017-09-12 21:52:11 +05:30
Ben Sander
fff74eee21
hipStreamQuery uses av::is_empty. Add HIP_FORCE_NULL_STREAM.
2017-08-31 03:00:14 +00:00
Ben Sander
ed8c3ba7e7
Refactor hipStreamWaitEvent
...
- Null streams use same flow as non-null.
- Add HIP_SYNC_STREAM_WAIT
- Resolve null stream.
2017-08-31 03:00:14 +00:00
Ben Sander
8add022539
Merge pull request #167 from bensander/event_safety
...
Event safety
2017-08-29 13:55:46 -05:00
Ben Sander
bc9ba7cd81
Lock streams when waiting on event completion or querying event safety.
2017-08-28 18:40:16 -05:00
Rahul Garg
765d7f3c7b
Null check on input pointer arguments
2017-08-25 08:46:34 +05:30
Rahul Garg
04a5a36ef7
Device count check on input device id
2017-08-22 14:05:44 +05:30
Ben Sander
d6aee9ad8f
Add some new HIP_TRACE_API options.
2017-08-16 04:00:33 +00:00
Ben Sander
94554500db
Merge pull request #148 from weixingzhang/vmaddr1
...
[HIP Texture] The GPU virtual address for texture memory needs to be
2017-08-15 14:19:45 -05:00
Maneesh Gupta
172a568aa6
[texture] guard new HCC APIs under workweek
...
Change-Id: I4f60a64fb0b0496ca1eb01ffe6ddda121c25d976
2017-08-15 15:51:38 +05:30
Maneesh Gupta
2bd79ab9f5
Merge pull request #138 from scchan/threadfence_system
...
implement __threadfence_system
2017-08-10 12:51:05 +05:30
Ben Sander
dfc87a85bd
Merge pull request #145 from gargrahul/context_mgmt_changes
...
Updated context management logic:
2017-08-09 17:47:45 -05:00
Weixing Zhang
e4de2d1138
[HIP Texture] The GPU virtual address for texture memory needs to be
...
aligned.
In hcc_am, a bigger buffer will be allocated for alignment purpose
and _unalignedDevicePointer is added in struct AmPointerInfo for
original allocated address.
2017-08-08 11:18:00 -04:00
Maneesh Gupta
dabd6d2bc6
Bump min hcc_workweek required for named kernel dispatch to 17312
...
Change-Id: I8c7b58306b279ed113d03260e4bc6086bb8b4e68
2017-08-08 11:08:55 +05:30
Rahul Garg
c4e9323877
Updated context management logic:
...
1) hipSetDevice sets a flag so that next call to hipCtxGetCurrent returns primary context on current device
2) hipCtxGetCurrent returns primary context on current device if TLS context stack is empty
3) hipCtxPopCurrent falls back to primary context on current device as default
4) hipCtxPushCurrent, hipCtxSetCurrent and hipCtxCreate reset the flag set in hipSetDevice
2017-08-08 07:02:22 +05:30
Siu Chi Chan
b6e22b5c93
implement __threadfence_system
2017-08-02 08:50:18 +00:00
Maneesh Gupta
9ded1d02a1
Merge pull request #134 from gargrahul/fix_hipMemcpy2DAsync
...
fix hipMemcpy2DAsync
2017-07-31 10:27:18 +05:30
Maneesh Gupta
06b51109c6
Merge pull request #135 from bensander/fix_tracing
...
Some fixes to tracing.
2017-07-31 10:24:41 +05:30
Ben Sander
9e9f384899
Some fixes to tracing.
2017-07-28 22:13:43 -05:00
Rahul Garg
320ae86d44
fix hipMemcpy2DAsync
2017-07-29 06:50:56 +05:30
Maneesh Gupta
791063ad6e
Merge pull request #127 from bensander/pass_module_kname
...
Pass kernel name to HCC dispatch_hsa_kernel, for debug/profile
2017-07-28 10:19:15 +05:30
Ben Sander
2781293e89
Add workweek check to make sure we have a new enough compiler
2017-07-27 23:00:58 -05:00
Maneesh Gupta
8330fb3fe0
Merge pull request #122 from bensander/enable_async_null_stream
...
Set HIP_SYNC_NULL_STREAM=0.
2017-07-28 09:15:56 +05:30
Ben Sander
a282351763
Pass kernel name to HCC dispatch_hsa_kernel, for debug/profile
2017-07-27 22:00:15 -05:00
Ben Sander
cd42711134
Set HIP_SYNC_NULL_STREAM=0.
...
Optimizes null stream synchronization so it uses GPU-side dependency
resolution. Requires HCC __hcc_workweek__ > 17300.
2017-07-27 11:11:54 -05:00