Граф коммитов

350 Коммитов

Автор SHA1 Сообщение Дата
Maneesh Gupta f44e77944d Merge pull request #491 from scchan/fix_wait
callback handling: don't need to wait for the thread to become ready

[ROCm/hip commit: 7311b60220]
2018-06-06 14:38:25 +05:30
Siu Chi Chan 417dde9d73 remove the _ready flag in ihipStreamCallback_t and the mutex that protects it.
[ROCm/hip commit: a1f3b587fb]
2018-06-04 17:29:04 -04:00
Rahul Garg 07115e0c02 Add integrated device property
[ROCm/hip commit: 1a02bc364f]
2018-06-02 13:11:16 +05:30
Maneesh Gupta 4f42ee762d Apply .clangformat to all repo source files
Change-Id: I7e79c6058f0303f9a98911e3b7dd2e8596079344


[ROCm/hip commit: 1ba06f63c4]
2018-03-12 11:29:03 +05:30
Maneesh Gupta 02bedf46dc Implement hipStreamAddCallback
Change-Id: Ib851e4d86ba9c8406ca37b88162ea483ccbc9d36


[ROCm/hip commit: cebb070d30]
2017-12-19 16:06:14 +05:30
Ben Sander 905389741c Fix some cppcheck style issues.
[ROCm/hip commit: 9bba97fdcc]
2017-12-01 20:45:34 +00:00
Pierre d917c6b546 Fix missing MARKER_END
Logging status of hipCtxSynchronize was missing
Test if hip profiling is active for MARKER_END in ihipPostLaunchKernel
Add MARKER_END after the completion of a kernel launched through
the "grid launch"


[ROCm/hip commit: 6baaed8e48]
2017-11-13 16:13:19 -05:00
Ben Sander f80896d58b Make hipEvent_t thread safe.
Support re-recording of same event by different threads.

- Add criticalData structure to hipEvent_t, similar to mechanism used
  for streams, contexts, device.  Events are always locked
  after streams to avoid deadlock.
- ihipEvent_t::locked_copyCrit can be used to copy critical state
  including marker.  The critical state in the event can then
  be re-recorded.
- refactor hipEventElapsedTime.  Remmove stale debug code, native signal
  refs.


[ROCm/hip commit: 4a2e6f8955]
2017-11-06 23:49:25 +00:00
Ben Sander 2979a99888 Merge pull request #237 from bensander/use_ctxptr_for_p2p
Use ctxptr for p2p

[ROCm/hip commit: 09d866a639]
2017-11-01 18:55:25 +01:00
Ben Sander e88ef63bc8 Add ns-level timer for HIP API routines
Refactor some miuses of ihipLogStatus, these should only be in top-level
HIP APIs and should be paired with HIP_API_INIT calls.


[ROCm/hip commit: 7e908bdec8]
2017-10-30 20:20:51 +00:00
Ben Sander 51ee7807db Merge pull request #222 from bensander/fix_device_prop
Fix device prop

[ROCm/hip commit: 2e8ec71e40]
2017-10-30 17:58:48 +01:00
Ben Sander ccb8b441a7 Check for null copyEngine before looking at peers.
[ROCm/hip commit: d610f16c47]
2017-10-30 16:58:03 +00:00
Ben Sander d7153f792d Fix bug with peer-to-peer combined with context API
- Store context inside the tracker rather than using int deviceID that
  was always mapped to primary context
- IsPeerWatcher now based on device IDs rather than specific peers.


[ROCm/hip commit: 7d30f32332]
2017-10-26 19:44:22 +00:00
Aditya Atluri 3ddd48ab86 Enhance debug for copy pointers
- show more pointer tracking fields
- show pointer info before and after "tailoring'


[ROCm/hip commit: 5d646d0fe3]
2017-10-26 19:44:22 +00:00
Siu Chi Chan 2bf3a98d83 replace __hcc_workweek__ with HC_FEATURE_PRINTF flag
[ROCm/hip commit: 5b9ce032d6]
2017-10-23 18:30:08 -04:00
Ben Sander 3a0a22746d Remove printf
[ROCm/hip commit: dd24983571]
2017-10-20 13:24:04 -07:00
Ben Sander e249e3c30e Update device properties.
- clear properties to defined initial state.
- enable some property flags which are now supported.


[ROCm/hip commit: acf89b43d4]
2017-10-20 15:52:13 +00:00
Ben Sander 30872c6020 Modify device properties to use pool API.
- Also better error code checking


[ROCm/hip commit: c9f906c2ce]
2017-10-20 14:49:29 +00:00
Siu Chi Chan 4dc24498af hipDeviceReset(): make sure to reinitialize the printf buffer in hcc RT
[ROCm/hip commit: ccef1cbd6e]
2017-10-18 16:26:13 -04:00
Wen-Heng (Jack) Chung 9c0dfd3538 Bump device major version from 2 to 3
This would significantly improve performance for certain apps in kernel
selection logic.


[ROCm/hip commit: c74d3fe2cb]
2017-09-15 15:47:39 +00:00
Ben Sander b446d8ebdf Merge branch 'master' into hip_init_alloc
[ROCm/hip commit: fbd22c3e49]
2017-09-14 11:53:33 -05:00
Ben Sander 5418e3de45 Add HIP_INIT_ALLOC to init allocated memory.
[ROCm/hip commit: cea80cd8b3]
2017-09-13 23:31:48 +00:00
Ben Sander cbbb0826f2 hipStreamQuery uses av::is_empty. Add HIP_FORCE_NULL_STREAM.
[ROCm/hip commit: 4ac6d643c1]
2017-08-31 03:00:14 +00:00
Ben Sander 2a23d6dbe7 Refactor hipStreamWaitEvent
- Null streams use same flow as non-null.
- Add HIP_SYNC_STREAM_WAIT
- Resolve null stream.


[ROCm/hip commit: 882dab4536]
2017-08-31 03:00:14 +00:00
Ben Sander bd434502db Lock streams when waiting on event completion or querying event safety.
[ROCm/hip commit: 6ff74d0e97]
2017-08-28 18:40:16 -05:00
Maneesh Gupta c036a1b25c [texture] guard new HCC APIs under workweek
Change-Id: I4f60a64fb0b0496ca1eb01ffe6ddda121c25d976


[ROCm/hip commit: e40047f2bf]
2017-08-15 15:51:38 +05:30
Weixing Zhang 5bf177340a [HIP Texture] The GPU virtual address for texture memory needs to be
aligned.

In hcc_am, a bigger buffer will be allocated for alignment purpose
and _unalignedDevicePointer is added in struct AmPointerInfo for
original allocated address.


[ROCm/hip commit: 4264a4ce56]
2017-08-08 11:18:00 -04:00
Maneesh Gupta 1be0719063 Merge pull request #135 from bensander/fix_tracing
Some fixes to tracing.

[ROCm/hip commit: 48573a037e]
2017-07-31 10:24:41 +05:30
Ben Sander 4a234edea9 Some fixes to tracing.
[ROCm/hip commit: ecbb494a58]
2017-07-28 22:13:43 -05:00
Maneesh Gupta 90fa6b18eb Merge pull request #122 from bensander/enable_async_null_stream
Set HIP_SYNC_NULL_STREAM=0.

[ROCm/hip commit: fcc294ed41]
2017-07-28 09:15:56 +05:30
Ben Sander cba82d9e24 Set HIP_SYNC_NULL_STREAM=0.
Optimizes null stream synchronization so it uses GPU-side dependency
resolution. Requires HCC __hcc_workweek__ > 17300.


[ROCm/hip commit: 77fb9893b4]
2017-07-27 11:11:54 -05:00
Ben Sander 6834270f15 Make host memory allocations coherent by default.
Associated change is to optimize event recording so it uses
agent-scope releaes (since it was only using system-scope release
to support non-coherent host mem).

Flags and environment variables exist to obtain previous behavior
if desired.  Options are documented in new performance guide.


[ROCm/hip commit: 8b8e97ff28]
2017-07-26 19:20:34 -05:00
Ben Sander 149ec02e84 Enable HCC_OPT_FLUSH=1 (if HCC compiler new enough)
[ROCm/hip commit: 67f5d2de3d]
2017-07-24 18:57:19 -05:00
Wen-Heng (Jack) Chung a80bc4c3b6 Temporarily disables HCC_OPT_FLUSH
Change-Id: I290791e58dd52ab3823f6c3315e42b0d386e9d64


[ROCm/hip commit: 17555eabc0]
2017-07-12 16:08:36 +00:00
Ben Sander ba0215ad9f Set default HIP_SYNC_NULL_STREAM=1.
[ROCm/hip commit: 626b5191e8]
2017-06-30 19:01:14 -05:00
Aditya Atluri f7bfec2252 automate gcnarch detection
Change-Id: Ibbad22db136f7f5e2be84c82e9169298a144cc77


[ROCm/hip commit: 3e610ab0fe]
2017-06-29 12:01:40 -05:00
Rahul Garg ff48608355 Fixed hipDeviceGetPCIBusId for HIP/HCC
Change-Id: I3688fa2476e1baada2d3c5fc3735cec3f15a1e21


[ROCm/hip commit: f4ea3b76c8]
2017-06-28 23:48:27 +05:30
Ben Sander 3ba979d2b7 Set default HIP_HIDDEN_FREE_MEM
[ROCm/hip commit: 148dbc1027]
2017-06-27 12:17:12 -05:00
Sun, Peng 84fbc3dfaa Add support of HIP_HIDDEN_FREE_MEM, to deduct the returned available
memory from hipMemGetInfo API, measured in MB.

Change-Id: I7a8260c12e032e04e26611db4c38c893a29f2653


[ROCm/hip commit: 1df08626c8]
2017-06-26 15:29:38 -05:00
Ben Sander ed1f242c60 Clean up old USE_* and RELEASE.md notes.
[ROCm/hip commit: 7912e61502]
2017-06-23 18:05:30 -05:00
Ben Sander 1d1e50fa77 Enable HCC_OPT_FLUSH=1.
Requires appropriate HCC with this support   :
commit 38e392b517a46a09a3b1c8f388e6a0db3741c510


[ROCm/hip commit: ac634bf334]
2017-06-07 00:15:05 -05:00
Ben Sander 65f91dbb0d Enable HIP_SYNC_NULL_STREAM=0 optimization.
[ROCm/hip commit: decf3eee18]
2017-06-05 08:50:41 -05:00
Ben Sander 445042f916 Fix HIP_SYNC_NULL_STREAM=0 mode.
- Fix null-stream sync
- hipStreamDestroy of null stream returns hipErrorInvalidResourceHandle
- Update documentation.
- Add tests for null stream sync, hipEventElapsedTime.
- Rename internal enum hipEventStatusRecorded to hipEventStatusComplete
- refactor hipStreamWaitEvent to streamline control-flow


[ROCm/hip commit: 39c18e5e5f]
2017-06-05 08:50:22 -05:00
Ben Sander a1bdd5f585 Update tests, add p2p coherency test.
[ROCm/hip commit: d5c1616324]
2017-06-03 17:11:34 -05:00
Ben Sander b3e01ed74f Add event controls for release fences.
Env var : HIP_EVENT_SYS_RELEASE
Event allocation flags : hipEventReleaseToDevice, hipEventReleaseToSystem
   (remove hipEventDisableSystemRelease)

Update test for new functionality.


[ROCm/hip commit: ec7102f461]
2017-05-27 16:02:34 -05:00
Ben Sander e24627aa80 Cleanup hipEvent. (Intermediate checkpoint)
Support hipEventDisableSystemRelease flag.
Update test.
Remove stray printf


[ROCm/hip commit: 620eb30691]
2017-05-27 16:02:34 -05:00
Ben Sander b7b7c7b7ac Remove HIP_NUM_KERNELS_INFLIGHT. (redundant with HCC controls)
[ROCm/hip commit: 578d430bb3]
2017-05-24 01:03:28 -05:00
Ben Sander 5b9ce41633 Add hipHostMallocCoherent, hipHostMallocNonCoherent
Provide per-allocation control over coherent/non-coherent mem.
These overrid the default HIP_COHERENT_HOST_ALLOC setting.


[ROCm/hip commit: 75f691ec2f]
2017-05-24 00:48:10 -05:00
Ben Sander 07865c3a02 Remove HIP_MAX_QUEUES (replaced with HCC_MAX_QUEUES)
[ROCm/hip commit: d0ef9d8462]
2017-05-23 23:48:01 -05:00
Ben Sander 97bdbe6590 Use accelerator_scope for create_marker and create_blocking_marker.
As optimization when system-scope is not needed.


[ROCm/hip commit: 2b253a48b6]
2017-05-23 23:15:45 -05:00