Ben Sander
cc08b36647
Merge pull request #167 from bensander/event_safety
...
Event safety
2017-08-29 13:55:46 -05:00
Ben Sander
6ff74d0e97
Lock streams when waiting on event completion or querying event safety.
2017-08-28 18:40:16 -05:00
Rahul Garg
38c9febe21
Null check on input pointer arguments
2017-08-25 08:46:34 +05:30
Rahul Garg
38ecfb9384
Device count check on input device id
2017-08-22 14:05:44 +05:30
Ben Sander
e3161bb40e
Add some new HIP_TRACE_API options.
2017-08-16 04:00:33 +00:00
Ben Sander
8b7dde6519
Merge pull request #148 from weixingzhang/vmaddr1
...
[HIP Texture] The GPU virtual address for texture memory needs to be
2017-08-15 14:19:45 -05:00
Maneesh Gupta
e40047f2bf
[texture] guard new HCC APIs under workweek
...
Change-Id: I4f60a64fb0b0496ca1eb01ffe6ddda121c25d976
2017-08-15 15:51:38 +05:30
Maneesh Gupta
f14e31d9d7
Merge pull request #138 from scchan/threadfence_system
...
implement __threadfence_system
2017-08-10 12:51:05 +05:30
Ben Sander
660061c3b4
Merge pull request #145 from gargrahul/context_mgmt_changes
...
Updated context management logic:
2017-08-09 17:47:45 -05:00
Weixing Zhang
4264a4ce56
[HIP Texture] The GPU virtual address for texture memory needs to be
...
aligned.
In hcc_am, a bigger buffer will be allocated for alignment purpose
and _unalignedDevicePointer is added in struct AmPointerInfo for
original allocated address.
2017-08-08 11:18:00 -04:00
Maneesh Gupta
8d6acec135
Bump min hcc_workweek required for named kernel dispatch to 17312
...
Change-Id: I8c7b58306b279ed113d03260e4bc6086bb8b4e68
2017-08-08 11:08:55 +05:30
Rahul Garg
3e84cf4aba
Updated context management logic:
...
1) hipSetDevice sets a flag so that next call to hipCtxGetCurrent returns primary context on current device
2) hipCtxGetCurrent returns primary context on current device if TLS context stack is empty
3) hipCtxPopCurrent falls back to primary context on current device as default
4) hipCtxPushCurrent, hipCtxSetCurrent and hipCtxCreate reset the flag set in hipSetDevice
2017-08-08 07:02:22 +05:30
Siu Chi Chan
96e79f97e2
implement __threadfence_system
2017-08-02 08:50:18 +00:00
Maneesh Gupta
fce1d0d7f0
Merge pull request #134 from gargrahul/fix_hipMemcpy2DAsync
...
fix hipMemcpy2DAsync
2017-07-31 10:27:18 +05:30
Maneesh Gupta
48573a037e
Merge pull request #135 from bensander/fix_tracing
...
Some fixes to tracing.
2017-07-31 10:24:41 +05:30
Ben Sander
ecbb494a58
Some fixes to tracing.
2017-07-28 22:13:43 -05:00
Rahul Garg
0053e4db03
fix hipMemcpy2DAsync
2017-07-29 06:50:56 +05:30
Maneesh Gupta
375ec84bc6
Merge pull request #127 from bensander/pass_module_kname
...
Pass kernel name to HCC dispatch_hsa_kernel, for debug/profile
2017-07-28 10:19:15 +05:30
Ben Sander
4980a6d3ab
Add workweek check to make sure we have a new enough compiler
2017-07-27 23:00:58 -05:00
Maneesh Gupta
fcc294ed41
Merge pull request #122 from bensander/enable_async_null_stream
...
Set HIP_SYNC_NULL_STREAM=0.
2017-07-28 09:15:56 +05:30
Ben Sander
3a4dfc0f85
Pass kernel name to HCC dispatch_hsa_kernel, for debug/profile
2017-07-27 22:00:15 -05:00
Ben Sander
77fb9893b4
Set HIP_SYNC_NULL_STREAM=0.
...
Optimizes null stream synchronization so it uses GPU-side dependency
resolution. Requires HCC __hcc_workweek__ > 17300.
2017-07-27 11:11:54 -05:00
Maneesh Gupta
526fcb1223
Merge pull request #116 from bensander/enable_coh_host
...
Make host memory allocations coherent by default.
2017-07-27 10:45:32 +05:30
Ben Sander
4f93b99f23
Merge pull request #100 from weixingzhang/texture
...
HIP Texture Support
2017-07-26 19:28:15 -05:00
Ben Sander
8b8e97ff28
Make host memory allocations coherent by default.
...
Associated change is to optimize event recording so it uses
agent-scope releaes (since it was only using system-scope release
to support non-coherent host mem).
Flags and environment variables exist to obtain previous behavior
if desired. Options are documented in new performance guide.
2017-07-26 19:20:34 -05:00
Ben Sander
67f5d2de3d
Enable HCC_OPT_FLUSH=1 (if HCC compiler new enough)
2017-07-24 18:57:19 -05:00
Maneesh Gupta
7ddb6b6b76
Don't crash for hipEventDestroy(NULL)
...
Change-Id: I3f4de1ad2712dbe8af81e10c5d3bef16d468c582
2017-07-18 11:01:02 +05:30
Weixing Zhang
e671cec367
HIP Texture Support
2017-07-17 15:16:12 -04:00
Maneesh Gupta
7066ff4478
remove unsupported erf(c)inv related host functionality
...
Change-Id: I665c33616359a0124b5552076359d2f8faa54930
2017-07-14 11:31:40 +05:30
Wen-Heng (Jack) Chung
17555eabc0
Temporarily disables HCC_OPT_FLUSH
...
Change-Id: I290791e58dd52ab3823f6c3315e42b0d386e9d64
2017-07-12 16:08:36 +00:00
Wen-Heng (Jack) Chung
35ebfda9d4
Fix hip_db header inclusion path
...
Change-Id: I8a95798301de73887dde3316dc593a1847002712
2017-07-12 16:08:20 +00:00
Wen-Heng (Jack) Chung
30ee797dd4
Remove explicit address spaces declarations
2017-07-07 12:46:29 -05:00
Ben Sander
0459f3db72
Add hipdbPrintMem - wrapper for hcc memory tracker.
2017-07-03 15:05:30 -05:00
Rahul Garg
34fcaa23bb
Fixed build error in hipDeviceGetByPCIBusId
...
Change-Id: I33be362910c4c5dca7435fb9c41f427b54f0e9a7
2017-07-01 07:56:30 +05:30
Ben Sander
626b5191e8
Set default HIP_SYNC_NULL_STREAM=1.
2017-06-30 19:01:14 -05:00
Rahul Garg
65cb024b61
Fixed hipDeviceGetByPCIBusId
...
Change-Id: Ia36bb9425671ef7659541c9aeedae4098456a31b
2017-06-30 10:11:41 +05:30
Aditya Atluri
3e610ab0fe
automate gcnarch detection
...
Change-Id: Ibbad22db136f7f5e2be84c82e9169298a144cc77
2017-06-29 12:01:40 -05:00
Rahul Garg
f4ea3b76c8
Fixed hipDeviceGetPCIBusId for HIP/HCC
...
Change-Id: I3688fa2476e1baada2d3c5fc3735cec3f15a1e21
2017-06-28 23:48:27 +05:30
Ben Sander
148dbc1027
Set default HIP_HIDDEN_FREE_MEM
2017-06-27 12:17:12 -05:00
Sun, Peng
1df08626c8
Add support of HIP_HIDDEN_FREE_MEM, to deduct the returned available
...
memory from hipMemGetInfo API, measured in MB.
Change-Id: I7a8260c12e032e04e26611db4c38c893a29f2653
2017-06-26 15:29:38 -05:00
Ben Sander
176ff824d1
Add option to pass names to HCC dispatch API (for debug)
2017-06-23 18:05:30 -05:00
Ben Sander
7912e61502
Clean up old USE_* and RELEASE.md notes.
2017-06-23 18:05:30 -05:00
Rahul Garg
3c73229916
Abort device function in HIP/HCC, need new HCC
...
Change-Id: I4195ab75e9b7b48c8b8128d6925ddc0fa5e9e009
2017-06-18 12:31:31 +05:30
Aditya Atluri
d1e28df226
fixed float2int functions
...
Change-Id: I67be79149f06daacf0f0d131bdedabf294126248
2017-06-16 09:07:06 -05:00
Rahul Garg
d24818bff6
Arguments validation in hipDeviceGetPCIBusId
...
Change-Id: I89770517c3ac94e4bf476344d27c18f03cfcde08
2017-06-15 00:21:47 +05:30
Rahul Garg
ba51d7f676
Validity check of input arguments in Ipc Mem APIs
...
Change-Id: Ia48e949d19f354f10c7e44cc2457fd4154bf6d76
2017-06-14 15:18:57 +05:30
Rahul Garg
a833b9a704
Input args NULL check in hipChooseDevice
...
Change-Id: I1a7b8cded2f81d739645bbf3dab2f04bb9c3c796
2017-06-13 13:35:50 +05:30
Sun, Peng
43df5ba660
Fix error related to undefined reference of __get_dynamicgroupbaseptr().
...
Change-Id: I14951e1725e35dd5f5e53805f81cdb58661f59f2
2017-06-08 19:24:32 -05:00
Ben Sander
99e9c7cca5
Use amHostCoherentFlag. Requires new HCC version.
2017-06-07 09:06:40 -05:00
Ben Sander
ac634bf334
Enable HCC_OPT_FLUSH=1.
...
Requires appropriate HCC with this support :
commit 38e392b517a46a09a3b1c8f388e6a0db3741c510
2017-06-07 00:15:05 -05:00