Ben Sander
ac634bf334
Enable HCC_OPT_FLUSH=1.
...
Requires appropriate HCC with this support :
commit 38e392b517a46a09a3b1c8f388e6a0db3741c510
2017-06-07 00:15:05 -05:00
Ben Sander
decf3eee18
Enable HIP_SYNC_NULL_STREAM=0 optimization.
2017-06-05 08:50:41 -05:00
Ben Sander
39c18e5e5f
Fix HIP_SYNC_NULL_STREAM=0 mode.
...
- Fix null-stream sync
- hipStreamDestroy of null stream returns hipErrorInvalidResourceHandle
- Update documentation.
- Add tests for null stream sync, hipEventElapsedTime.
- Rename internal enum hipEventStatusRecorded to hipEventStatusComplete
- refactor hipStreamWaitEvent to streamline control-flow
2017-06-05 08:50:22 -05:00
Ben Sander
d5c1616324
Update tests, add p2p coherency test.
2017-06-03 17:11:34 -05:00
Siu Chi Chan
741eb844fe
fix atomicCAS:remove load for the return value after CAS
2017-05-31 15:20:19 -04:00
Ben Sander
cb60763737
Set event->_stream on hipHccModuleLaunchKernel path if start/stop used
...
Ensure _stream is always non-null in recorded events.
Fixes isDefaultStream fault.
2017-05-30 21:55:46 -05:00
Maneesh Gupta
4ff01c971f
Disable normcdfinvf on __host__
...
Change-Id: If7bfc9826a09eb9b7675ea2a417b9418759b7912
2017-05-30 15:45:22 +05:30
Ben Sander
ec7102f461
Add event controls for release fences.
...
Env var : HIP_EVENT_SYS_RELEASE
Event allocation flags : hipEventReleaseToDevice, hipEventReleaseToSystem
(remove hipEventDisableSystemRelease)
Update test for new functionality.
2017-05-27 16:02:34 -05:00
Ben Sander
620eb30691
Cleanup hipEvent. (Intermediate checkpoint)
...
Support hipEventDisableSystemRelease flag.
Update test.
Remove stray printf
2017-05-27 16:02:34 -05:00
Ben Sander
b251d72917
Add isDefaultStream() accessor.
...
Fix code that checked for stream==nullptr after stream had been
resolved to a "true stream".
2017-05-26 13:46:48 -05:00
Siu Chi Chan
1dce01f9bb
fix hip_fast_dsqrt* to call a double fp sqrt function
2017-05-25 23:15:30 -04:00
Ben Sander
578d430bb3
Remove HIP_NUM_KERNELS_INFLIGHT. (redundant with HCC controls)
2017-05-24 01:03:28 -05:00
Ben Sander
75f691ec2f
Add hipHostMallocCoherent, hipHostMallocNonCoherent
...
Provide per-allocation control over coherent/non-coherent mem.
These overrid the default HIP_COHERENT_HOST_ALLOC setting.
2017-05-24 00:48:10 -05:00
Ben Sander
d0ef9d8462
Remove HIP_MAX_QUEUES (replaced with HCC_MAX_QUEUES)
2017-05-23 23:48:01 -05:00
Ben Sander
2b253a48b6
Use accelerator_scope for create_marker and create_blocking_marker.
...
As optimization when system-scope is not needed.
2017-05-23 23:15:45 -05:00
Ben Sander
7cfe07cff4
Fix trace category for hipHostMalloc
2017-05-23 23:15:45 -05:00
Aditya Atluri
0559fc69e9
fixed erfinv build error as it is implemented in hcc
...
Change-Id: I27a512147c53f658a63fdf3e90f5e9cfac09ada8
2017-05-23 09:32:19 -05:00
Evgeny Mankov
270f643c9c
[HIP] [HIPIFY] [FIX] cuModuleLoadDataEx -> hipModuleLoadDataEx
...
https://github.com/GPUOpen-ProfessionalCompute-Tools/HIP/issues/81
1. Do not use JIT options on HCC path, call hipModuleLoadData instead.
2. NVCC path is unchanged, to call cuModuleLoadDataEx with all options.
3. Get rid of manual hipification, based on #ifdef #else for NVCC/HIP.
4. Update documentation accordingly.
2017-05-19 17:22:14 +03:00
Ben Sander
46030bb2d2
Return precise address for hipHostGetDevicePointer.
2017-05-17 07:36:06 -05:00
Aditya Atluri
9dceccf136
changed vector types to make sure it generate proper llvm vector types
...
Change-Id: I6c4616dae137dc4eac35e5827dc5b7f3251e0247
2017-05-16 21:35:40 -05:00
Ben Sander
427f8472aa
Fix HIP_TRACE_API so kernel launch only printed when requested.
2017-05-16 19:04:25 -05:00
Ben Sander
27877f8854
Add initial HIP_SYNC_NULL_STREAM=0 mode.
...
This eliminates host-synchronization for null stream. Instead, the
null-stream uses GPU-side events to wait for other streams.
Default is OFF pending additional testing.
Add enhanced null-stream test.
Also refine HIP_TRACE_API.
2017-05-16 19:04:25 -05:00
Ben Sander
0edab14139
Add HIP_TRACE_API=4. Only display memory allocation/free apis.
2017-05-16 19:04:25 -05:00
Aditya Atluri
a43149135e
added gfx900 to hipDeviceProp_t
...
Change-Id: I49e7a32f218926fd55f1c94c5dc2366d6c8ac4ca
2017-05-12 21:43:34 -05:00
Ben Sander
0679831384
Remove old USE_ switches no longer needed.
2017-05-12 16:06:03 -05:00
Ben Sander
b306095ac2
Add hipEventDisableSystemRelease flag.
2017-05-12 16:06:03 -05:00
Ben Sander
c3ccaa01e5
hipHostMalloc allocation are mapped to all devices by default.
...
Support hipHostMallocPortable flag.
Default flags are hipHostMallocPortable | hipHostMallocMapped.
Also:
-refactor tests to move addCount and addCountReverse into HipTest
namespace.
-test multi-GPU host memory.
2017-05-10 17:34:36 -05:00
Ben Sander
687809104b
Fix some typos, add additional guidance for -BSymbolic
2017-05-05 17:29:04 -05:00
Rahul Garg
ec27c695c4
Added support for hipMemcpy2DAsync in HIP/HCC
...
Change-Id: Ia4a8306f2dc1e33a81a7195ec29aef652fcccc4b
2017-05-03 22:29:12 +05:30
Aditya Atluri
a5cb2d40ec
fixed hipFuncSetCacheConfig on rocm path
...
Change-Id: I937a3afbf115edc94a753a0beb2230ed60a6f021
2017-04-28 11:53:11 -05:00
Aditya Atluri
ab2eb420e2
fixed fast math expf and exp10f
...
Change-Id: I73963220f902efebb0a7404c5f8966dffb4c35ca
2017-04-26 19:01:10 -05:00
Ben Sander
5a52b79782
Tailor pointer info for src/dst before calling HCC copy routines.
...
HCC sometimes uses the srcPtrInfo or dstPtrInfo to determine the
pointer. Make sure these use the actual pointer and not the base
of the allocation.
2017-04-24 22:30:54 -05:00
Ben Sander
9e41e3c6e2
Fix hip debug for case where copyAgent is null (host-to-host)
2017-04-24 22:30:54 -05:00
Ben Sander
59df70662a
Fix hipMalloc to return error code if allocation fails.
2017-04-24 22:30:54 -05:00
Aditya Atluri
59ab3659ee
fixed build issues with hipPointerGetAttributes
...
Change-Id: I3f5fbc05bdaef720884ba949075928752a070377
2017-04-24 15:31:07 -05:00
Aditya Atluri
72bcfa438b
changed arguments for hipPointerGetAttributes
...
Change-Id: Ia7a7c4722c1f7d0a23f0e5cc3dd6dea6c01c1fd8
2017-04-24 15:24:16 -05:00
Ben Sander
22f3b91cad
Add integer abs (initial implementation, can be optimized with OCML)
...
Change-Id: I1f568c8c0e2333af1fda4c313dc48ea0c5b6ab00
2017-04-11 01:16:28 +00:00
Sun, Peng
6d4af1ab1f
update GGL to log launched kernel information
...
Change-Id: Ied0aa6055673c687071b4a579aecd17f0f3f09ce
2017-04-07 14:06:31 -05:00
Ben Sander
6656d33f75
Finish adding start/stop event support to hipHccModuleLaunchKernel.
...
Change interface to use hipEvent_t rather than hipEvent_t*
Change-Id: I259062dc087a13d51dc27f84e1e8861f332a104d
2017-04-06 21:02:50 -05:00
sunway513
cfa3155082
Refactor events and add initial event option for hipHccModuleLaunchKernel
...
- Change hipEvent_t to a class.
- Move event logic inside the class.
- Add _type to support Independent, StartCommand, StopCommand events.
StartCommand returns start timestamp from events.
Change-Id: I4ddd694f2645a3ff7170c9111dc1d3e39931ca21
2017-04-06 21:02:50 -05:00
Rahul Garg
fc61b793fe
mgpu IPC support fix
...
Change-Id: I12e4b2fd189c3658efd3b07defa18ece3853b0eb
2017-04-04 15:51:10 +05:30
Sun, Peng
c82c84949c
Fix warpSize, for related issue in hipeigen and torch
...
Change-Id: Ic66b24923a363304dca189011869ba7a0a6f8895
2017-04-01 15:42:53 -05:00
Sun, Peng
27bbeedabb
Add copy right in grid_launch.cpp
...
Change-Id: I7de3fc32f13182b5c41a4e44147b642ba15e8636
2017-04-01 15:12:00 -05:00
Sun, Peng
80a99350a3
Add grid_launch.cpp for GGL
...
Change-Id: I87ff9b3f1203d0909f998c96c839f7b321fc3f09
2017-04-01 14:57:47 -05:00
Aditya Atluri
3eed9aba5d
added debug support for HIP sample
...
Change-Id: Ia7265234082039b68114f7421f4dbcb7149d4d2b
2017-03-31 14:13:46 -05:00
Aditya Atluri
1ef7222c3a
Fixed copyright and header names
...
Change-Id: Id595c65ea3b7289e87be4c42db5d8a31905a4fdd
2017-03-31 12:40:29 -05:00
Aditya Atluri
b9091ba818
added new api hipHccModuleLaunchKernel
...
1. hipHccModuleLaunchKernel is same as hipModuleLaunchKernel with OpenCL workitem model
2. Added copy right
3. Fixed header naming
Change-Id: I6a7c35a3566e2f8d3f5056613e34193775d4b236
2017-03-31 12:11:34 -05:00
Aditya Atluri
7ac438ed02
disabled metadata apis
...
Change-Id: Ifb8839c581644cccc2afcd18c38a866f649a4144
2017-03-28 10:46:31 -05:00
Aditya Atluri
7d49dcc030
Re-enabled metadata parsing in HIP
...
Change-Id: If8caa844571cb8581450df9ffdb76e2445c75f13
2017-03-27 11:00:39 -05:00
Rahul Garg
1d18006ab4
Fix for MemcpyFromSymbol on HIP/NVCC path
...
Change-Id: Ice38307f72870ae468cbf0861e104f0fa46dfd56
2017-03-27 00:35:10 +05:30