Commit Graph

187 Commits

Author SHA1 Message Date
foreman 0533bf5386 P4 to Git Change 1987598 by gandryey@gera-win10 on 2019/08/22 13:29:18
SWDEV-197289 - VDI tracing API integration in rocTracer
	- Add profiling activity callbacks for HIP API. The logic will utilize the  current profiling mechanism inside VDI

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_activity.cpp#1 add
... //depot/stg/opencl/drivers/opencl/api/hip/hip_hcc.def.in#19 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_hcc.map.in#21 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.cpp#246 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#341 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.cpp#426 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuvirtual.hpp#145 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.cpp#148 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.hpp#64 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocvirtual.cpp#83 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocvirtual.hpp#26 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/activity.cpp#1 add
... //depot/stg/opencl/drivers/opencl/runtime/platform/activity.hpp#1 add
... //depot/stg/opencl/drivers/opencl/runtime/platform/command.cpp#97 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/command.hpp#95 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/prof_protocol.h#1 add


[ROCm/hip commit: 373d64478c]
2019-08-22 13:48:47 -04:00
foreman b10e8ac7c6 P4 to Git Change 1987582 by cpaquot@cpaquot-ocl-lc-lnx on 2019/08/22 13:20:28
SWDEV-185742 - [HIP] Add a svm path to avoid analyzing the mem objs
	and just memcpy the kernel args

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_module.cpp#36 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocvirtual.cpp#82 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocvirtual.hpp#25 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/command.cpp#96 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/command.hpp#94 edit


[ROCm/hip commit: 6577ecf5d3]
2019-08-22 13:26:57 -04:00
foreman b2509d2e2b P4 to Git Change 1987529 by kjayapra@4_HIPWS_RCCM_ROCM on 2019/08/22 12:32:42
SWDEV-200625 - Reverting back CL#1986255.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_internal.hpp#38 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_module.cpp#35 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_platform.cpp#39 edit


[ROCm/hip commit: ea22166353]
2019-08-22 12:41:42 -04:00
foreman 0898f2f413 P4 to Git Change 1986255 by kjayapra@4_HIPWS_RCCM_ROCM on 2019/08/20 17:06:44
SWDEV-198194 - Making some code common between static and dynamically created module handling.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_internal.hpp#37 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_module.cpp#34 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_platform.cpp#38 edit


[ROCm/hip commit: 15685fcc88]
2019-08-20 17:11:51 -04:00
foreman a984284070 P4 to Git Change 1984760 by kjayapra@4_HIPWS_RCCM_ROCM on 2019/08/16 18:33:19
SWDEV-188177 - Fixing parameters passed to ihipBindTexture in case of 1D image.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_texture.cpp#16 edit


[ROCm/hip commit: 182ccd1ab0]
2019-08-16 18:38:15 -04:00
foreman 4f2b941c41 P4 to Git Change 1984568 by kjayapra@4_HIPWS_RCCM_ROCM on 2019/08/16 13:19:53
SWDEV-144570 - Code changes to support Texture header changes.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_texture.cpp#15 edit


[ROCm/hip commit: 2296b63a86]
2019-08-16 13:27:47 -04:00
foreman 3a6d027037 P4 to Git Change 1982056 by kjayapra@3_HIPWS_TXT_ROCM on 2019/08/12 21:16:51
SWDEV-199635 - Setting VarInfoCallBack before all program builds.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_internal.hpp#36 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_module.cpp#33 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/devprogram.cpp#58 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palprogram.cpp#97 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#106 edit


[ROCm/hip commit: c2ffca9c1d]
2019-08-12 21:22:47 -04:00
foreman 0bc4597204 P4 to Git Change 1982034 by cpaquot@cpaquot-ocl-lc-lnx on 2019/08/12 19:08:13
SWDEV-199606 - [HIP] hipStreamQuery issue
	Don't test against CL_COMMAND_MARKER but 0 instead (user visible=false)

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_event.cpp#15 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_event.hpp#6 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_stream.cpp#22 edit


[ROCm/hip commit: 4e71cb437e]
2019-08-12 19:16:30 -04:00
foreman 7a63df9aec P4 to Git Change 1981324 by kjayapra@3_HIPWS_TXT_ROCM on 2019/08/11 18:44:40
SWDEV-188177 - Texture API implementation and support for extern variables.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_hcc.def.in#18 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_hcc.map.in#20 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_internal.hpp#35 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_module.cpp#32 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_platform.cpp#37 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_texture.cpp#14 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#340 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/devprogram.cpp#57 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/devprogram.hpp#31 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#608 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.hpp#172 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.cpp#250 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpuprogram.hpp#79 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#152 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.hpp#41 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palprogram.cpp#96 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palprogram.hpp#39 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.cpp#133 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.hpp#39 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#105 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.hpp#48 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/program.cpp#102 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/program.hpp#46 edit


[ROCm/hip commit: 7e6c7a9b8a]
2019-08-11 18:53:11 -04:00
foreman 250a441899 P4 to Git Change 1981152 by cpaquot@cpaquot-ocl-lc-lnx on 2019/08/09 20:33:32
SWDEV-193430 - [HIP] Delay creating HostQueue till commands are enqueued.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_event.cpp#14 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_event.hpp#5 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_internal.hpp#34 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#68 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_module.cpp#31 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_stream.cpp#21 edit


[ROCm/hip commit: 92840c805e]
2019-08-09 20:41:13 -04:00
foreman 3bd691396e P4 to Git Change 1981065 by cpaquot@cpaquot-ocl-lc-lnx on 2019/08/09 16:31:44
SWDEV-198556 - [HIP] Use src/dstMemory->getContext instead of host_context.
	Also relax the check for P2P copies in case of hipMemcpy(hostMalloced, hipMalloced(dev1), dev0)

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#67 edit


[ROCm/hip commit: 867d521a08]
2019-08-09 16:34:38 -04:00
foreman fae2466d94 P4 to Git Change 1980493 by cpaquot@cpaquot-ocl-lc-lnx on 2019/08/08 18:57:21
SWDEV-198556 - [HIP] Override queue if src/dst's device doesn't match the queue's device.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#66 edit


[ROCm/hip commit: 8fdb248e42]
2019-08-08 19:03:46 -04:00
foreman 9f77fec459 P4 to Git Change 1979518 by kjayapra@3_HIPWS_TXHDR_ROCM on 2019/08/07 12:40:50
SWDEV-198194 - Adding lock in Unregistervar.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_platform.cpp#36 edit


[ROCm/hip commit: 4092ab0f7a]
2019-08-07 12:45:50 -04:00
foreman 1a4364558a P4 to Git Change 1979482 by zukhan@zukhan-ocl-win10 on 2019/08/07 11:30:33
SWDEV-196326 - [hipclang-vdi-rocm]: [FBA-80]: Runtime error when all GPUs are hidden by ROCR_VISIBLE_DEVICES
	 - Adjusted hipGetDeviceCount to return "hipErrorNoDevice".
	 - This was done to match the same behaviour as HIP-HCC, and API spec for when no devices are found. Rather then return "count = 0" as it did.

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/17789/diff/

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_device.cpp#22 edit


[ROCm/hip commit: 7f0373f2e3]
2019-08-07 11:44:31 -04:00
foreman 32919a1ecb P4 to Git Change 1979479 by cpaquot@cpaquot-ocl-lc-lnx on 2019/08/07 11:25:59
SWDEV-198424 - [HIP] Implemented missing fields for hipGetDeviceAttribute.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_device.cpp#21 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_device_runtime.cpp#17 edit


[ROCm/hip commit: 042a4c2bb2]
2019-08-07 11:30:59 -04:00
foreman b6bbedd924 P4 to Git Change 1979460 by kjayapra@3_HIPWS_TXHDR_ROCM on 2019/08/07 11:05:27
SWDEV-145570 - Code changes after header update.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_texture.cpp#13 edit


[ROCm/hip commit: fa2d5186ba]
2019-08-07 11:07:16 -04:00
foreman a36e00c36d P4 to Git Change 1979123 by cpaquot@cpaquot-ocl-lc-lnx on 2019/08/06 17:15:10
SWDEV-198556 - [HIP] Gnarly bug due to macros:
	HIP_RETURN(ret) duplicates ret twice first by setting the last error
	then via LogDebugInfo. So if HIP_RETURN has a function as a parameter,
	the function would get called twice. So ihipMalloc and ihipMemcpy were
	being called twice (and perhaps more functions).
	Also logging the pointer returned by ihipMalloc so we can track memory
	in logs more easily.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_internal.hpp#33 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#65 edit


[ROCm/hip commit: adf865621c]
2019-08-06 17:19:28 -04:00
foreman cd496a5a90 P4 to Git Change 1978068 by cpaquot@cpaquot-ocl-lc-lnx on 2019/08/03 12:16:57
SWDEV-198546 - [HIP] Implemented basic Ctx APIs that are deprecated.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_context.cpp#20 edit


[ROCm/hip commit: 44813761ba]
2019-08-03 12:23:25 -04:00
foreman 6ccfa38fa3 P4 to Git Change 1977240 by wchau@wc_hip_vdi on 2019/08/01 16:35:47
SWDEV-180872 - Runtime support changes for Cooperative Group Features
	- Taking into account of SGPRs usage to determine the block size

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_platform.cpp#35 edit


[ROCm/hip commit: 99d54cb7bd]
2019-08-01 16:40:24 -04:00
foreman 121dddd188 P4 to Git Change 1977018 by cpaquot@cpaquot-ocl-lc-lnx on 2019/08/01 11:58:59
SWDEV-197462 - [HIP] Add HIP_INIT_API to hipFree to initialize current thread.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#64 edit


[ROCm/hip commit: ed497989b9]
2019-08-01 12:06:04 -04:00
foreman 3397e07ced P4 to Git Change 1975700 by jatang@jatang_win_pal_lc on 2019/07/30 11:21:33
SWDEV-1 - Fix HIP build.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/fixme.cpp#2 edit


[ROCm/hip commit: 00abf507c3]
2019-07-30 11:26:23 -04:00
foreman 31bb0d4fdb P4 to Git Change 1972936 by pghafari@pghafari-gera-win10 on 2019/07/24 12:22:38
SWDEV-197122 - Renamed filename typo

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/CMakeFiles.txt#2 move/delete
... //depot/stg/opencl/drivers/opencl/api/hip/CMakeLists.txt#1 move/add


[ROCm/hip commit: e4072ca0ec]
2019-07-24 12:29:10 -04:00
foreman 7f4b948a25 P4 to Git Change 1972922 by cpaquot@cpaquot-ocl-lc-lnx on 2019/07/24 11:59:16
SWDEV-196881 - [HIP] Fix hipStreamQuery implementation. Using isEmpty isn't correct.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_internal.hpp#32 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_stream.cpp#20 edit


[ROCm/hip commit: 01a2cf16e0]
2019-07-24 12:05:47 -04:00
foreman d6e574ffb2 P4 to Git Change 1972736 by pghafari@pghafari-gera-win10 on 2019/07/24 04:51:12
SWDEV-197122 - Initial CMake file for hip-vdi-rocr path

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/CMakeFiles.txt#1 add


[ROCm/hip commit: 49bdfeb9be]
2019-07-24 05:06:40 -04:00
foreman ab4f7de476 P4 to Git Change 1972494 by kjayapra@0_HIPWS_P2P1_ROCM on 2019/07/23 20:01:13
SWDEV-144570 - Handling variable register during hipModuleLoad.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_internal.hpp#31 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_module.cpp#30 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_platform.cpp#34 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/devprogram.cpp#51 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/devprogram.hpp#30 edit


[ROCm/hip commit: c837c6e769]
2019-07-23 20:09:59 -04:00
foreman 4fac385159 P4 to Git Change 1972329 by cpaquot@cpaquot-ocl-lc-lnx on 2019/07/23 15:16:26
SWDEV-197168 - [HIP] handle width or height or src or dst being 0

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#63 edit


[ROCm/hip commit: 6987b1896b]
2019-07-23 15:18:51 -04:00
foreman 4cd4f327df P4 to Git Change 1968576 by cpaquot@cpaquot-ocl-lc-lnx on 2019/07/16 15:48:18
SWDEV-145570 - [HIP] update code after header change

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#62 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_texture.cpp#12 edit


[ROCm/hip commit: fd89a62ac3]
2019-07-16 15:54:22 -04:00
foreman 8725261ad6 P4 to Git Change 1966049 by cpaquot@cpaquot-ocl-lc-lnx on 2019/07/11 19:05:18
SWDEV-189500 - [HIP] Have to force async=false for host to device case as well

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#61 edit


[ROCm/hip commit: 763c294441]
2019-07-11 19:13:06 -04:00
foreman 0122ae9530 P4 to Git Change 1960615 by gandryey@gera-win10 on 2019/07/08 18:05:10
SWDEV-79445 - HIP generic changes and code clean-up
	- Correct elapsed time calculation. Use event start and end.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_event.cpp#13 edit


[ROCm/hip commit: ff4b99311a]
2019-07-08 18:14:01 -04:00
foreman e48868e55b P4 to Git Change 1940447 by cpaquot@cpaquot-ocl-lc-lnx on 2019/07/03 14:44:49
SWDEV-194872 - [HIP] CUDA and HCC sync after a DeviceToHost async copy.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#60 edit


[ROCm/hip commit: 4a26d801fb]
2019-07-03 14:51:15 -04:00
foreman d8324e6ed8 P4 to Git Change 1917740 by cpaquot@cpaquot-ocl-lc-lnx on 2019/06/28 12:03:07
SWDEV-193938 - [HIP] RCCL test fails
	Set default stream to null-stream
	In hipStreamWaitEvent if event_ is null, get the last queued command instead.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_event.cpp#12 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_event.hpp#4 edit


[ROCm/hip commit: e9f58233e1]
2019-06-28 12:09:09 -04:00
foreman e07ddab2d9 P4 to Git Change 1917620 by yaxunl@yaxunl-lc10 on 2019/06/28 07:57:46
SWDEV-145570 - Fix device name mismatch.

	Not only gfx906 can have device name with +xnack etc.
	Other devices e.g. gfx900 could have that too.
	Make the previous fix more generic.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_platform.cpp#33 edit


[ROCm/hip commit: 97a079826e]
2019-06-28 08:06:23 -04:00
foreman 9176cd6664 P4 to Git Change 1879857 by gandryey@gera-win10 on 2019/06/20 18:06:07
SWDEV-184710 - Support hipLaunchCooperativeKernelMultiDevice()
	- Clean-up the loop for the launch on each device
	- Add hipExtLaunchMultiKernelMultiDevice()

	 http://ocltc.amd.com/reviews/r/17573/

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_module.cpp#29 edit


[ROCm/hip commit: fd634d58f1]
2019-06-20 18:13:20 -04:00
foreman fcc31b820e P4 to Git Change 1815357 by cpaquot@cpaquot-ocl-lc-lnx on 2019/06/12 16:24:09
SWDEV-192384 - [HIP] Fixed case where start and stop events are the same for ElapsedTime.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_event.cpp#11 edit


[ROCm/hip commit: 975d4b53f7]
2019-06-13 15:40:39 -04:00
foreman 2c7de7cd62 P4 to Git Change 1815283 by gandryey@gera-win10 on 2019/06/12 15:26:45
SWDEV-180872 - Runtime support changes for Cooperative Group Features
	- Correct the name in the def files

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_hcc.def.in#17 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_hcc.map.in#19 edit


[ROCm/hip commit: 4e8767e7f5]
2019-06-13 13:00:48 -04:00
foreman 8e5d4c3ac4 P4 to Git Change 1815146 by cpaquot@cpaquot-ocl-lc-lnx on 2019/06/12 13:43:33
SWDEV-192333 - [HIP] Implemented hipExtMallocWithFlags and hipExtGetLinkTypeAndHopCount

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_device_runtime.cpp#16 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_hcc.def.in#16 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_hcc.map.in#18 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#59 edit


[ROCm/hip commit: b15b82ffb9]
2019-06-13 12:47:17 -04:00
foreman 869eff62bb P4 to Git Change 1809454 by kjayapra@0_HIPWS_TRT2_ROCM on 2019/06/11 17:45:36
SWDEV-191936 - Implementation of hipMemcpyPeer & hipMemcpyPeerAsync + fix to check source P2PAgents for P2P. RocM Verified.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_peer.cpp#6 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocvirtual.cpp#76 edit


[ROCm/hip commit: c2b13cce1d]
2019-06-12 10:48:40 -04:00
foreman 71fba5bf67 P4 to Git Change 1809277 by gandryey@gera-win10 on 2019/06/11 17:34:13
SWDEV-180872 - Runtime support changes for Cooperative Group Features
	- Initial implementation of the core functionality. Disabled by default. Use GPU_ENABLE_COOP_GROUPS=1 to enable the feature.
	- Runtime uses device queue for cooperative executions with a synchronization on the launched queue.
	- The current implementation is pure runtime change and it can work if only one app uses this feature. No ROCr/KFD support was added or tested
	- Only inline assembler was tested

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_device.cpp#20 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_device_runtime.cpp#15 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_hcc.def.in#15 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_hcc.map.in#17 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_module.cpp#28 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_platform.cpp#32 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#338 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#606 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.hpp#171 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#31 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.hpp#9 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#142 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.hpp#39 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palschedcl.cpp#6 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.cpp#135 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.hpp#61 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocblit.cpp#32 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocblit.hpp#12 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.cpp#127 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.hpp#37 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocschedcl.cpp#3 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocvirtual.cpp#75 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocvirtual.hpp#23 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/command.cpp#94 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/command.hpp#92 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#311 edit


[ROCm/hip commit: 17943639e4]
2019-06-12 10:00:38 -04:00
foreman f11c7b6bc3 P4 to Git Change 1792745 by vsytchen@vsytchen-remote-ocl-win10 on 2019/06/06 11:18:13
SWDEV-145570 - Simplify pinned memory allocation logic

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/17467/diff/

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#58 edit


[ROCm/hip commit: 21b1c1a183]
2019-06-06 11:51:22 -04:00
foreman c195272bac P4 to Git Change 1791519 by vsytchen@vsytchen-remote-ocl-win10 on 2019/06/04 10:06:43
SWDEV-189383 - [HIP CQE][HIPonPAL][WIN] hipDeviceMalloc, hip_test_ldg, hipHostRegister, hipModule, hipStreamSync2 tests failed on VEGA10.

	1. For pinned memory allocations add the host pointer and all of its respective device pointers to the memory object map.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#57 edit


[ROCm/hip commit: 0e11975282]
2019-06-04 10:24:11 -04:00
foreman 8025d4046b P4 to Git Change 1789054 by cpaquot@cpaquot-ocl-lc-lnx on 2019/05/29 14:02:50
SWDEV-190565 - [HIP] Don't use clSetEventWaitList and just add the event to the list in HIP.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_event.cpp#10 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_common.hpp#24 edit


[ROCm/hip commit: 78905ef7e9]
2019-05-29 14:38:26 -04:00
foreman 98c937d333 P4 to Git Change 1788615 by cpaquot@cpaquot-ocl-lc-lnx on 2019/05/28 18:54:55
SWDEV-190565 - [HIP] Allow null stream as argument to hipStreamWaitEvent.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_context.cpp#19 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_stream.cpp#19 edit


[ROCm/hip commit: 3330d5e7a4]
2019-05-28 19:06:58 -04:00
foreman 775fbfb503 P4 to Git Change 1788476 by kjayapra@1_HIPWS_TRT2_PAL on 2019/05/28 14:44:38
SWDEV-190190 - Implemented API hipCanAccessPeerDevice

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_peer.cpp#5 edit


[ROCm/hip commit: 0f9c66c7d9]
2019-05-28 15:08:10 -04:00
foreman fec9e6e867 P4 to Git Change 1787806 by yaxunl@yaxunl-lc10 on 2019/05/27 10:53:56
SWDEV-145570 - Support loading fat binary generated through --genco by hipModuleLoad.

	hip-clang --genco generates fat binary instead of code object. To support that
	we need to extract code object from fat binary in hipModuleLoadData. This is
	needed for hipRTC since multiple GPU archs may be passed.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_module.cpp#27 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_platform.cpp#31 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/devprogram.cpp#42 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#308 edit


[ROCm/hip commit: 37cbce4df7]
2019-05-27 20:11:08 -04:00
foreman d38311e23e P4 to Git Change 1783301 by cpaquot@cpaquot-ocl-lc-lnx on 2019/05/15 11:57:57
SWDEV-189488 - [HIP] Caffe2 TensorTest.TensorSerializationMultiDevices fails
	1. Make sure to set attributes->device to current device for host malloc'd
	2. Return hipSuccess for hipDeviceCanAccessPeer

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#56 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_peer.cpp#4 edit


[ROCm/hip commit: 15d1df9255]
2019-05-15 12:22:50 -04:00
foreman b83b1af9ad P4 to Git Change 1781540 by kpyzhov@hip-vdi-pal-win on 2019/05/10 17:10:30
SWDEV-188234 - Corrected reading binary module file in hipModuleLoad().

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_module.cpp#26 edit


[ROCm/hip commit: 680e1a6552]
2019-05-10 19:19:11 -04:00
foreman 1489d1addd P4 to Git Change 1781384 by michliao@hliao-dev-00-hip.rocm-workspace on 2019/05/10 14:00:04
SWDEV-189216 - Export `hipExtModuleLaunchKernel`

	RB: http://ocltc.amd.com/reviews/r/17332/

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_hcc.map.in#16 edit


[ROCm/hip commit: 0949beeeb2]
2019-05-10 15:50:37 -04:00
foreman 09deb3b408 P4 to Git Change 1780704 by yaxunl@yaxunl-lc10 on 2019/05/09 13:26:13
SWDEV-145570 - Fix device name mismatch for gfx906.

	For now hip-clang can only emits gfx906 ISA with conservative configurations, i.e. with ecc on and xnack on, therefore it is always gfx906. It is still under discussion how to encode the target id for xnack off or ecc off.

	Therefore, the reasonable solution for now is just allow code object marked as gfx906 to be loaded on any device name that starts with gfx906. We will have more detailed control once hip-clang is able to emit code object for gfx906 with ecc off or xnack off.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_platform.cpp#30 edit


[ROCm/hip commit: 1d60371d01]
2019-05-09 14:19:54 -04:00
foreman e94aa7fb8c P4 to Git Change 1778985 by cpaquot@cpaquot-ocl-lc-lnx on 2019/05/06 17:21:57
SWDEV-188668 - [HIP] Implemented hipExtModuleLaunchKernel. Ignoring the flag for now.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_hcc.def.in#14 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_hcc.map.in#15 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_module.cpp#25 edit


[ROCm/hip commit: 1e856beba9]
2019-05-06 18:02:54 -04:00
foreman dcc5c0c214 P4 to Git Change 1778971 by cpaquot@cpaquot-ocl-lc-lnx on 2019/05/06 17:03:12
SWDEV-187125 - [HIP] Protect hip::Event with lock and add threadId to logs.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_event.cpp#9 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_event.hpp#3 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_internal.hpp#30 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_module.cpp#24 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_stream.cpp#18 edit


[ROCm/hip commit: d41643c39f]
2019-05-06 17:43:06 -04:00