Γράφημα Υποβολών

172 Υποβολές

Συγγραφέας SHA1 Μήνυμα Ημερομηνία
foreman fa2d5186ba P4 to Git Change 1979460 by kjayapra@3_HIPWS_TXHDR_ROCM on 2019/08/07 11:05:27
SWDEV-145570 - Code changes after header update.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_texture.cpp#13 edit
2019-08-07 11:07:16 -04:00
foreman adf865621c P4 to Git Change 1979123 by cpaquot@cpaquot-ocl-lc-lnx on 2019/08/06 17:15:10
SWDEV-198556 - [HIP] Gnarly bug due to macros:
	HIP_RETURN(ret) duplicates ret twice first by setting the last error
	then via LogDebugInfo. So if HIP_RETURN has a function as a parameter,
	the function would get called twice. So ihipMalloc and ihipMemcpy were
	being called twice (and perhaps more functions).
	Also logging the pointer returned by ihipMalloc so we can track memory
	in logs more easily.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_internal.hpp#33 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#65 edit
2019-08-06 17:19:28 -04:00
foreman 44813761ba P4 to Git Change 1978068 by cpaquot@cpaquot-ocl-lc-lnx on 2019/08/03 12:16:57
SWDEV-198546 - [HIP] Implemented basic Ctx APIs that are deprecated.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_context.cpp#20 edit
2019-08-03 12:23:25 -04:00
foreman 99d54cb7bd P4 to Git Change 1977240 by wchau@wc_hip_vdi on 2019/08/01 16:35:47
SWDEV-180872 - Runtime support changes for Cooperative Group Features
	- Taking into account of SGPRs usage to determine the block size

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_platform.cpp#35 edit
2019-08-01 16:40:24 -04:00
foreman ed497989b9 P4 to Git Change 1977018 by cpaquot@cpaquot-ocl-lc-lnx on 2019/08/01 11:58:59
SWDEV-197462 - [HIP] Add HIP_INIT_API to hipFree to initialize current thread.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#64 edit
2019-08-01 12:06:04 -04:00
foreman 00abf507c3 P4 to Git Change 1975700 by jatang@jatang_win_pal_lc on 2019/07/30 11:21:33
SWDEV-1 - Fix HIP build.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/fixme.cpp#2 edit
2019-07-30 11:26:23 -04:00
foreman e4072ca0ec P4 to Git Change 1972936 by pghafari@pghafari-gera-win10 on 2019/07/24 12:22:38
SWDEV-197122 - Renamed filename typo

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/CMakeFiles.txt#2 move/delete
... //depot/stg/opencl/drivers/opencl/api/hip/CMakeLists.txt#1 move/add
2019-07-24 12:29:10 -04:00
foreman 01a2cf16e0 P4 to Git Change 1972922 by cpaquot@cpaquot-ocl-lc-lnx on 2019/07/24 11:59:16
SWDEV-196881 - [HIP] Fix hipStreamQuery implementation. Using isEmpty isn't correct.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_internal.hpp#32 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_stream.cpp#20 edit
2019-07-24 12:05:47 -04:00
foreman 49bdfeb9be P4 to Git Change 1972736 by pghafari@pghafari-gera-win10 on 2019/07/24 04:51:12
SWDEV-197122 - Initial CMake file for hip-vdi-rocr path

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/CMakeFiles.txt#1 add
2019-07-24 05:06:40 -04:00
foreman c837c6e769 P4 to Git Change 1972494 by kjayapra@0_HIPWS_P2P1_ROCM on 2019/07/23 20:01:13
SWDEV-144570 - Handling variable register during hipModuleLoad.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_internal.hpp#31 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_module.cpp#30 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_platform.cpp#34 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/devprogram.cpp#51 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/devprogram.hpp#30 edit
2019-07-23 20:09:59 -04:00
foreman 6987b1896b P4 to Git Change 1972329 by cpaquot@cpaquot-ocl-lc-lnx on 2019/07/23 15:16:26
SWDEV-197168 - [HIP] handle width or height or src or dst being 0

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#63 edit
2019-07-23 15:18:51 -04:00
foreman fd89a62ac3 P4 to Git Change 1968576 by cpaquot@cpaquot-ocl-lc-lnx on 2019/07/16 15:48:18
SWDEV-145570 - [HIP] update code after header change

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#62 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_texture.cpp#12 edit
2019-07-16 15:54:22 -04:00
foreman 763c294441 P4 to Git Change 1966049 by cpaquot@cpaquot-ocl-lc-lnx on 2019/07/11 19:05:18
SWDEV-189500 - [HIP] Have to force async=false for host to device case as well

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#61 edit
2019-07-11 19:13:06 -04:00
foreman ff4b99311a P4 to Git Change 1960615 by gandryey@gera-win10 on 2019/07/08 18:05:10
SWDEV-79445 - HIP generic changes and code clean-up
	- Correct elapsed time calculation. Use event start and end.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_event.cpp#13 edit
2019-07-08 18:14:01 -04:00
foreman 4a26d801fb P4 to Git Change 1940447 by cpaquot@cpaquot-ocl-lc-lnx on 2019/07/03 14:44:49
SWDEV-194872 - [HIP] CUDA and HCC sync after a DeviceToHost async copy.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#60 edit
2019-07-03 14:51:15 -04:00
foreman e9f58233e1 P4 to Git Change 1917740 by cpaquot@cpaquot-ocl-lc-lnx on 2019/06/28 12:03:07
SWDEV-193938 - [HIP] RCCL test fails
	Set default stream to null-stream
	In hipStreamWaitEvent if event_ is null, get the last queued command instead.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_event.cpp#12 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_event.hpp#4 edit
2019-06-28 12:09:09 -04:00
foreman 97a079826e P4 to Git Change 1917620 by yaxunl@yaxunl-lc10 on 2019/06/28 07:57:46
SWDEV-145570 - Fix device name mismatch.

	Not only gfx906 can have device name with +xnack etc.
	Other devices e.g. gfx900 could have that too.
	Make the previous fix more generic.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_platform.cpp#33 edit
2019-06-28 08:06:23 -04:00
foreman fd634d58f1 P4 to Git Change 1879857 by gandryey@gera-win10 on 2019/06/20 18:06:07
SWDEV-184710 - Support hipLaunchCooperativeKernelMultiDevice()
	- Clean-up the loop for the launch on each device
	- Add hipExtLaunchMultiKernelMultiDevice()

	 http://ocltc.amd.com/reviews/r/17573/

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_module.cpp#29 edit
2019-06-20 18:13:20 -04:00
foreman 975d4b53f7 P4 to Git Change 1815357 by cpaquot@cpaquot-ocl-lc-lnx on 2019/06/12 16:24:09
SWDEV-192384 - [HIP] Fixed case where start and stop events are the same for ElapsedTime.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_event.cpp#11 edit
2019-06-13 15:40:39 -04:00
foreman 4e8767e7f5 P4 to Git Change 1815283 by gandryey@gera-win10 on 2019/06/12 15:26:45
SWDEV-180872 - Runtime support changes for Cooperative Group Features
	- Correct the name in the def files

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_hcc.def.in#17 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_hcc.map.in#19 edit
2019-06-13 13:00:48 -04:00
foreman b15b82ffb9 P4 to Git Change 1815146 by cpaquot@cpaquot-ocl-lc-lnx on 2019/06/12 13:43:33
SWDEV-192333 - [HIP] Implemented hipExtMallocWithFlags and hipExtGetLinkTypeAndHopCount

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_device_runtime.cpp#16 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_hcc.def.in#16 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_hcc.map.in#18 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#59 edit
2019-06-13 12:47:17 -04:00
foreman c2b13cce1d P4 to Git Change 1809454 by kjayapra@0_HIPWS_TRT2_ROCM on 2019/06/11 17:45:36
SWDEV-191936 - Implementation of hipMemcpyPeer & hipMemcpyPeerAsync + fix to check source P2PAgents for P2P. RocM Verified.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_peer.cpp#6 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocvirtual.cpp#76 edit
2019-06-12 10:48:40 -04:00
foreman 17943639e4 P4 to Git Change 1809277 by gandryey@gera-win10 on 2019/06/11 17:34:13
SWDEV-180872 - Runtime support changes for Cooperative Group Features
	- Initial implementation of the core functionality. Disabled by default. Use GPU_ENABLE_COOP_GROUPS=1 to enable the feature.
	- Runtime uses device queue for cooperative executions with a synchronization on the launched queue.
	- The current implementation is pure runtime change and it can work if only one app uses this feature. No ROCr/KFD support was added or tested
	- Only inline assembler was tested

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_device.cpp#20 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_device_runtime.cpp#15 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_hcc.def.in#15 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_hcc.map.in#17 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_module.cpp#28 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_platform.cpp#32 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#338 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.cpp#606 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/gpu/gpudevice.hpp#171 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#31 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.hpp#9 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#142 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.hpp#39 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palschedcl.cpp#6 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.cpp#135 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.hpp#61 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocblit.cpp#32 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocblit.hpp#12 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.cpp#127 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.hpp#37 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocschedcl.cpp#3 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocvirtual.cpp#75 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocvirtual.hpp#23 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/command.cpp#94 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/command.hpp#92 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#311 edit
2019-06-12 10:00:38 -04:00
foreman 21b1c1a183 P4 to Git Change 1792745 by vsytchen@vsytchen-remote-ocl-win10 on 2019/06/06 11:18:13
SWDEV-145570 - Simplify pinned memory allocation logic

	ReviewBoardURL = http://ocltc.amd.com/reviews/r/17467/diff/

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#58 edit
2019-06-06 11:51:22 -04:00
foreman 0e11975282 P4 to Git Change 1791519 by vsytchen@vsytchen-remote-ocl-win10 on 2019/06/04 10:06:43
SWDEV-189383 - [HIP CQE][HIPonPAL][WIN] hipDeviceMalloc, hip_test_ldg, hipHostRegister, hipModule, hipStreamSync2 tests failed on VEGA10.

	1. For pinned memory allocations add the host pointer and all of its respective device pointers to the memory object map.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#57 edit
2019-06-04 10:24:11 -04:00
foreman 78905ef7e9 P4 to Git Change 1789054 by cpaquot@cpaquot-ocl-lc-lnx on 2019/05/29 14:02:50
SWDEV-190565 - [HIP] Don't use clSetEventWaitList and just add the event to the list in HIP.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_event.cpp#10 edit
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_common.hpp#24 edit
2019-05-29 14:38:26 -04:00
foreman 3330d5e7a4 P4 to Git Change 1788615 by cpaquot@cpaquot-ocl-lc-lnx on 2019/05/28 18:54:55
SWDEV-190565 - [HIP] Allow null stream as argument to hipStreamWaitEvent.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_context.cpp#19 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_stream.cpp#19 edit
2019-05-28 19:06:58 -04:00
foreman 0f9c66c7d9 P4 to Git Change 1788476 by kjayapra@1_HIPWS_TRT2_PAL on 2019/05/28 14:44:38
SWDEV-190190 - Implemented API hipCanAccessPeerDevice

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_peer.cpp#5 edit
2019-05-28 15:08:10 -04:00
foreman 37cbce4df7 P4 to Git Change 1787806 by yaxunl@yaxunl-lc10 on 2019/05/27 10:53:56
SWDEV-145570 - Support loading fat binary generated through --genco by hipModuleLoad.

	hip-clang --genco generates fat binary instead of code object. To support that
	we need to extract code object from fat binary in hipModuleLoadData. This is
	needed for hipRTC since multiple GPU archs may be passed.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_module.cpp#27 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_platform.cpp#31 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/devprogram.cpp#42 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#308 edit
2019-05-27 20:11:08 -04:00
foreman 15d1df9255 P4 to Git Change 1783301 by cpaquot@cpaquot-ocl-lc-lnx on 2019/05/15 11:57:57
SWDEV-189488 - [HIP] Caffe2 TensorTest.TensorSerializationMultiDevices fails
	1. Make sure to set attributes->device to current device for host malloc'd
	2. Return hipSuccess for hipDeviceCanAccessPeer

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#56 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_peer.cpp#4 edit
2019-05-15 12:22:50 -04:00
foreman 680e1a6552 P4 to Git Change 1781540 by kpyzhov@hip-vdi-pal-win on 2019/05/10 17:10:30
SWDEV-188234 - Corrected reading binary module file in hipModuleLoad().

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_module.cpp#26 edit
2019-05-10 19:19:11 -04:00
foreman 0949beeeb2 P4 to Git Change 1781384 by michliao@hliao-dev-00-hip.rocm-workspace on 2019/05/10 14:00:04
SWDEV-189216 - Export `hipExtModuleLaunchKernel`

	RB: http://ocltc.amd.com/reviews/r/17332/

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_hcc.map.in#16 edit
2019-05-10 15:50:37 -04:00
foreman 1d60371d01 P4 to Git Change 1780704 by yaxunl@yaxunl-lc10 on 2019/05/09 13:26:13
SWDEV-145570 - Fix device name mismatch for gfx906.

	For now hip-clang can only emits gfx906 ISA with conservative configurations, i.e. with ecc on and xnack on, therefore it is always gfx906. It is still under discussion how to encode the target id for xnack off or ecc off.

	Therefore, the reasonable solution for now is just allow code object marked as gfx906 to be loaded on any device name that starts with gfx906. We will have more detailed control once hip-clang is able to emit code object for gfx906 with ecc off or xnack off.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_platform.cpp#30 edit
2019-05-09 14:19:54 -04:00
foreman 1e856beba9 P4 to Git Change 1778985 by cpaquot@cpaquot-ocl-lc-lnx on 2019/05/06 17:21:57
SWDEV-188668 - [HIP] Implemented hipExtModuleLaunchKernel. Ignoring the flag for now.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_hcc.def.in#14 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_hcc.map.in#15 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_module.cpp#25 edit
2019-05-06 18:02:54 -04:00
foreman d41643c39f P4 to Git Change 1778971 by cpaquot@cpaquot-ocl-lc-lnx on 2019/05/06 17:03:12
SWDEV-187125 - [HIP] Protect hip::Event with lock and add threadId to logs.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_event.cpp#9 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_event.hpp#3 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_internal.hpp#30 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_module.cpp#24 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_stream.cpp#18 edit
2019-05-06 17:43:06 -04:00
foreman 09920e6024 P4 to Git Change 1778949 by michliao@hliao-dev-00-hip.rocm-workspace on 2019/05/06 16:36:53
SWDEV-144570 - Fix build failure after switching to gcc-7

	- Hex representation of float needs gnu++11. We'd better not relying on
	  that. Change the float in hex format into alternative representation.

	RBT: http://ocltc.amd.com/reviews/r/17300/

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_platform.cpp#29 edit
2019-05-06 17:20:00 -04:00
foreman 3ad9c3f984 P4 to Git Change 1777225 by cpaquot@cpaquot-ocl-lc-lnx on 2019/05/01 18:47:25
SWDEV-188233 - [HIP] Implemented hipGetChannelDesc.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_texture.cpp#11 edit
2019-05-01 19:53:29 -04:00
foreman ba93e1ae21 P4 to Git Change 1777158 by cpaquot@cpaquot-ocl-lc-lnx on 2019/05/01 17:12:01
SWDEV-188219 - [HIP] Add hipStreamCreateWithPriority. NOP right now.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_device_runtime.cpp#14 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_hcc.def.in#13 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_hcc.map.in#14 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_stream.cpp#17 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/commandqueue.hpp#21 edit
2019-05-01 18:43:47 -04:00
foreman 345f3f989a P4 to Git Change 1775995 by gandryey@gera-w8 on 2019/04/29 13:46:53
SWDEV-79445 - OCL generic changes and code clean-up
	- Enable P2P extension for PAL path, currently it's staging copy only
	- Fix P2P staging copy

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#55 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.cpp#243 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#336 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#130 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palsettings.cpp#77 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.cpp#131 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.hpp#59 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.cpp#124 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.hpp#36 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocvirtual.cpp#74 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/command.cpp#92 edit
2019-04-29 13:58:26 -04:00
foreman 9011aa9282 P4 to Git Change 1775366 by kjayapra@99_HIPWS_SLV_CHECKIN on 2019/04/26 14:48:23
SWDEV-144570 - hipFuncGetAttributes() API changes.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_internal.hpp#29 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_module.cpp#23 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_platform.cpp#28 edit
2019-04-26 15:15:48 -04:00
foreman 80fb280643 P4 to Git Change 1772785 by cpaquot@cpaquot-ocl-lc-lnx on 2019/04/22 14:31:18
SWDEV-144570 - [HIP] Lazy build kernels to avoid overfilling dev memory.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_internal.hpp#28 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_platform.cpp#27 edit
2019-04-22 15:19:24 -04:00
foreman 84679ca20e P4 to Git Change 1772349 by mshivama@mshivama_tf on 2019/04/20 07:35:24
SWDEV-187020 - Basic "Hello World" Tensorflow program fails to execute on HIP/VDI/PAL/LINUX

	Make sure that the size of `dest` is big enough to hoid `src` including trailing zero byte

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_device.cpp#19 edit
2019-04-21 21:50:04 -04:00
foreman 7a1d02c8a1 P4 to Git Change 1772193 by mshivama@mshivama_tf on 2019/04/19 09:39:39
SWDEV-187020 -  Basic "Hello World" Tensorflow program fails to execute on HIP/VDI/PAL/LINUX

	the API hiDeviceGetName() should pass the length of device name as argument to strncpy() instead
	of total (dest) memory size which is being passed to it by its caller. Also, bit of a code clean-up.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_device.cpp#18 edit
2019-04-19 09:49:37 -04:00
foreman 17b5ecc6dc P4 to Git Change 1771752 by cpaquot@cpaquot-ocl-lc-lnx on 2019/04/18 11:54:18
SWDEV-145570 - [HIP] Use isAsync flag for async cases.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#54 edit
2019-04-18 12:15:25 -04:00
foreman 881618fc1b P4 to Git Change 1771336 by cpaquot@cpaquot-ocl-lc-lnx on 2019/04/17 18:19:42
SWDEV-145570 - [HIP] Use staging buffer to copy peer to peer.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_context.cpp#18 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_internal.hpp#27 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#53 edit
2019-04-17 18:38:30 -04:00
foreman ac884527f4 P4 to Git Change 1766904 by michliao@hliao-dev-00-hip.rocm-workspace on 2019/04/08 14:42:29
SWDEV-145570 - Check host_context when matching GPU device.

	- In CL#1766264, `host_context` is introduced for mGPU support. Need to
	  match that context specially when trying to match GPU device context.

	The following tests passed:
	$ python test_dataloader.py TestDictDataLoader.test_pin_memory
	.
	----------------------------------------------------------------------
	Ran 1 test in 0.004s

	OK
	$ python test_dataloader.py TestDataLoader.test_sequential_pin_memory
	.
	----------------------------------------------------------------------
	Ran 1 test in 0.063s

	OK
	$ python test_dataloader.py TestDataLoader.test_shuffle_pin_memory
	.
	----------------------------------------------------------------------
	Ran 1 test in 0.174s

	OK
	$ python test_dataloader.py TestStringDataLoader.test_shuffle_pin_memory
	.
	----------------------------------------------------------------------
	Ran 1 test in 0.104s

	OK
	$ python test_torch.py TestTorch.test_pin_memory
	.
	----------------------------------------------------------------------
	Ran 1 test in 0.124s

	OK

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#52 edit
2019-04-08 15:23:35 -04:00
foreman 0f67004f12 P4 to Git Change 1766349 by michliao@hliao-dev-00-hip.rocm-workspace on 2019/04/05 14:24:45
SWDEV-144570 - Fix pointer attribute query.

	- For memory not registered with runtime, return
	  `hipErrorInvalidValue`. That's the behavior expected to check whether
	  a host buffer is pinned.
	- Return `hipErrorInvalidDevice` in case a registered memory object
	  cannot find its matching device.

	RB: http://ocltc.amd.com/reviews/r/17094/

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#51 edit
2019-04-05 14:51:37 -04:00
foreman 8753616c97 P4 to Git Change 1766264 by cpaquot@cpaquot-ocl-lc-lnx on 2019/04/05 11:38:18
SWDEV-145570 - [HIP] Use a context with all devices in system for host register
	hipHostRegister and hipMemcpy 0x10 and 0x20 fail in mGPU systems because
	we only register the memory on the current device. But in HIP, the registering
	needs to happen on all devices.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_context.cpp#17 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_internal.hpp#26 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#50 edit
2019-04-05 11:58:25 -04:00
foreman 22c104f84f P4 to Git Change 1766115 by cpaquot@cpaquot-ocl-lc-lnx on 2019/04/04 18:29:12
SWDEV-145570 - Don't assert when LOG_LEVEL=3 and return value isn't HIP_SUCCESS.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_internal.hpp#25 edit
2019-04-04 18:41:09 -04:00
foreman e3910b720c P4 to Git Change 1766106 by kjayapra@99_HIPWS_SLV_CHECKIN on 2019/04/04 18:07:26
SWDEV-144570 - Implementation of hipMemcpyToSymbol and simillar fns for PAL.

Affected files ...

... //depot/stg/opencl/drivers/opencl/api/hip/hip_internal.hpp#24 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_module.cpp#22 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_platform.cpp#26 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/devprogram.hpp#23 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palprogram.cpp#89 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palprogram.hpp#37 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.cpp#102 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocprogram.hpp#46 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/memory.cpp#135 edit
2019-04-04 18:22:40 -04:00