SWDEV-2 - Change OpenCL version number from 2910 to 2911.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2658 edit
SWDEV-2 - Change OpenCL version number from 2909 to 2910.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2657 edit
SWDEV-145570 - Support loading fat binary generated through --genco by hipModuleLoad.
hip-clang --genco generates fat binary instead of code object. To support that
we need to extract code object from fat binary in hipModuleLoadData. This is
needed for hipRTC since multiple GPU archs may be passed.
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/hip/hip_module.cpp#27 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_platform.cpp#31 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/devprogram.cpp#42 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#308 edit
SWDEV-2 - Change OpenCL version number from 2908 to 2909.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2656 edit
SWDEV-2 - Change OpenCL version number from 2907 to 2908.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2655 edit
SWDEV-2 - Change OpenCL version number from 2906 to 2907.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2654 edit
SWDEV-2 - Change OpenCL version number from 2905 to 2906.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2653 edit
SWDEV-190142 - Switch os::TimeNanos to use CLOCK_MONOTONIC_RAW.
Avoids NTP adjustment and re-aligns CPU clock usage with ROCm.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/os/os_posix.cpp#45 edit
SWDEV-189990, SWDEV-190337 - Update OpenCL/PAL gfx10 counter blocks again to match GPUPerfAPI.
SWDEV-190337:
- Update the GFX10 block list (gfx10BlockIsPal) to match GPUPerfAPI, which expects the OpenCL block lists to match OpenGL (UGL). Note: The list now matches the expectations of GPUPerfAPI. For TA/TD/TCP, nearly all GFX10 ASICs only require 10 or 12 instances (Arden would require 14, Mero would require 8, but not sure if those are supported by OCL), but we are using 16 instances to match UGL.
- Make sure the blockIdToIndexSelect array contains all the blocks supported by PAL (add a static_assert to ensure this)
- Refactor the PCIndexSelect enum. This enum is used to determine how to sum up counters across multiple block instances. The following types are now supported:
Instance -- no autosumming; instances have a one-to-one correlation with PAL
ShaderEngine -- the block is instanced per shader engine, and OpenCL will autosum counters across all PAL instances, providing a single value for all of PAL's instances
ShaderArray -- the block is instanced per shader array, and OpenCL will autosum counters across shader arrays, providing a single value for each instance within a shader array. For example, if a block has four instances per shader array, PAL would expose 16 instances total on Navi10 (2 SEs, 2 SAs per SE), but OpenCL will expose four instances
ComputeUnit -- the block is instanced per compute unit, and OpenCL will autosum counters across shader arrays, providing a single value for each compute-unit-per-shader-array. For example, if a block is instanced per compute unit, then PAL would expose 40 instances on a 40CU Navi10. OpenCL would support 10 instances (2 CUs-per-WGP, 5 WGPs-per-SA), autosummed across shader arrays.
SWDEV-189990:
- Revert GFX9 and GFX10 tests back to using the MCVML2 counter it was using previously (prior to CL 1766829). This is counter index 2, which the test calls "BigK bank 0 hits". In the aforementioned change list, I updated the counter index from 2 to 14, since index 14 is the actual counter that represents "BigK bank 0 hits". Counter index 2 is the number of hits, not "bigK" hits. This previous change caused a test regression reported in SWDEV-189990. By reverting the code to use counter 2, the expected value in the test should be correct. Perhaps a better update would be to change the description in the source from "BigK bank 0 hits" to "bank 0 hits", but for now, I'm just going to go back to what the test was doing before.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palcounters.cpp#21 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palcounters.hpp#11 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/runtime/OCLPerfCounters.cpp#47 edit
SWDEV-2 - Change OpenCL version number from 2904 to 2905.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2652 edit
SWDEV-2 - Change OpenCL version number from 2903 to 2904.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2651 edit
SWDEV-2 - Change OpenCL version number from 2902 to 2903.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2650 edit
SWDEV-189453 - [Navi10][OpenCl][x32][Converter] Process hang
- Use the argument size from the caller. With LC path and 32 bit the both sizes are different and runtime has to use the caller's size, which matches the host bitness, because the optimized path updates 32 bit values only.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palblit.cpp#30 edit
SWDEV-2 - Change OpenCL version number from 2901 to 2902.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2649 edit
SWDEV-2 - Change OpenCL version number from 2900 to 2901.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2648 edit
SWDEV-189541 - [HIP] Make sure maxSvmSize is power of two.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#136 edit
SWDEV-2 - Change OpenCL version number from 2899 to 2900.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2647 edit
SWDEV-2 - Change OpenCL version number from 2898 to 2899.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2646 edit
SWDEV-189140 - Add P2P support in PAL path
- PAL requires P2P resource open on the usage device. Add the new interface to open the resource
- Add a hidden P2P device object creation into amd::Memory. It can be activated with OCL context that has a single device.
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/opencl/amdocl/cl_p2p_amd.cpp#2 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/device.hpp#337 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#134 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palmemory.cpp#25 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palresource.cpp#74 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palresource.hpp#28 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palsettings.cpp#80 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palsettings.hpp#23 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palvirtual.cpp#133 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdevice.cpp#126 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/command.cpp#93 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/memory.cpp#136 edit
... //depot/stg/opencl/drivers/opencl/runtime/platform/memory.hpp#109 edit
... //depot/stg/opencl/drivers/opencl/runtime/utils/flags.hpp#306 edit
SWDEV-2 - Change OpenCL version number from 2897 to 2898.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2645 edit
SWDEV-2 - Change OpenCL version number from 2896 to 2897.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2644 edit
SWDEV-2 - Change OpenCL version number from 2895 to 2896.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2643 edit
SWDEV-2 - Change OpenCL version number from 2894 to 2895.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2642 edit
SWDEV-2 - Change OpenCL version number from 2893 to 2894.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2641 edit
SWDEV-188605 - [CQE OCL][QR][mGPU][Vega][DTB-Blocker][Windows][RS5] Observed Failure of OCLP2PBuffer sub test of ocltst_runtime both 64 &32 bit due to faultyCL 1775995
- Count offline GPU devices during PAL device detection
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldevice.cpp#132 edit
SWDEV-2 - Change OpenCL version number from 2892 to 2893.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2640 edit
SWDEV-2 - Change OpenCL version number from 2891 to 2892.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2639 edit
SWDEV-2 - Change OpenCL version number from 2890 to 2891.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2638 edit
SWDEV-2 - Change OpenCL version number from 2889 to 2890.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2637 edit
SWDEV-2 - Change OpenCL version number from 2888 to 2889.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2636 edit
SWDEV-2 - Change OpenCL version number from 2887 to 2888.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2635 edit
SWDEV-187956 - [CQE OCL][NAVI 10][WIN][LNX] Assertion failure observed with Subtests of OpenMM on PAL/LC path
- Correct the number of SIMDs and SIMD width for Navi10.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/paldefs.hpp#51 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rocdefs.hpp#24 edit
... //depot/stg/opencl/drivers/opencl/tests/ocltst/module/runtime/OCLDeviceQueries.cpp#54 edit
SWDEV-2 - Change OpenCL version number from 2886 to 2887.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2634 edit
SWDEV-181817 - remove HostMemDirectAccess flag for non-tiled 3D image, so that unimplemented pal::Resouce::mapLayers() will not be hit by Fill Image path.
ocltc.amd.com/reviews/r/17238/
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palmemory.cpp#23 edit
SWDEV-2 - Change OpenCL version number from 2885 to 2886.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2633 edit
SWDEV-2 - Change OpenCL version number from 2884 to 2885.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2632 edit
SWDEV-185753 - [CQE OCL][QR][DTB-Blocker][Vega][Navi][PCO][Windows] observed Failure with Multiple applications on all asics (Navi, Vega, PCO, RV2 ) due to Faulty CL 1766106
- Keep SVM allocation of code segment for HIP only.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/pal/palprogram.cpp#92 edit
SWDEV-2 - Change OpenCL version number from 2883 to 2884.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2631 edit
SWDEV-187076 - Fix parameter initialization on COMPGR path.
- Once 'Args' or '.args' cannot be found, it should be regarded as no
argument is associated with that kernel instead of an error.
RB: http://ocltc.amd.com/reviews/r/17177/
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/device/devkernel.cpp#22 edit
SWDEV-2 - Change OpenCL version number from 2882 to 2883.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2630 edit
SWDEV-2 - Change OpenCL version number from 2881 to 2882.
Affected files ...
... //depot/stg/opencl/drivers/opencl/runtime/utils/versions.hpp#2629 edit