Commit Graph

4404 Commits

Author SHA1 Message Date
Konstantin Zhuravlyov 511065e301 SWDEV-320486 - Correctly set HSAIL_ELFTOOLCHAIN_DIR
Change-Id: Icb39b376eaf3a8132b210bd1bd4eeecaa97bea9d
2022-01-26 18:38:25 -05:00
Payam b473a80f6a SWDEV-245531 - GLInterop adding svm support
Change-Id: Id7f2a7a214f5ad692b4a7acf26cfb1dc423ed54a
2022-01-26 08:17:58 -05:00
Chauncey Hui 7727ea394c SWDEV-2 - Change OpenCL version number from 3414 to 3415 2022-01-26 03:00:06 -05:00
German Andryeyev a543d4a860 SWDEV-318505 - Update HSAIL xnack path
Report proper target id for xnack in HSAIL path. Runtime
will use ISA table and report hsailName().
Fix offline compilation path for PAL.

Change-Id: Ic0250bf6b9c193d867aec9800a319da1bf00c3ee
2022-01-24 09:27:32 -05:00
Chauncey Hui e6225e07d0 SWDEV-2 - Change OpenCL version number from 3413 to 3414 2022-01-19 03:00:08 -05:00
Alex Xie 15101e704b SWDEV-312368 - Segmentation fault in test_gl
When OCL failed to obtain function pointer from GL, we should not call it.

Change-Id: I50f69d270ce445386906a286e44c4e8c83722302
2022-01-14 17:35:31 -05:00
Chauncey Hui d04fdd9356 SWDEV-2 - Change OpenCL version number from 3412 to 3413 2022-01-13 03:00:05 -05:00
Satyanvesh Dittakavi e20dd61932 SWDEV-306939 - Fix vdi errors/warnings by CppCheck
Change-Id: I56d910f8363787f1050d5d7e8064ed553c5827fd
2022-01-12 00:22:16 -05:00
Chauncey Hui 8b5b2a2d79 SWDEV-2 - Change OpenCL version number from 3411 to 3412 2022-01-08 03:00:04 -05:00
German Andryeyev 5f98028a48 SWDEV-316665 - Add ATI_BITS_32 define to cmake build
Some logic in ROCclr relies on ATI_BITS_32 define

Change-Id: I2f0ef535e8d7717f9aa8646bdc80c09880bd2bee
2022-01-06 13:14:19 -05:00
Chauncey Hui f892de68a3 SWDEV-2 - Change OpenCL version number from 3410 to 3411 2022-01-05 03:00:05 -05:00
German Andryeyev e0a4e0df0e SWDEV-317061 - Fix 32-bit HSAIL complib load
Use __stdcall decl to match the library defines

Change-Id: Id4178443d5a9bb4e5401d80b3b5d15c7bbea0330
2022-01-04 16:48:18 -05:00
Chauncey Hui c04d650b67 SWDEV-2 - Change OpenCL version number from 3409 to 3410 2021-12-29 03:00:09 -05:00
haoyuan2 3d5d9e2691 SWDEV-316150 - align with OCL to use wavefront64 on Navi21/22/23/24
Change-Id: Ic47bc7cc6300c217c02c2bbbda9940a3b5c9597d
2021-12-28 12:39:18 -08:00
Chauncey Hui a6948a31f2 SWDEV-2 - Change OpenCL version number from 3408 to 3409 2021-12-19 03:00:03 -05:00
Saleel Kudchadker 1fbd75b825 SWDEV-313306 - Fix Co-operative groups dtests
Add a state indicator to retain ExternalSignals when needed.
Co-operative group launch uses external signals to indicate a dependency
to the next command.

Change-Id: I6d0daa006e2377c3bbf4aeca0fd5b63c7ac8fbbb
2021-12-17 12:41:37 -08:00
Chauncey Hui e5122e3bc7 SWDEV-2 - Change OpenCL version number from 3407 to 3408 2021-12-15 03:00:05 -05:00
Saleel Kudchadker 3239222516 SWDEV-313306 - Clear external signals
Crash was due to the fact that external signal structure was stale even
after destroyign the command. That is because we skipped wait due to a
missing check.
Detect external signals and dispatch a barrier in ReleaseGpuMemoryFence.
Also clear external_signals_ at ProfilingBegin.

Change-Id: I991387edcfe928b511bf5e780988ee131321ed5a
2021-12-13 23:03:33 -08:00
Chauncey Hui 1b5de6e562 SWDEV-2 - Change OpenCL version number from 3406 to 3407 2021-12-11 03:00:08 -05:00
German Andryeyev f613831471 SWDEV-300515 - Apply CU granularity on the CU count
Report granularity for possible app query

Change-Id: I98857c6f4cc7ae590927ea35ce57d181abe7860b
2021-12-10 10:47:28 -05:00
Chauncey Hui 7abd2827b5 SWDEV-2 - Change OpenCL version number from 3405 to 3406 2021-12-10 03:00:08 -05:00
haoyuan2 439af94dd9 SWDEV-290298 - add a flag to indicate the primary context active status
Change-Id: Ia31790706d3f855bc1eedf5ef874e471
2021-12-09 23:28:54 -05:00
Sarbojit Sarkar aedbad0109 SWDEV-314254 - Fix for hipMemcpy3D test crash
Change-Id: Iac70bfe0d351cfb5b56fefc9a6487d3f26f2b4ef
2021-12-09 11:46:52 -05:00
Chauncey Hui 84af5eed4e SWDEV-2 - Change OpenCL version number from 3404 to 3405 2021-12-09 03:00:06 -05:00
German Andryeyev 008133cf41 SWDEV-305016 - Improve MGPU scaling in Tensorflow
Add a threshold for ROCR/SDMA P2P transfers. ROCR copy path
requires extra barriers in compute for synchronization. That costs
extra performance with tiny transfers.
Reduce active wait time to 10us. Tensorflow uses extra thread
per GPU with constant hipEventQuery() calls. Longer active waits
in ROCr affect CPU performance.

Change-Id: I9020358438615fa2d4617f862f00a562f0a588e7
2021-12-08 11:59:37 -05:00
Chauncey Hui 587cda0c91 SWDEV-2 - Change OpenCL version number from 3403 to 3404 2021-12-08 03:00:05 -05:00
Julia Jiang 376ea1e293 SWDEV-308644 - reorganize extra blit kernel in PAL stack
Change-Id: I9d853e8d417ef75b522184d83646ec4b9fa8669b
2021-12-07 14:55:20 -05:00
Christophe Paquot 5243552768 SWDEV-307109 - Don't force Persistent for HIP
With SAM on, don't force Persistent for allocations
in HIP. This makes ROCCLR go down paths we don't want
for HIP.

Change-Id: If54cc16fa891d4cfdc761c6ab21ad707627e822a
2021-12-06 12:16:08 -05:00
Chauncey Hui 1a677effa8 SWDEV-2 - Change OpenCL version number from 3402 to 3403 2021-12-03 03:00:05 -05:00
Satyanvesh Dittakavi 9dabdcdc3e SWDEV-309286 - save the max system mem size in device info
Change-Id: I9955625aca6ceda059aef6354d909de636b610ba
2021-12-02 10:59:07 -05:00
Chauncey Hui 1df0ddefe4 SWDEV-2 - Change OpenCL version number from 3401 to 3402 2021-12-02 03:00:03 -05:00
Saleel Kudchadker 3f82b99f5d SWDEV-308843 - Increase MaxPinnedXferSize to 128
This allows experimenting with env var GPU_PINNED_XFER_SIZE which is
still at a default of 32MB

Change-Id: I85ade700ed58d498eba29d1737601dc74d4c26a4
2021-12-01 20:37:56 -05:00
anusha GodavarthySurya 102aa9d6d9 SWDEV-284895 - Adding kind metadata and launch init/fini marked kernels
Change-Id: If2b21c4b98567632c426943e0b69aca8d6f1ec2a
2021-12-01 08:17:44 -08:00
Chauncey Hui 3d56881a14 SWDEV-2 - Change OpenCL version number from 3400 to 3401 2021-11-30 03:00:03 -05:00
Sarbojit Sarkar 02dc6f9f9a SWDEV-310181 - Fix for AtoH Memcpy tests failure
Change-Id: Ibf8c8c01257f0516088d50d5c9f82040ed8fa067
2021-11-29 22:55:23 -05:00
kjayapra-amd d4ad981c0c SWDEV-312822 - Fix the globalWorkSize to number of sizeof(var) instead of bytes.
Change-Id: Ic6b2bbb2e8d4cb6aa8d906d4b93cd06a176160d8
2021-11-29 17:36:11 -05:00
German Andryeyev 102c19adf3 SWDEV-294669 - Avoid stall when the new signal was created
Stall in the host thread could occur earlier than the app expects.
Make sure rutnime can grow the signals to the queue size without
any stall. Also adding a new signal to the end of the pool could
break the dependency chain on signal reuse. The new logic will
insert the new signal after current to keep the chain intact.

Change-Id: I9c90b98515907db8b677528263c3e88cd9581a14
2021-11-29 10:08:06 -05:00
Chauncey Hui bfcb47ca72 SWDEV-2 - Change OpenCL version number from 3399 to 3400 2021-11-29 03:00:05 -05:00
kjayapra-amd 2e9bc8f793 SWDEV-312822 - Revert "SWDEV-310187 - Change flag to keep track of aligned sizes instead of expanded patterns."
This reverts commit 8307886644.

Change-Id: I022c2a8375f9929e9723cec66e1e0b960263fc39
2021-11-28 23:39:40 -05:00
Chauncey Hui 2c23a1091f SWDEV-2 - Change OpenCL version number from 3398 to 3399 2021-11-27 03:00:08 -05:00
German Andryeyev 6f2e7c3199 SWDEV-313126 - Use data() method for the base array address
Reference for the first element can trigger an assert with
_GLIBCXX_ASSERTIONS build

Change-Id: I59c63c052831307edfe5dcc6384798a43e9596dd
2021-11-26 09:51:57 -05:00
Chauncey Hui e4be77cbef SWDEV-2 - Change OpenCL version number from 3397 to 3398 2021-11-26 03:00:05 -05:00
Julia Jiang f5c9ad5b1d SWDEV-308644 - merge roc blit kernels
Change-Id: I378e511959fe17c03fa45066022e9670a4d181f0
2021-11-25 10:07:51 -05:00
Chauncey Hui 964f5bc502 SWDEV-2 - Change OpenCL version number from 3396 to 3397 2021-11-25 03:00:04 -05:00
Satyanvesh Dittakavi c56317b2e0 SWDEV-292714 - Add unique id for each allocation to support HIP_POINTER_ATTRIBUTE_BUFFER_ID
Change-Id: Ibb3fcb2d0bbbef03525fc884d5b3e9b5f6c11423
2021-11-24 06:10:50 -05:00
Chauncey Hui 0ab4ad1310 SWDEV-2 - Change OpenCL version number from 3395 to 3396 2021-11-24 03:00:04 -05:00
Saleel Kudchadker 1e6cc5a2fa SWDEV-299893 - Fix PAL/lnx build
Change-Id: If4408a81d2dc8686c8e805a7697a7d9b2589c864
2021-11-23 08:09:33 -08:00
Chauncey Hui f526e9ea89 SWDEV-2 - Change OpenCL version number from 3394 to 3395 2021-11-23 03:00:03 -05:00
Sarbojit Sarkar 2afeacc858 SWDEV-310181 - Fix for mGPU dtest failure
Change-Id: Id0898bd45e23f2d637bef25a3e69f26d9dc40785
2021-11-22 01:01:47 -05:00
Chauncey Hui 7fdeb37239 SWDEV-2 - Change OpenCL version number from 3393 to 3394 2021-11-20 03:00:06 -05:00