İşleme Grafiği

4772 İşleme

Yazar SHA1 Mesaj Tarih
jiabaxie 8278e86918 SWDEV-366886 - force svm alloc for rocm
Change-Id: If85a6aaeb5861ff1550a0bc3f0a25073ff8145e7
2023-01-18 16:26:34 -05:00
German c8927cd84e SWDEV-377991 - Remove Liquidflash extension
Initial check-in to untie dependencies with HIP and OCL repos

Change-Id: I363b63954c3f118f40a6ed893545d6a4ac44144c
2023-01-18 13:16:20 -05:00
Alex Xie 60d413a5b9 SWDEV-374668 - Observed one subtest failure in ocltst32
Don't force sysmem allocation

Change-Id: I792f0da231c6e8058f1bfb81ba1f88ecb59b7c8e
2023-01-18 12:02:27 -05:00
Chauncey Hui 241bfcc033 SWDEV-2 - Change OpenCL version number from 3556 to 3557 2023-01-18 03:00:09 -05:00
Anusha GodavarthySurya 2b50636809 SWDEV-373173 - Correct max VGPRs and VGPR Granularity in Occupancy calculation for different asics
Change-Id: I6f595428ed31733cdc04e65e7c3e383f454ab72b
2023-01-18 00:22:02 -05:00
Tao Sang cb30ce4e06 SWDEV-377423 - Fix kernel attribute missing
Fix missing issue of kernel attributes including vec_type_hint,
work_group_size_hint and reqd_work_group_size.
Make WorkGroupInfo's meta attributes initialized before other parameters
are initialized.
This way workGroupInfo_'s compileSizeHint_, compileSize_ and
compileVecTypeHint_ will be valid when they are used to create kernel
signature in Kernel::createSignature().
Fix a typo of ".workgorup_size_hint".

Change-Id: I4a1ede2210a25596ad7a935cd4debb896e0147f8
2023-01-17 14:50:35 -05:00
Jiabao Xie 6c44353107 SWDEV-366886 - force alloc from svm
Change-Id: I6a89aa46d944832e59491bb96ba3669e5031b636
2023-01-17 12:12:27 -05:00
Anusha GodavarthySurya 0bd0c29b7d SWDEV-373173 - Add kernel metadata for WGP mode
Change-Id: Ic270e90c46938b975513ff2bc19b7bb6b9485f9d
2023-01-17 10:51:22 -05:00
Chauncey Hui 4595d88b04 SWDEV-2 - Change OpenCL version number from 3555 to 3556 2023-01-17 03:00:06 -05:00
German Andryeyev 5d83345c20 SWDEV-355281 - Add offset calculation
Calculate memory location offset for requested memory

Change-Id: I7919523df1790d9eaad5827ee5bd98e4263a7554
2023-01-16 14:19:40 -05:00
Jaydeep Patel d97b4e8c74 SWDEV-374360 - Handle free for external memory.
Change-Id: I4a1ede2210a255960d7a935cd4debb806e0147f6
2023-01-16 13:06:36 -05:00
Saleel Kudchadker 033d4c0463 SWDEV-345213 - Fix staged line-by-line copy path
- Address an old bug in offset calculation that was causing out of bound
access.
- Improve logging

Change-Id: Iebdf34dddaa5e987cc72184a2152918adc6a96e0
2023-01-16 11:04:30 -05:00
Juan Manuel MARTINEZ CAAMAÑO 8ab3fd58cf SWDEV-287018 - Remove requiredDump argument from Program::linkLLVMBitcode
Weirdly, the `requiredDump` argument to linkLLVMBitcode was used to enable/disable
the keeping temporary bytecode files (those generated by -save-temps=all) after linking.

This patch removes this argument as there is no obvious benefit from keepeing it
(the user would only rely on -save-temps=all to control this).

Change-Id: I0c00486f95eb1d4e296b5247c488407c47f0b2d9
2023-01-16 03:34:21 -05:00
Chauncey Hui 097e8f270a SWDEV-2 - Change OpenCL version number from 3554 to 3555 2023-01-16 03:00:05 -05:00
Ajay ecea27eb2d SWDEV-372757 - thread check workaround for windows hang
Change-Id: Ie9f87b88dd0f3078ad1919edc336f297f6b40373
2023-01-13 04:05:35 -05:00
Chauncey Hui 8a973ee882 SWDEV-2 - Change OpenCL version number from 3553 to 3554 2023-01-13 03:00:09 -05:00
Anusha GodavarthySurya 710749c291 SWDEV-374778 - Select SDMA path when HSA_XNACK is enabled
Change-Id: I052675186238eb24b910a0a558d6b8ede36d1413
2023-01-12 23:34:21 -05:00
Ioannis Assiouras 4d49204fb9 SWDEV-374401 - Surface the ammount of used constant memory by kernel
Change-Id: I6de0e46482f27b9068582fdaee0e562f9f71f9f0
2023-01-12 05:22:37 -05:00
Saleel Kudchadker 77ceb7e9ad SWDEV-371123 - Fix barrier value packet header
- Use correct header for vendor packet
- Pass one dependent signal when submitting a marker if there is one

Change-Id: I4efc70dd5204b559de26f899d0637f50421c8834
2023-01-12 04:06:03 -05:00
Chauncey Hui 6b4577c8d7 SWDEV-2 - Change OpenCL version number from 3552 to 3553 2023-01-12 03:00:09 -05:00
jatang 6647882773 SWDEV-287018 - Only generate _optimized.bc when both requiredDump and DUMP_BC_LINKED are true
The current code generates a _optimized.bc regardlessly, so put back the original logic.

Change-Id: I3f84d10934b3e983f5f828af8d0943449a6e1d94
2023-01-11 17:51:35 -05:00
Chauncey Hui ed10cb76c1 SWDEV-2 - Change OpenCL version number from 3551 to 3552 2023-01-10 03:00:08 -05:00
Anusha GodavarthySurya 274f2de391 SWDEV-364576 - initialize device malloc heap state using blit kernel
Change-Id: I5d0172aff7d2c04b322a4d828b8a2b438158b80f
2023-01-07 06:53:53 +00:00
Chauncey Hui 69ffa7bcb8 SWDEV-2 - Change OpenCL version number from 3550 to 3551 2023-01-06 03:00:08 -05:00
Saleel Kudchadker 6651aab28d SWDEV-371123 - Fix typo for log print
Change-Id: I18297a032d6fc9a030863eac7c525e86e9fdcd29
2023-01-05 17:46:22 -05:00
kjayapra-amd 79f0b8b794 SWDEV-241902 - Disable COMGR unbundler and use legacy Runtime unbundler.
Change-Id: I04128a40f67f3e5ce0db935a0a733b7044d75c34
2023-01-03 09:31:11 -08:00
Chauncey Hui 2eef0f8dd3 SWDEV-2 - Change OpenCL version number from 3549 to 3550 2022-12-29 03:00:08 -05:00
jatang 05a8eaaf0f SWDEV-372791 - Default UniformWorkGroupSize to true for CL1.x
Change-Id: If03396b5a7a3a69e05fdb4c43791a0d84d9f4127
2022-12-28 12:59:08 -05:00
Chauncey Hui 6c1a250331 SWDEV-2 - Change OpenCL version number from 3548 to 3549 2022-12-28 03:00:09 -05:00
Jaydeep Patel 070ae4e6d4 SWDEV-374370 - Propogate element size to blit kernel.
Change-Id: I06d1ae6feebd238e9a63c617eb4c4dcf485d9ee0
2022-12-26 09:33:50 +00:00
Chauncey Hui b8faaf8458 SWDEV-2 - Change OpenCL version number from 3547 to 3548 2022-12-21 03:00:11 -05:00
Jaydeep Patel 9076d9a518 SWDEV-366087 - Pass pitch and slice pitch to blit kernel from rect struct.
Change-Id: I1ffe54929db59a40e2a1ae19c125f8d8e81b07ec
2022-12-20 16:43:49 +00:00
Chauncey Hui d37bf7d22b SWDEV-2 - Change OpenCL version number from 3546 to 3547 2022-12-16 03:00:08 -05:00
Saleel Kudchadker e0384f9f6b SWDEV-373334 - Use copyMetadata for blit decisions
- Check isAsync flag for small host copies on large bar as it synchronizes
- Use CopyEngine Preference hint if HMM is enabled.

Change-Id: I1ffc4b2604ed03cf5979cdc454178648c5ae5cba
2022-12-15 17:09:02 -05:00
Chauncey Hui 7e3e528441 SWDEV-2 - Change OpenCL version number from 3545 to 3546 2022-12-15 03:00:08 -05:00
Jacob Lambert 50d77d17fd SWDEV-356171 - Adding new Comgr mangled name fetching APIs
amd::Comgr::populate_mangled_names()
  amd::Comgr::get_mangled_name()

Change-Id: Ie6ebf3aae76632f7e7cfef3c90642b2e3cd10b48
2022-12-14 12:40:18 -05:00
jatang a4fadb191a SWDEV-1 - Fix another build warning.
Change-Id: Ifed33732752e9d9ec74a244a0104bf571a9fc6c5
2022-12-14 10:14:49 -05:00
Chauncey Hui 5aee3d3018 SWDEV-2 - Change OpenCL version number from 3544 to 3545 2022-12-14 03:00:06 -05:00
jatang e8cd1e93e8 SWDEV-1 - Not to truncate size_t in print when it's 64bit
Change-Id: Iab8b7eb128c767b6808b19cbeac5169b7a97f1ad
2022-12-13 09:13:33 -05:00
Chauncey Hui edb515eabf SWDEV-2 - Change OpenCL version number from 3543 to 3544 2022-12-13 03:00:08 -05:00
kjayapra-amd e56a611b92 SWDEV-371904 - Adding pseudo fine grain flag to hsa memory allocation for device fine grained memory.
Change-Id: I8cada90f0e3880dfbc5bf5a3fac4554e7a0cb08e
2022-12-11 08:15:17 -05:00
Chauncey Hui dc6bbe303b SWDEV-2 - Change OpenCL version number from 3542 to 3543 2022-12-10 03:00:08 -05:00
Todd tiantuo Li 9168415ca2 SWDEV-354868 - Queue::Create() for RT queue should fail when number of reserved CU's for RT queue is zero (most likely due to being aligned down by dedicatedCuGranularity).
Change-Id: I234e7ff83cb312bf44f5ad4b1a897c079f5106a9
2022-12-09 16:45:10 -05:00
Chauncey Hui 9c8cfc7f98 SWDEV-2 - Change OpenCL version number from 3541 to 3542 2022-12-09 03:00:07 -05:00
Ioannis Assiouras 72b45e2a1f SWDEV-369581 - Convey copy API metadata to ROCclr
Change-Id: I569462d6d268700d419510255e201bf7d80d6714
2022-12-09 00:27:15 -05:00
pghafari 402504f548 SWDEV-366279 - updating p2p error msg
Change-Id: I294e7770efd11f511dd5817eb28bd3c97d9d4926
2022-12-09 00:05:04 -05:00
Jaydeep Patel cd7034c9bb SWDEV-370537 - PAL does 64kb alignment so no need to do it in runtime. Reverting alignment to 4kb.
Change-Id: If3eaac65bf63bb9b8b73a3cad1bb34d357f1adeb
2022-12-08 14:54:46 -05:00
German 8cf53e7dff SWDEV-370901 - Force sysmem allocation
Adobe AE allocates/destroys system memory per each frame
in some effects, which is a very expensive operation.
Disable pinning and force sysmem allocation in VidMM.
That allows to trigger resource cache in runtime and
avoid paging operations, caused by pinning.
The actual benchmark doesn't have significant improvement
from the change, but GPU view traces are much cleaner.

Change-Id: I92cdd87d37c64611788f05250f71d6fe489b4d7e
2022-12-08 13:06:51 -05:00
kjayapra-amd 7319dc36c1 SWDEV-363865 - Use COMGR path to unbundle code object in HIPRTC.
Change-Id: Ie6a48ab2792bb3f7ae1e62d716ecc35b8e24fe16
2022-12-08 08:34:33 -05:00
Chauncey Hui cbc66bf89d SWDEV-2 - Change OpenCL version number from 3540 to 3541 2022-12-08 03:00:09 -05:00