Commit graph

4760 Commits

Autor SHA1 Nachricht Datum
Chauncey Hui 257336ce69 SWDEV-2 - Change OpenCL version number from 3555 to 3556
[ROCm/clr commit: 4595d88b04]
2023-01-17 03:00:06 -05:00
German Andryeyev 8ad3120e25 SWDEV-355281 - Add offset calculation
Calculate memory location offset for requested memory

Change-Id: I7919523df1790d9eaad5827ee5bd98e4263a7554


[ROCm/clr commit: 5d83345c20]
2023-01-16 14:19:40 -05:00
Jaydeep Patel 19c80916fa SWDEV-374360 - Handle free for external memory.
Change-Id: I4a1ede2210a255960d7a935cd4debb806e0147f6


[ROCm/clr commit: d97b4e8c74]
2023-01-16 13:06:36 -05:00
Saleel Kudchadker 772def2cd1 SWDEV-345213 - Fix staged line-by-line copy path
- Address an old bug in offset calculation that was causing out of bound
access.
- Improve logging

Change-Id: Iebdf34dddaa5e987cc72184a2152918adc6a96e0


[ROCm/clr commit: 033d4c0463]
2023-01-16 11:04:30 -05:00
Juan Manuel MARTINEZ CAAMAÑO 16a5bfd654 SWDEV-287018 - Remove requiredDump argument from Program::linkLLVMBitcode
Weirdly, the `requiredDump` argument to linkLLVMBitcode was used to enable/disable
the keeping temporary bytecode files (those generated by -save-temps=all) after linking.

This patch removes this argument as there is no obvious benefit from keepeing it
(the user would only rely on -save-temps=all to control this).

Change-Id: I0c00486f95eb1d4e296b5247c488407c47f0b2d9


[ROCm/clr commit: 8ab3fd58cf]
2023-01-16 03:34:21 -05:00
Chauncey Hui 9e755a3aea SWDEV-2 - Change OpenCL version number from 3554 to 3555
[ROCm/clr commit: 097e8f270a]
2023-01-16 03:00:05 -05:00
Ajay 3d12929eb8 SWDEV-372757 - thread check workaround for windows hang
Change-Id: Ie9f87b88dd0f3078ad1919edc336f297f6b40373


[ROCm/clr commit: ecea27eb2d]
2023-01-13 04:05:35 -05:00
Chauncey Hui 5fd3eb836c SWDEV-2 - Change OpenCL version number from 3553 to 3554
[ROCm/clr commit: 8a973ee882]
2023-01-13 03:00:09 -05:00
Anusha GodavarthySurya e8c8fd8c3f SWDEV-374778 - Select SDMA path when HSA_XNACK is enabled
Change-Id: I052675186238eb24b910a0a558d6b8ede36d1413


[ROCm/clr commit: 710749c291]
2023-01-12 23:34:21 -05:00
Ioannis Assiouras ab81da6544 SWDEV-374401 - Surface the ammount of used constant memory by kernel
Change-Id: I6de0e46482f27b9068582fdaee0e562f9f71f9f0


[ROCm/clr commit: 4d49204fb9]
2023-01-12 05:22:37 -05:00
Saleel Kudchadker f903a6c2c9 SWDEV-371123 - Fix barrier value packet header
- Use correct header for vendor packet
- Pass one dependent signal when submitting a marker if there is one

Change-Id: I4efc70dd5204b559de26f899d0637f50421c8834


[ROCm/clr commit: 77ceb7e9ad]
2023-01-12 04:06:03 -05:00
Chauncey Hui 8a163fef1b SWDEV-2 - Change OpenCL version number from 3552 to 3553
[ROCm/clr commit: 6b4577c8d7]
2023-01-12 03:00:09 -05:00
jatang 1f5f1c8708 SWDEV-287018 - Only generate _optimized.bc when both requiredDump and DUMP_BC_LINKED are true
The current code generates a _optimized.bc regardlessly, so put back the original logic.

Change-Id: I3f84d10934b3e983f5f828af8d0943449a6e1d94


[ROCm/clr commit: 6647882773]
2023-01-11 17:51:35 -05:00
Chauncey Hui 94a74a794b SWDEV-2 - Change OpenCL version number from 3551 to 3552
[ROCm/clr commit: ed10cb76c1]
2023-01-10 03:00:08 -05:00
Anusha GodavarthySurya 379781ea65 SWDEV-364576 - initialize device malloc heap state using blit kernel
Change-Id: I5d0172aff7d2c04b322a4d828b8a2b438158b80f


[ROCm/clr commit: 274f2de391]
2023-01-07 06:53:53 +00:00
Chauncey Hui 9e9e09366b SWDEV-2 - Change OpenCL version number from 3550 to 3551
[ROCm/clr commit: 69ffa7bcb8]
2023-01-06 03:00:08 -05:00
Saleel Kudchadker 4a92208bb7 SWDEV-371123 - Fix typo for log print
Change-Id: I18297a032d6fc9a030863eac7c525e86e9fdcd29


[ROCm/clr commit: 6651aab28d]
2023-01-05 17:46:22 -05:00
kjayapra-amd 8b6dc59393 SWDEV-241902 - Disable COMGR unbundler and use legacy Runtime unbundler.
Change-Id: I04128a40f67f3e5ce0db935a0a733b7044d75c34


[ROCm/clr commit: 79f0b8b794]
2023-01-03 09:31:11 -08:00
Chauncey Hui 80e1f21379 SWDEV-2 - Change OpenCL version number from 3549 to 3550
[ROCm/clr commit: 2eef0f8dd3]
2022-12-29 03:00:08 -05:00
jatang fca7b357b8 SWDEV-372791 - Default UniformWorkGroupSize to true for CL1.x
Change-Id: If03396b5a7a3a69e05fdb4c43791a0d84d9f4127


[ROCm/clr commit: 05a8eaaf0f]
2022-12-28 12:59:08 -05:00
Chauncey Hui 59f6fb9aab SWDEV-2 - Change OpenCL version number from 3548 to 3549
[ROCm/clr commit: 6c1a250331]
2022-12-28 03:00:09 -05:00
Jaydeep Patel f1eae835e3 SWDEV-374370 - Propogate element size to blit kernel.
Change-Id: I06d1ae6feebd238e9a63c617eb4c4dcf485d9ee0


[ROCm/clr commit: 070ae4e6d4]
2022-12-26 09:33:50 +00:00
Chauncey Hui 2dbee131bd SWDEV-2 - Change OpenCL version number from 3547 to 3548
[ROCm/clr commit: b8faaf8458]
2022-12-21 03:00:11 -05:00
Jaydeep Patel 7df45ffc38 SWDEV-366087 - Pass pitch and slice pitch to blit kernel from rect struct.
Change-Id: I1ffe54929db59a40e2a1ae19c125f8d8e81b07ec


[ROCm/clr commit: 9076d9a518]
2022-12-20 16:43:49 +00:00
Chauncey Hui 5d8905d50c SWDEV-2 - Change OpenCL version number from 3546 to 3547
[ROCm/clr commit: d37bf7d22b]
2022-12-16 03:00:08 -05:00
Saleel Kudchadker ed4c04b150 SWDEV-373334 - Use copyMetadata for blit decisions
- Check isAsync flag for small host copies on large bar as it synchronizes
- Use CopyEngine Preference hint if HMM is enabled.

Change-Id: I1ffc4b2604ed03cf5979cdc454178648c5ae5cba


[ROCm/clr commit: e0384f9f6b]
2022-12-15 17:09:02 -05:00
Chauncey Hui 8d648e4efe SWDEV-2 - Change OpenCL version number from 3545 to 3546
[ROCm/clr commit: 7e3e528441]
2022-12-15 03:00:08 -05:00
Jacob Lambert 87e6e1357b SWDEV-356171 - Adding new Comgr mangled name fetching APIs
amd::Comgr::populate_mangled_names()
  amd::Comgr::get_mangled_name()

Change-Id: Ie6ebf3aae76632f7e7cfef3c90642b2e3cd10b48


[ROCm/clr commit: 50d77d17fd]
2022-12-14 12:40:18 -05:00
jatang b9cb9b64ed SWDEV-1 - Fix another build warning.
Change-Id: Ifed33732752e9d9ec74a244a0104bf571a9fc6c5


[ROCm/clr commit: a4fadb191a]
2022-12-14 10:14:49 -05:00
Chauncey Hui 4c2154acfc SWDEV-2 - Change OpenCL version number from 3544 to 3545
[ROCm/clr commit: 5aee3d3018]
2022-12-14 03:00:06 -05:00
jatang 3707f5d671 SWDEV-1 - Not to truncate size_t in print when it's 64bit
Change-Id: Iab8b7eb128c767b6808b19cbeac5169b7a97f1ad


[ROCm/clr commit: e8cd1e93e8]
2022-12-13 09:13:33 -05:00
Chauncey Hui db15e282f1 SWDEV-2 - Change OpenCL version number from 3543 to 3544
[ROCm/clr commit: edb515eabf]
2022-12-13 03:00:08 -05:00
kjayapra-amd e295beb8ba SWDEV-371904 - Adding pseudo fine grain flag to hsa memory allocation for device fine grained memory.
Change-Id: I8cada90f0e3880dfbc5bf5a3fac4554e7a0cb08e


[ROCm/clr commit: e56a611b92]
2022-12-11 08:15:17 -05:00
Chauncey Hui 9cd8e38967 SWDEV-2 - Change OpenCL version number from 3542 to 3543
[ROCm/clr commit: dc6bbe303b]
2022-12-10 03:00:08 -05:00
Todd tiantuo Li bf1d819ac7 SWDEV-354868 - Queue::Create() for RT queue should fail when number of reserved CU's for RT queue is zero (most likely due to being aligned down by dedicatedCuGranularity).
Change-Id: I234e7ff83cb312bf44f5ad4b1a897c079f5106a9


[ROCm/clr commit: 9168415ca2]
2022-12-09 16:45:10 -05:00
Chauncey Hui 2f501eff81 SWDEV-2 - Change OpenCL version number from 3541 to 3542
[ROCm/clr commit: 9c8cfc7f98]
2022-12-09 03:00:07 -05:00
Ioannis Assiouras 733c8d1d1c SWDEV-369581 - Convey copy API metadata to ROCclr
Change-Id: I569462d6d268700d419510255e201bf7d80d6714


[ROCm/clr commit: 72b45e2a1f]
2022-12-09 00:27:15 -05:00
pghafari 4e6a699eae SWDEV-366279 - updating p2p error msg
Change-Id: I294e7770efd11f511dd5817eb28bd3c97d9d4926


[ROCm/clr commit: 402504f548]
2022-12-09 00:05:04 -05:00
Jaydeep Patel be5d6fb005 SWDEV-370537 - PAL does 64kb alignment so no need to do it in runtime. Reverting alignment to 4kb.
Change-Id: If3eaac65bf63bb9b8b73a3cad1bb34d357f1adeb


[ROCm/clr commit: cd7034c9bb]
2022-12-08 14:54:46 -05:00
German ebf7fad258 SWDEV-370901 - Force sysmem allocation
Adobe AE allocates/destroys system memory per each frame
in some effects, which is a very expensive operation.
Disable pinning and force sysmem allocation in VidMM.
That allows to trigger resource cache in runtime and
avoid paging operations, caused by pinning.
The actual benchmark doesn't have significant improvement
from the change, but GPU view traces are much cleaner.

Change-Id: I92cdd87d37c64611788f05250f71d6fe489b4d7e


[ROCm/clr commit: 8cf53e7dff]
2022-12-08 13:06:51 -05:00
kjayapra-amd e47e613948 SWDEV-363865 - Use COMGR path to unbundle code object in HIPRTC.
Change-Id: Ie6a48ab2792bb3f7ae1e62d716ecc35b8e24fe16


[ROCm/clr commit: 7319dc36c1]
2022-12-08 08:34:33 -05:00
Chauncey Hui 22ca7dc91e SWDEV-2 - Change OpenCL version number from 3540 to 3541
[ROCm/clr commit: cbc66bf89d]
2022-12-08 03:00:09 -05:00
Saleel Kudchadker 7ba49616e9 SWDEV-371123 - Use barrier value packet for event records
Change-Id: I5e5e5e89e0d96a2430b4682d168b76848fa5b94e


[ROCm/clr commit: 4f64d89026]
2022-12-07 17:57:36 -05:00
Chauncey Hui 9323ba1345 SWDEV-2 - Change OpenCL version number from 3539 to 3540
[ROCm/clr commit: c8e66f4cf8]
2022-12-07 03:00:08 -05:00
Tao Sang ea9808ac40 SWDEV-370659 - Add lock for HSAIL only
Add lock for HSAIL only in order to fix test failures
in math brute force and integer_ops tests.

Change-Id: I5f14cdcaa4ee9867fdae63fff197a0f21ee5f1d4


[ROCm/clr commit: f29d3bc3ac]
2022-12-06 15:50:04 -05:00
Chauncey Hui 1e49759297 SWDEV-2 - Change OpenCL version number from 3538 to 3539
[ROCm/clr commit: a4576e0a60]
2022-12-06 03:00:08 -05:00
Ioannis Assiouras 57b1f696ec SWDEV-369548 - Added getProcessId function in Os:: and change log functions to use that.
Change-Id: Ib64ee4d65a5212af7a4a95f80173b7ff8382340f


[ROCm/clr commit: 7631ba8b4d]
2022-12-05 23:59:25 -05:00
Tao Sang 0efc782e69 SWDEV-347733 - Remove luxmark workaround for gfx11
Make gfx11 always use LC by default.
Fix switch order of Navi3x.
Change-Id: I5774563edf4d1aed29a4e92723630adbfb75c48c


[ROCm/clr commit: 8c582a02ef]
2022-12-05 18:38:08 -05:00
Sourabh Betigeri 7aa958a8f7 SWDEV-305894 - Cooperative groups grid and multi grid sync support for gfx940+
Change-Id: I35d72f1cb50c3a96eee56a612b72d641852b145f


[ROCm/clr commit: 5d7f3f9f3c]
2022-12-05 16:30:30 -05:00
Chauncey Hui 40956fe9a7 SWDEV-2 - Change OpenCL version number from 3537 to 3538
[ROCm/clr commit: 3296dd8636]
2022-12-03 03:00:06 -05:00