Граф коммитов

4738 Коммитов

Автор SHA1 Сообщение Дата
Chauncey Hui 2dbee131bd SWDEV-2 - Change OpenCL version number from 3547 to 3548
[ROCm/clr commit: b8faaf8458]
2022-12-21 03:00:11 -05:00
Jaydeep Patel 7df45ffc38 SWDEV-366087 - Pass pitch and slice pitch to blit kernel from rect struct.
Change-Id: I1ffe54929db59a40e2a1ae19c125f8d8e81b07ec


[ROCm/clr commit: 9076d9a518]
2022-12-20 16:43:49 +00:00
Chauncey Hui 5d8905d50c SWDEV-2 - Change OpenCL version number from 3546 to 3547
[ROCm/clr commit: d37bf7d22b]
2022-12-16 03:00:08 -05:00
Saleel Kudchadker ed4c04b150 SWDEV-373334 - Use copyMetadata for blit decisions
- Check isAsync flag for small host copies on large bar as it synchronizes
- Use CopyEngine Preference hint if HMM is enabled.

Change-Id: I1ffc4b2604ed03cf5979cdc454178648c5ae5cba


[ROCm/clr commit: e0384f9f6b]
2022-12-15 17:09:02 -05:00
Chauncey Hui 8d648e4efe SWDEV-2 - Change OpenCL version number from 3545 to 3546
[ROCm/clr commit: 7e3e528441]
2022-12-15 03:00:08 -05:00
Jacob Lambert 87e6e1357b SWDEV-356171 - Adding new Comgr mangled name fetching APIs
amd::Comgr::populate_mangled_names()
  amd::Comgr::get_mangled_name()

Change-Id: Ie6ebf3aae76632f7e7cfef3c90642b2e3cd10b48


[ROCm/clr commit: 50d77d17fd]
2022-12-14 12:40:18 -05:00
jatang b9cb9b64ed SWDEV-1 - Fix another build warning.
Change-Id: Ifed33732752e9d9ec74a244a0104bf571a9fc6c5


[ROCm/clr commit: a4fadb191a]
2022-12-14 10:14:49 -05:00
Chauncey Hui 4c2154acfc SWDEV-2 - Change OpenCL version number from 3544 to 3545
[ROCm/clr commit: 5aee3d3018]
2022-12-14 03:00:06 -05:00
jatang 3707f5d671 SWDEV-1 - Not to truncate size_t in print when it's 64bit
Change-Id: Iab8b7eb128c767b6808b19cbeac5169b7a97f1ad


[ROCm/clr commit: e8cd1e93e8]
2022-12-13 09:13:33 -05:00
Chauncey Hui db15e282f1 SWDEV-2 - Change OpenCL version number from 3543 to 3544
[ROCm/clr commit: edb515eabf]
2022-12-13 03:00:08 -05:00
kjayapra-amd e295beb8ba SWDEV-371904 - Adding pseudo fine grain flag to hsa memory allocation for device fine grained memory.
Change-Id: I8cada90f0e3880dfbc5bf5a3fac4554e7a0cb08e


[ROCm/clr commit: e56a611b92]
2022-12-11 08:15:17 -05:00
Chauncey Hui 9cd8e38967 SWDEV-2 - Change OpenCL version number from 3542 to 3543
[ROCm/clr commit: dc6bbe303b]
2022-12-10 03:00:08 -05:00
Todd tiantuo Li bf1d819ac7 SWDEV-354868 - Queue::Create() for RT queue should fail when number of reserved CU's for RT queue is zero (most likely due to being aligned down by dedicatedCuGranularity).
Change-Id: I234e7ff83cb312bf44f5ad4b1a897c079f5106a9


[ROCm/clr commit: 9168415ca2]
2022-12-09 16:45:10 -05:00
Chauncey Hui 2f501eff81 SWDEV-2 - Change OpenCL version number from 3541 to 3542
[ROCm/clr commit: 9c8cfc7f98]
2022-12-09 03:00:07 -05:00
Ioannis Assiouras 733c8d1d1c SWDEV-369581 - Convey copy API metadata to ROCclr
Change-Id: I569462d6d268700d419510255e201bf7d80d6714


[ROCm/clr commit: 72b45e2a1f]
2022-12-09 00:27:15 -05:00
pghafari 4e6a699eae SWDEV-366279 - updating p2p error msg
Change-Id: I294e7770efd11f511dd5817eb28bd3c97d9d4926


[ROCm/clr commit: 402504f548]
2022-12-09 00:05:04 -05:00
Jaydeep Patel be5d6fb005 SWDEV-370537 - PAL does 64kb alignment so no need to do it in runtime. Reverting alignment to 4kb.
Change-Id: If3eaac65bf63bb9b8b73a3cad1bb34d357f1adeb


[ROCm/clr commit: cd7034c9bb]
2022-12-08 14:54:46 -05:00
German ebf7fad258 SWDEV-370901 - Force sysmem allocation
Adobe AE allocates/destroys system memory per each frame
in some effects, which is a very expensive operation.
Disable pinning and force sysmem allocation in VidMM.
That allows to trigger resource cache in runtime and
avoid paging operations, caused by pinning.
The actual benchmark doesn't have significant improvement
from the change, but GPU view traces are much cleaner.

Change-Id: I92cdd87d37c64611788f05250f71d6fe489b4d7e


[ROCm/clr commit: 8cf53e7dff]
2022-12-08 13:06:51 -05:00
kjayapra-amd e47e613948 SWDEV-363865 - Use COMGR path to unbundle code object in HIPRTC.
Change-Id: Ie6a48ab2792bb3f7ae1e62d716ecc35b8e24fe16


[ROCm/clr commit: 7319dc36c1]
2022-12-08 08:34:33 -05:00
Chauncey Hui 22ca7dc91e SWDEV-2 - Change OpenCL version number from 3540 to 3541
[ROCm/clr commit: cbc66bf89d]
2022-12-08 03:00:09 -05:00
Saleel Kudchadker 7ba49616e9 SWDEV-371123 - Use barrier value packet for event records
Change-Id: I5e5e5e89e0d96a2430b4682d168b76848fa5b94e


[ROCm/clr commit: 4f64d89026]
2022-12-07 17:57:36 -05:00
Chauncey Hui 9323ba1345 SWDEV-2 - Change OpenCL version number from 3539 to 3540
[ROCm/clr commit: c8e66f4cf8]
2022-12-07 03:00:08 -05:00
Tao Sang ea9808ac40 SWDEV-370659 - Add lock for HSAIL only
Add lock for HSAIL only in order to fix test failures
in math brute force and integer_ops tests.

Change-Id: I5f14cdcaa4ee9867fdae63fff197a0f21ee5f1d4


[ROCm/clr commit: f29d3bc3ac]
2022-12-06 15:50:04 -05:00
Chauncey Hui 1e49759297 SWDEV-2 - Change OpenCL version number from 3538 to 3539
[ROCm/clr commit: a4576e0a60]
2022-12-06 03:00:08 -05:00
Ioannis Assiouras 57b1f696ec SWDEV-369548 - Added getProcessId function in Os:: and change log functions to use that.
Change-Id: Ib64ee4d65a5212af7a4a95f80173b7ff8382340f


[ROCm/clr commit: 7631ba8b4d]
2022-12-05 23:59:25 -05:00
Tao Sang 0efc782e69 SWDEV-347733 - Remove luxmark workaround for gfx11
Make gfx11 always use LC by default.
Fix switch order of Navi3x.
Change-Id: I5774563edf4d1aed29a4e92723630adbfb75c48c


[ROCm/clr commit: 8c582a02ef]
2022-12-05 18:38:08 -05:00
Sourabh Betigeri 7aa958a8f7 SWDEV-305894 - Cooperative groups grid and multi grid sync support for gfx940+
Change-Id: I35d72f1cb50c3a96eee56a612b72d641852b145f


[ROCm/clr commit: 5d7f3f9f3c]
2022-12-05 16:30:30 -05:00
Chauncey Hui 40956fe9a7 SWDEV-2 - Change OpenCL version number from 3537 to 3538
[ROCm/clr commit: 3296dd8636]
2022-12-03 03:00:06 -05:00
Saleel Kudchadker f14b550e87 SWDEV-371123 - Refactor dispatchBarrierValuePacket
Change-Id: I846bd3f60dd8db125e2ca5475e9d25fdd567922b


[ROCm/clr commit: 10ecf8f159]
2022-12-02 12:22:31 -05:00
Chauncey Hui 7eaabcd8bd SWDEV-2 - Change OpenCL version number from 3536 to 3537
[ROCm/clr commit: fb42848728]
2022-12-02 03:00:08 -05:00
Jason Tang 5cc5cf4f11 SWDEV-1 - Fix a build error.
Change-Id: I5f14cdca04ee9867f0ae63fff197a0f21ee5f1d2


[ROCm/clr commit: 56fe793134]
2022-12-01 10:31:34 -05:00
Chauncey Hui 3b77cda4aa SWDEV-2 - Change OpenCL version number from 3535 to 3536
[ROCm/clr commit: 5b19eb7210]
2022-12-01 03:00:07 -05:00
Jason Tang 310f43e9e3 SWDEV-1 - Fix a build warning.
Change-Id: I085fcc68ef3aaebf85515ea45fca99d7ec845721


[ROCm/clr commit: c825d06454]
2022-11-30 11:07:15 -05:00
Chauncey Hui e4e12ab1c4 SWDEV-2 - Change OpenCL version number from 3534 to 3535
[ROCm/clr commit: e12c3bd819]
2022-11-30 03:00:07 -05:00
Ioannis Assiouras 941dc51bfd SWDEV-369547 - Ensure that ipc_mem_detach is not called for non-ipcShared MemObj.
Change-Id: Ia73b60b9f08d593301879e0f72c395edbc215112


[ROCm/clr commit: b445717f72]
2022-11-30 02:16:13 -05:00
Chauncey Hui ac1a33714d SWDEV-2 - Change OpenCL version number from 3533 to 3534
[ROCm/clr commit: e74835795a]
2022-11-29 03:00:07 -05:00
kjayapra-amd 1fea133877 SWDEV-358915 - Wait for 10 micro secs, if the next engine is SDMA.
Change-Id: Ib3ed91510e62090f47d4f55be8de635caaee083d


[ROCm/clr commit: 61b6a94627]
2022-11-28 12:14:16 -05:00
Anusha GodavarthySurya a93695d9f0 SWDEV-354074 - cache hsa queue and delete when device is destroyed
Change-Id: I26365521d785f0bc612e32bdcdb6caacb3af9a11


[ROCm/clr commit: 446a3fc688]
2022-11-23 07:17:45 +00:00
Chauncey Hui 037841e4b6 SWDEV-2 - Change OpenCL version number from 3532 to 3533
[ROCm/clr commit: dd810ecfbc]
2022-11-22 03:00:19 -05:00
Tao Sang c0a12bbbb5 SWDEV-306410 - Remove program lock
Remove global program lock in order to fix too
long kernel launch overhead with multi-threads
on MGPUs.
This patch depends on a compiler patch that makes
LC thread safe.
Change-Id: Ic8a7374d19112764d6de5d483ec5d07a56661d1b


[ROCm/clr commit: 3b2a8f3c8b]
2022-11-20 14:42:24 -05:00
Chauncey Hui d992a8aeca SWDEV-2 - Change OpenCL version number from 3531 to 3532
[ROCm/clr commit: c6c10d0127]
2022-11-20 03:00:08 -05:00
Alex Xie 1ac90df635 SWDEV-363271 - Add some performance counter logs
Change-Id: I368ba4afe47357eb6051b2a06f762b8050e5a560


[ROCm/clr commit: 9e00c674c8]
2022-11-19 16:27:14 -05:00
Chauncey Hui 67ab900b7c SWDEV-2 - Change OpenCL version number from 3530 to 3531
[ROCm/clr commit: 686ad01a24]
2022-11-19 03:00:08 -05:00
German e7d9a3f450 SWDEV-368308 - Remove HW debug extension
Change-Id: If0c68023c09f0dac9111d52ecc0ad63719aa4e70


[ROCm/clr commit: e5a36ab1ad]
2022-11-18 10:29:44 -05:00
Chauncey Hui b6cdeae044 SWDEV-2 - Change OpenCL version number from 3529 to 3530
[ROCm/clr commit: 80d444b2f1]
2022-11-18 03:00:08 -05:00
German 4b6a6ba8e8 SWDEV-363074 - Adjust staging copy limits in Windows
Pinned copy can cause big performance drops, because slow pinning under Windows.
Use up to 128MB for staging transfers. Change staging buffer size to 4MB.
Linux path should still have the old defaults.

Change-Id: I954edceb3ec89e8e670be116aa2d0a9564c8b11c


[ROCm/clr commit: 79d12df147]
2022-11-17 14:48:16 -05:00
kjayapra-amd c5c1b7fd56 SWDEV-241902 - Using COMGR APIs for extracing code object.
Change-Id: I96f1cee9c28f89f0b0fa5af0c2ae7966beb3207d


[ROCm/clr commit: 0d9e1de4e3]
2022-11-17 11:09:59 -05:00
German cb4aa60a37 SWDEV-328670 - Enable ROCr interop
Change-Id: I932afa2a0e75b1f1771c4be1e984b32966636afb


[ROCm/clr commit: b40823ccc1]
2022-11-17 09:40:34 -05:00
Juan Manuel MARTINEZ CAAMAÑO dfc604419d SWDEV-286150 - [NFC] Avoid copying the entire devicePrograms map
Change-Id: I059f979d9bcdf6604aa3630b40fd47475b75fc30


[ROCm/clr commit: bab23480d3]
2022-11-17 03:15:55 -05:00
Juan Manuel MARTINEZ CAAMAÑO ed84e3c9fa SWDEV-287018 - Generate tempraries for _optimized.bc and _linked.bc if -save-temp-all is passed.
Change-Id: Ie8a7af66f0764f0a94948d45d7deed786f799c96


[ROCm/clr commit: 9b8005d758]
2022-11-15 07:49:07 -05:00