Граф коммитов

4738 Коммитов

Автор SHA1 Сообщение Дата
Chauncey Hui 7e3e528441 SWDEV-2 - Change OpenCL version number from 3545 to 3546 2022-12-15 03:00:08 -05:00
Jacob Lambert 50d77d17fd SWDEV-356171 - Adding new Comgr mangled name fetching APIs
amd::Comgr::populate_mangled_names()
  amd::Comgr::get_mangled_name()

Change-Id: Ie6ebf3aae76632f7e7cfef3c90642b2e3cd10b48
2022-12-14 12:40:18 -05:00
jatang a4fadb191a SWDEV-1 - Fix another build warning.
Change-Id: Ifed33732752e9d9ec74a244a0104bf571a9fc6c5
2022-12-14 10:14:49 -05:00
Chauncey Hui 5aee3d3018 SWDEV-2 - Change OpenCL version number from 3544 to 3545 2022-12-14 03:00:06 -05:00
jatang e8cd1e93e8 SWDEV-1 - Not to truncate size_t in print when it's 64bit
Change-Id: Iab8b7eb128c767b6808b19cbeac5169b7a97f1ad
2022-12-13 09:13:33 -05:00
Chauncey Hui edb515eabf SWDEV-2 - Change OpenCL version number from 3543 to 3544 2022-12-13 03:00:08 -05:00
kjayapra-amd e56a611b92 SWDEV-371904 - Adding pseudo fine grain flag to hsa memory allocation for device fine grained memory.
Change-Id: I8cada90f0e3880dfbc5bf5a3fac4554e7a0cb08e
2022-12-11 08:15:17 -05:00
Chauncey Hui dc6bbe303b SWDEV-2 - Change OpenCL version number from 3542 to 3543 2022-12-10 03:00:08 -05:00
Todd tiantuo Li 9168415ca2 SWDEV-354868 - Queue::Create() for RT queue should fail when number of reserved CU's for RT queue is zero (most likely due to being aligned down by dedicatedCuGranularity).
Change-Id: I234e7ff83cb312bf44f5ad4b1a897c079f5106a9
2022-12-09 16:45:10 -05:00
Chauncey Hui 9c8cfc7f98 SWDEV-2 - Change OpenCL version number from 3541 to 3542 2022-12-09 03:00:07 -05:00
Ioannis Assiouras 72b45e2a1f SWDEV-369581 - Convey copy API metadata to ROCclr
Change-Id: I569462d6d268700d419510255e201bf7d80d6714
2022-12-09 00:27:15 -05:00
pghafari 402504f548 SWDEV-366279 - updating p2p error msg
Change-Id: I294e7770efd11f511dd5817eb28bd3c97d9d4926
2022-12-09 00:05:04 -05:00
Jaydeep Patel cd7034c9bb SWDEV-370537 - PAL does 64kb alignment so no need to do it in runtime. Reverting alignment to 4kb.
Change-Id: If3eaac65bf63bb9b8b73a3cad1bb34d357f1adeb
2022-12-08 14:54:46 -05:00
German 8cf53e7dff SWDEV-370901 - Force sysmem allocation
Adobe AE allocates/destroys system memory per each frame
in some effects, which is a very expensive operation.
Disable pinning and force sysmem allocation in VidMM.
That allows to trigger resource cache in runtime and
avoid paging operations, caused by pinning.
The actual benchmark doesn't have significant improvement
from the change, but GPU view traces are much cleaner.

Change-Id: I92cdd87d37c64611788f05250f71d6fe489b4d7e
2022-12-08 13:06:51 -05:00
kjayapra-amd 7319dc36c1 SWDEV-363865 - Use COMGR path to unbundle code object in HIPRTC.
Change-Id: Ie6a48ab2792bb3f7ae1e62d716ecc35b8e24fe16
2022-12-08 08:34:33 -05:00
Chauncey Hui cbc66bf89d SWDEV-2 - Change OpenCL version number from 3540 to 3541 2022-12-08 03:00:09 -05:00
Saleel Kudchadker 4f64d89026 SWDEV-371123 - Use barrier value packet for event records
Change-Id: I5e5e5e89e0d96a2430b4682d168b76848fa5b94e
2022-12-07 17:57:36 -05:00
Chauncey Hui c8e66f4cf8 SWDEV-2 - Change OpenCL version number from 3539 to 3540 2022-12-07 03:00:08 -05:00
Tao Sang f29d3bc3ac SWDEV-370659 - Add lock for HSAIL only
Add lock for HSAIL only in order to fix test failures
in math brute force and integer_ops tests.

Change-Id: I5f14cdcaa4ee9867fdae63fff197a0f21ee5f1d4
2022-12-06 15:50:04 -05:00
Chauncey Hui a4576e0a60 SWDEV-2 - Change OpenCL version number from 3538 to 3539 2022-12-06 03:00:08 -05:00
Ioannis Assiouras 7631ba8b4d SWDEV-369548 - Added getProcessId function in Os:: and change log functions to use that.
Change-Id: Ib64ee4d65a5212af7a4a95f80173b7ff8382340f
2022-12-05 23:59:25 -05:00
Tao Sang 8c582a02ef SWDEV-347733 - Remove luxmark workaround for gfx11
Make gfx11 always use LC by default.
Fix switch order of Navi3x.
Change-Id: I5774563edf4d1aed29a4e92723630adbfb75c48c
2022-12-05 18:38:08 -05:00
Sourabh Betigeri 5d7f3f9f3c SWDEV-305894 - Cooperative groups grid and multi grid sync support for gfx940+
Change-Id: I35d72f1cb50c3a96eee56a612b72d641852b145f
2022-12-05 16:30:30 -05:00
Chauncey Hui 3296dd8636 SWDEV-2 - Change OpenCL version number from 3537 to 3538 2022-12-03 03:00:06 -05:00
Saleel Kudchadker 10ecf8f159 SWDEV-371123 - Refactor dispatchBarrierValuePacket
Change-Id: I846bd3f60dd8db125e2ca5475e9d25fdd567922b
2022-12-02 12:22:31 -05:00
Chauncey Hui fb42848728 SWDEV-2 - Change OpenCL version number from 3536 to 3537 2022-12-02 03:00:08 -05:00
Jason Tang 56fe793134 SWDEV-1 - Fix a build error.
Change-Id: I5f14cdca04ee9867f0ae63fff197a0f21ee5f1d2
2022-12-01 10:31:34 -05:00
Chauncey Hui 5b19eb7210 SWDEV-2 - Change OpenCL version number from 3535 to 3536 2022-12-01 03:00:07 -05:00
Jason Tang c825d06454 SWDEV-1 - Fix a build warning.
Change-Id: I085fcc68ef3aaebf85515ea45fca99d7ec845721
2022-11-30 11:07:15 -05:00
Chauncey Hui e12c3bd819 SWDEV-2 - Change OpenCL version number from 3534 to 3535 2022-11-30 03:00:07 -05:00
Ioannis Assiouras b445717f72 SWDEV-369547 - Ensure that ipc_mem_detach is not called for non-ipcShared MemObj.
Change-Id: Ia73b60b9f08d593301879e0f72c395edbc215112
2022-11-30 02:16:13 -05:00
Chauncey Hui e74835795a SWDEV-2 - Change OpenCL version number from 3533 to 3534 2022-11-29 03:00:07 -05:00
kjayapra-amd 61b6a94627 SWDEV-358915 - Wait for 10 micro secs, if the next engine is SDMA.
Change-Id: Ib3ed91510e62090f47d4f55be8de635caaee083d
2022-11-28 12:14:16 -05:00
Anusha GodavarthySurya 446a3fc688 SWDEV-354074 - cache hsa queue and delete when device is destroyed
Change-Id: I26365521d785f0bc612e32bdcdb6caacb3af9a11
2022-11-23 07:17:45 +00:00
Chauncey Hui dd810ecfbc SWDEV-2 - Change OpenCL version number from 3532 to 3533 2022-11-22 03:00:19 -05:00
Tao Sang 3b2a8f3c8b SWDEV-306410 - Remove program lock
Remove global program lock in order to fix too
long kernel launch overhead with multi-threads
on MGPUs.
This patch depends on a compiler patch that makes
LC thread safe.
Change-Id: Ic8a7374d19112764d6de5d483ec5d07a56661d1b
2022-11-20 14:42:24 -05:00
Chauncey Hui c6c10d0127 SWDEV-2 - Change OpenCL version number from 3531 to 3532 2022-11-20 03:00:08 -05:00
Alex Xie 9e00c674c8 SWDEV-363271 - Add some performance counter logs
Change-Id: I368ba4afe47357eb6051b2a06f762b8050e5a560
2022-11-19 16:27:14 -05:00
Chauncey Hui 686ad01a24 SWDEV-2 - Change OpenCL version number from 3530 to 3531 2022-11-19 03:00:08 -05:00
German e5a36ab1ad SWDEV-368308 - Remove HW debug extension
Change-Id: If0c68023c09f0dac9111d52ecc0ad63719aa4e70
2022-11-18 10:29:44 -05:00
Chauncey Hui 80d444b2f1 SWDEV-2 - Change OpenCL version number from 3529 to 3530 2022-11-18 03:00:08 -05:00
German 79d12df147 SWDEV-363074 - Adjust staging copy limits in Windows
Pinned copy can cause big performance drops, because slow pinning under Windows.
Use up to 128MB for staging transfers. Change staging buffer size to 4MB.
Linux path should still have the old defaults.

Change-Id: I954edceb3ec89e8e670be116aa2d0a9564c8b11c
2022-11-17 14:48:16 -05:00
kjayapra-amd 0d9e1de4e3 SWDEV-241902 - Using COMGR APIs for extracing code object.
Change-Id: I96f1cee9c28f89f0b0fa5af0c2ae7966beb3207d
2022-11-17 11:09:59 -05:00
German b40823ccc1 SWDEV-328670 - Enable ROCr interop
Change-Id: I932afa2a0e75b1f1771c4be1e984b32966636afb
2022-11-17 09:40:34 -05:00
Juan Manuel MARTINEZ CAAMAÑO bab23480d3 SWDEV-286150 - [NFC] Avoid copying the entire devicePrograms map
Change-Id: I059f979d9bcdf6604aa3630b40fd47475b75fc30
2022-11-17 03:15:55 -05:00
Juan Manuel MARTINEZ CAAMAÑO 9b8005d758 SWDEV-287018 - Generate tempraries for _optimized.bc and _linked.bc if -save-temp-all is passed.
Change-Id: Ie8a7af66f0764f0a94948d45d7deed786f799c96
2022-11-15 07:49:07 -05:00
Chauncey Hui 8246c73e3a SWDEV-2 - Change OpenCL version number from 3528 to 3529 2022-11-12 03:00:08 -05:00
kjayapra-amd 7f1fb925ff SWDEV-361374 - Adding support for hipPointerSetAttributes
Change-Id: I3ec9627f43b3cbe0aa299c8aa9cd96f8fbd74925
2022-11-11 12:07:26 -05:00
Chauncey Hui 820d90c6c0 SWDEV-2 - Change OpenCL version number from 3527 to 3528 2022-11-11 03:00:06 -05:00
Jason Tang 2c09557d60 SWDEV-1 - Add LOG_ERROR output for Pal::CreatePlatform failure
Change-Id: Id5c3fdcd8f822bfa67ae570c09ede7ddc8a936ef
2022-11-10 09:30:10 -05:00