Chauncey Hui
2dbee131bd
SWDEV-2 - Change OpenCL version number from 3547 to 3548
...
[ROCm/clr commit: b8faaf8458 ]
2022-12-21 03:00:11 -05:00
Jaydeep Patel
7df45ffc38
SWDEV-366087 - Pass pitch and slice pitch to blit kernel from rect struct.
...
Change-Id: I1ffe54929db59a40e2a1ae19c125f8d8e81b07ec
[ROCm/clr commit: 9076d9a518 ]
2022-12-20 16:43:49 +00:00
Chauncey Hui
5d8905d50c
SWDEV-2 - Change OpenCL version number from 3546 to 3547
...
[ROCm/clr commit: d37bf7d22b ]
2022-12-16 03:00:08 -05:00
Saleel Kudchadker
ed4c04b150
SWDEV-373334 - Use copyMetadata for blit decisions
...
- Check isAsync flag for small host copies on large bar as it synchronizes
- Use CopyEngine Preference hint if HMM is enabled.
Change-Id: I1ffc4b2604ed03cf5979cdc454178648c5ae5cba
[ROCm/clr commit: e0384f9f6b ]
2022-12-15 17:09:02 -05:00
Chauncey Hui
8d648e4efe
SWDEV-2 - Change OpenCL version number from 3545 to 3546
...
[ROCm/clr commit: 7e3e528441 ]
2022-12-15 03:00:08 -05:00
Jacob Lambert
87e6e1357b
SWDEV-356171 - Adding new Comgr mangled name fetching APIs
...
amd::Comgr::populate_mangled_names()
amd::Comgr::get_mangled_name()
Change-Id: Ie6ebf3aae76632f7e7cfef3c90642b2e3cd10b48
[ROCm/clr commit: 50d77d17fd ]
2022-12-14 12:40:18 -05:00
jatang
b9cb9b64ed
SWDEV-1 - Fix another build warning.
...
Change-Id: Ifed33732752e9d9ec74a244a0104bf571a9fc6c5
[ROCm/clr commit: a4fadb191a ]
2022-12-14 10:14:49 -05:00
Chauncey Hui
4c2154acfc
SWDEV-2 - Change OpenCL version number from 3544 to 3545
...
[ROCm/clr commit: 5aee3d3018 ]
2022-12-14 03:00:06 -05:00
jatang
3707f5d671
SWDEV-1 - Not to truncate size_t in print when it's 64bit
...
Change-Id: Iab8b7eb128c767b6808b19cbeac5169b7a97f1ad
[ROCm/clr commit: e8cd1e93e8 ]
2022-12-13 09:13:33 -05:00
Chauncey Hui
db15e282f1
SWDEV-2 - Change OpenCL version number from 3543 to 3544
...
[ROCm/clr commit: edb515eabf ]
2022-12-13 03:00:08 -05:00
kjayapra-amd
e295beb8ba
SWDEV-371904 - Adding pseudo fine grain flag to hsa memory allocation for device fine grained memory.
...
Change-Id: I8cada90f0e3880dfbc5bf5a3fac4554e7a0cb08e
[ROCm/clr commit: e56a611b92 ]
2022-12-11 08:15:17 -05:00
Chauncey Hui
9cd8e38967
SWDEV-2 - Change OpenCL version number from 3542 to 3543
...
[ROCm/clr commit: dc6bbe303b ]
2022-12-10 03:00:08 -05:00
Todd tiantuo Li
bf1d819ac7
SWDEV-354868 - Queue::Create() for RT queue should fail when number of reserved CU's for RT queue is zero (most likely due to being aligned down by dedicatedCuGranularity).
...
Change-Id: I234e7ff83cb312bf44f5ad4b1a897c079f5106a9
[ROCm/clr commit: 9168415ca2 ]
2022-12-09 16:45:10 -05:00
Chauncey Hui
2f501eff81
SWDEV-2 - Change OpenCL version number from 3541 to 3542
...
[ROCm/clr commit: 9c8cfc7f98 ]
2022-12-09 03:00:07 -05:00
Ioannis Assiouras
733c8d1d1c
SWDEV-369581 - Convey copy API metadata to ROCclr
...
Change-Id: I569462d6d268700d419510255e201bf7d80d6714
[ROCm/clr commit: 72b45e2a1f ]
2022-12-09 00:27:15 -05:00
pghafari
4e6a699eae
SWDEV-366279 - updating p2p error msg
...
Change-Id: I294e7770efd11f511dd5817eb28bd3c97d9d4926
[ROCm/clr commit: 402504f548 ]
2022-12-09 00:05:04 -05:00
Jaydeep Patel
be5d6fb005
SWDEV-370537 - PAL does 64kb alignment so no need to do it in runtime. Reverting alignment to 4kb.
...
Change-Id: If3eaac65bf63bb9b8b73a3cad1bb34d357f1adeb
[ROCm/clr commit: cd7034c9bb ]
2022-12-08 14:54:46 -05:00
German
ebf7fad258
SWDEV-370901 - Force sysmem allocation
...
Adobe AE allocates/destroys system memory per each frame
in some effects, which is a very expensive operation.
Disable pinning and force sysmem allocation in VidMM.
That allows to trigger resource cache in runtime and
avoid paging operations, caused by pinning.
The actual benchmark doesn't have significant improvement
from the change, but GPU view traces are much cleaner.
Change-Id: I92cdd87d37c64611788f05250f71d6fe489b4d7e
[ROCm/clr commit: 8cf53e7dff ]
2022-12-08 13:06:51 -05:00
kjayapra-amd
e47e613948
SWDEV-363865 - Use COMGR path to unbundle code object in HIPRTC.
...
Change-Id: Ie6a48ab2792bb3f7ae1e62d716ecc35b8e24fe16
[ROCm/clr commit: 7319dc36c1 ]
2022-12-08 08:34:33 -05:00
Chauncey Hui
22ca7dc91e
SWDEV-2 - Change OpenCL version number from 3540 to 3541
...
[ROCm/clr commit: cbc66bf89d ]
2022-12-08 03:00:09 -05:00
Saleel Kudchadker
7ba49616e9
SWDEV-371123 - Use barrier value packet for event records
...
Change-Id: I5e5e5e89e0d96a2430b4682d168b76848fa5b94e
[ROCm/clr commit: 4f64d89026 ]
2022-12-07 17:57:36 -05:00
Chauncey Hui
9323ba1345
SWDEV-2 - Change OpenCL version number from 3539 to 3540
...
[ROCm/clr commit: c8e66f4cf8 ]
2022-12-07 03:00:08 -05:00
Tao Sang
ea9808ac40
SWDEV-370659 - Add lock for HSAIL only
...
Add lock for HSAIL only in order to fix test failures
in math brute force and integer_ops tests.
Change-Id: I5f14cdcaa4ee9867fdae63fff197a0f21ee5f1d4
[ROCm/clr commit: f29d3bc3ac ]
2022-12-06 15:50:04 -05:00
Chauncey Hui
1e49759297
SWDEV-2 - Change OpenCL version number from 3538 to 3539
...
[ROCm/clr commit: a4576e0a60 ]
2022-12-06 03:00:08 -05:00
Ioannis Assiouras
57b1f696ec
SWDEV-369548 - Added getProcessId function in Os:: and change log functions to use that.
...
Change-Id: Ib64ee4d65a5212af7a4a95f80173b7ff8382340f
[ROCm/clr commit: 7631ba8b4d ]
2022-12-05 23:59:25 -05:00
Tao Sang
0efc782e69
SWDEV-347733 - Remove luxmark workaround for gfx11
...
Make gfx11 always use LC by default.
Fix switch order of Navi3x.
Change-Id: I5774563edf4d1aed29a4e92723630adbfb75c48c
[ROCm/clr commit: 8c582a02ef ]
2022-12-05 18:38:08 -05:00
Sourabh Betigeri
7aa958a8f7
SWDEV-305894 - Cooperative groups grid and multi grid sync support for gfx940+
...
Change-Id: I35d72f1cb50c3a96eee56a612b72d641852b145f
[ROCm/clr commit: 5d7f3f9f3c ]
2022-12-05 16:30:30 -05:00
Chauncey Hui
40956fe9a7
SWDEV-2 - Change OpenCL version number from 3537 to 3538
...
[ROCm/clr commit: 3296dd8636 ]
2022-12-03 03:00:06 -05:00
Saleel Kudchadker
f14b550e87
SWDEV-371123 - Refactor dispatchBarrierValuePacket
...
Change-Id: I846bd3f60dd8db125e2ca5475e9d25fdd567922b
[ROCm/clr commit: 10ecf8f159 ]
2022-12-02 12:22:31 -05:00
Chauncey Hui
7eaabcd8bd
SWDEV-2 - Change OpenCL version number from 3536 to 3537
...
[ROCm/clr commit: fb42848728 ]
2022-12-02 03:00:08 -05:00
Jason Tang
5cc5cf4f11
SWDEV-1 - Fix a build error.
...
Change-Id: I5f14cdca04ee9867f0ae63fff197a0f21ee5f1d2
[ROCm/clr commit: 56fe793134 ]
2022-12-01 10:31:34 -05:00
Chauncey Hui
3b77cda4aa
SWDEV-2 - Change OpenCL version number from 3535 to 3536
...
[ROCm/clr commit: 5b19eb7210 ]
2022-12-01 03:00:07 -05:00
Jason Tang
310f43e9e3
SWDEV-1 - Fix a build warning.
...
Change-Id: I085fcc68ef3aaebf85515ea45fca99d7ec845721
[ROCm/clr commit: c825d06454 ]
2022-11-30 11:07:15 -05:00
Chauncey Hui
e4e12ab1c4
SWDEV-2 - Change OpenCL version number from 3534 to 3535
...
[ROCm/clr commit: e12c3bd819 ]
2022-11-30 03:00:07 -05:00
Ioannis Assiouras
941dc51bfd
SWDEV-369547 - Ensure that ipc_mem_detach is not called for non-ipcShared MemObj.
...
Change-Id: Ia73b60b9f08d593301879e0f72c395edbc215112
[ROCm/clr commit: b445717f72 ]
2022-11-30 02:16:13 -05:00
Chauncey Hui
ac1a33714d
SWDEV-2 - Change OpenCL version number from 3533 to 3534
...
[ROCm/clr commit: e74835795a ]
2022-11-29 03:00:07 -05:00
kjayapra-amd
1fea133877
SWDEV-358915 - Wait for 10 micro secs, if the next engine is SDMA.
...
Change-Id: Ib3ed91510e62090f47d4f55be8de635caaee083d
[ROCm/clr commit: 61b6a94627 ]
2022-11-28 12:14:16 -05:00
Anusha GodavarthySurya
a93695d9f0
SWDEV-354074 - cache hsa queue and delete when device is destroyed
...
Change-Id: I26365521d785f0bc612e32bdcdb6caacb3af9a11
[ROCm/clr commit: 446a3fc688 ]
2022-11-23 07:17:45 +00:00
Chauncey Hui
037841e4b6
SWDEV-2 - Change OpenCL version number from 3532 to 3533
...
[ROCm/clr commit: dd810ecfbc ]
2022-11-22 03:00:19 -05:00
Tao Sang
c0a12bbbb5
SWDEV-306410 - Remove program lock
...
Remove global program lock in order to fix too
long kernel launch overhead with multi-threads
on MGPUs.
This patch depends on a compiler patch that makes
LC thread safe.
Change-Id: Ic8a7374d19112764d6de5d483ec5d07a56661d1b
[ROCm/clr commit: 3b2a8f3c8b ]
2022-11-20 14:42:24 -05:00
Chauncey Hui
d992a8aeca
SWDEV-2 - Change OpenCL version number from 3531 to 3532
...
[ROCm/clr commit: c6c10d0127 ]
2022-11-20 03:00:08 -05:00
Alex Xie
1ac90df635
SWDEV-363271 - Add some performance counter logs
...
Change-Id: I368ba4afe47357eb6051b2a06f762b8050e5a560
[ROCm/clr commit: 9e00c674c8 ]
2022-11-19 16:27:14 -05:00
Chauncey Hui
67ab900b7c
SWDEV-2 - Change OpenCL version number from 3530 to 3531
...
[ROCm/clr commit: 686ad01a24 ]
2022-11-19 03:00:08 -05:00
German
e7d9a3f450
SWDEV-368308 - Remove HW debug extension
...
Change-Id: If0c68023c09f0dac9111d52ecc0ad63719aa4e70
[ROCm/clr commit: e5a36ab1ad ]
2022-11-18 10:29:44 -05:00
Chauncey Hui
b6cdeae044
SWDEV-2 - Change OpenCL version number from 3529 to 3530
...
[ROCm/clr commit: 80d444b2f1 ]
2022-11-18 03:00:08 -05:00
German
4b6a6ba8e8
SWDEV-363074 - Adjust staging copy limits in Windows
...
Pinned copy can cause big performance drops, because slow pinning under Windows.
Use up to 128MB for staging transfers. Change staging buffer size to 4MB.
Linux path should still have the old defaults.
Change-Id: I954edceb3ec89e8e670be116aa2d0a9564c8b11c
[ROCm/clr commit: 79d12df147 ]
2022-11-17 14:48:16 -05:00
kjayapra-amd
c5c1b7fd56
SWDEV-241902 - Using COMGR APIs for extracing code object.
...
Change-Id: I96f1cee9c28f89f0b0fa5af0c2ae7966beb3207d
[ROCm/clr commit: 0d9e1de4e3 ]
2022-11-17 11:09:59 -05:00
German
cb4aa60a37
SWDEV-328670 - Enable ROCr interop
...
Change-Id: I932afa2a0e75b1f1771c4be1e984b32966636afb
[ROCm/clr commit: b40823ccc1 ]
2022-11-17 09:40:34 -05:00
Juan Manuel MARTINEZ CAAMAÑO
dfc604419d
SWDEV-286150 - [NFC] Avoid copying the entire devicePrograms map
...
Change-Id: I059f979d9bcdf6604aa3630b40fd47475b75fc30
[ROCm/clr commit: bab23480d3 ]
2022-11-17 03:15:55 -05:00
Juan Manuel MARTINEZ CAAMAÑO
ed84e3c9fa
SWDEV-287018 - Generate tempraries for _optimized.bc and _linked.bc if -save-temp-all is passed.
...
Change-Id: Ie8a7af66f0764f0a94948d45d7deed786f799c96
[ROCm/clr commit: 9b8005d758 ]
2022-11-15 07:49:07 -05:00