Jayaprakash, Karthik
4ea2d9a5ee
SWDEV-531711 - Report correct error code based on device failure. ( #286 )
...
[ROCm/clr commit: f5b8db33f1 ]
2025-05-17 06:33:13 -04:00
Brzak, Branislav
dc39d67017
SWDEV-508742 - Make clCreatePipe spec compliant ( #80 )
...
[ROCm/clr commit: 7698d799ce ]
2025-05-16 15:18:35 +05:30
Belton-Schure, Aidan
c04355bbe3
Add __syncwarp operation ( #160 )
...
Change-Id: I6a3783beafdbb9f11a3b37333f4ff3f5be27ea54
[ROCm/clr commit: c50610b44d ]
2025-05-15 14:20:13 +05:30
Patel, Jaydeepkumar
7b6e59cbea
SWDEV-530803 - User current device id while cloning graph node. ( #313 )
...
[ROCm/clr commit: 32eb6a5d89 ]
2025-05-15 09:06:15 +05:30
Andryeyev, German
b13eec7049
SWDEV-345024 - Retain the program on Fini kernel execution ( #307 )
...
Fini kernel is executed during the invocation of amd::Program destructor,
but the dispatch logic can retain/release the reference counter and
cause double free. Avoid double free with an extra retain() call
[ROCm/clr commit: bddb8f14d1 ]
2025-05-14 21:21:26 +05:30
Xie, Pengda
d0b5bee680
SWDEV-527781 - Remove Stream Validation in HIP APIs
...
[ROCm/clr commit: 0457b634f8 ]
2025-05-13 13:45:27 -07:00
Assiouras, Ioannis
71f19d7017
SWDEV-529449 - Bug fix when retrieving a memobj from the IPC mem handle
...
[ROCm/clr commit: f7482ef0a6 ]
2025-05-13 19:18:22 +01:00
Hernandez, Gerardo
f6e2ce709e
SWDEV-491314 - Re-enable cross-lane sync builtins ( #94 )
...
* Enables warp sync builtins by default
* Removes HIP_ENABLE_WARP_SYNC_BUILTINS; that macro will no longer have an effect. Instead, we will now be able to disable the builtins with the macro: HIP_DISABLE_WARP_SYNC_BUILTINS
[ROCm/clr commit: 5606debd8e ]
2025-05-13 16:35:58 +01:00
Hila, Nino
92b0f9d114
Update palamida.yml ( #266 )
...
* Add palamida.yml - removing url
[ROCm/clr commit: 29df3ae6e9 ]
2025-05-12 21:39:21 -07:00
Jayaprakash, Karthik
a128beeeb5
SWDEV-506467 - Fixing compilation issue seen on clang compilation for ASAN. ( #253 )
...
[ROCm/clr commit: 876de49b11 ]
2025-05-12 17:16:56 -04:00
Brzak, Branislav
cc6bfe71fc
SWDEV-528683 - Hardcode valid wavefront compile time options ( #306 )
...
[ROCm/clr commit: f9199ac205 ]
2025-05-12 19:29:39 +02:00
Andryeyev, German
b9669ea266
SWDEV-531678 - Remove split path from the dispatch ( #283 )
...
The split path for blit kernels are no longer necessary, since the new blit kernels
don't use the copy size as the global workload
[ROCm/clr commit: da198ac5b2 ]
2025-05-12 12:50:32 -04:00
Jayaprakash, Karthik
5dff283f4e
SWDEV-526855 - Modify the SIMDPerCU calculation for gfx1250/1. ( #275 )
...
[ROCm/clr commit: acb1f7e8d5 ]
2025-05-12 11:09:03 -04:00
Arandjelovic, Marko
3d862af4df
SWDEV-512344 - Unmap all subbuffers ( #214 )
...
[ROCm/clr commit: c5ced8c3a2 ]
2025-05-12 16:56:10 +02:00
Arandjelovic, Marko
558a26cf98
SWDEV-511204 - Mapped virtual memory should use device instead of host context ( #213 )
...
Since the sub-buffer(virtual memory that is mapped to device memory) is associated with device memory, it should utilize the device context instead of the host context. The original implementation caused hipMemcpyPeer to not take the P2P path, as the memory object was treated as host memory.
[ROCm/clr commit: a7492c516d ]
2025-05-12 16:55:25 +02:00
Patel, Jaydeepkumar
5bd5f3aa6e
SWDEV-521135 - Make common way to set/parse UUID bytes from PAL props. ( #63 )
...
[ROCm/clr commit: 6858b0fca1 ]
2025-05-12 17:00:30 +05:30
Six, Lancelot
fe14cc349b
SWDEV-517078: Fix gfx11 trap handler ( #212 )
...
Fix incorrect edits done when porting the 2nd level trap handler from
the hsa-runtime.
Change-Id: I7bc5160be47b8f669efe05c4d194bc3c47fc0661
[ROCm/clr commit: c35e9643ec ]
2025-05-11 01:12:28 +01:00
Xie, AlexBin
142e347aac
SWDEV-528860 - reserve some memory in visible frame buffer ( #251 )
...
[ROCm/clr commit: faac50c77a ]
2025-05-09 20:08:23 -04:00
Huang, AnZhong
3decd4f975
SWDEV-527299 - Support HIP_POINTER_ATTRIBUTE_CONTEXT ( #180 )
...
* SWDEV-527299 - Support HIP_POINTER_ATTRIBUTE_CONTEXT
As HIP enables UVA by default, it seems we can simply expose the context to support this feature.
[ROCm/clr commit: b434fbe2bd ]
2025-05-09 17:34:16 +08:00
Chaudhary, Jatin Jaikishan
a8630e866d
SWDEV-525933 - add constexpr operators for fp16/bf16 ( #199 )
...
[ROCm/clr commit: 2f73e1385b ]
2025-05-09 09:53:58 +01:00
Xie, Jiabao(Jimbo)
e1d2194b75
SWDEV-528913 - support gfx950 in rocsetting ( #217 )
...
* SWDEV-528913 - support gfx950 in rocsetting
---------
Co-authored-by: Jimbo Xie <jiabaxie@amd.com >
[ROCm/clr commit: a320a3f214 ]
2025-05-07 15:44:49 -04:00
Lambert, Jacob
dc1c1e3199
SWDEV-518221 - Don't link against libamd_comgr.so at runtime
...
Convention is to always link against .so.* at runtime.
Having it link against .so will break on systems that package
the .so files in their dev/devel package.
This issue was found when building ROCm 6.4 for Fedora.
Commiting on behalf of GitHub user Mystro256
[ROCm/clr commit: 6b12154583 ]
2025-05-07 11:56:41 -07:00
Zhang, Victor
fbabd2b69d
SWDEV-528142 - add error check for KernelParameters::capture ( #276 )
...
* SWDEV-528142 - add error check for KernelParameters::capture
* Update kernel.cpp
---------
Co-authored-by: victzhan <victzhan@amd.com >
[ROCm/clr commit: f960433dcd ]
2025-05-07 09:52:09 -04:00
Jayaprakash, Karthik
cde2a250ec
SWDEV-493805 - Cleaning up launch parameters arguments. ( #241 )
...
[ROCm/clr commit: fa55557f46 ]
2025-05-06 15:06:13 -04:00
Dittakavi, Satyanvesh
086a1c289a
SWDEV-529831 - Return error if the program is empty ( #257 )
...
[ROCm/clr commit: 607f8f26fd ]
2025-05-06 15:12:12 +05:30
Chaudhary, Jatin Jaikishan
b5f67d4804
SWDEV-529854 - __hmax/__hmin should handle nan's ( #246 )
...
[ROCm/clr commit: a71c6eb1a0 ]
2025-05-06 09:42:15 +01:00
Chaudhary, Jatin Jaikishan
a12739ecd9
SWDEV-529927 - add missing operations for fp16/bf16 ( #238 )
...
[ROCm/clr commit: b1ebf33850 ]
2025-05-06 09:41:21 +01:00
Andryeyev, German
3ea758a2d4
SWDEV-528808 - Release all HW queues even if only one is idle ( #240 )
...
Pytorch may not explicitly idle each queue. Thus, some queues can be considered as busy,
but have idle state in reality
[ROCm/clr commit: 65a0181a7c ]
2025-05-05 19:09:01 -04:00
Guan, Zichuan
ee91a1e94a
Disable HIP_PLATFORM auto-detect if already defined ( #254 )
...
Co-authored-by: Stella Laurenzo <stellaraccident@gmail.com >
[ROCm/clr commit: 3775298655 ]
2025-05-05 15:37:53 -04:00
Arsenault, Matthew
13d8f9adae
SWDEV-1 - Stop using ocml rounding functions ( #228 )
...
Directly use the builtins. Use the elementwise versions since there's
no implied errno, regardless of -f[no]-math-errno.
I didn't change the cases unnecessarily casting. The bfloat and vector
cases should work directly.
[ROCm/clr commit: 1db9a7d48b ]
2025-05-05 19:35:12 +02:00
Andryeyev, German
c512258e45
SWDEV-528808 - Disable dynamic queue by default ( #256 )
...
Dynamic queue management will be disabled by default and
the original sort logic is restored
[ROCm/clr commit: 9b018165ce ]
2025-05-05 10:56:35 -04:00
Searles, Mark
e480220c6a
Fix typos in warning msgs ( #231 )
...
[ROCm/clr commit: cd9bc61559 ]
2025-05-02 14:31:42 -07:00
Chaudhary, Jatin Jaikishan
f619372ae6
SWDEV-514560 - add fp6 header implementation ( #54 )
...
Co-authored-by: rahul manocha <rmanocha_amdeng>
[ROCm/clr commit: 12febe6782 ]
2025-05-01 15:17:38 +01:00
Assiouras, Ioannis
3d4ff304d7
SWDEV-521011 - Fix alignment in PalResource::CreateSvm
...
[ROCm/clr commit: 9d6a0d1a4d ]
2025-05-01 02:22:49 +01:00
Andryeyev, German
13c7977d50
SWDEV-526836 - add PipelineStageBlt flag ( #229 )
...
CP sync requires PipelineStageBlt flag.
[ROCm/clr commit: 84a4f293f4 ]
2025-04-30 14:27:41 -04:00
Assiouras, Ioannis
4efd624960
SWDEV-525593, SWDEV-527293 - Acquire active queue after xferQueue is created ( #165 )
...
For xferQueue VirtualGPU::create is called after ProfilingBegin
so the active queue needs to be acquired.
[ROCm/clr commit: d3fb8eda8b ]
2025-04-30 09:21:11 +01:00
Godavarthy Surya, Anusha
e4a499f22e
SWDEV-522841 - Graph nodes must be created/launched on device where they are captured/created ( #108 )
...
[ROCm/clr commit: 2538d7f02b ]
2025-04-29 22:20:39 +05:30
Jiang, Julia
6ab34e0924
SWDEV-522634 - Fix device properties in hipInfo ( #203 )
...
[ROCm/clr commit: eb62fe9f62 ]
2025-04-29 11:29:47 -04:00
Sang, Tao
68deb3d10a
SWDEV-520352 - Remove HostThread and legacy monitor ( #230 )
...
* SWDEV-520352 - Remove HostThread and legacy monitor
Remove HostThread, semaphore and legacy monitor.
Make original logics of thread and command queue stricker.
Add more comments to make logics clearer.
Some other minor improvement.
Also part of SWDEV-458943.
[ROCm/clr commit: 96cadbc9e9 ]
2025-04-29 09:55:24 -04:00
GunaShekar, Ajay
c4567a9188
SWDEV-523028 - print PAL failure return values in logs ( #81 )
...
* print PAL failure return values in logs
* dump kernel info incase of PAL failure
[ROCm/clr commit: 99ef573399 ]
2025-04-29 11:23:43 +05:30
Jayaprakash, Karthik
0071d33754
SWDEV-522707 - Set phys_mem_handle type to sizeof(size_t) to avoid blocking address range. ( #105 )
...
[ROCm/clr commit: 6811fd90b8 ]
2025-04-29 11:19:16 +05:30
Jayaprakash, Karthik
49a527c826
SWDEV-506467 - Skip Abort in case of crash from the device. ( #60 )
...
Change-Id: I964b2f2647d068202e9c38fcddb1337da754df8d
[ROCm/clr commit: b2388dfb88 ]
2025-04-29 11:19:02 +05:30
Betigeri, Sourabh
ae0640131e
SWDEV-528351 - Removes unused code and asserts to improve coverage ( #219 )
...
[ROCm/clr commit: 9cf3f1e461 ]
2025-04-28 14:40:35 -07:00
Critchley, Paul
4d3978e094
SWDEV-523611 - [Tools][OCL] OpenCL fails to capture with PalTrace ( #198 )
...
Finalize DevDriver initialization after device creation
[ROCm/clr commit: 7e9d5eab7c ]
2025-04-28 08:02:34 -07:00
Godavarthy Surya, Anusha
ff69bcc903
SWDEV-469422 - Avoid using of hipStream_t in internal methods ( #69 )
...
Change-Id: Ifd5362f371c846a88241927383cb95cf046548ef
[ROCm/clr commit: fb92683d86 ]
2025-04-28 15:09:11 +05:30
Godavarthy Surya, Anusha
0eb2e5e8f2
SWDEV-469422 - hipGraphNodeDOTAttribute change std::string members to const char* ( #70 )
...
Compiler creates global variables for every unique string
Change-Id: I4cf8dd3e763d16740096e345da67a7ef72f61515
[ROCm/clr commit: bbcb1f9c70 ]
2025-04-28 14:57:36 +05:30
Assiouras, Ioannis
875468bbfb
SWDEV-526188 - Fix race condition in StatCO::getStatFunc()
...
Make sure that a newly created FatBinaryInfo is assigned to modules only after extractFatBinary has been called for the object.
[ROCm/clr commit: 1099e0a131 ]
2025-04-27 21:14:01 +01:00
Kudchadker, Saleel
cd14def193
SWDEV-521647 - Fix tracking of hw_event ( #206 )
...
- When a command may possibly have two packets(like device heap
initializer), and if there is no signal on the main kernel packet the
tracking was broken as it marked HW event of the command as the first
packet signal.
- Make sure if no completion signal is attached to the second packet
then clear the HW event for the command.
[ROCm/clr commit: 072fb0804e ]
2025-04-25 08:46:44 -07:00
Kudchadker, Saleel
1b1d6b841e
SWDEV-510186 - Improve logging ( #220 )
...
- Print all arguments for logs, this is useful for debug
[ROCm/clr commit: ce24936970 ]
2025-04-25 08:40:31 -07:00
Li, Todd tiantuo
8706df3726
SWDEV-511055 - fix HIP PAL memory allocation workaround for APU ( #40 )
...
[ROCm/clr commit: 95cdc83eaf ]
2025-04-24 15:07:16 -07:00