Patel, Jaydeepkumar
6858b0fca1
SWDEV-521135 - Make common way to set/parse UUID bytes from PAL props. ( #63 )
2025-05-12 17:00:30 +05:30
Six, Lancelot
c35e9643ec
SWDEV-517078: Fix gfx11 trap handler ( #212 )
...
Fix incorrect edits done when porting the 2nd level trap handler from
the hsa-runtime.
Change-Id: I7bc5160be47b8f669efe05c4d194bc3c47fc0661
2025-05-11 01:12:28 +01:00
Xie, AlexBin
faac50c77a
SWDEV-528860 - reserve some memory in visible frame buffer ( #251 )
2025-05-09 20:08:23 -04:00
Huang, AnZhong
b434fbe2bd
SWDEV-527299 - Support HIP_POINTER_ATTRIBUTE_CONTEXT ( #180 )
...
* SWDEV-527299 - Support HIP_POINTER_ATTRIBUTE_CONTEXT
As HIP enables UVA by default, it seems we can simply expose the context to support this feature.
2025-05-09 17:34:16 +08:00
Chaudhary, Jatin Jaikishan
2f73e1385b
SWDEV-525933 - add constexpr operators for fp16/bf16 ( #199 )
2025-05-09 09:53:58 +01:00
Xie, Jiabao(Jimbo)
a320a3f214
SWDEV-528913 - support gfx950 in rocsetting ( #217 )
...
* SWDEV-528913 - support gfx950 in rocsetting
---------
Co-authored-by: Jimbo Xie <jiabaxie@amd.com >
2025-05-07 15:44:49 -04:00
Lambert, Jacob
6b12154583
SWDEV-518221 - Don't link against libamd_comgr.so at runtime
...
Convention is to always link against .so.* at runtime.
Having it link against .so will break on systems that package
the .so files in their dev/devel package.
This issue was found when building ROCm 6.4 for Fedora.
Commiting on behalf of GitHub user Mystro256
2025-05-07 11:56:41 -07:00
Zhang, Victor
f960433dcd
SWDEV-528142 - add error check for KernelParameters::capture ( #276 )
...
* SWDEV-528142 - add error check for KernelParameters::capture
* Update kernel.cpp
---------
Co-authored-by: victzhan <victzhan@amd.com >
2025-05-07 09:52:09 -04:00
Jayaprakash, Karthik
fa55557f46
SWDEV-493805 - Cleaning up launch parameters arguments. ( #241 )
2025-05-06 15:06:13 -04:00
Dittakavi, Satyanvesh
607f8f26fd
SWDEV-529831 - Return error if the program is empty ( #257 )
2025-05-06 15:12:12 +05:30
Chaudhary, Jatin Jaikishan
a71c6eb1a0
SWDEV-529854 - __hmax/__hmin should handle nan's ( #246 )
2025-05-06 09:42:15 +01:00
Chaudhary, Jatin Jaikishan
b1ebf33850
SWDEV-529927 - add missing operations for fp16/bf16 ( #238 )
2025-05-06 09:41:21 +01:00
Andryeyev, German
65a0181a7c
SWDEV-528808 - Release all HW queues even if only one is idle ( #240 )
...
Pytorch may not explicitly idle each queue. Thus, some queues can be considered as busy,
but have idle state in reality
2025-05-05 19:09:01 -04:00
Guan, Zichuan
3775298655
Disable HIP_PLATFORM auto-detect if already defined ( #254 )
...
Co-authored-by: Stella Laurenzo <stellaraccident@gmail.com >
2025-05-05 15:37:53 -04:00
Arsenault, Matthew
1db9a7d48b
SWDEV-1 - Stop using ocml rounding functions ( #228 )
...
Directly use the builtins. Use the elementwise versions since there's
no implied errno, regardless of -f[no]-math-errno.
I didn't change the cases unnecessarily casting. The bfloat and vector
cases should work directly.
2025-05-05 19:35:12 +02:00
Andryeyev, German
9b018165ce
SWDEV-528808 - Disable dynamic queue by default ( #256 )
...
Dynamic queue management will be disabled by default and
the original sort logic is restored
2025-05-05 10:56:35 -04:00
Searles, Mark
cd9bc61559
Fix typos in warning msgs ( #231 )
2025-05-02 14:31:42 -07:00
Chaudhary, Jatin Jaikishan
12febe6782
SWDEV-514560 - add fp6 header implementation ( #54 )
...
Co-authored-by: rahul manocha <rmanocha_amdeng>
2025-05-01 15:17:38 +01:00
Assiouras, Ioannis
9d6a0d1a4d
SWDEV-521011 - Fix alignment in PalResource::CreateSvm
2025-05-01 02:22:49 +01:00
Andryeyev, German
84a4f293f4
SWDEV-526836 - add PipelineStageBlt flag ( #229 )
...
CP sync requires PipelineStageBlt flag.
2025-04-30 14:27:41 -04:00
Assiouras, Ioannis
d3fb8eda8b
SWDEV-525593, SWDEV-527293 - Acquire active queue after xferQueue is created ( #165 )
...
For xferQueue VirtualGPU::create is called after ProfilingBegin
so the active queue needs to be acquired.
2025-04-30 09:21:11 +01:00
Godavarthy Surya, Anusha
2538d7f02b
SWDEV-522841 - Graph nodes must be created/launched on device where they are captured/created ( #108 )
2025-04-29 22:20:39 +05:30
Jiang, Julia
eb62fe9f62
SWDEV-522634 - Fix device properties in hipInfo ( #203 )
2025-04-29 11:29:47 -04:00
Sang, Tao
96cadbc9e9
SWDEV-520352 - Remove HostThread and legacy monitor ( #230 )
...
* SWDEV-520352 - Remove HostThread and legacy monitor
Remove HostThread, semaphore and legacy monitor.
Make original logics of thread and command queue stricker.
Add more comments to make logics clearer.
Some other minor improvement.
Also part of SWDEV-458943.
2025-04-29 09:55:24 -04:00
GunaShekar, Ajay
99ef573399
SWDEV-523028 - print PAL failure return values in logs ( #81 )
...
* print PAL failure return values in logs
* dump kernel info incase of PAL failure
2025-04-29 11:23:43 +05:30
Jayaprakash, Karthik
6811fd90b8
SWDEV-522707 - Set phys_mem_handle type to sizeof(size_t) to avoid blocking address range. ( #105 )
2025-04-29 11:19:16 +05:30
Jayaprakash, Karthik
b2388dfb88
SWDEV-506467 - Skip Abort in case of crash from the device. ( #60 )
...
Change-Id: I964b2f2647d068202e9c38fcddb1337da754df8d
2025-04-29 11:19:02 +05:30
Betigeri, Sourabh
9cf3f1e461
SWDEV-528351 - Removes unused code and asserts to improve coverage ( #219 )
2025-04-28 14:40:35 -07:00
Critchley, Paul
7e9d5eab7c
SWDEV-523611 - [Tools][OCL] OpenCL fails to capture with PalTrace ( #198 )
...
Finalize DevDriver initialization after device creation
2025-04-28 08:02:34 -07:00
Godavarthy Surya, Anusha
fb92683d86
SWDEV-469422 - Avoid using of hipStream_t in internal methods ( #69 )
...
Change-Id: Ifd5362f371c846a88241927383cb95cf046548ef
2025-04-28 15:09:11 +05:30
Godavarthy Surya, Anusha
bbcb1f9c70
SWDEV-469422 - hipGraphNodeDOTAttribute change std::string members to const char* ( #70 )
...
Compiler creates global variables for every unique string
Change-Id: I4cf8dd3e763d16740096e345da67a7ef72f61515
2025-04-28 14:57:36 +05:30
Assiouras, Ioannis
1099e0a131
SWDEV-526188 - Fix race condition in StatCO::getStatFunc()
...
Make sure that a newly created FatBinaryInfo is assigned to modules only after extractFatBinary has been called for the object.
2025-04-27 21:14:01 +01:00
Kudchadker, Saleel
072fb0804e
SWDEV-521647 - Fix tracking of hw_event ( #206 )
...
- When a command may possibly have two packets(like device heap
initializer), and if there is no signal on the main kernel packet the
tracking was broken as it marked HW event of the command as the first
packet signal.
- Make sure if no completion signal is attached to the second packet
then clear the HW event for the command.
2025-04-25 08:46:44 -07:00
Kudchadker, Saleel
ce24936970
SWDEV-510186 - Improve logging ( #220 )
...
- Print all arguments for logs, this is useful for debug
2025-04-25 08:40:31 -07:00
Li, Todd tiantuo
95cdc83eaf
SWDEV-511055 - fix HIP PAL memory allocation workaround for APU ( #40 )
2025-04-24 15:07:16 -07:00
Sang, Tao
1113eff3f9
SWDEV-493275 - Support scratch limit ( #20 )
...
Support programmatic query and change of scratch limit on
AMD devices.
Change-Id: Id5da355a77366f97868e462847f3916e87fd2af6
2025-04-24 17:15:25 -04:00
Critchley, Paul
4f2a4b12a9
SWDEV-527731 - [Ubertrace] OpenCL driver reports wrong Instrumentation API Version ( #211 )
2025-04-24 14:06:17 -07:00
Godavarthy Surya, Anusha
e5ce544c45
SWDEV-469423 - hipStreamEndCapture graph* can be nullptr ( #170 )
2025-04-24 13:57:09 +05:30
Hila, Nino
38d48c9a7d
Add palamida.yml ( #215 )
2025-04-23 13:15:09 -07:00
Sang, Tao
27aad09bd4
SWDEV-518831 - fix streams' sync issue in mthreads ( #123 )
...
* SWDEV-518831 - fix streams' sync issue in mthreads
1. Fix sync issue of null stream and non-null streams in
multithreads.
2. Remove assert(GetSubmissionBatch() == nullptr) as it
is invalid in multithreads.
3. Update getActiveQueues() to deal with the state of
being terminated.
2025-04-23 15:08:07 -04:00
Sang, Tao
78f92901d8
SWDEV-516050 - Fix monitor hang in OCL ( #75 )
...
Fix monitor hang in cts integer_ops.
Improve notify().
Won't affect notifyAll() and Hip in direct
dispatch mode.
Change-Id: I95a458358e1cab9c76aefde117db09cdbd1fd3af
2025-04-23 14:34:53 -04:00
Xie, Jiabao(Jimbo)
9a8c9e70b2
SWDEV-441487 - add gfx1150/1 support to amd-staging clr ( #182 )
...
Co-authored-by: Jimbo Xie <jiabaxie@amd.com >
2025-04-23 20:43:03 +05:30
GunaShekar, Ajay
64d6f5714a
SWDEV-523281 - CHANGELOG.md and negative test return values : hipLaunchKernelEx, hipLaunchKernelExC, hipDrvLaunchKernelEx ( #155 )
2025-04-22 21:47:37 +05:30
Andryeyev, German
a5c860f3b0
SWDEV-497841 - Enable memory manager by default ( #149 )
2025-04-22 21:20:37 +05:30
Andryeyev, German
a3effa16f1
SWDEV-523300 - Add the new option to build HIP ( #179 )
...
Add the new cmake option AMD_COMPUTE_WIN to build HIP on Windows
from the public github. AMD_COMPUTE_WIN should point to a special
repo with the PAL static libs
2025-04-22 21:05:04 +05:30
Hernandez, Gerardo
1a8d766836
SWDEV-420237 - Fix reduce sync operations when masks are divergent ( #181 )
...
Do not use __ockl_activelane_u32() to calculate the index of the lane within the mask, as that would not work with divergent masks that have other bits on before the associated lane.
2025-04-22 19:47:58 +05:30
Godavarthy Surya, Anusha
bf28bbd9ab
SWDEV-508538 - Optimize mem access and pack structure ( #71 )
...
Change-Id: Ib05b8891a6d228fc3266918a000d332fddc7438b
2025-04-21 13:43:25 +05:30
Brzak, Branislav
99142c3dd9
SWDEV-526612 - Add missing copyright notices ( #201 )
2025-04-18 20:54:27 +05:30
Ramirez, Lucas
d020598a0f
SWDEV-524612 - Consider "1" a truthy value for WGPMode ( #187 )
...
The compiler currently serializes the workgroup_processor_mode COMGR metadata boolean field as "0"/"1" instead of "false"/"true". Consider "1" a truthy value during parsing.
2025-04-17 11:50:07 +02:00
Brzak, Branislav
d00b2a0953
SWDEV-525423 - In COMGR Loader don't open file if image is already mapped ( #193 )
2025-04-16 11:00:54 +02:00