rocm-systems

Автор	SHA1	Сообщение	Дата
Allen Hubbe	e2dcf99456	gda: fix getmem_nbi_wg source and dest (#311 ) A copy paste mistake in a previous commit caused source and dest to be reversed. Correct the source and dest params. Fixes: `6de67d5d7c` Signed-off-by: Allen Hubbe <allen.hubbe@amd.com>	2025-11-06 16:21:20 -06:00
Sourabh U Betigeri	90d5dc6b3a	SWDEV-564408 - Reduces hip-tests runtime Pt 1 (#1695 ) * SWDEV-564408 - Reduces hip-tests runtime Pt 1 * Update cmd_options.hh	2025-11-06 13:45:36 -08:00
Pratik Basyal	fdb557c88a	[Systems-Profiler] Officially unsupported OS removed (#1740 ) * Fedora and CentOS removed * David's feedback incorporated Co-authored-by: David Galiffi <David.Galiffi@amd.com> --------- Co-authored-by: David Galiffi <David.Galiffi@amd.com>	2025-11-06 16:06:28 -05:00
Kian Cossettini	f4d0aeb8f3	Adjust host thread count for OpenMP-VV tests (#1742 ) Reducing test time	2025-11-06 16:04:47 -05:00
alex-breslow-amd	bd614458c3	[gfx950] Turn On Single Node One Slice Optimization for gfx950 and MI300A (#2017 ) * Internal benchmarking shows nice single-node performance uplift for MI300A and MI350 [ROCm/rccl commit: `56e0b4e445`]	2025-11-06 12:12:45 -08:00
alex-breslow-amd	56e0b4e445	[gfx950] Turn On Single Node One Slice Optimization for gfx950 and MI300A (#2017 ) * Internal benchmarking shows nice single-node performance uplift for MI300A and MI350	2025-11-06 12:12:45 -08:00
MachineTom	3bb8c2ac50	SWDEV-564392 - Clean up image tests (#1694 ) Remove unnecessary checking. Enable all disabled tests. Move Mipmap test files into Windows section.	2025-11-06 15:07:53 -05:00
Yiltan	7348bac9bf	Memset queues (#313 ) [ROCm/rocshmem commit: `cd9b5ee806`]	2025-11-06 14:16:53 -05:00
Yiltan	cd9b5ee806	Memset queues (#313 )	2025-11-06 14:16:53 -05:00
Joseph Macaranas	524f62ae67	TheRock CI Workflow Updates 20251106 (#1743 ) - Update the pinned SHA for TheRock in CI workflows. - Update the version for actions in those same workflows. - Comment out the rm .patch line and provide details on its use.	2025-11-06 12:06:44 -05:00
Poag, Charis	d73726698b	[SWDEV-562295] Fix Dmesg errors when using CLI (#822 ) * Changes: - Modified attempting to open files to check permissions -> check read access only. Do not try to open all paths, may cause driver issues. Read access is sufficient to check permissions. Reason: GPUs which support partitioning (memory/compute), logical devices will not be valid until configured. See `sudo amd-smi set -h` or applicable APIs to configure on supported hardware. Example error dmesg output: [965358.883112] amdgpu 0000:15:00.0: amdgpu: renderD153 partition 1 not valid! [965358.883283] amdgpu 0000:15:00.0: amdgpu: renderD154 partition 2 not valid! [965358.883438] amdgpu 0000:15:00.0: amdgpu: renderD155 partition 3 not valid! [965358.883594] amdgpu 0000:15:00.0: amdgpu: renderD156 partition 4 not valid! [965358.883749] amdgpu 0000:15:00.0: amdgpu: renderD157 partition 5 not valid! [965358.883904] amdgpu 0000:15:00.0: amdgpu: renderD158 partition 6 not valid! [965358.884060] amdgpu 0000:15:00.0: amdgpu: renderD159 partition 7 not valid! --------- Signed-off-by: Charis Poag <Charis.Poag@amd.com>	2025-11-06 10:24:14 -06:00
Poag, Charis	ced0642b4b	[SWDEV-562295] Fix Dmesg errors when using CLI (#822 ) * Changes: - Modified attempting to open files to check permissions -> check read access only. Do not try to open all paths, may cause driver issues. Read access is sufficient to check permissions. Reason: GPUs which support partitioning (memory/compute), logical devices will not be valid until configured. See `sudo amd-smi set -h` or applicable APIs to configure on supported hardware. Example error dmesg output: [965358.883112] amdgpu 0000:15:00.0: amdgpu: renderD153 partition 1 not valid! [965358.883283] amdgpu 0000:15:00.0: amdgpu: renderD154 partition 2 not valid! [965358.883438] amdgpu 0000:15:00.0: amdgpu: renderD155 partition 3 not valid! [965358.883594] amdgpu 0000:15:00.0: amdgpu: renderD156 partition 4 not valid! [965358.883749] amdgpu 0000:15:00.0: amdgpu: renderD157 partition 5 not valid! [965358.883904] amdgpu 0000:15:00.0: amdgpu: renderD158 partition 6 not valid! [965358.884060] amdgpu 0000:15:00.0: amdgpu: renderD159 partition 7 not valid! --------- Signed-off-by: Charis Poag <Charis.Poag@amd.com> [ROCm/amdsmi commit: `d73726698b`]	2025-11-06 10:24:14 -06:00
jamessiddeley-amd	37bbb58a19	[rocprof-compute] fix unit regex 'ns' in analyze mode (#1689 ) * fix unit regex in analyze mode * ruff format	2025-11-06 11:13:10 -05:00
Galantsev, Dmitrii	8bdf951d32	Add numbers to .so because wheels dont allow symlinks (#820 ) Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>	2025-11-06 03:57:31 -06:00
Galantsev, Dmitrii	181659ea1f	Add numbers to .so because wheels dont allow symlinks (#820 ) Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com> [ROCm/amdsmi commit: `8bdf951d32`]	2025-11-06 03:57:31 -06:00
marandje	0ad05ed515	SWDEV-556947 - Parse the HIP version from the Git tag (#1135 )	2025-11-06 10:18:26 +01:00
Galantsev, Dmitrii	aac09912ec	Add downloaded gtest as fallback Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>	2025-11-06 01:26:40 -06:00
Galantsev, Dmitrii	4e8d89306e	Add downloaded gtest as fallback Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com> [ROCm/amdsmi commit: `aac09912ec`]	2025-11-06 01:26:40 -06:00
Satyanvesh Dittakavi	478cee0f68	SWDEV-559525 - Add the HIP_POINTER_ATTRIBUTE_IS_LEGACY_HIP_IPC_CAPABLE attribute support (#1647 ) * SWDEV-559525 - Add the HIP_POINTER_ATTRIBUTE_IS_LEGACY_HIP_IPC_CAPABLE attribute implementation * Update indentation in hip_memory.cpp	2025-11-06 12:07:32 +05:30
systems-assistant[bot]	27f85500f8	Update amdgpu-windows-interop with latest changes 20251105 (#1728 ) Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-11-05 22:09:25 -05:00
habajpai-amd	ea31a0bf18	rocprofiler-sdk: fix per-record group_by_queue scoping (#1676 ) * rocprofiler-sdk: fix per-record group_by_queue scoping * added under resolved issues to CHANGELOG.md --------- Co-authored-by: David Galiffi <David.Galiffi@amd.com>	2025-11-05 21:46:44 -05:00
Aryan Salmanpour	f03eebf563	Update libva path configuration for TheRock (#670 ) [ROCm/rocdecode commit: `c389675f31`]	2025-11-05 15:13:05 -08:00
Aryan Salmanpour	c389675f31	Update libva path configuration for TheRock (#670 )	2025-11-05 15:13:05 -08:00
Aryan Salmanpour	b621dd081b	Update libva path configuration for TheRock (#199 ) [ROCm/rocjpeg commit: `20a125f845`]	2025-11-05 15:11:26 -08:00
Aryan Salmanpour	20a125f845	Update libva path configuration for TheRock (#199 )	2025-11-05 15:11:26 -08:00
Xie, AlexBin	c877be2afe	rocr: make sure the member variable is conctructed (#1387 ) Signed-off-by: Alex Xie <AlexBin.Xie@amd.com>	2025-11-05 17:19:33 -05:00
alexxu-amd	a330fb6b91	fix latest docs doesn't get synchronized issue (#1714 )	2025-11-05 17:08:19 -05:00
Yiltan	bf19d70a29	Added ibv_wrapper which opens library using dlopen (#309 ) [ROCm/rocshmem commit: `110f9c8793`]	2025-11-05 16:12:44 -05:00
Yiltan	110f9c8793	Added ibv_wrapper which opens library using dlopen (#309 )	2025-11-05 16:12:44 -05:00
Joseph Macaranas	865a8d4d59	Revert "Update amdgpu-windows-interop with latest changes (#1718 )" (#1725 ) This reverts commit `321e497048`.	2025-11-05 15:38:23 -05:00
systems-assistant[bot]	321e497048	Update amdgpu-windows-interop with latest changes (#1718 ) Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-11-05 21:13:32 +01:00
lancesix	280cda3196	clr: SWDEV-547890 - Maintain an MQD for the emulated AQL queue (#1669 ) * clr: SWDEV-547890 - Maintain an MQD for the emulated AQL queue To simplify the shader debugger implementation, maintain the relevant parts of the emulated AQL queue's MQD (amd_queue_t): read_dispatch_id, write_dispatch_id, compute_tmpring_size. With this MQD, the shader debugger can handle the emulated AQL queue the same way it does the real AQL queue, no specialization is required. * clr: SWDEV-547890 - Conservatively update the MQD's read_dispatch_id The read_dispatch_id cannot be smaller than the current aql_packet_id - hsa_queue.size for the debugger to work correctly. The read_dispatch_id really should be updated when the CmdBuf is marked as complete. Left a FIXME to address it in a future commit. --------- Co-authored-by: Laurent Morichetti <laurent.morichetti@amd.com>	2025-11-05 17:39:33 +00:00
Allen Hubbe	e8a7371007	gda ionic: use all threads in wave operations (#295 ) Use all available threads for polling the cq to increase the maximum message rate. Even when posting a single wqe in the wave, use all available theads for polling the cq to reserve space in the sq. Changes were needed in the rocshmem abstraction to avoid disabling gpu threads, like taking turns or using only the first thread in a wave or wavefront. To avoid breaking other gda implementations, reimplement turn-based or single thread strategy in post_wqe_rma_turn and post_wqe_rma_single. Signed-off-by: Allen Hubbe <allen.hubbe@amd.com> [ROCm/rocshmem commit: `6de67d5d7c`]	2025-11-05 11:01:14 -06:00
Allen Hubbe	6de67d5d7c	gda ionic: use all threads in wave operations (#295 ) Use all available threads for polling the cq to increase the maximum message rate. Even when posting a single wqe in the wave, use all available theads for polling the cq to reserve space in the sq. Changes were needed in the rocshmem abstraction to avoid disabling gpu threads, like taking turns or using only the first thread in a wave or wavefront. To avoid breaking other gda implementations, reimplement turn-based or single thread strategy in post_wqe_rma_turn and post_wqe_rma_single. Signed-off-by: Allen Hubbe <allen.hubbe@amd.com>	2025-11-05 11:01:14 -06:00
Rakesh Roy	8797bb0150	Revert "SWDEV-562996 - Build fix: Ubertrace callback calling convention mismatch on x86 (#1587 )" (#1717 ) This reverts commit `8d31383dfe`. Reason for revert: It is breaking TheRock build on Windows	2025-11-05 11:48:02 -05:00
Apurv Mishra	eded1f3529	rocrtst: Add check for SVM support in Runtime (#1687 ) Signed-off-by: Apurv Mishra <Apurv.Mishra@amd.com> Approved-by: David Yat Sin <David.YatSin@amd.com>	2025-11-05 11:36:38 -05:00
Galantsev, Dmitrii	982737a852	Fix missing iomanip and cstdio in tests Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com>	2025-11-05 10:14:19 -06:00
Galantsev, Dmitrii	87ace88e72	Fix missing iomanip and cstdio in tests Signed-off-by: Galantsev, Dmitrii <dmitrii.galantsev@amd.com> [ROCm/amdsmi commit: `982737a852`]	2025-11-05 10:14:19 -06:00
Aurelien Bouteiller	51cf7c6c05	python venv madness round 2: use ensurepip if installed (#308 ) When creating a python venv during the install_dependencies script, we try to use ensurepip if it is installed, as it deals better with cases where multiple venvs are active simultaneously. (as seen in CI buildbot) [ROCm/rocshmem commit: `b7a6d86c6b`]	2025-11-05 10:52:22 -05:00
Aurelien Bouteiller	b7a6d86c6b	python venv madness round 2: use ensurepip if installed (#308 ) When creating a python venv during the install_dependencies script, we try to use ensurepip if it is installed, as it deals better with cases where multiple venvs are active simultaneously. (as seen in CI buildbot)	2025-11-05 10:52:22 -05:00
Aurelien Bouteiller	76e8750d88	Add backend type query method, use it to disable 32bit amo testers on gda (#307 ) * Add backend type query method, use it to disable 32bit amo testers on gda * The infrateam testers work [ROCm/rocshmem commit: `8c175315f2`]	2025-11-05 10:24:07 -05:00
Aurelien Bouteiller	8c175315f2	Add backend type query method, use it to disable 32bit amo testers on gda (#307 ) * Add backend type query method, use it to disable 32bit amo testers on gda * The infrateam testers work	2025-11-05 10:24:07 -05:00
MachineTom	3b1c0c3464	SWDEV-558845 - Support image in rocr on Windows (#1582 ) Enable image build in Windows. Remove some useless codes that fail building in Windows. Some minor improvement. Temporarily exclude mipmap test files. Prevent negative tests affect some tests. Move some catch info log codes into failed cases.	2025-11-05 09:33:41 -05:00
yangsu13	3c17717624	librocdxg: Remove std::call_once to fix repeated libdxcore.so loading failures The use of std::call_once caused the initialization flag to be set permanently, preventing proper re-attempts to load libdxcore.so when needed. This change removes the once_flag mechanism and relies solely on dxcore_handle_ checks to manage library loading, allowing proper re-initialization attempts. Signed-off-by: yangsu13 <Yang.Su2@amd.com> Reviewed-by: Flora Cui <flora.cui@amd.com>	2025-11-05 18:53:37 +08:00
Flora Cui	1bc5af684c	update hsa header Signed-off-by: Flora Cui <flora.cui@amd.com>	2025-11-05 18:53:37 +08:00
Flora Cui	a32d71124f	librocdxg: implement stub hsaKmtAisReadWriteFile Signed-off-by: Flora Cui <flora.cui@amd.com>	2025-11-05 18:53:37 +08:00
Chengjun Yao	b5dd613ccd	librocdxg: Integrate DXCore loader into WDDM thunks Replace direct D3DKMT API calls with DXCORE_CALL macro in WDDM thunk layer. This enables dynamic loading of DXCore functions while maintaining the same API interface. Updated thunk functions: - MapGpuVirtualAddress, CreateAllocation, DestroyAllocation - ReserveGpuVirtualAddress, FreeGpuVirtualAddress - MakeResident, Evict, ShareObjects - QueryResourceInfoFromNtHandle, OpenResourceFromNtHandle All existing functionality is preserved while adding flexibility for runtime DXCore availability detection. Signed-off-by: Chengjun Yao <Chengjun.Yao@amd.com> Signed-off-by: Yang Su <Yang.Su2@amd.com> Reviewed-by: Shi.Leslie <Yuliang.Shi@amd.com>	2025-11-05 18:53:37 +08:00
Chengjun Yao	15ff43c4ce	librocdxg: Integrate DXCore loader into WDDM thunks Replace direct D3DKMT API calls with DXCORE_CALL macro in WDDM thunk layer. This enables dynamic loading of DXCore functions while maintaining the same API interface. Updated thunk functions: - MapGpuVirtualAddress, CreateAllocation, DestroyAllocation - ReserveGpuVirtualAddress, FreeGpuVirtualAddress - MakeResident, Evict, ShareObjects - QueryResourceInfoFromNtHandle, OpenResourceFromNtHandle All existing functionality is preserved while adding flexibility for runtime DXCore availability detection. Signed-off-by: Chengjun Yao <Chengjun.Yao@amd.com> Signed-off-by: Yang Su <Yang.Su2@amd.com> Reviewed-by: Shi.Leslie <Yuliang.Shi@amd.com>	2025-11-05 18:53:37 +08:00
Flora Cui	bf818a2e75	librocdxg: update rocr queue type to amd_queue_v2_t Signed-off-by: Flora Cui <flora.cui@amd.com>	2025-11-05 18:53:37 +08:00
Chengjun Yao	c8fbb93f03	librocdxg: Remove static DXCore library dependency Remove static linking to libdxcore library from CMakeLists.txt. This prepares for dynamic loading implementation and eliminates hard dependency on DXCore being present at build time. The DXCore functionality will be loaded dynamically at runtime in subsequent patches, making the library more flexible for different deployment scenarios. Signed-off-by: Chengjun Yao <Chengjun.Yao@amd.com> Signed-off-by: Yang Su <Yang.Su2@amd.com> Signed-off-by: Flora Cui <flora.cui@amd.com> Reviewed-by: Shi.Leslie <Yuliang.Shi@amd.com>	2025-11-05 18:53:37 +08:00

... 20 21 22 23 24 ...

76333 Коммитов