rocm-systems

Autor	SHA1	Zpráva	Datum
Bertan Dogancay	bed7cdf863	[GEN/BUILD] Refactor generate.py and reduce build time for older archs (#2006 )	2025-10-30 11:45:53 -04:00
Dmitrii	e0ec72ccdd	[rdc] Bump rocprofiler-sdk requirement to 1.1.0 (#1610 ) Fixes RDC builds broken by #1563	2025-10-30 10:06:45 -04:00
marandje	cfbb2230ea	SWDEV-491296 - Fix Unit_hipMemImportFromShareableHandle_Capture (#1564 )	2025-10-30 15:06:26 +01:00
Nilesh M Negi	03d37f6305	Fix gfx950 gating conditions to match ROCm 7.0.2 (#2003 ) [ROCm/rccl commit: `8444b3c6e9`]	2025-10-29 23:27:04 -05:00
Nilesh M Negi	8444b3c6e9	Fix gfx950 gating conditions to match ROCm 7.0.2 (#2003 )	2025-10-29 23:27:04 -05:00
Mustafa Abduljabbar	eb0b1387b7	[Device] Adjust threadblock size for gfx950 to increase LL64/Simple performance for AR, RS and AG (#1978 ) * Add initial commit to increase tb size to 512 * Fix LL perf issue when subset of NCCL_MAX_NTHREADS is used Adding a constant to barrier_generic logic from using fallback logic when nthreads < NCCL_MAX_NTHREADS and nthreads == blockDim.X * Adjust nthreads for LL * Opt threads for reduce_scatter upper small range * Add macro for single node * Restrict MSCCL to 256 threads to prevent mem access fault * Support pre-MI350 compatibility * Partially refactor threadblock size override * Use const macros instead of numerals * opt out of unused function [ROCm/rccl commit: `12f51ba8bf`]	2025-10-29 23:24:32 -05:00
Mustafa Abduljabbar	12f51ba8bf	[Device] Adjust threadblock size for gfx950 to increase LL64/Simple performance for AR, RS and AG (#1978 ) * Add initial commit to increase tb size to 512 * Fix LL perf issue when subset of NCCL_MAX_NTHREADS is used Adding a constant to barrier_generic logic from using fallback logic when nthreads < NCCL_MAX_NTHREADS and nthreads == blockDim.X * Adjust nthreads for LL * Opt threads for reduce_scatter upper small range * Add macro for single node * Restrict MSCCL to 256 threads to prevent mem access fault * Support pre-MI350 compatibility * Partially refactor threadblock size override * Use const macros instead of numerals * opt out of unused function	2025-10-29 23:24:32 -05:00
Charis Poag	0a5fdc944f	[SWDEV-560847] Fix Vram type not showing newer types * Changes: - Allows `amd-smi static --vram` (`amdsmi_get_gpu_vram_info()`) to read the following types: DDR5, LPDDR4, LPDDR5, and HBM3E. Change-Id: I1eddf9dcb574e1868541cc5063ae95cb6d6e1c59 Signed-off-by: Charis Poag <Charis.Poag@amd.com>	2025-10-29 16:13:42 -05:00
Charis Poag	4df843f110	[SWDEV-560847] Fix Vram type not showing newer types * Changes: - Allows `amd-smi static --vram` (`amdsmi_get_gpu_vram_info()`) to read the following types: DDR5, LPDDR4, LPDDR5, and HBM3E. Change-Id: I1eddf9dcb574e1868541cc5063ae95cb6d6e1c59 Signed-off-by: Charis Poag <Charis.Poag@amd.com> [ROCm/amdsmi commit: `0a5fdc944f`]	2025-10-29 16:13:42 -05:00
Allan Xavier	51971426bd	Allowed GPU enumeration to continue with non-contiguous render nodes Signed-off-by: gabrpham_amdeng <Gabriel.Pham@amd.com>	2025-10-29 15:31:56 -05:00
Allan Xavier	9b4a9acd27	Allowed GPU enumeration to continue with non-contiguous render nodes Signed-off-by: gabrpham_amdeng <Gabriel.Pham@amd.com> [ROCm/amdsmi commit: `51971426bd`]	2025-10-29 15:31:56 -05:00
Bindhiya Kanangot Balakrishnan	8dd4a4997b	[SWDEV-563281] Add json and csv output for xgmi status Added json and csv output format support for newly added xgmi link_status. Aligned legend. Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com>	2025-10-29 15:25:15 -05:00
Bindhiya Kanangot Balakrishnan	d5691b7ed9	[SWDEV-563281] Add json and csv output for xgmi status Added json and csv output format support for newly added xgmi link_status. Aligned legend. Signed-off-by: Bindhiya Kanangot Balakrishnan <Bindhiya.KanangotBalakrishnan@amd.com> [ROCm/amdsmi commit: `8dd4a4997b`]	2025-10-29 15:25:15 -05:00
gilbertlee-amd	555a5f1892	Fixing install script hip_compiler bug and improving logging on fallback (#156 ) * Fixing install script hip_compiler bug and improving logging on fallback [ROCm/rccl-tests commit: `6405c76e68`]	2025-10-29 10:57:56 -06:00
gilbertlee-amd	6405c76e68	Fixing install script hip_compiler bug and improving logging on fallback (#156 ) * Fixing install script hip_compiler bug and improving logging on fallback	2025-10-29 10:57:56 -06:00
Bertan Dogancay	4c7afea115	[Tools/Replayer] Fix prohibited calls during capture mode (#1938 ) [ROCm/rccl commit: `b703ffdfa4`]	2025-10-29 12:19:32 -04:00
Bertan Dogancay	b703ffdfa4	[Tools/Replayer] Fix prohibited calls during capture mode (#1938 )	2025-10-29 12:19:32 -04:00
cadolphe-amd	458c25c3a0	SWDEV-556658 - Update Unit_TexObjectCreate_TypePitch2D_IncompleteInit to align with API (#1144 )	2025-10-29 11:36:45 -04:00
xuchen-amd	b774f28181	[rocprofiler-compute] Remove grafana and mongodb integration (#978 ) * Remove grafana and mongodb integration * Remove grafana documentation assets * clarify changelog --------- Co-authored-by: Vignesh Edithal <Vignesh.Edithal@amd.com>	2025-10-29 11:32:06 -04:00
dsicarov-amd	4915496bf9	SWDEV-533237 Add hipOccupancyAvailableDynamicSMemPerBlock API (#899 ) * SWDEV-533237 Add initial support for hipOccupancyAvailableDynamicSMemPerBlock API * SWDEV-533237 Add hipOccupancyAvailableDynamicSMemPerBlock wrapper for nvidia * SWDEV-533237 Add implementation of hipOccupancyAvailableDynamicSMemPerBlock API * SWDEV-533237 Add LDSAlignment field in Isa table --------- Co-authored-by: Rahul Manocha <rmanocha@amd.com>	2025-10-29 10:58:42 +01:00
Istvan Kiss	197f73dac9	Sync HIP documentation 2025-10-20 (#1258 ) * Add examples to tools folder * Correct P2P memory access section * Sync poriting guide * Add HIP Graph tutorial * Add hint about using amdgpu-dkms for IPC API * Add a few more env variables	2025-10-29 07:42:06 +01:00
Geo Min	8e98b80deb	[TheRock CI] Fixing patches for rocm-systems (#1460 ) * Fixing patches for rocm-systems * Adding all * Adding remaining projects * Submodule bump * adding compiler * adding test commit hash * Adding artifact group * adding update for artifact group * Adding new commit hash	2025-10-28 19:47:17 -07:00
Pham, Gabriel	9e3537d778	Added set --pcie command and added more pcie info to static --bus output (#481 ) * Added amd-smi set --pcie command * Removed current pcie level due to it not being static * Added pcie information to static --bus --------- Signed-off-by: Pham, Gabriel <Gabriel.Pham@amd.com> Signed-off-by: Maisam Arif <Maisam.Arif@amd.com>	2025-10-28 14:55:55 -05:00
Pham, Gabriel	87b2fd73b8	Added set --pcie command and added more pcie info to static --bus output (#481 ) * Added amd-smi set --pcie command * Removed current pcie level due to it not being static * Added pcie information to static --bus --------- Signed-off-by: Pham, Gabriel <Gabriel.Pham@amd.com> Signed-off-by: Maisam Arif <Maisam.Arif@amd.com> [ROCm/amdsmi commit: `9e3537d778`]	2025-10-28 14:55:55 -05:00
Pryor, Adam	2144cfbba4	[SWDEV-357472] Add evicted_ms metric (#620 ) - Added evicted_time metric for kfd processes. - Time that queues are evicted on a GPU in milliseconds - Added to CLI in `amd-smi monitor -q` and `amd-smi process` - Added to C API and Python API: - amdsmi_get_gpu_process_list() - amdsmi_get_gpu_compute_process_info() - amdsmi_get_gpu_compute_process_info_by_pid() --------- Signed-off-by: Pryor, Adam <Adam.Pryor@amd.com>	2025-10-28 14:49:03 -05:00
Pryor, Adam	354886f4ff	[SWDEV-357472] Add evicted_ms metric (#620 ) - Added evicted_time metric for kfd processes. - Time that queues are evicted on a GPU in milliseconds - Added to CLI in `amd-smi monitor -q` and `amd-smi process` - Added to C API and Python API: - amdsmi_get_gpu_process_list() - amdsmi_get_gpu_compute_process_info() - amdsmi_get_gpu_compute_process_info_by_pid() --------- Signed-off-by: Pryor, Adam <Adam.Pryor@amd.com> [ROCm/amdsmi commit: `2144cfbba4`]	2025-10-28 14:49:03 -05:00
corey-derochie-amd	c5cdee4fa5	Updated Changelog with 7.1.1 and 7.2.0 stub sections (#2008 ) * Missing ROCm 7.0 & 7.1.0 Changelog entries (#1976) * Update CHANGELOG.md * Update CHANGELOG.md * Apply suggestions from code review Co-authored-by: Jeffrey Novotny <jnovotny@amd.com> * Update CHANGELOG.md * Update CHANGELOG.md * Update CHANGELOG.md --------- Co-authored-by: Jeffrey Novotny <jnovotny@amd.com> * Added ROCm 7.2.0 section. * Update CHANGELOG.md * Apply suggestion from @corey-derochie-amd --------- Co-authored-by: Jeffrey Novotny <jnovotny@amd.com> [ROCm/rccl commit: `561ad2fe05`]	2025-10-28 13:41:22 -06:00
corey-derochie-amd	561ad2fe05	Updated Changelog with 7.1.1 and 7.2.0 stub sections (#2008 ) * Missing ROCm 7.0 & 7.1.0 Changelog entries (#1976) * Update CHANGELOG.md * Update CHANGELOG.md * Apply suggestions from code review Co-authored-by: Jeffrey Novotny <jnovotny@amd.com> * Update CHANGELOG.md * Update CHANGELOG.md * Update CHANGELOG.md --------- Co-authored-by: Jeffrey Novotny <jnovotny@amd.com> * Added ROCm 7.2.0 section. * Update CHANGELOG.md * Apply suggestion from @corey-derochie-amd --------- Co-authored-by: Jeffrey Novotny <jnovotny@amd.com>	2025-10-28 13:41:22 -06:00
Ajay GunaShekar	22213c0ec3	SWDEV-559569 - enable fixed tests (#1363 )	2025-10-28 12:17:15 -07:00
Atul Kulkarni	f2287e8f97	Removed RCCL_EXPOSE_STATIC duplicate definition. (#1988 ) [ROCm/rccl commit: `cc867dbaf2`]	2025-10-28 13:01:48 -05:00
Atul Kulkarni	cc867dbaf2	Removed RCCL_EXPOSE_STATIC duplicate definition. (#1988 )	2025-10-28 13:01:48 -05:00
Atul Kulkarni	884138205d	Added ROCM_VERSION restriction to alloc unit tests (#1989 ) [ROCm/rccl commit: `26dc7abb32`]	2025-10-28 12:54:34 -05:00
Atul Kulkarni	26dc7abb32	Added ROCM_VERSION restriction to alloc unit tests (#1989 )	2025-10-28 12:54:34 -05:00
Edgar Gabriel	0ad710e537	minor change to MPI detection logic (#294 ) somehow the test whether we requested MPI support or not stopped working, although no obvious code change can be located. Make the if-statement more stringent by explicitely testing whether USE_MPI_SUPPORT is "ON". [ROCm/rocshmem commit: `c0285ac0ce`]	2025-10-28 12:54:26 -05:00
Edgar Gabriel	c0285ac0ce	minor change to MPI detection logic (#294 ) somehow the test whether we requested MPI support or not stopped working, although no obvious code change can be located. Make the if-statement more stringent by explicitely testing whether USE_MPI_SUPPORT is "ON".	2025-10-28 12:54:26 -05:00
alex-breslow-amd	f7405b8739	Remove nontemporality from stores, put in casts to global address space (#1982 ) * Implements casting key loads and stores to address_space(1) so that vector global load and store instructions are emitted by the compiler instead of more costly flat loads and stores * Removes nontemporality from some key stores for gfx950. [ROCm/rccl commit: `e69b11eba5`]	2025-10-28 10:34:48 -07:00
alex-breslow-amd	e69b11eba5	Remove nontemporality from stores, put in casts to global address space (#1982 ) * Implements casting key loads and stores to address_space(1) so that vector global load and store instructions are emitted by the compiler instead of more costly flat loads and stores * Removes nontemporality from some key stores for gfx950.	2025-10-28 10:34:48 -07:00
David Galiffi	3d7a5eec0e	Setup `rocprofsys_root` environment variable (#1561 ) * Setup `rocprofsys_root` environment variable * Update `CHANGELOGS` * Fixed formatting * Add rocpd output and validation to python tests * Refactoring environment setup	2025-10-28 13:06:07 -04:00
Venkateshwar Reddy Kandula	c5bd693478	[rocprofiler-sdk] Disable HIP/CLR build in rocprofiler-sdk CI jobs (#1574 ) * disable HIP/CLR build * misc. fix	2025-10-28 11:42:11 -05:00
dependabot[bot]	c19037e946	Docs - Bump rocm-docs-core[api_reference] from 1.26.0 to 1.27.0 in /docs/sphinx (#198 ) Bumps [rocm-docs-core[api_reference]](https://github.com/ROCm/rocm-docs-core) from 1.26.0 to 1.27.0. - [Release notes](https://github.com/ROCm/rocm-docs-core/releases) - [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md) - [Commits](https://github.com/ROCm/rocm-docs-core/compare/v1.26.0...v1.27.0) --- updated-dependencies: - dependency-name: rocm-docs-core[api_reference] dependency-version: 1.27.0 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> [ROCm/rocjpeg commit: `f0d55cf80d`]	2025-10-28 09:33:41 -07:00
dependabot[bot]	f0d55cf80d	Docs - Bump rocm-docs-core[api_reference] from 1.26.0 to 1.27.0 in /docs/sphinx (#198 ) Bumps [rocm-docs-core[api_reference]](https://github.com/ROCm/rocm-docs-core) from 1.26.0 to 1.27.0. - [Release notes](https://github.com/ROCm/rocm-docs-core/releases) - [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md) - [Commits](https://github.com/ROCm/rocm-docs-core/compare/v1.26.0...v1.27.0) --- updated-dependencies: - dependency-name: rocm-docs-core[api_reference] dependency-version: 1.27.0 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-10-28 09:33:41 -07:00
Gopesh Bhardwaj	2be2945228	Version bump and CHANGELOG update for 7.1 (#1563 )	2025-10-28 11:53:32 -04:00
Swati Rawat	f0f008d494	Update using-rocprofv3-process-attachment.rst (#1534 )	2025-10-28 11:52:23 -04:00
akolliasAMD	6f6719dbab	renamed memcpy to memcpy_lane (#296 ) [ROCm/rocshmem commit: `87d87cc881`]	2025-10-28 09:33:13 -06:00
akolliasAMD	87d87cc881	renamed memcpy to memcpy_lane (#296 )	2025-10-28 09:33:13 -06:00
ywang103-amd	99183ffd92	fix failure of pc sampling and unit tests (#1526 )	2025-10-28 11:30:32 -04:00
corey-derochie-amd	44160d34a4	Updated CODEOWNERS to instead use RCCL-Reviewers team (#2010 ) * Updated CODEOWNERS to instead use RCCL-Reviewers team * Apply suggestion from @nileshnegi Co-authored-by: Nilesh M Negi <Nilesh.Negi@amd.com> --------- Co-authored-by: Nilesh M Negi <Nilesh.Negi@amd.com> [ROCm/rccl commit: `f290e302d3`]	2025-10-28 09:27:26 -06:00
corey-derochie-amd	f290e302d3	Updated CODEOWNERS to instead use RCCL-Reviewers team (#2010 ) * Updated CODEOWNERS to instead use RCCL-Reviewers team * Apply suggestion from @nileshnegi Co-authored-by: Nilesh M Negi <Nilesh.Negi@amd.com> --------- Co-authored-by: Nilesh M Negi <Nilesh.Negi@amd.com>	2025-10-28 09:27:26 -06:00
dependabot[bot]	6f222c11a6	Bump rocm-docs-core[api_reference] from 1.26.0 to 1.27.0 in /docs/sphinx (#790 ) Bumps [rocm-docs-core[api_reference]](https://github.com/ROCm/rocm-docs-core) from 1.26.0 to 1.27.0. - [Release notes](https://github.com/ROCm/rocm-docs-core/releases) - [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md) - [Commits](https://github.com/ROCm/rocm-docs-core/compare/v1.26.0...v1.27.0) --- updated-dependencies: - dependency-name: rocm-docs-core[api_reference] dependency-version: 1.27.0 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-10-28 09:59:49 -05:00
dependabot[bot]	f36affe4d5	Bump rocm-docs-core[api_reference] from 1.26.0 to 1.27.0 in /docs/sphinx (#790 ) Bumps [rocm-docs-core[api_reference]](https://github.com/ROCm/rocm-docs-core) from 1.26.0 to 1.27.0. - [Release notes](https://github.com/ROCm/rocm-docs-core/releases) - [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md) - [Commits](https://github.com/ROCm/rocm-docs-core/compare/v1.26.0...v1.27.0) --- updated-dependencies: - dependency-name: rocm-docs-core[api_reference] dependency-version: 1.27.0 dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> [ROCm/amdsmi commit: `6f222c11a6`]	2025-10-28 09:59:49 -05:00

... 26 27 28 29 30 ...

76333 Commity