rocm-systems

Yazar	SHA1	Mesaj	Tarih
Edgar Gabriel	d37af80d7e	add support for GPUs using wavefront size of 32 (#285 ) * add gfx1100 support Add support for Radeon 7900 GPUs (RX and PRO), and 7800 PRO. I was contemplating to add gfx1101 and gfx1102 GPUs as well, but those are the lower end models that are more unlikely to be used for compute intensive jobs. In addition, I do not have access to them to test the support. * update WF_SIZe for different options Radeon systems use a WarpSize of 32, unlike current Instinct systems, which use a warp size of 64. For the device side, a gfx specific ifdef is sufficient. For the host side, we need to query the device properties. * adjust functional tests to wf_size of 32 * update unit tests to handle wf_size of 32 * address reviewer comments [ROCm/rocshmem commit: `d0c2845031`]	2025-10-22 16:04:58 -05:00
Aurelien Bouteiller	8837414042	Cleanup/wg init (#260 ) * remove wg_init and wg_finalize from functional tests * Remove wg_init and wg_finalize from examples * deprecate wg_init/finalize * Updated docs * Typo in documentation --------- Co-authored-by: Yiltan <yiltan@amd.com> [ROCm/rocshmem commit: `6e7277b544`]	2025-10-07 14:34:18 -04:00
Avinash Kethineedi	81b55c3769	functional_tests: use `size_t` for size variable (#190 ) Changed the data type of `size` to `size_t` in all functional tests to ensure consistency with rocSHMEM APIs. [ROCm/rocshmem commit: `7a5c6f86d7`]	2025-07-03 13:26:54 -05:00
Avinash Kethineedi	c4de6833f6	Add SPDX license identifiers and update copyright headers (#85 ) * Update copyright information and add SPDX license identifier * Update AUTHORS * Remove `sos_tests` [ROCm/rocshmem commit: `f6ef19f5a9`]	2025-04-15 15:37:53 -05:00
Avinash Kethineedi	e16bb62767	Update RMA functional tests (#50 ) * Update primitive tests for multi-workgroup support * Update workgroup primitive tests for multi-workgroup support * Update workfront primitive tests for multi-workgroup support * Update team based primitive tests for multi-workgroup support * Update RMA functional tests to capture timing after quiet call - Modified RMA functional tests to record the time after a `quiet` call in thread, wavefront, and workgroup RMA calls. * Improve error handling and memory management - Replaced `cout` with `cerr` for improved error reporting. - Ensured all allocated memory is freed when `rocshmem_malloc` fails. * Update start time in primitive tests and latency calculations - Modified primitive tests to capture the earliest start time. - Updated latency calculations in functional tests. * Remove `GetSwarmTester` * Update start time in team primitive tests * Invoke quiet call from a single thread within a block on a rocshmem context [ROCm/rocshmem commit: `aa3121a967`]	2025-03-18 14:39:57 -05:00
avinashkethineedi	dba989733f	Update bandwidth and latency calculations - Refined bandwidth and latency calculations for improved accuracy [ROCm/rocshmem commit: `c155636da4`]	2025-02-17 06:18:46 +00:00
Yiltan Temucin	48605db5de	Remove comparisons of signed to unsigned values [ROCm/rocshmem commit: `fa0858833e`]	2024-12-12 10:21:08 -06:00
Brandon Potter	913ce47ef1	Use new naming scheme [ROCm/rocshmem commit: `fd8dbc7fb6`]	2024-11-25 14:25:29 -06:00
Brandon Potter	ad4ab69c19	Transfer files from RAD repository [ROCm/rocshmem commit: `ea8f264a11`]	2024-07-01 09:57:08 -05:00

9 İşleme