Avinash Kethineedi
65b4ff4c41
Add multi work-group support for collective functional tests ( #45 )
...
- Added multi-work group support for the All-to-all, Fcollect, Broadcast, Barrier and Sync collective functional tests
- Renamed All-to-all and Fcollect tests to TeamAlltoAll and TeamFcollect
[ROCm/rocshmem commit: 57d60aa727 ]
2025-02-19 10:31:53 -06:00
Yiltan
e1ed36e58f
Sync develop with amd-mainline ( #46 )
...
* Update install_dependencies.sh
* Updated to ROCm repos
* Merge pull request #37 from ROCm/depBuild
locked specific version on ompi and ucx
* locked specific version on ompi and ucx
* [IPC] Fix ROCSHMEM_SIGNAL_ADD
* Generate CMake Package Configuration Files
---------
Co-authored-by: akolliasAMD <99202231+akolliasAMD@users.noreply.github.com >
Co-authored-by: akolliasAMD <akollias@amd.com >
[ROCm/rocshmem commit: 785e31aa48 ]
2025-02-18 12:30:34 -05:00
avinashkethineedi
6c70aee32e
Remove rocshmem_timer function
...
[ROCm/rocshmem commit: 21dbd5cc5e ]
2025-02-17 17:10:51 +00:00
avinashkethineedi
dba989733f
Update bandwidth and latency calculations
...
- Refined bandwidth and latency calculations for improved accuracy
[ROCm/rocshmem commit: c155636da4 ]
2025-02-17 06:18:46 +00:00
avinashkethineedi
539e991b2a
RO Backend: Add support for char, signed char and unsigned char
...
[ROCm/rocshmem commit: 540cd4b918 ]
2025-02-12 20:10:03 +00:00
Yiltan
1f3881fa21
Merge pull request #38 from Yiltan/ro/implement-sigops
...
Implements Signalling Operations for RO
[ROCm/rocshmem commit: 495cd6970b ]
2025-02-10 15:10:07 -05:00
Yiltan
87e049f9c9
Merge pull request #34 from Yiltan/sigops-test-fix
...
Fix Signalling Operations Functional Test
[ROCm/rocshmem commit: 94144f4460 ]
2025-02-10 14:56:45 -05:00
Yiltan
f64c76b31c
Merge pull request #39 from Yiltan/ro/fix-teamreduce
...
Fix Team reduction intra-node
[ROCm/rocshmem commit: 944444cf12 ]
2025-02-10 14:56:27 -05:00
Yiltan Hassan Temucin
76981d6374
Fix Team reduction intra-node
...
[ROCm/rocshmem commit: 022b2c27e7 ]
2025-02-07 08:39:35 -06:00
Avinash Kethineedi
4d919faf55
Merge pull request #36 from avinashkethineedi/fix/rocshmem-ctx-wg-team-sync
...
Fix `rocshmem_ctx_wg_team_sync` API
[ROCm/rocshmem commit: d97e5ba2c8 ]
2025-02-06 13:41:16 -06:00
Yiltan Hassan Temucin
e50460af83
Use the precalculated num_warps variable
...
[ROCm/rocshmem commit: b83ff2fa84 ]
2025-02-06 13:21:25 -06:00
Yiltan Hassan Temucin
c4f2ccd48f
[RO] implemented signaling operations
...
[ROCm/rocshmem commit: f1c25f7e19 ]
2025-02-06 10:17:32 -06:00
Yiltan Hassan Temucin
90b8f191d6
[RO] added MPI_UNSIGNED_LONG as type
...
[ROCm/rocshmem commit: 21171deeb8 ]
2025-02-06 10:17:32 -06:00
Yiltan Hassan Temucin
257610bdc5
Validate signal after put signal operations
...
[ROCm/rocshmem commit: 8d74c7b73e ]
2025-02-06 08:17:22 -06:00
avinashkethineedi
fca7471d67
Fix rocshmem_ctx_wg_team_sync API
...
- Updated `rocshmem_ctx_wg_team_sync` to utilize a team-specific memory buffer for synchronization
[ROCm/rocshmem commit: c5b548c398 ]
2025-02-05 19:09:07 +00:00
avinashkethineedi
71af1b366d
Fix rocshmem_ctx_my_pe and rocshmem_ctx_n_pes APIs to return PE numbering and size relative to the team in a team-specific context.
...
[ROCm/rocshmem commit: e311400d15 ]
2025-02-05 03:41:40 +00:00
Yiltan Hassan Temucin
9317172fab
Fix sigops functional test
...
- Ensure quiet is called on the correct context
[ROCm/rocshmem commit: bae1641311 ]
2025-02-04 13:30:31 -08:00
Avinash Kethineedi
2214d21491
Merge pull request #31 from avinashkethineedi/rocshmem_g
...
Implement `rocshmem_g` API and optimize memory usage
[ROCm/rocshmem commit: 248972b30b ]
2025-02-04 11:15:41 -06:00
Yiltan Hassan Temucin
ffdce76fe4
[IPC] Fix ROCSHMEM_SIGNAL_ADD
...
[ROCm/rocshmem commit: fd3eaa3f69 ]
2025-02-03 09:59:28 -08:00
avinashkethineedi
5af3fdeacb
Implement rocshmem_g API and optimize memory usage
...
- Implement `rocshmem_g` API
- Free up memory space allocated for `rocshmem_g` and atomic operations' return values
[ROCm/rocshmem commit: 757d7e53ca ]
2025-02-02 05:56:46 +00:00
Yiltan
86c3c5ff39
Merge pull request #29 from Yiltan/multi-node
...
Updated RO builds script and functional test driver
[ROCm/rocshmem commit: 7e5b533904 ]
2025-01-27 14:44:22 -05:00
Yiltan
c39e737eef
Merge pull request #28 from Yiltan/cmakefiles-create
...
Generate CMake Package Configuration Files
[ROCm/rocshmem commit: f851411ac5 ]
2025-01-24 10:50:27 -05:00
Yiltan
3a071f1d69
Merge pull request #25 from mawad-amd/muhaawad/build_examples_option
...
Add `BUILD_EXAMPLES` CMake option
[ROCm/rocshmem commit: a458ea2ef4 ]
2025-01-24 10:50:10 -05:00
Yiltan Hassan Temucin
adf66d04f4
Updated RO builds script and functional test driver for multi-node support
...
[ROCm/rocshmem commit: 3a8b0d4647 ]
2025-01-23 16:46:19 -06:00
Yiltan Hassan Temucin
90f2bf1ec8
Generate CMake Package Configuration Files
...
[ROCm/rocshmem commit: 00824385ba ]
2025-01-22 11:24:41 -06:00
Yiltan
fc8007bec6
Minor fixes for packaging
...
[ROCm/rocshmem commit: fa90f4b0ac ]
2025-01-20 18:15:07 +00:00
Muhammad Awad
6688cf8fd6
Add BUILD_EXAMPLES CMake option
...
Signed-off-by: Muhammad Awad <MuhammadAbdelghaffar.Awad@amd.com >
[ROCm/rocshmem commit: 7a6b3261a3 ]
2025-01-18 15:26:20 -06:00
Yiltan
61f5c61ab1
Update scripts/install_dependencies.sh
...
Co-authored-by: Avinash Kethineedi <avinash.kethineedi@amd.com >
[ROCm/rocshmem commit: 0fb673e186 ]
2025-01-16 13:38:08 -05:00
Yiltan Temucin
b30ab63d2d
Added script to install dependencies
...
[ROCm/rocshmem commit: 5de0371bec ]
2025-01-16 10:06:39 -06:00
avinashkethineedi
15c5ff048c
Replace raw pointers for host_interface with shared_ptr to enable automatic memory handling
...
[ROCm/rocshmem commit: 1ef2d3a6b7 ]
2025-01-13 20:58:43 +00:00
avinashkethineedi
7a0bf44b83
Updated driver.sh and tester.hpp with sequential numbering for test identification
...
* Enabled Ping Pong tests
* Removed test commands for multi-workgroup collective tests
[ROCm/rocshmem commit: 23172c9150 ]
2024-12-26 21:28:21 +00:00
avinashkethineedi
ca4142de78
Updated default case of functional tests with empty test
...
[ROCm/rocshmem commit: e40e6a63fa ]
2024-12-26 19:33:23 +00:00
Edgar Gabriel
3cf91156f9
Merge pull request #20 from edgargabriel/topic/remove-internal-dir
...
remove internal directory
[ROCm/rocshmem commit: 12aeab1a59 ]
2024-12-26 09:21:09 -06:00
Yiltan Temucin
3bf7d2eb87
IPC backend now aborts with rocshmem global_exit()
...
[ROCm/rocshmem commit: c0e4a32ca2 ]
2024-12-23 11:03:04 -06:00
Avinash Kethineedi
2e59d60264
Merge pull request #19 from avinashkethineedi/teams_split_API
...
[ROCm/rocshmem commit: c5902afe28 ]
2024-12-23 20:42:09 +05:30
Edgar Gabriel
fe9b96cd75
remove internal directory.
...
[ROCm/rocshmem commit: 8121616034 ]
2024-12-23 14:47:50 +00:00
avinashkethineedi
7a8d464b3c
Fix rocshmem_team_split_strided API
...
[ROCm/rocshmem commit: cb8b9094b4 ]
2024-12-21 18:16:42 +00:00
Yiltan Temucin
d8a16962bf
Commented function that fails functional tests
...
[ROCm/rocshmem commit: 83a588ee2b ]
2024-12-20 14:48:54 -06:00
Brandon Potter
05f260bfc8
Merge pull request #16 from BKP/develop
...
Fix signal calculation bug for fine-tiled unit tests
[ROCm/rocshmem commit: 7fd66663c5 ]
2024-12-20 08:40:56 -06:00
Edgar Gabriel
9fdd0a98b7
update the link for MPI/UCX compilation
...
use the official page from rocmdocs.amd.com
[ROCm/rocshmem commit: 57f22cd8c3 ]
2024-12-19 20:59:13 +00:00
Brandon Potter
413114da9f
Fix signal calculation bug for fine-tiled unit tests
...
[ROCm/rocshmem commit: b1f6621f33 ]
2024-12-19 18:34:47 +00:00
Yiltan Temucin
b622528559
Bump rocSHMEM version
...
[ROCm/rocshmem commit: 3185428ad8 ]
2024-12-16 08:54:12 -06:00
Yiltan Temucin
9928cc24ef
Updated README.md
...
[ROCm/rocshmem commit: 28b41ea648 ]
2024-12-16 08:54:12 -06:00
Yiltan
d14682c6bd
Merge pull request #63 from Yiltan/sign-compare
...
Remove comparisons of signed to unsigned values
[ROCm/rocshmem commit: 1648aa7b2c ]
2024-12-13 17:23:19 -05:00
Yiltan Temucin
d2474f26ff
Reordered backends in readme.md
...
[ROCm/rocshmem commit: cb0f6d5731 ]
2024-12-13 13:28:54 -06:00
Yiltan Temucin
5db3d6fdfc
Removed comment on HSA_FORCE_FINE_GRAIN_PCIE
...
[ROCm/rocshmem commit: 4a3d02eb18 ]
2024-12-12 15:31:11 -06:00
Yiltan Temucin
8b008aacfd
Updated README
...
[ROCm/rocshmem commit: 4d558b345d ]
2024-12-12 13:02:23 -06:00
Yiltan Temucin
48605db5de
Remove comparisons of signed to unsigned values
...
[ROCm/rocshmem commit: fa0858833e ]
2024-12-12 10:21:08 -06:00
Yiltan Temucin
e73a3d7738
Renamed utlis to utils
...
[ROCm/rocshmem commit: 7347dee522 ]
2024-12-06 15:50:26 -06:00
Yiltan Temucin
758d2c78f0
Added timeout to unit tests
...
[ROCm/rocshmem commit: 98c164d72e ]
2024-12-06 15:50:22 -06:00