Avinash Kethineedi
65b4ff4c41
Add multi work-group support for collective functional tests ( #45 )
...
- Added multi-work group support for the All-to-all, Fcollect, Broadcast, Barrier and Sync collective functional tests
- Renamed All-to-all and Fcollect tests to TeamAlltoAll and TeamFcollect
[ROCm/rocshmem commit: 57d60aa727 ]
2025-02-19 10:31:53 -06:00
Yiltan
e1ed36e58f
Sync develop with amd-mainline ( #46 )
...
* Update install_dependencies.sh
* Updated to ROCm repos
* Merge pull request #37 from ROCm/depBuild
locked specific version on ompi and ucx
* locked specific version on ompi and ucx
* [IPC] Fix ROCSHMEM_SIGNAL_ADD
* Generate CMake Package Configuration Files
---------
Co-authored-by: akolliasAMD <99202231+akolliasAMD@users.noreply.github.com >
Co-authored-by: akolliasAMD <akollias@amd.com >
[ROCm/rocshmem commit: 785e31aa48 ]
2025-02-18 12:30:34 -05:00
Avinash Kethineedi
f2990df7f2
Merge pull request #44 from avinashkethineedi/fix/time_calculations
...
Update bandwidth and latency calculations
[ROCm/rocshmem commit: f8701a44fa ]
2025-02-17 12:48:33 -06:00
avinashkethineedi
6c70aee32e
Remove rocshmem_timer function
...
[ROCm/rocshmem commit: 21dbd5cc5e ]
2025-02-17 17:10:51 +00:00
avinashkethineedi
dba989733f
Update bandwidth and latency calculations
...
- Refined bandwidth and latency calculations for improved accuracy
[ROCm/rocshmem commit: c155636da4 ]
2025-02-17 06:18:46 +00:00
Avinash Kethineedi
04889cb71c
Merge pull request #40 from avinashkethineedi/RO_data_types
...
RO Backend: Add support for char, signed char and unsigned char
[ROCm/rocshmem commit: 40bd8a38a0 ]
2025-02-12 14:40:05 -06:00
avinashkethineedi
539e991b2a
RO Backend: Add support for char, signed char and unsigned char
...
[ROCm/rocshmem commit: 540cd4b918 ]
2025-02-12 20:10:03 +00:00
Yiltan
1f3881fa21
Merge pull request #38 from Yiltan/ro/implement-sigops
...
Implements Signalling Operations for RO
[ROCm/rocshmem commit: 495cd6970b ]
2025-02-10 15:10:07 -05:00
Yiltan
87e049f9c9
Merge pull request #34 from Yiltan/sigops-test-fix
...
Fix Signalling Operations Functional Test
[ROCm/rocshmem commit: 94144f4460 ]
2025-02-10 14:56:45 -05:00
Yiltan
f64c76b31c
Merge pull request #39 from Yiltan/ro/fix-teamreduce
...
Fix Team reduction intra-node
[ROCm/rocshmem commit: 944444cf12 ]
2025-02-10 14:56:27 -05:00
Yiltan Hassan Temucin
76981d6374
Fix Team reduction intra-node
...
[ROCm/rocshmem commit: 022b2c27e7 ]
2025-02-07 08:39:35 -06:00
Avinash Kethineedi
4d919faf55
Merge pull request #36 from avinashkethineedi/fix/rocshmem-ctx-wg-team-sync
...
Fix `rocshmem_ctx_wg_team_sync` API
[ROCm/rocshmem commit: d97e5ba2c8 ]
2025-02-06 13:41:16 -06:00
Avinash Kethineedi
614d5c7c81
Merge pull request #35 from avinashkethineedi/fix/team-context-pe-numbering
...
Fix `rocshmem_ctx_my_pe` and `rocshmem_ctx_n_pes` APIs
[ROCm/rocshmem commit: 5861346a8e ]
2025-02-06 13:39:28 -06:00
Yiltan Hassan Temucin
e50460af83
Use the precalculated num_warps variable
...
[ROCm/rocshmem commit: b83ff2fa84 ]
2025-02-06 13:21:25 -06:00
Yiltan Hassan Temucin
c4f2ccd48f
[RO] implemented signaling operations
...
[ROCm/rocshmem commit: f1c25f7e19 ]
2025-02-06 10:17:32 -06:00
Yiltan Hassan Temucin
90b8f191d6
[RO] added MPI_UNSIGNED_LONG as type
...
[ROCm/rocshmem commit: 21171deeb8 ]
2025-02-06 10:17:32 -06:00
Yiltan Hassan Temucin
257610bdc5
Validate signal after put signal operations
...
[ROCm/rocshmem commit: 8d74c7b73e ]
2025-02-06 08:17:22 -06:00
avinashkethineedi
fca7471d67
Fix rocshmem_ctx_wg_team_sync API
...
- Updated `rocshmem_ctx_wg_team_sync` to utilize a team-specific memory buffer for synchronization
[ROCm/rocshmem commit: c5b548c398 ]
2025-02-05 19:09:07 +00:00
avinashkethineedi
71af1b366d
Fix rocshmem_ctx_my_pe and rocshmem_ctx_n_pes APIs to return PE numbering and size relative to the team in a team-specific context.
...
[ROCm/rocshmem commit: e311400d15 ]
2025-02-05 03:41:40 +00:00
Yiltan Hassan Temucin
9317172fab
Fix sigops functional test
...
- Ensure quiet is called on the correct context
[ROCm/rocshmem commit: bae1641311 ]
2025-02-04 13:30:31 -08:00
Avinash Kethineedi
2214d21491
Merge pull request #31 from avinashkethineedi/rocshmem_g
...
Implement `rocshmem_g` API and optimize memory usage
[ROCm/rocshmem commit: 248972b30b ]
2025-02-04 11:15:41 -06:00
Yiltan
f967be4f54
Merge pull request #32 from Yiltan/ipc/sigop-bug
...
[IPC] Fix ROCSHMEM_SIGNAL_ADD
[ROCm/rocshmem commit: 2d9d09ea01 ]
2025-02-03 16:48:05 -05:00
Yiltan Hassan Temucin
ffdce76fe4
[IPC] Fix ROCSHMEM_SIGNAL_ADD
...
[ROCm/rocshmem commit: fd3eaa3f69 ]
2025-02-03 09:59:28 -08:00
avinashkethineedi
5af3fdeacb
Implement rocshmem_g API and optimize memory usage
...
- Implement `rocshmem_g` API
- Free up memory space allocated for `rocshmem_g` and atomic operations' return values
[ROCm/rocshmem commit: 757d7e53ca ]
2025-02-02 05:56:46 +00:00
Yiltan
86c3c5ff39
Merge pull request #29 from Yiltan/multi-node
...
Updated RO builds script and functional test driver
[ROCm/rocshmem commit: 7e5b533904 ]
2025-01-27 14:44:22 -05:00
Yiltan
c39e737eef
Merge pull request #28 from Yiltan/cmakefiles-create
...
Generate CMake Package Configuration Files
[ROCm/rocshmem commit: f851411ac5 ]
2025-01-24 10:50:27 -05:00
Yiltan
3a071f1d69
Merge pull request #25 from mawad-amd/muhaawad/build_examples_option
...
Add `BUILD_EXAMPLES` CMake option
[ROCm/rocshmem commit: a458ea2ef4 ]
2025-01-24 10:50:10 -05:00
Yiltan Hassan Temucin
adf66d04f4
Updated RO builds script and functional test driver for multi-node support
...
[ROCm/rocshmem commit: 3a8b0d4647 ]
2025-01-23 16:46:19 -06:00
Yiltan Hassan Temucin
90f2bf1ec8
Generate CMake Package Configuration Files
...
[ROCm/rocshmem commit: 00824385ba ]
2025-01-22 11:24:41 -06:00
Yiltan
b159f0ecb4
Merge pull request #27 from ROCm/package-bug-fix
...
Minor fixes for packaging
[ROCm/rocshmem commit: bacced0cc3 ]
2025-01-21 09:13:29 -05:00
Yiltan
fc8007bec6
Minor fixes for packaging
...
[ROCm/rocshmem commit: fa90f4b0ac ]
2025-01-20 18:15:07 +00:00
Yiltan
498f76f72c
Merge pull request #24 from Yiltan/install-script
...
Added script to install dependencies
[ROCm/rocshmem commit: 469e2a0167 ]
2025-01-20 11:02:47 -05:00
Muhammad Awad
6688cf8fd6
Add BUILD_EXAMPLES CMake option
...
Signed-off-by: Muhammad Awad <MuhammadAbdelghaffar.Awad@amd.com >
[ROCm/rocshmem commit: 7a6b3261a3 ]
2025-01-18 15:26:20 -06:00
Yiltan
61f5c61ab1
Update scripts/install_dependencies.sh
...
Co-authored-by: Avinash Kethineedi <avinash.kethineedi@amd.com >
[ROCm/rocshmem commit: 0fb673e186 ]
2025-01-16 13:38:08 -05:00
Yiltan Temucin
b30ab63d2d
Added script to install dependencies
...
[ROCm/rocshmem commit: 5de0371bec ]
2025-01-16 10:06:39 -06:00
Avinash Kethineedi
7b7f8b2998
Merge pull request #23 from avinashkethineedi/bugfix/memory-usage
...
Automatic Memory Management with `shared_ptr` for host interface
[ROCm/rocshmem commit: 17b7afdcba ]
2025-01-15 02:34:14 +05:30
avinashkethineedi
15c5ff048c
Replace raw pointers for host_interface with shared_ptr to enable automatic memory handling
...
[ROCm/rocshmem commit: 1ef2d3a6b7 ]
2025-01-13 20:58:43 +00:00
Avinash Kethineedi
9b162d4065
Merge pull request #22 from avinashkethineedi/functional_tests
...
[ROCm/rocshmem commit: 4a3c3d54fb ]
2024-12-27 20:49:55 +05:30
avinashkethineedi
7a0bf44b83
Updated driver.sh and tester.hpp with sequential numbering for test identification
...
* Enabled Ping Pong tests
* Removed test commands for multi-workgroup collective tests
[ROCm/rocshmem commit: 23172c9150 ]
2024-12-26 21:28:21 +00:00
avinashkethineedi
ca4142de78
Updated default case of functional tests with empty test
...
[ROCm/rocshmem commit: e40e6a63fa ]
2024-12-26 19:33:23 +00:00
Edgar Gabriel
3cf91156f9
Merge pull request #20 from edgargabriel/topic/remove-internal-dir
...
remove internal directory
[ROCm/rocshmem commit: 12aeab1a59 ]
2024-12-26 09:21:09 -06:00
Yiltan
4692053198
Merge pull request #21 from Yiltan/global-exit-fix
...
IPC backend now aborts with rocshmem_global_exit()
[ROCm/rocshmem commit: c87fa8183a ]
2024-12-24 08:52:04 -05:00
Yiltan Temucin
3bf7d2eb87
IPC backend now aborts with rocshmem global_exit()
...
[ROCm/rocshmem commit: c0e4a32ca2 ]
2024-12-23 11:03:04 -06:00
Avinash Kethineedi
2e59d60264
Merge pull request #19 from avinashkethineedi/teams_split_API
...
[ROCm/rocshmem commit: c5902afe28 ]
2024-12-23 20:42:09 +05:30
Yiltan
04aa793a83
Merge pull request #18 from Yiltan/functional-test-fix
...
Commented function that fails functional tests
[ROCm/rocshmem commit: 3995f1136e ]
2024-12-23 09:59:52 -05:00
Edgar Gabriel
fe9b96cd75
remove internal directory.
...
[ROCm/rocshmem commit: 8121616034 ]
2024-12-23 14:47:50 +00:00
avinashkethineedi
7a8d464b3c
Fix rocshmem_team_split_strided API
...
[ROCm/rocshmem commit: cb8b9094b4 ]
2024-12-21 18:16:42 +00:00
Yiltan Temucin
d8a16962bf
Commented function that fails functional tests
...
[ROCm/rocshmem commit: 83a588ee2b ]
2024-12-20 14:48:54 -06:00
Brandon Potter
05f260bfc8
Merge pull request #16 from BKP/develop
...
Fix signal calculation bug for fine-tiled unit tests
[ROCm/rocshmem commit: 7fd66663c5 ]
2024-12-20 08:40:56 -06:00
Edgar Gabriel
069c28f6e2
Merge pull request #17 from edgargabriel/topic/doc-update
...
update the link for MPI/UCX compilation
[ROCm/rocshmem commit: ee6b4ada01 ]
2024-12-19 15:24:41 -06:00