Wenkai Du
d28e1cb44f
Merge remote-tracking branch 'nccl/master' into develop
2022-04-18 11:15:25 -07:00
nunnikri
b83efe9c5c
Installing rccl.h wrapper to /opt/rocm-xxx/include path ( #532 )
...
* Fixing the broken library soft link
* Installing rccl.h wrapper to /opt/rocm-xxx/include path.
This missing wrapper was causing compilation errors with pytorch. Fixing it
2022-04-09 07:55:39 -07:00
nunnikri
acfb0210ea
Fixing the broken library soft link ( #529 )
2022-04-07 15:19:33 -07:00
Liam Wrubleski
a8f1e61f48
Packages for test and benchmark executables on all supported OSes using CPack. ( #512 )
2022-03-21 15:04:14 -06:00
akolliasAMD
65ea3d80db
Added alltoallv test and optional args variable on collective args ( #514 )
...
* Added alltoallv test and optional args variable on collective args
2022-03-18 13:55:11 -04:00
Nirmal Unnikrishnan
115461cc04
File reorganization with backward compatibility
...
Updated the header file location and export path
2022-03-10 01:28:41 +00:00
Nirmal Unnikrishnan
676a4737c1
File reorganization as per the new defined standard
...
The header files will in /opt/rocm-xxx/include/rccl
Libraries and cmake will be in /opt/rocm-xxx/lib folder.
Added wrappers for header files using rocm-cmake functions for backward compatibility.
2022-03-08 17:32:02 +00:00
Ziyue Yang
b569c0a1db
Add Pivot AllToAll algorithm for Rome model ( #503 )
...
* add a2a pivot interface
* remove debug info
* address comments
* fix bug
* remove custom script
* address comments
* fix bug
2022-02-20 21:09:47 -08:00
Wenkai Du
3a919c1f49
Merge remote-tracking branch 'nccl/master' into develop
2021-11-11 14:22:12 -08:00
Wenkai Du
29170a8b5f
Support different protocols and algorithms in all reduce only build ( #455 )
...
* Support different protocols and algorithms in all reduce only build
* Restore deleted line in error
2021-11-02 08:39:08 -07:00
Wenkai Du
4643a17f83
Check rocm_smi64Config.h on older ROCm build ( #452 )
2021-10-28 07:26:28 -07:00
Wenkai Du
ec36c4c326
Enable timing profiling mode ( #447 )
2021-10-27 08:21:48 -07:00
Stanley Tsang
7e55b211c5
Build AllReduce only mode ( #443 )
...
* Initial commit of all_reduce_only support
* Working AllReduce only build
* Removing printfs and restoring release build
* Restore P2P index
* Updates to build_allreduce_only mode.
* cleaning up macro ifdefs
2021-10-26 17:36:46 -06:00
Liam Wrubleski
97d9cf40e7
Setup runtime and development packages ( #407 )
...
* changes to enable devel package
* Update rocm-cmake version & build
2021-07-26 15:06:17 -06:00
Wenkai Du
56155ff5b6
Use rocm_smi_lib for getting topology information ( #402 )
...
* Use rocm_smi_lib for getting topology information
* Add rocm-smi-lib dependency to RCCL package
2021-07-08 13:23:11 -07:00
Eiden Yoshida
5c3e7d8b67
Fix static builds ( #393 )
2021-06-23 09:19:48 -06:00
Wenkai Du
e75bc53e06
Deduct ROCM_PATH from CXX unless specified ( #400 )
2021-06-22 13:29:08 -07:00
Wenkai Du
59d2867b01
Remove hard coded /opt/rocm from cmake ( #396 )
2021-06-21 08:29:23 -07:00
Eiden Yoshida
fb267ea333
Move address-sanitizer build above addition of rccl library in CMakeLists ( #392 )
2021-06-11 14:43:54 -06:00
Eiden Yoshida
eea7b24058
Add address sanitizer build option ( #389 )
2021-06-10 09:14:54 -06:00
Wenkai Du
a4ea1fed5b
Merge remote-tracking branch 'nccl/master' into develop
2021-05-05 16:01:01 -07:00
Wenkai Du
ad54a14a5c
Add libdl linking option ( #358 )
2021-04-26 15:24:58 -07:00
Wenkai Du
1fe031402a
Add gfx90a target ( #344 )
...
* Add gfx90a target
* Support gfx90a topology
Co-authored-by: Eiden Yoshida <eiden.yoshida@amd.com >
2021-04-14 09:29:00 -06:00
Wenkai Du
3f18540f50
Remove link to NUMA lib as it is no longer needed ( #346 )
2021-04-12 09:53:17 -07:00
Wenkai Du
def8b4ca0d
Move RCCL changelog and Copyright out of /usr/share ( #343 )
2021-04-09 14:08:40 -07:00
Wenkai Du
1d2946ee4b
Rework network port trimming code ( #338 )
...
* Rework network port trimming code
* Move Rome related changes to separate source files
2021-03-31 10:25:59 -07:00
Wenkai Du
244e25d980
Hide non-public symbols from library ( #332 )
...
* Hide non-public symbols from library
* Move flag outside of parallel-jobs check
2021-03-18 18:08:08 -07:00
Wenkai Du
8e180cf087
Revert "Port alltoall[v]" ( #325 )
...
This reverts commit f4d5d3d620 .
2021-03-06 13:59:31 -08:00
Wenkai Du
ab1e7a0318
Merge remote-tracking branch 'origin/develop' into 2.8.3
2021-02-04 20:02:34 -05:00
gilbertlee-amd
3e62ceddc5
Clique kernel support ( #295 ) ( #15 )
...
* Adding experimental clique-based kernels (opt-in only)
Co-authored-by: Stanley Tsang <stanley.tsang@amd.com >
Co-authored-by: Gilbert Lee <gilbert.lee@amd.com >
Co-authored-by: Wenkai Du <43822138+wenkaidu@users.noreply.github.com >
Co-authored-by: Stanley Tsang <stanley.tsang@amd.com >
Co-authored-by: Wenkai Du <43822138+wenkaidu@users.noreply.github.com >
2021-01-28 09:45:01 -07:00
Wenkai Du
f4d5d3d620
Port alltoall[v]
2021-01-14 19:28:01 -05:00
Wenkai Du
d469947641
Merge remote-tracking branch 'nccl/master' into no-target-id
2021-01-14 19:27:53 -05:00
Wenkai Du
882d52ad7e
Adding backward compatibility for target-id syntax for AMDGPU_TARGETS ( #306 )
2020-12-04 13:55:56 -08:00
gilbertlee-amd
41bcfb8878
Clique kernel support ( #295 )
...
* Adding experimental clique-based kernels (opt-in only)
Co-authored-by: Stanley Tsang <stanley.tsang@amd.com >
Co-authored-by: Gilbert Lee <gilbert.lee@amd.com >
Co-authored-by: Wenkai Du <43822138+wenkaidu@users.noreply.github.com >
2020-11-10 15:44:10 -07:00
Wenkai Du
1fdb216f87
Use target id of xnack off ( #298 )
2020-11-10 11:10:48 -08:00
Wenkai Du
2ecfc62ec8
Fix lintian errors ( #287 )
2020-10-21 16:20:53 -07:00
Eiden Yoshida
205b5507b4
Update sramecc and xnack to ANY ( #284 )
...
Co-authored-by: Tony <Tony.Tye@amd.com >
Co-authored-by: Wenkai Du<Wenkai.Du@amd.com >
2020-10-16 00:25:18 -06:00
gilbertlee-amd
84a2541e01
Revert "Initial support for clique-based kernels ( #276 )" ( #280 )
...
This reverts commit 2b8184808d .
2020-10-15 11:30:18 -07:00
gilbertlee-amd
2b8184808d
Initial support for clique-based kernels ( #276 )
...
* Initial support for clique-based kernels
2020-10-13 11:22:04 -06:00
Wenkai Du
b871ea3c0c
Add Alltoallv RCCL kernel implementation ( #269 )
...
* Add alltoallv API and implementation
* Extend Rome P2P channel limit to multinode and alltoall kernels
* topo_expl: fix compilation and sync up with main
* gtest: use RCCL alltoallv API
* Code review changes
2020-09-30 16:25:36 -07:00
nunnikri
aa985bfb7e
SWDEV-253325 : Chaning amdgpu-target to cuda-gpu-arch ( #268 )
2020-09-25 15:44:56 -06:00
Stanley Tsang
8c90aefb6d
Adding the ability to force install dependencies (namely gtest); gtest library installation fix for centos ( #265 )
...
* Adding the ability to force install dependencies (namely gtest); gtest library installation fix for centos
* Removing potentially unneccessary dependencies from install script
2020-09-10 17:27:22 -06:00
Stanley Tsang
f2e5db7bf7
Adding XNACK flags. ( #264 )
...
* Adding XNACK flags.
2020-09-08 11:36:30 -06:00
Aaron Enye Shi
958b213428
Add RCCL Static Lib Creation with -fgpu-rdc
...
RCCL uses -fgpu-rdc to compile its source objects. When linking
the RCCL static library, the link and archive step must do through
hipcc and uses the flag --emit-static-lib. When compiling
UnitTests, the librccl.a must be consumed through -l and -L.
2020-09-03 11:25:41 -04:00
Stanley Tsang
c5d4d9eb76
Adding static library building option. ( #244 )
...
* Adding static library building option.
* Disabling running tests for static build
* Removing static packaging in CI
Co-authored-by: Saad Rahim <saad.rahim@amd.com >
2020-08-06 11:19:43 -06:00
saadrahim
0dc019e35f
Download GTest if not found in system ( #237 )
...
Co-authored-by: Stanley Tsang <stanley.tsang@amd.com >
2020-08-06 09:36:58 -06:00
Wenkai Du
35c5a7fe45
Fix RCCL build package name ( #236 )
2020-07-20 14:43:00 -07:00
saadrahim
7f93aa7e53
Changing dependency to hip-rocclr ( #228 )
2020-07-14 17:49:56 -06:00
Wenkai Du
1addf4f196
Match RCCL package name to API version ( #229 )
2020-07-07 13:30:39 -07:00
Wenkai Du
84f8ba3bb0
Revert use posix_memalign for network buffer allocation on host memory ( #222 )
2020-06-24 11:25:55 -07:00