Wenkai Du
7882b2f0c5
topo_expl: add a few more single node models
...
[ROCm/rccl commit: 32388d60a9 ]
2020-03-02 11:43:03 -08:00
Wenkai Du
2a66deb694
Merge pull request #179 from wenkaidu/search
...
Use fraction of system maxWidth as steps for searching
[ROCm/rccl commit: 8b5bc8bca2 ]
2020-02-28 11:05:46 -08:00
Wenkai Du
6509f36fc9
Merge pull request #180 from wenkaidu/2.5.7
...
Sync up with NCCL 2.5.7
[ROCm/rccl commit: bbbe3dd628 ]
2020-02-27 15:36:01 -08:00
Wenkai Du
b750defc28
Merge remote-tracking branch 'remotes/nccl/master'
...
[ROCm/rccl commit: 8e73a2ad60 ]
2020-02-27 12:53:03 -08:00
Wenkai Du
a8f784e269
Merge pull request #178 from wenkaidu/topo_visual
...
Add topology visualizer tool
[ROCm/rccl commit: 63af7ab5cb ]
2020-02-27 09:31:34 -08:00
Wenkai Du
a36c2ecbc4
Add topology visualizer tool
...
[ROCm/rccl commit: 498d5029ad ]
2020-02-26 15:23:34 -08:00
Wenkai Du
9439b4edce
Merge pull request #177 from wenkaidu/topo_expl
...
topo_expl: use bandwidth numbers defined in graph in CPU models
[ROCm/rccl commit: ae834349ec ]
2020-02-26 14:37:45 -08:00
Wenkai Du
3886f9bea8
topo_expl: use bandwidth numbers defined in graph in CPU models
...
[ROCm/rccl commit: 934b6de557 ]
2020-02-26 14:17:36 -08:00
Wenkai Du
e4aad9b56d
Merge pull request #176 from wenkaidu/rome_bw
...
Revise PCI BW numbers on Rome
[ROCm/rccl commit: 67b4bea41f ]
2020-02-26 13:47:39 -08:00
Wenkai Du
45a7541582
Revise PCI BW numbers on Rome
...
[ROCm/rccl commit: d2adc61bf6 ]
2020-02-26 13:17:49 -08:00
Wenkai Du
b4be0ff3b8
Use fraction of system maxWidth as steps for searching
...
This reverts previous workaround of deducting only half of width
from paths.
[ROCm/rccl commit: 8391637613 ]
2020-02-26 09:14:35 -08:00
Wenkai Du
a11aba0bfa
Merge pull request #175 from wenkaidu/ll_abort
...
Fix abort handling in LL primitives
[ROCm/rccl commit: 9bf1edb554 ]
2020-02-25 14:12:33 -08:00
Wenkai Du
5747c3cac1
Fix abort handling in LL primitives
...
[ROCm/rccl commit: 077c3cda74 ]
2020-02-25 13:42:54 -08:00
Wenkai Du
81af08cfde
Merge pull request #174 from wenkaidu/max_speed
...
Fix system maxSpeed and maxWidth calculation
[ROCm/rccl commit: 1569f93f26 ]
2020-02-24 15:42:24 -08:00
Wenkai Du
d640f38d56
Fix system maxSpeed and maxWidth calculation
...
[ROCm/rccl commit: 9b80b3633f ]
2020-02-24 15:18:57 -08:00
Wenkai Du
ef2eb457ca
Merge pull request #173 from wenkaidu/cr8
...
Fix incorrect CR8 detection
[ROCm/rccl commit: 8fb18e0ae7 ]
2020-02-21 10:24:41 -08:00
Wenkai Du
93d448e2fe
Fix incorrect CR8 detection
...
Also change level of ring graph print to help debugging
[ROCm/rccl commit: f54dc58113 ]
2020-02-21 10:09:49 -08:00
Wenkai Du
cf4bce4ad3
Merge pull request #172 from wenkaidu/topo_expl
...
Add topology explorer
[ROCm/rccl commit: 5b3856f2ed ]
2020-02-20 15:16:55 -08:00
Wenkai Du
00f421ccbd
Add topology explorer
...
[ROCm/rccl commit: 55f8e2dec7 ]
2020-02-19 14:42:06 -08:00
Wenkai Du
9dad3e0a90
Merge pull request #167 from wenkaidu/cr8
...
Generate 8G6L chordal ring from reference
[ROCm/rccl commit: 9110820470 ]
2020-02-18 14:59:23 -08:00
Eiden Yoshida
d6d1f700f6
Fix hipclang argument in CI ( #171 )
...
[ROCm/rccl commit: 428f1f1555 ]
2020-02-18 13:17:52 -07:00
Eiden Yoshida
eb823a7621
Refactor Jenkinsfiles to allow use of new docker containers ( #170 )
...
[ROCm/rccl commit: edb863de62 ]
2020-02-18 11:25:29 -07:00
Sylvain Jeaugey
6034c27655
Fix Allgather operations above 4G with multiple GPUs per process.
...
Fixes nccl-tests#37.
Direct offsets were still on 32 bits in the low-level primitives.
[ROCm/rccl commit: c38f174bd4 ]
2020-02-12 11:11:55 -08:00
Wenkai Du
8432e8a921
Generate 8G6L chordal ring from reference
...
[ROCm/rccl commit: abcfbf1231 ]
2020-02-11 22:01:12 +00:00
Wenkai Du
062bc75d9b
Merge pull request #166 from wenkaidu/hcc_ver
...
Bump up HCC version for -hc-function-calls switch
[ROCm/rccl commit: f4ba41e80a ]
2020-02-11 13:20:50 -08:00
Wenkai Du
ded8d0d389
Bump up HCC version for -hc-function-calls switch
...
[ROCm/rccl commit: 3d092f32b8 ]
2020-02-11 19:37:13 +00:00
Wenkai Du
6b2d7de200
Add ring bandwidth correction factor
...
[ROCm/rccl commit: d1dae2721d ]
2020-01-30 09:52:27 -08:00
Stanley Tsang
e5419407c4
Updating copyright notices for 2020.
...
[ROCm/rccl commit: 20fa04d9b6 ]
2020-01-29 15:28:08 -08:00
Wenkai Du
e6b5933d7e
Merge remote-tracking branch 'remotes/rccl/master' into rccl_2.5.6_cleanup
...
[ROCm/rccl commit: fe6d012eb0 ]
2020-01-29 15:28:03 -08:00
Wenkai Du
622b49e80a
Split primitive class to smaller structures
...
[ROCm/rccl commit: 486fd436af ]
2020-01-29 15:27:23 -08:00
Wenkai Du
d2fbcfea02
Misc fixes and improvements for 2.5.6
...
1. Fix RCCL unit test
2. Add ROME detection and tuning
3. Change default P2P level
4. Fix search algorithm for XGMI
5. Remove explicit channel duplication with implicit by using half of link speed
6. Add collective trace support
7. Correct Intel Skylake CPU detection and bandwidth
8. Fix topo connect function
9. Disable GDR read and remove unreachable code
10. Disable LL128 kernels
11. Add tuning parameters
12. Use original clock64() implementation which returns RTC counter value
13. Print out timestamp of collective trace
14. Do not use struct ncclColl in kernel launch parameter
15. Fix abort handling and add tracing
17. Add __launch_bounds__ to kernel functions
18. Remove unused abortCount
19. Unset default MIN_NRINGS and MIN_NCHANNELS
20. Do not allocate shared memory when not using LL128 kernels
21. Correct time print out in tuning log
[ROCm/rccl commit: 1e55645d97 ]
2020-01-29 15:27:05 -08:00
Sylvain Jeaugey
40958b6445
2.5.7-1
...
[ROCm/rccl commit: 3701130b3c ]
2020-01-16 15:40:57 -08:00
Sylvain Jeaugey
8bdd1b487e
Merge pull request #283 from lukeyeager/topo-trim-net-links
...
Topo trim net links
[ROCm/rccl commit: 44c34e5d10 ]
2020-01-16 15:40:36 -08:00
paulfreddy
bbb0c59cd4
Changes for multiple ROCm installation ( #164 )
...
* Changes for multiple ROCm installation
1. Set version to 2.10.1
2. Add CMAKE_INSTALL_PREFIX to neccessary places
3. Cleanup, fix rpath, use prefix in install.sh
* Changes for multiple ROCm installation
1. Set soversion to match release version
2. Add CMAKE_INSTALL_PREFIX to neccessary places
3. Cleanup, fix rpath, use prefix in install.sh
* Changes for multiple ROCm installation
1. Set soversion to match release version
2. Add CMAKE_INSTALL_PREFIX to neccessary places
3. Cleanup, fix rpath, use prefix in install.sh
[ROCm/rccl commit: 15c917244d ]
2020-01-08 21:28:16 -08:00
Luke Yeager
d91217b16f
[topology] remove NET links when trimming system
...
This fixes a memory leak.
[ROCm/rccl commit: 7a18fe0784 ]
2020-01-07 13:29:57 -08:00
Luke Yeager
91ff39bedb
[build] Allow setting CXXFLAGS on the command line
...
[ROCm/rccl commit: c7ba70ff90 ]
2020-01-07 13:29:42 -08:00
Gilbert Lee
5783917a75
Changing single sync mode to time all iterations instead of just last
...
[ROCm/rccl commit: e5074ce94d ]
2019-12-20 17:08:39 -08:00
gilbertlee-amd
71635198b8
Removing OpenMP from unit tests ( #163 )
...
[ROCm/rccl commit: 000bce6f27 ]
2019-12-20 11:41:56 -07:00
gilbertlee-amd
a461b6d139
Adding new sleep after sync capability for data fabric profiling ( #162 )
...
Fixing missing header include for ROCM 3.0 changes
[ROCm/rccl commit: 2f4269d06d ]
2019-12-12 15:20:54 -07:00
Christian Sigg
ff74ebdcea
Fix clang build ( #274 )
...
The attribute is called `optnone`, not `noopt`.
[ROCm/rccl commit: 3899f6e0f2 ]
2019-12-09 09:31:13 -08:00
Ke Wen
6413a29ce8
Merge branch 'master' into HEAD
...
[ROCm/rccl commit: 44b5652617 ]
2019-12-06 18:28:11 -08:00
Ke Wen
8dc42618e4
2.5.6-2
...
Fix PPC64 Debian packaging
[ROCm/rccl commit: 6bb953d4e6 ]
2019-12-06 18:26:39 -08:00
saadrahim
26e161a7a2
Package fix ( #161 )
...
* Fixing RHEL dependency on rocm-dev
[ROCm/rccl commit: 0092b35132 ]
2019-12-06 16:06:50 -07:00
saadrahim
13de181fbc
Changing package dependency to rocm-dev ( #160 )
...
[ROCm/rccl commit: bd59b6f880 ]
2019-12-06 14:00:25 -07:00
Sylvain Jeaugey
e5a17ee58d
Fix clang compilation
...
[ROCm/rccl commit: aa15dfb29c ]
2019-12-06 09:55:54 -08:00
Christian Sigg
4984d5ce0b
Fix clang build ( #271 )
...
Clang doesn't understand `optimize("O0")`. It has `noopt`, which GCC doesn't understand. Wrap the difference in a macro.
[ROCm/rccl commit: 8c564e9b57 ]
2019-12-06 09:14:55 -08:00
Wenkai Du
35ad901dfe
Merge pull request #158 from wenkaidu/p2p
...
Change default P2P level
[ROCm/rccl commit: 9e10cde644 ]
2019-12-04 16:30:58 -08:00
Wenkai Du
b25dd83e7e
Merge pull request #157 from wenkaidu/readme
...
Change manual build instructions to fit most common usage
[ROCm/rccl commit: e9ca3a8029 ]
2019-12-04 14:50:41 -08:00
Wenkai Du
272d22fbe3
Change default P2P level
...
[ROCm/rccl commit: 90e928bcd5 ]
2019-12-04 21:05:10 +00:00
Wenkai Du
d7d4175df0
Merge remote-tracking branch 'remotes/nccl/master' into rccl_2.5.6
...
[ROCm/rccl commit: 6648c81dc6 ]
2019-12-03 15:42:04 -08:00