2
0
Gráfico de cometimentos

346 Cometimentos

Autor(a) SHA1 Mensagem Data
Wenkai Du 7882b2f0c5 topo_expl: add a few more single node models
[ROCm/rccl commit: 32388d60a9]
2020-03-02 11:43:03 -08:00
Wenkai Du 2a66deb694 Merge pull request #179 from wenkaidu/search
Use fraction of system maxWidth as steps for searching

[ROCm/rccl commit: 8b5bc8bca2]
2020-02-28 11:05:46 -08:00
Wenkai Du 6509f36fc9 Merge pull request #180 from wenkaidu/2.5.7
Sync up with NCCL 2.5.7

[ROCm/rccl commit: bbbe3dd628]
2020-02-27 15:36:01 -08:00
Wenkai Du b750defc28 Merge remote-tracking branch 'remotes/nccl/master'
[ROCm/rccl commit: 8e73a2ad60]
2020-02-27 12:53:03 -08:00
Wenkai Du a8f784e269 Merge pull request #178 from wenkaidu/topo_visual
Add topology visualizer tool

[ROCm/rccl commit: 63af7ab5cb]
2020-02-27 09:31:34 -08:00
Wenkai Du a36c2ecbc4 Add topology visualizer tool
[ROCm/rccl commit: 498d5029ad]
2020-02-26 15:23:34 -08:00
Wenkai Du 9439b4edce Merge pull request #177 from wenkaidu/topo_expl
topo_expl: use bandwidth numbers defined in graph in CPU models

[ROCm/rccl commit: ae834349ec]
2020-02-26 14:37:45 -08:00
Wenkai Du 3886f9bea8 topo_expl: use bandwidth numbers defined in graph in CPU models
[ROCm/rccl commit: 934b6de557]
2020-02-26 14:17:36 -08:00
Wenkai Du e4aad9b56d Merge pull request #176 from wenkaidu/rome_bw
Revise PCI BW numbers on Rome

[ROCm/rccl commit: 67b4bea41f]
2020-02-26 13:47:39 -08:00
Wenkai Du 45a7541582 Revise PCI BW numbers on Rome
[ROCm/rccl commit: d2adc61bf6]
2020-02-26 13:17:49 -08:00
Wenkai Du b4be0ff3b8 Use fraction of system maxWidth as steps for searching
This reverts previous workaround of deducting only half of width
from paths.


[ROCm/rccl commit: 8391637613]
2020-02-26 09:14:35 -08:00
Wenkai Du a11aba0bfa Merge pull request #175 from wenkaidu/ll_abort
Fix abort handling in LL primitives

[ROCm/rccl commit: 9bf1edb554]
2020-02-25 14:12:33 -08:00
Wenkai Du 5747c3cac1 Fix abort handling in LL primitives
[ROCm/rccl commit: 077c3cda74]
2020-02-25 13:42:54 -08:00
Wenkai Du 81af08cfde Merge pull request #174 from wenkaidu/max_speed
Fix system maxSpeed and maxWidth calculation

[ROCm/rccl commit: 1569f93f26]
2020-02-24 15:42:24 -08:00
Wenkai Du d640f38d56 Fix system maxSpeed and maxWidth calculation
[ROCm/rccl commit: 9b80b3633f]
2020-02-24 15:18:57 -08:00
Wenkai Du ef2eb457ca Merge pull request #173 from wenkaidu/cr8
Fix incorrect CR8 detection

[ROCm/rccl commit: 8fb18e0ae7]
2020-02-21 10:24:41 -08:00
Wenkai Du 93d448e2fe Fix incorrect CR8 detection
Also change level of ring graph print to help debugging


[ROCm/rccl commit: f54dc58113]
2020-02-21 10:09:49 -08:00
Wenkai Du cf4bce4ad3 Merge pull request #172 from wenkaidu/topo_expl
Add topology explorer

[ROCm/rccl commit: 5b3856f2ed]
2020-02-20 15:16:55 -08:00
Wenkai Du 00f421ccbd Add topology explorer
[ROCm/rccl commit: 55f8e2dec7]
2020-02-19 14:42:06 -08:00
Wenkai Du 9dad3e0a90 Merge pull request #167 from wenkaidu/cr8
Generate 8G6L chordal ring from reference

[ROCm/rccl commit: 9110820470]
2020-02-18 14:59:23 -08:00
Eiden Yoshida d6d1f700f6 Fix hipclang argument in CI (#171)
[ROCm/rccl commit: 428f1f1555]
2020-02-18 13:17:52 -07:00
Eiden Yoshida eb823a7621 Refactor Jenkinsfiles to allow use of new docker containers (#170)
[ROCm/rccl commit: edb863de62]
2020-02-18 11:25:29 -07:00
Sylvain Jeaugey 6034c27655 Fix Allgather operations above 4G with multiple GPUs per process.
Fixes nccl-tests#37.
Direct offsets were still on 32 bits in the low-level primitives.


[ROCm/rccl commit: c38f174bd4]
2020-02-12 11:11:55 -08:00
Wenkai Du 8432e8a921 Generate 8G6L chordal ring from reference
[ROCm/rccl commit: abcfbf1231]
2020-02-11 22:01:12 +00:00
Wenkai Du 062bc75d9b Merge pull request #166 from wenkaidu/hcc_ver
Bump up HCC version for -hc-function-calls switch

[ROCm/rccl commit: f4ba41e80a]
2020-02-11 13:20:50 -08:00
Wenkai Du ded8d0d389 Bump up HCC version for -hc-function-calls switch
[ROCm/rccl commit: 3d092f32b8]
2020-02-11 19:37:13 +00:00
Wenkai Du 6b2d7de200 Add ring bandwidth correction factor
[ROCm/rccl commit: d1dae2721d]
2020-01-30 09:52:27 -08:00
Stanley Tsang e5419407c4 Updating copyright notices for 2020.
[ROCm/rccl commit: 20fa04d9b6]
2020-01-29 15:28:08 -08:00
Wenkai Du e6b5933d7e Merge remote-tracking branch 'remotes/rccl/master' into rccl_2.5.6_cleanup
[ROCm/rccl commit: fe6d012eb0]
2020-01-29 15:28:03 -08:00
Wenkai Du 622b49e80a Split primitive class to smaller structures
[ROCm/rccl commit: 486fd436af]
2020-01-29 15:27:23 -08:00
Wenkai Du d2fbcfea02 Misc fixes and improvements for 2.5.6
1. Fix RCCL unit test
2. Add ROME detection and tuning
3. Change default P2P level
4. Fix search algorithm for XGMI
5. Remove explicit channel duplication with implicit by using half of link speed
6. Add collective trace support
7. Correct Intel Skylake CPU detection and bandwidth
8. Fix topo connect function
9. Disable GDR read and remove unreachable code
10. Disable LL128 kernels
11. Add tuning parameters
12. Use original clock64() implementation which returns RTC counter value
13. Print out timestamp of collective trace
14. Do not use struct ncclColl in kernel launch parameter
15. Fix abort handling and add tracing
17. Add __launch_bounds__ to kernel functions
18. Remove unused abortCount
19. Unset default MIN_NRINGS and MIN_NCHANNELS
20. Do not allocate shared memory when not using LL128 kernels
21. Correct time print out in tuning log


[ROCm/rccl commit: 1e55645d97]
2020-01-29 15:27:05 -08:00
Sylvain Jeaugey 40958b6445 2.5.7-1
[ROCm/rccl commit: 3701130b3c]
2020-01-16 15:40:57 -08:00
Sylvain Jeaugey 8bdd1b487e Merge pull request #283 from lukeyeager/topo-trim-net-links
Topo trim net links

[ROCm/rccl commit: 44c34e5d10]
2020-01-16 15:40:36 -08:00
paulfreddy bbb0c59cd4 Changes for multiple ROCm installation (#164)
* Changes for multiple ROCm installation

   1. Set version to 2.10.1
   2. Add CMAKE_INSTALL_PREFIX to neccessary places
   3. Cleanup, fix rpath, use prefix in install.sh

* Changes for multiple ROCm installation

   1. Set soversion to match release version
   2. Add CMAKE_INSTALL_PREFIX to neccessary places
   3. Cleanup, fix rpath, use prefix in install.sh

* Changes for multiple ROCm installation

1. Set soversion to match release version
2. Add CMAKE_INSTALL_PREFIX to neccessary places
3. Cleanup, fix rpath, use prefix in install.sh


[ROCm/rccl commit: 15c917244d]
2020-01-08 21:28:16 -08:00
Luke Yeager d91217b16f [topology] remove NET links when trimming system
This fixes a memory leak.


[ROCm/rccl commit: 7a18fe0784]
2020-01-07 13:29:57 -08:00
Luke Yeager 91ff39bedb [build] Allow setting CXXFLAGS on the command line
[ROCm/rccl commit: c7ba70ff90]
2020-01-07 13:29:42 -08:00
Gilbert Lee 5783917a75 Changing single sync mode to time all iterations instead of just last
[ROCm/rccl commit: e5074ce94d]
2019-12-20 17:08:39 -08:00
gilbertlee-amd 71635198b8 Removing OpenMP from unit tests (#163)
[ROCm/rccl commit: 000bce6f27]
2019-12-20 11:41:56 -07:00
gilbertlee-amd a461b6d139 Adding new sleep after sync capability for data fabric profiling (#162)
Fixing missing header include for ROCM 3.0 changes

[ROCm/rccl commit: 2f4269d06d]
2019-12-12 15:20:54 -07:00
Christian Sigg ff74ebdcea Fix clang build (#274)
The attribute is called `optnone`, not `noopt`.

[ROCm/rccl commit: 3899f6e0f2]
2019-12-09 09:31:13 -08:00
Ke Wen 6413a29ce8 Merge branch 'master' into HEAD
[ROCm/rccl commit: 44b5652617]
2019-12-06 18:28:11 -08:00
Ke Wen 8dc42618e4 2.5.6-2
Fix PPC64 Debian packaging


[ROCm/rccl commit: 6bb953d4e6]
2019-12-06 18:26:39 -08:00
saadrahim 26e161a7a2 Package fix (#161)
* Fixing RHEL dependency on rocm-dev


[ROCm/rccl commit: 0092b35132]
2019-12-06 16:06:50 -07:00
saadrahim 13de181fbc Changing package dependency to rocm-dev (#160)
[ROCm/rccl commit: bd59b6f880]
2019-12-06 14:00:25 -07:00
Sylvain Jeaugey e5a17ee58d Fix clang compilation
[ROCm/rccl commit: aa15dfb29c]
2019-12-06 09:55:54 -08:00
Christian Sigg 4984d5ce0b Fix clang build (#271)
Clang doesn't understand `optimize("O0")`. It has `noopt`, which GCC doesn't understand. Wrap the difference in a macro.

[ROCm/rccl commit: 8c564e9b57]
2019-12-06 09:14:55 -08:00
Wenkai Du 35ad901dfe Merge pull request #158 from wenkaidu/p2p
Change default P2P level

[ROCm/rccl commit: 9e10cde644]
2019-12-04 16:30:58 -08:00
Wenkai Du b25dd83e7e Merge pull request #157 from wenkaidu/readme
Change manual build instructions to fit most common usage

[ROCm/rccl commit: e9ca3a8029]
2019-12-04 14:50:41 -08:00
Wenkai Du 272d22fbe3 Change default P2P level
[ROCm/rccl commit: 90e928bcd5]
2019-12-04 21:05:10 +00:00
Wenkai Du d7d4175df0 Merge remote-tracking branch 'remotes/nccl/master' into rccl_2.5.6
[ROCm/rccl commit: 6648c81dc6]
2019-12-03 15:42:04 -08:00