69ee68b6d3
H800/H100 fixes and tuning.
Re-enable intra-process direct pointer buffer access when CUMEM is
enabled.
[ROCm/rccl commit: 8c6c595185]