H800/H100 fixes and tuning. Re-enable intra-process direct pointer buffer access when CUMEM is enabled. [ROCm/rccl commit: 8c6c595185]
8c6c595185