6421a1e79e
+ in hipMemcpyDtoDAsync: cuMemcpyDtoD -> cuMemcpyDtoDAsync + in hipMemcpyDtoHAsync: cuMemcpyDtoH -> cuMemcpyDtoHAsync P.S. "The types CUstream and cudaStream_t are identical and may be used interchangeably", thus explicit c-like type cast is not needed, aka CUstream(stream).