From d741a88f396f6a17ce317f4f80e5c4e46e125a51 Mon Sep 17 00:00:00 2001 From: Julia Jiang <56359287+jujiang-del@users.noreply.github.com> Date: Thu, 21 Aug 2025 15:59:54 -0400 Subject: [PATCH] SWDEV-525231 - Update changelog for 7.0 on develop (#647) --- projects/clr/CHANGELOG.md | 7 +++++-- 1 file changed, 5 insertions(+), 2 deletions(-) diff --git a/projects/clr/CHANGELOG.md b/projects/clr/CHANGELOG.md index 5e15c9f0b6..d15f3f12ef 100644 --- a/projects/clr/CHANGELOG.md +++ b/projects/clr/CHANGELOG.md @@ -62,8 +62,10 @@ Full documentation for HIP is available at [rocm.docs.amd.com](https://rocm.docs ### Changed * Deprecated GPUs. Some unsupported GPUs such as gfx9, gfx8 and gfx7 are deprecated on Microsoft Windows. +* Removal of Beta warnings in HIP Graph APIs +All Beta warnings in usage of HIP Graph APIs are removed, they are now officially and fully supported. * Behavior changes - - `hipGetLastError` now gets the error code returned by `hipGetLastError` which should be the last actual error caught in the current thread during the application execution. + - `hipGetLastError` now returns the error code which is the last actual error caught in the current thread during the application execution. - Cooperative groups in `hipLaunchCooperativeKernelMultiDevice` and `hipLaunchCooperativeKernel` functions, additional input parameter validation checks are added. - `hipPointerGetAttributes` returns `hipSuccess` instead of an error with invalid value `hipErrorInvalidValue`, in case `NULL` host or attribute pointer is passed as input parameter. It now matches the functionality of `cudaPointerGetAttributes` which changed with CUDA 11 and above releases. - `hipFree` previously there was an implicit wait which was applicable for all memory allocations, for synchronization purpose. This wait is now disabled for allocations made with `hipMallocAsync` and `hipMallocFromPoolAsync`, to match the behavior of CUDA API `cudaFree` @@ -197,7 +199,8 @@ HIP runtime has the following functional improvements which greatly improve runt * Failures in roc-obj tools. HIP runtime now makes `DEPRECATED` message in roc-obj tools as `STDERR`. * Support of `hipDeviceMallocContiguous` flags in `hipExtMallocWithFlags()`. It now enables `HSA_AMD_MEMORY_POOL_CONTIGUOUS_FLAG` in the memory pool allocation on GPU device. * Compilation failure, HIP runtime refactored the vector type alignment with `__hip_vec_align_v` - +* A numerical error/corruption found in Pytorch during graph replay. HIP runtime fixed the input sizes of kernel launch dimensions in hipExtModuleLaunchKernel for the execution of hipGraph capture. +* A crash during kernel execution in a customer application. The structure of kernel arguments was updated via adding the size of kernel arguments, and HIP runtime does validation before launch kernel with the structured arguments. ## HIP 6.4.2 for ROCm 6.4.2