From 9aaad2017b497a5f3be7c175cd5ccc066719356a Mon Sep 17 00:00:00 2001 From: Julia Jiang <56359287+jujiang-del@users.noreply.github.com> Date: Wed, 27 Aug 2025 16:10:31 -0400 Subject: [PATCH] SWDEV-525231 - Update changelog for 7.0 (#768) --- projects/clr/CHANGELOG.md | 5 +++-- 1 file changed, 3 insertions(+), 2 deletions(-) diff --git a/projects/clr/CHANGELOG.md b/projects/clr/CHANGELOG.md index 61c801155c..28d32ac9b3 100644 --- a/projects/clr/CHANGELOG.md +++ b/projects/clr/CHANGELOG.md @@ -53,11 +53,12 @@ Full documentation for HIP is available at [rocm.docs.amd.com](https://rocm.docs - HIP APIs for `FP4`/`FP6`/`FP8`, which are compatible with corresponding CUDA APIs. - HIP Extensions APIs for microscaling formats, which are supported on AMD GPUs. * New `wptr` and `rptr` values in `ClPrint`, for better logging in dispatch barrier methods. -* New debug mask, to print precise code object information for logging. * The `_sync()` version of crosslane builtins such as `shfl_sync()` are enabled by default. These can be disabled by setting the preprocessor macro `HIP_DISABLE_WARP_SYNC_BUILTINS`. * Added `constexpr` operators for `fp16`/`bf16`. * Added warp level primitives: `__syncwarp` and reduce intrinsics (e.g. `__reduce_add_sync()`) -* Extended fine grained system memory pool. +* Support for the flags in APIs as following, now allows uncached memory allocation. + - `hipExtHostRegisterUncached`, used in `hipHostRegister`. + - `hipHostMallocUncached` and `hipHostAllocUncached`, used in `hipHostMalloc` and `hipHostAlloc`. * `num_threads` total number of threads in the group. The legacy API size is alias. * Added PCI CHIP ID information as the device attribute. * Added new tests applications for OCP data types `FP4`/`FP6`/`FP8`.