SWDEV-525231 - clarify that reduce sync operations are new intrinsics in the 7.0 CHANGELOG (#876)

SWDEV-525231 - clarify that reduce sync operations are new intrinsics in 7.0, not an existing one
Этот коммит содержится в:
Hernandez, Gerardo
2025-08-15 20:40:05 +01:00
коммит произвёл GitHub
родитель 54e592beec
Коммит a5be0f5346
+2 -2
Просмотреть файл
@@ -31,9 +31,9 @@ Full documentation for HIP is available at [rocm.docs.amd.com](https://rocm.docs
- HIP Extensions APIs for microscaling formats, which are supported on AMD GPUs.
* New `wptr` and `rptr` values in `ClPrint`, for better logging in dispatch barrier methods.
* New debug mask, to print precise code object information for logging.
* The `_sync()` version of crosslane builtins such as `shfl_sync()` and `__reduce_add_sync` are enabled by default. These can be disabled by setting the preprocessor macro `HIP_DISABLE_WARP_SYNC_BUILTINS`.
* The `_sync()` version of crosslane builtins such as `shfl_sync()` are enabled by default. These can be disabled by setting the preprocessor macro `HIP_DISABLE_WARP_SYNC_BUILTINS`.
* Added `constexpr` operators for `fp16`/`bf16`.
* Added `__syncwarp` operation.
* Added warp level primitives: `__syncwarp` and reduce intrinsics (e.g. `__reduce_add_sync()`)
* Extended fine grained system memory pool.
* `num_threads` total number of threads in the group. The legacy API size is alias.
* Added PCI CHIP ID information as the device attribute.