Gráfico de commits

  • 57f81914d8 gfx950: restrict maxChannels to 48 for multi-node collectives (#2116) Nusrat Islam 2025-12-31 09:28:19 -06:00
  • f756aa9add gfx950: restrict maxChannels to 48 for multi-node collectives (#2116) Nusrat Islam 2025-12-31 09:28:19 -06:00
  • 03f714dd25 [SWDEV-567254] Sync Unified and Linux header (#2220) Joseph Narlo 2025-12-30 13:27:55 -06:00
  • ca32193c84 Fix test cases (#2462) vedithal-amd 2025-12-30 11:39:20 -05:00
  • 7d25ecc65c Add an environment variable to allow user explicitly turn off direct AllGather (#2119) amd-jiali 2025-12-29 16:43:40 -08:00
  • 935208ad09 Add an environment variable to allow user explicitly turn off direct AllGather (#2119) amd-jiali 2025-12-29 16:43:40 -08:00
  • a59d46ffbf SWDEV-567545 - Implement block_rank in co-op grid groups (#2182) Jimbo 2025-12-29 11:39:23 -05:00
  • 5bf6e366dd [SWDEV-548460] Add RDC Policy Reset Message (#2180) Adam Pryor 2025-12-29 10:31:13 -06:00
  • 741b4b9fdf SWDEV-558849 - Fix Windows build for ROCR backend (#2368) German Andryeyev 2025-12-29 08:35:22 -05:00
  • ea3fb1b810 Remove SMFMAC functionality in rocflop sample since its not supported in MI100 (#2456) vedithal-amd 2025-12-27 09:47:54 -05:00
  • 9c1560b8bb [rocprofiler-compute] Fix merging logic for multi process (#2445) vedithal-amd 2025-12-27 09:47:42 -05:00
  • 983386e40b [rocprofiler-compute] Write raw counter and metric values (#2314) abchoudh-amd 2025-12-26 14:06:57 +05:30
  • 2585ae8815 Virtual device enablement ( Minimal changes ) (#2110) Avinash 2025-12-25 15:06:33 -06:00
  • 6f62165369 Virtual device enablement ( Minimal changes ) (#2110) Avinash 2025-12-25 15:06:33 -06:00
  • bb83791b17 Remove redundant ROCPROFSYS_TRACE_CACHED variable from the code (#2434) marantic-amd 2025-12-25 13:36:04 +01:00
  • c3132773c8 Fix agent device ID in the cached kernel_dispatch trace (#2452) marantic-amd 2025-12-25 10:23:16 +01:00
  • c33dcd2d07 wsl/libhsakmt: fix reserved local help size calc Flora Cui 2025-11-24 15:22:21 +08:00
  • 91df8f84da wsl/libhsakmt: implement hsaKmtSetMemoryUserData Flora Cui 2025-11-21 17:23:01 +08:00
  • 4fb2ed2c5a librocdxg: fix vgpr count Flora Cui 2025-11-20 10:14:19 +08:00
  • 9bf8eb8c1e librocdxg: correct atomic info for APU Longlong Yao 2025-10-27 11:22:44 +08:00
  • e616b3e65e librocdxg: use shared GPU memory as vram on small APU Longlong Yao 2025-10-27 11:20:26 +08:00
  • 56eeaf26f8 librocdxg: query total shared GPU memory Longlong Yao 2025-10-27 10:59:00 +08:00
  • 5ebe95d5b2 librocdxg: query total shared GPU memory Longlong Yao 2025-10-27 10:59:00 +08:00
  • 6652313128 librocdxg: Add Strix and Strix Halo support Longlong Yao 2025-10-27 10:52:52 +08:00
  • a2c5e19624 librocdxg: add interface to query segment info Longlong Yao 2025-11-06 17:31:47 +08:00
  • 26cf8c8298 librocdxg: add interface to query segment info Longlong Yao 2025-11-06 17:31:47 +08:00
  • 641fa27699 [SWDEV-566543] Fix param validation in FrequenciesRead test (#2430) Bindhiya Kanangot Balakrishnan 2025-12-23 17:38:25 -06:00
  • 49b8900158 SWDEV-558849 - keep the lastEnqueueCommand_ when PAL backend is enabled (#2320) Ioannis Assiouras 2025-12-23 21:24:09 +00:00
  • c2c4d4c1f5 Revert "Adding full build capability to theROCK for HIP changes (#2003)" (#2441) ammallya 2025-12-23 13:01:08 -08:00
  • 61fd728fdb [rocprofiler-compute] Faster counter accuracy testing (#2420) vedithal-amd 2025-12-23 13:13:53 -05:00
  • d7302d6c1c [rocprofiler-compute] Test env. vars. in rocprofiler-sdk backend (#2414) vedithal-amd 2025-12-23 13:13:28 -05:00
  • 588773f9bf [rocprofiler-compute] Fix for multi process workload profiling (#2418) vedithal-amd 2025-12-23 13:12:18 -05:00
  • f221a1ae08 Updated troubleshooting-rccl.rst to change rocm-smi to amd-smi (#2028) Corey Derochie 2025-12-23 08:52:11 -07:00
  • f942810959 Updated troubleshooting-rccl.rst to change rocm-smi to amd-smi (#2028) Corey Derochie 2025-12-23 08:52:11 -07:00
  • bb599d8ed7 Add support for AMD AINIC within RCCL default internal network plugin. (#2078) Karthikeyan Arumugam 2025-12-23 07:33:10 -08:00
  • 9f4651f20f Add support for AMD AINIC within RCCL default internal network plugin. (#2078) Karthikeyan Arumugam 2025-12-23 07:33:10 -08:00
  • 3e49440495 SWDEV-555178 - Calculate phys mem offset for remap range (#1879) marandje 2025-12-23 10:27:42 +01:00
  • 719556fbba [rocprofiler-systems] Add SIGKILL delay option (#2384) Milan Radosavljevic 2025-12-23 03:17:57 +01:00
  • 37e3b8a3db [rocpd] Write rocpd yaml files as a list, even when only 1 file (#2288) Young Hui - AMD 2025-12-22 17:56:59 -05:00
  • 56bfb13644 QueuePair: prefix bnxt functions and variables (#373) Omri Mor 2025-12-22 14:46:17 -08:00
  • f5940f6b9a QueuePair: prefix bnxt functions and variables (#373) Omri Mor 2025-12-22 14:46:17 -08:00
  • c43dc136f3 [Bugfix] GDA/bnxt: release SQ lock before return (#372) Omri Mor 2025-12-22 12:05:00 -08:00
  • 016e08120a [Bugfix] GDA/bnxt: release SQ lock before return (#372) Omri Mor 2025-12-22 12:05:00 -08:00
  • 447025011a [Rocprof-Sys] Resolve crash when profiling TensorFlow GPU application (#2381) habajpai-amd 2025-12-23 00:30:55 +05:30
  • 1f8e8e3fbf Add CODEOWNERS for rocprofiler-sdk project (#2427) Ammar ELWazir 2025-12-22 11:16:09 -06:00
  • 9141f26905 [Documentaion] updating roctx library linkage documentation (#2251) Gopesh Bhardwaj 2025-12-22 21:06:13 +05:30
  • 0a52f5c101 Adding full build capability to theROCK for HIP changes (#2003) ammallya 2025-12-22 05:31:32 -08:00
  • ba1380a75d Put cached perfetto traces as default one (#2138) marantic-amd 2025-12-22 12:47:35 +01:00
  • 7da3275b42 [rocprofiler-systems] Improve metadata parsing (#2238) Aleksandar Djordjevic 2025-12-22 12:30:51 +01:00
  • a4b99485a9 gda/ro: validate and exit cleanly when forced GDA config is invalid (#354) Kutovoi, Vadim 2025-12-22 10:54:33 +00:00
  • 80a710ac0a gda/ro: validate and exit cleanly when forced GDA config is invalid (#354) Kutovoi, Vadim 2025-12-22 10:54:33 +00:00
  • 5b241f3e61 Fixed ctests (#2406) abchoudh-amd 2025-12-22 13:12:58 +05:30
  • ed38201b90 gda: fix incorrect casts from void* to uintptr_t (#369) Omri Mor 2025-12-19 16:18:49 -08:00
  • e8fc5e67c4 gda: fix incorrect casts from void* to uintptr_t (#369) Omri Mor 2025-12-19 16:18:49 -08:00
  • 3635953cd8 Revert "Adding org var and dynamic selection of targets (#2317)" (#2416) Geo Min 2025-12-19 14:51:53 -08:00
  • 14c949a827 SWDEV-572676 - adjust tile size to 32 in Unit_hipCGThreadBlockTileType for Navi4x (#2379) cadolphe-amd 2025-12-19 16:43:34 -05:00
  • d552491985 SWDEV-572329 - Remove barrier packet (#2304) Sourabh U Betigeri 2025-12-19 13:37:48 -08:00
  • fdc1660dfa SWDEV-565304 - Pass numa node to migrate pages correctly (#1729) Sourabh U Betigeri 2025-12-19 13:36:53 -08:00
  • c199df6b96 Revert "Adding org var and dynamic runner selection (#2106)" (#2114) Geo Min 2025-12-19 12:53:09 -08:00
  • 4f474a7389 Revert "Adding org var and dynamic runner selection (#2106)" (#2114) Geo Min 2025-12-19 12:53:09 -08:00
  • 0c0d8dc974 SWDEV-548892 - Stop using __ockl_lane_id (#2186) Matt Arsenault 2025-12-19 20:34:55 +01:00
  • 7c989ac022 [SWDEV-525635] Updated output file handling options (#1896) systems-assistant[bot] 2025-12-19 13:10:42 -06:00
  • e21c087f2a [BugFix] Fix rocshmem_get_device_ctx to return ctx_opaque pointer (#359) Dimple Prajapati 2025-12-19 07:01:02 -08:00
  • cf6a53e81c [BugFix] Fix rocshmem_get_device_ctx to return ctx_opaque pointer (#359) Dimple Prajapati 2025-12-19 07:01:02 -08:00
  • dde4902844 Fix driver.sh script for system where neither amd-smi or rocm-smi are (#370) Aurelien Bouteiller 2025-12-19 10:00:11 -05:00
  • 5eaa152010 Fix driver.sh script for system where neither amd-smi or rocm-smi are (#370) Aurelien Bouteiller 2025-12-19 10:00:11 -05:00
  • 750d3f8b2e Bump urllib3 from 2.5.0 to 2.6.0 in /docs/sphinx (#365) dependabot[bot] 2025-12-19 09:55:42 -05:00
  • 166a591216 Bump urllib3 from 2.5.0 to 2.6.0 in /docs/sphinx (#365) dependabot[bot] 2025-12-19 09:55:42 -05:00
  • 7b00d3a89b fix: prevent double-free crash during process exit in amd-smi (#2213) habajpai-amd 2025-12-19 11:56:40 +05:30
  • 883fdfb820 Revert "clr: Minor fixes for error return" (#2399) Sourabh U Betigeri 2025-12-18 15:40:13 -08:00
  • 8bc2e81e9a Tuning: use constant value for CorrectionFactor tables alexander-sannikov 2025-12-08 18:32:08 +00:00
  • 50568dc93d Tuning: use constant value for CorrectionFactor tables alexander-sannikov 2025-12-08 18:32:08 +00:00
  • 1b00f1a895 Tuning: fixed out-of-bound access alexander-sannikov 2025-12-08 16:24:37 +00:00
  • dea50b5e11 Tuning: fixed out-of-bound access alexander-sannikov 2025-12-08 16:24:37 +00:00
  • 4ef22f973e Revert: Restore default symbol visibility for tests in debug mode (#2111) Atul Kulkarni 2025-12-18 09:20:12 -08:00
  • 313b98281c Revert: Restore default symbol visibility for tests in debug mode (#2111) Atul Kulkarni 2025-12-18 09:20:12 -08:00
  • 112b4fd413 [rocprofiler-compute] Add SDK dependency to rocprofiler-compute-tarball.yml workflow (#2329) Jason Bonnell 2025-12-18 11:56:23 -05:00
  • e4abee4f7d [rocprofiler-compute] Improve iteration multiplexing code and documentation (#2080) vedithal-amd 2025-12-18 11:51:21 -05:00
  • bd6c6852fc [SWDEV-566924] Update KFD_ID metric to use amd-smi instead of rocprof (#2355) Adam Pryor 2025-12-18 08:39:19 -06:00
  • fdf73116d5 Do not allocate code objects when we map a static code object (#2332) Jatin Chaudhary 2025-12-18 09:22:02 +00:00
  • b4e04b07ed test: add unit tests for common utilities from PR #1249 (#2237) habajpai-amd 2025-12-18 11:03:14 +05:30
  • 4a9833e70e Revert "Add HasExpertSchedMode device prop (#2241)" (#2371) Maneesh Gupta 2025-12-18 10:56:44 +05:30
  • 5ebd50c0b4 rocr: Fix asyncHandler segfault (#2261) David Yat Sin 2025-12-17 23:52:20 -05:00
  • bed6070e12 Adding tuning conf file for CU reduction for AR, AG, and RS with under-subscribed number of GPUs per node (#2102) Pedram Alizadeh 2025-12-17 16:58:54 -05:00
  • f0e7e8745f Adding tuning conf file for CU reduction for AR, AG, and RS with under-subscribed number of GPUs per node (#2102) Pedram Alizadeh 2025-12-17 16:58:54 -05:00
  • c64c23fbee Removes default visibility in debug mode and updates unit tests for alt_rsmi impl (#2091) Atul Kulkarni 2025-12-17 10:27:00 -08:00
  • 74690ea705 Removes default visibility in debug mode and updates unit tests for alt_rsmi impl (#2091) Atul Kulkarni 2025-12-17 10:27:00 -08:00
  • 96f6b6e251 SWDEV-571304 : Fix the constructor for __half (#2240) Shadi Dashmiz 2025-12-17 11:15:20 -05:00
  • c0b4aef5ad Add HasExpertSchedMode device prop (#2241) Filip Jankovic 2025-12-17 17:06:08 +01:00
  • 79ad00fb15 Scale down memory usage data when the actual data is stored to cache (#2343) marantic-amd 2025-12-17 14:57:41 +01:00
  • e3c051d9b8 [RDC] Optimize RDC counter sampling with greedy packing algorithm (#1590) Benjamin Welton 2025-12-17 05:56:33 -08:00
  • 6d9d880d31 [rocprofiler-compute] Counter accuracy tests and improvements for iteration multiplexing (#2011) abchoudh-amd 2025-12-17 18:26:39 +05:30
  • 30dcc1a977 Revert "wsl/librocdxg: Change hsaKmtQueueRingDoorbell interface" yangsu13 2025-12-17 10:47:06 +08:00
  • c738e73d99 [rocprofiler-compute][tui] menu bar lag fix (#1942) xuchen-amd 2025-12-16 17:02:27 -05:00
  • 3a3738ad98 Added AMDSMI CI to rocm-systems(#2074) amd-juwillia 2025-12-16 12:52:42 -07:00
  • c9ac018395 Adding org var and dynamic selection of targets (#2317) Geo Min 2025-12-16 10:46:59 -08:00
  • 4f7698c27e Adding org var and dynamic runner selection (#2106) Geo Min 2025-12-16 10:41:57 -08:00
  • 2e193aed68 Adding org var and dynamic runner selection (#2106) Geo Min 2025-12-16 10:41:57 -08:00
  • 2e8786d0b2 Bump rocm-docs-core[api_reference] from 1.31.0 to 1.31.1 in /docs/sphinx (#216) dependabot[bot] 2025-12-16 09:18:22 -08:00
  • f7b3567d4c Bump rocm-docs-core[api_reference] from 1.31.0 to 1.31.1 in /docs/sphinx (#216) dependabot[bot] 2025-12-16 09:18:22 -08:00