Correct rocprofv3 usage instructions (#2925)

* Correct rocprofv3 usage

* Apply suggestion from @SwRaw

* Apply suggestion from @SwRaw

* Update .gitignore
This commit is contained in:
Swati Rawat
2026-01-28 22:46:19 +05:30
committed by GitHub
parent d917259953
commit 9de4a2ebb1
2 changed files with 100 additions and 99 deletions
+1
View File
@@ -1 +1,2 @@
.cline_storage
/projects/hip/_build
@@ -27,10 +27,10 @@ Follow this systematic approach to optimize GPU performance:
.. code-block:: bash
rocprofv3 --stats ./your_application
rocprofv3 --stats --<tracing_option> -- <application_path>
Collect metrics on kernel execution time, memory bandwidth, occupancy, and
CU utilization.
CU utilization. For more details on using ``rocprofv3`` for application tracing and profiling, see :doc:`rocprofv3 documentation <rocprofiler-sdk:how-to/using-rocprofv3>`.
2. **Analyze metrics to identify bottlenecks**