Граф коммитов

693 Коммитов

Автор SHA1 Сообщение Дата
Jatin Chaudhary 77ef46fb63 SWDEV-412457 - <thread> header needs to be included
Change-Id: Ia291e5e4cc28050e05e6d7e5d7904698db56dce5


[ROCm/roctracer commit: b2671813eb]
2023-09-26 14:52:25 +01:00
Ammar ELWazir c5d35de808 SWDEV-407953: Adding Patch Version from ROCm Standard Option
ROCm Packaging uses ROCM_PATCH_VERSION as a standard option to add the patch version which determines the rocm release for different libraries versions

Change-Id: I1edce84d2963d495c55c83cc0697761d7f696c92


[ROCm/roctracer commit: 421febd4bf]
2023-08-21 13:25:40 -04:00
Ammar ELWazir 2b78928d60 SWDEV-415259: Replacing HCC with AMD
__HIP_PLATFORM_HCC__ with __HIP_PLATFORM_AMD__

Change-Id: I2e79cff5a0216125b4ce2aa90104d4a1696bad08


[ROCm/roctracer commit: 67b3717da9]
2023-08-17 09:23:45 -04:00
Ranjith Ramakrishnan 5c6492d1ec SWDEV-366827 - Disable file reorg backward compatibility support by default
Change-Id: Ibf3c409c1741211ea86d52a90227c64ef47ee54b


[ROCm/roctracer commit: 6ab944c009]
2023-08-07 11:01:11 -07:00
Ranjith Ramakrishnan 2d06c8c312 SWDEV-409575 - Append additional RPATH to libraries installed in /opt/rocm-ver/lib/roctracer
RPATH in libraries installed in /opt/rocm-ver/lib/roctracer should be: $ORIGIN:$ORIGIN/..
cmake shared linker flags will provide the rpath $ORIGIN
The patch will append the rpath $ORIGIN/.. to the component specific libraries

Change-Id: Ied2bcb57bf0dd38ee3d1a946a5afc1bb182ff619


[ROCm/roctracer commit: 6fbf7673aa]
2023-07-13 16:43:46 -07:00
gobhardw ff0c5c2862 SWDEV-406619: enabling symbol lookup in .dynsym of code object
Change-Id: I35e6d85d59cbe1350db4734dcee33a71f0d4f87f


[ROCm/roctracer commit: 9bda77f171]
2023-07-12 14:10:28 +05:30
gobhardw 86beea665a Fixing RPM build errors for roctracer
Change-Id: I55e67c5c31e266fe187bb28712a0c616ebe68107


[ROCm/roctracer commit: 23695fbd79]
2023-07-01 00:22:09 +05:30
Ammar ELWazir e3a756d8f7 SWDEV-288134: Workaround to enable roctracer early
Change-Id: Iae21dbfd9f50dcf2e317130c6dc8328f17eba553


[ROCm/roctracer commit: 04d0bd4874]
2023-06-20 21:59:19 +00:00
Ammar ELWazir a113218672 SWDEV-340925 & ROCMOPS-4997: Fixing Tests package for roctracer
Change-Id: I9913e1df86398ef79a5afcf4b89b5a132d61422c


[ROCm/roctracer commit: 2a9595bc8f]
2023-06-08 14:48:28 +00:00
gobhardw 56a2989ffa SWDEV-398161, SWDEV-398764 Fixed hsa-trace failures for profiling data corrupted
Change-Id: I3d8dbb2a40d948cd06cb1278acc50dc5be4ca0ef


[ROCm/roctracer commit: ee713682a1]
2023-05-15 22:38:23 +05:30
Ranjith Ramakrishnan a53f6af29e SWDEV-383221 - Set the default value of ROCM_HEADER_WRAPPER_WERROR to OFF
Using wrapper header files will result in #warning message by default

Change-Id: Ib8a05d11f2391dfcdac8601da26e1096821cd555


[ROCm/roctracer commit: 245eafea4c]
2023-05-08 16:28:19 -07:00
Ranjith Ramakrishnan 70eb1706af SWDEV-366831 - Compile time flag to switch between #warning and #error message
Using backward compatibility paths will provide an #error message. Compile time option added to enable/disable the #error message.
Disabling the same will provide a #warning message

Change-Id: I6abc236e810ccc38d3636074e0e8f5a9657c2e9a


[ROCm/roctracer commit: ea061be2d1]
2023-03-10 09:12:20 -08:00
AravindanC 5019a364c8 SWDEV-351540 - ASAN packaging for roctracer
Change-Id: If23c92e785fbbeb27034a8958568dd86b847beed


[ROCm/roctracer commit: 4edc9727bd]
2023-02-27 11:32:39 -08:00
Ranjith Ramakrishnan f3abbba3b0 SWDEV-366831 - File reorg backward compatibility message changed to #error
Change-Id: I403c34722e6516c60c217ed1b883066e248c598b


[ROCm/roctracer commit: ee3321f2dc]
2023-02-08 17:25:57 -08:00
Kiumars Sabeti ac660e2c92 SWDEV-360818: Reserved fields of hip structures are undefined. Because of that, reserved fields may contain special characters that are not readable by postprocessing Python script. To patch this issue, here we modify script that generates hip_ostream_ops.h to set reserved fields of hip structures to 0.
Change-Id: I0cda5cedfa30b140f121803dd8c066107ae0756e


[ROCm/roctracer commit: 397b3b10af]
2023-01-26 17:44:55 -05:00
Ranjith Ramakrishnan 9ee831c09f Split roctracer to runtime and dev/devel package
SWDEV-356024 - Development package name will have suffix dev or devel based on OS
Devel package contents - Header files, name link of public library files, html files and roctracer manual file
Runtime package contents - Versioned public library files, private library files and license file

Change-Id: I8ced3eab5d8824a66be39b9e777368506516b155


[ROCm/roctracer commit: 9acba8b4a1]
2023-01-17 14:10:58 -08:00
Laurent Morichetti 20cc405de7 SWDEV-361912 - Fix kernel demangling
Older GNU C++ runtimes cannot demangle symbol names generated by recent
versions of LLVM. To work around this issue, use the LLVM demangler to
process kernel names.

Change-Id: I595f900d06360bb5acce542955cf1f5aed81f00e


[ROCm/roctracer commit: 91b449d0d5]
2022-12-05 20:36:04 -08:00
Ranjith Ramakrishnan f3417a617f SWDEV-366823 - Change pragma message to warning
File reorganization feature was implemented with backward compatibility
The backward compatibility support will be deprecated in future release.
Changed the #pragma message to #warning for a smooth transition

Change-Id: I85e14470cce0f3d7c14ecb40e0e9e8b29c977c9f


[ROCm/roctracer commit: ca1726f80d]
2022-11-22 14:10:34 -05:00
Ranjith Ramakrishnan 1d4e650841 SWDEV-369226 - Correct the usage of include file
In the generated header file hsa_prof_str.h , the header file hsa_ostream_ops.h was included using angle brackets
This results in compilation with include path /opt/rocm-ver/include. Corrected the usage by using double quotes

Change-Id: Ie9f1fff78d16a6953a2c99056b2acef42e577204


[ROCm/roctracer commit: b1585c983d]
2022-11-21 16:38:39 -08:00
Ammar ELWazir e4435b0e97 Fixing Readme.txt
API Spec Documentation link fixed

Change-Id: I0d0ef52eaa0e66b161cb042704c083e29b0db06f


[ROCm/roctracer commit: d835435b0d]
2022-11-16 00:39:07 -06:00
Laurent Morichetti 83402eae9c SWDEV-362165 - Escape strings in the API function's arguments
Also escape '\', '"', '\b', '\f', '\n', '\r' and '\t'.

Change-Id: I02cb1a0a511156661ff40700a81b0989a450aacc


[ROCm/roctracer commit: 0d6e5b1cea]
2022-11-07 15:41:51 -08:00
Laurent Morichetti 644e3b5253 Report HSA_OPS activities using the ROCr driver_node_id instead of the device's index
When multiple ranks are used, each rank's first logical device always
has GPU ID 0, regardless of which physical device is selected with
CUDA_VISIBLE_DEVICES. Because of this, when merging trace files from
multiple ranks, GPU IDs from different processes may overlap.

The long term solution is to use the KFD's gpu_id which is stable
across APIs and processes. Unfortunately the gpu_id is not yet exposed
by the ROCr, so for now use the driver's node id.

Change-Id: I2f5af8d2a7e8a89efeb5e0a1b86bdfa547b25fc8


[ROCm/roctracer commit: 799f0323cd]
2022-10-18 20:08:09 -07:00
Laurent Morichetti 4f0a4afd95 Fix a build error when compiling with clang
Fix the following error:

roctx.cpp:91:25: error: reinterpret_cast from 'const void *' to 'decltype(report_activity.load())' (aka 'int (*)(activity_domain_t, unsigned int, void *)') casts away qualifiers
  report_activity.store(reinterpret_cast<decltype(report_activity.load())>(function),
                        ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

by replacing the 'const void *function' argument with the correct type.

Change-Id: I912239daf6f4a3f00fc753306b84833e5c75f74b


[ROCm/roctracer commit: c95d5dd96f]
2022-10-18 18:20:58 -07:00
Laurent Morichetti 348bc1afaf SWDEV-362165 - Escape strings in the API function's arguments
Strings ([const] char *, [const] char[]) passed as arguments to API
functions may not always contain printable characters. All string
arguments should be quoted and escaped in the trace logs.

Change-Id: Ie39058f2190048b1a0090df16d9ac6bc6507e28a


[ROCm/roctracer commit: b556f8681e]
2022-10-15 19:03:36 -07:00
Laurent Morichetti aaf512c9ad Remove the thread local begin_timestamp stack
Using a thread_local object is problematic as the thread local
destructors are called first before any global destructor, making
the object invalid while tearing down the process.

rocblas uses a global destructor to clean up the loaded HIP modules
and ends up calling hip_executable_destroy after the timestamp stack
is destructed. As a result the begin timestamp for that API function
is 0.

The solution is to store the phase_enter timestamp in the phase_data.

Change-Id: If143f4d123dfb111c72fb20365431d07e73fc570


[ROCm/roctracer commit: 8a575d8d6e]
2022-09-30 15:01:08 -07:00
Laurent Morichetti 549d6a5ec5 Fix a profiling data corrupted error
Using rocprof with ROCP_MCOPY_DATA=1 while tracing HSA produces the
following error:

tblextr.py: Memcpy args "(0x7feb16a00000, 123handle=28593376125, 0x7feb12a00010, 123handle=27558560125, 4194304, 0, 0, 123handle=140661639440000125) = 1" cannot be identified
Profiling data corrupted: ' ./out/rpl_data_220930_143009_1826700/input_results_220930_143009/results.txt'

There are two issues:

1) The hsa_agent_t handle argument is misprinted: "123handle=...125"
  Instead of printing '{' and '}', it prints '123' and '125'. The wrong
  operator<<(unsigned char) is used and an integer value is printed
  instead of a char.

  Use std::operator<< instead of hsa_support::detail::operator<< to
  print '{' and '}'

2) The result value is unitialized and in some cases printed as a
  negative integer value. The leading '-' is not matched by the
  mem_manager regular expresion for HSA api calls.

  Correctly capture the HSA function's return value.

Change-Id: If13a1e62eeb4e598447c4b90d53d1b2e3b408696


[ROCm/roctracer commit: 6416434d3b]
2022-09-30 13:36:48 -07:00
Laurent Morichetti f36f7e1b3e Fix an issue with aync copy timestamps
The timestamps coming from the HIP runtime for asynchronus memory
copies are corrupted (begin > end) because the HSA setting to record
timestamps is turned off by the tracer's HSA intercept.

The solution is to intercept hsa_amd_profiling_async_copy_enable and
remember the application/runtime's request so that it can be ORed with
IsEnabled(ACTIVITY_DOMAIN_HSA_OPS, HSA_OP_ID_COPY).

Change-Id: Ib687cbf36711563e86c2bb8bc934c7c51572bfde


[ROCm/roctracer commit: 329c0467cb]
2022-09-28 15:44:40 -07:00
Laurent Morichetti 6ea76c581b Use the "safe" Stack for begin_timestamp
The tracer tool needs to remember the begin timestamps for API
callbacks, and uses a thread_local std::stack for that purpose.

The issue with thread_local objects is that they are destructed
before anything else when the main thread exits. To work around
that issue, we use a "safe" stack in the roctracer API.

Use the same "safe" stack in the tracer tool.

Change-Id: I0d69d4eb44f0205f4102d0d5ef9803a1ec1800a5


[ROCm/roctracer commit: b664937ebd]
2022-09-26 13:44:55 -07:00
Laurent Morichetti e8dc8de195 Fix a typo in HipLoader
rocprof errors out with the following message:
symbol lookup 'KernelNameRef' failed: libamdhip64.so.5: undefined \
  symbol: KernelNameRef

The HipLoader is incorrectly looking for a KernelNameRef symbol
instead of hipKernelNameRef.

Fixed the typo: KernelNameRef -> hipKernelNameRef.

Change-Id: Ia4860e1669707b0c83d67e71b78d362b07a6aaa7


[ROCm/roctracer commit: a287f20961]
2022-09-26 09:31:14 -07:00
Laurent Morichetti f262501ed3 Clean up logger.h
Change-Id: Ibcb58d2236b012d00c3fc421a425c03093de5d50


[ROCm/roctracer commit: bb98bc7d85]
2022-09-16 09:55:09 -07:00
Laurent Morichetti 6dea5c5e3d Fix an array subscript out-of-bounds error
Starting with gcc-11 (verified with gcc-12 as well), an array
out-of-bounds subscript error is reported for accessing the registration
table element at the operation ID index. Validating the index in the
function calling Register/Unregister does not quiet the warning/error
in release builds, so, for gcc-11 and gcc-12, we disable that warning
just for the RegistrationTable class.

Change-Id: I6bc4a02aa072cfa8905ecde5e3960aebf32fc912


[ROCm/roctracer commit: 67ce5fae13]
2022-09-16 09:54:40 -07:00
Laurent Morichetti e8cb732660 Cleanup the include files
Use #include "header" instead of #include <header> so that the header
files are found when the application #includes <roctracer/roctracer.h>
with -I /opt/rocm/include.

Change-Id: I24feac9a5030d3600aee98084340e246c3990db5


[ROCm/roctracer commit: 05ee3ff973]
2022-09-13 08:23:40 -07:00
Laurent Morichetti 14c153601d SWDEV-355896 - Fix a data corruption error in post processing
The post-processing script cannot handle HIP ops without a correlation
ID. The correlation ID is needed to connect the record to a HIP stream
and originating thread.

This issue was exposed by a change to the tracer API to report
asynchronous activities even if their originating synchronous API
activity (callback) is not enabled. This was a flow in the API.

Also fix an issue with the API filtering. Undefined API names should
not cause an exception, they should be ignored.

Change-Id: Iab2221af6180ade2b9c2eb10c256c3a73d872e9f


[ROCm/roctracer commit: 4856d33959]
2022-09-13 08:21:13 -07:00
Laurent Morichetti 1d0f27e2c8 Fix the symbol name for deprecated functions
Change-Id: I53c0af1d1f6a3998992bdaa737e9b10829e5abc3


[ROCm/roctracer commit: 900d5e0a64]
2022-09-08 21:04:41 -07:00
Laurent Morichetti cb7d599829 Fix hsa_support::timestamp_ns if HSA is not yet initialized
Default to the HSA runtime's hsa_system_get_info if the saved HSA
functions table is not yet initialized.

Change-Id: I3659095a5ad662f7ca8b0d92bd035901c6d66bb0


[ROCm/roctracer commit: 87ffbd27f4]
2022-09-08 21:02:31 -07:00
Laurent Morichetti fe526e0c36 Fix the Loader
Instead of dlopen'ing RTLD_NOLOAD a library (for example libamdhip64.so)
and rely on the dynamic linker search path, search through the already
loaded shared objects for a library with a matching name.

Change-Id: I3e74d432bd7ca68df8927ca435b290e86aaaf9e9


[ROCm/roctracer commit: db69cc1c9f]
2022-09-08 20:58:07 -07:00
Laurent Morichetti 7ea1dbafd2 SWDEV-351980 - Remove the ROCtracer private interface from the public header
Change-Id: Ib3183e87d0c2bd1679926a4da9bbb6e46d70fb9f


[ROCm/roctracer commit: ab3f361f61]
2022-09-08 20:58:07 -07:00
Laurent Morichetti 3d46d2d5cb SWDEV-351980 - Consolidate registration tables in the roctracer
Change-Id: I44cd1cc81cf6a529aed89ee8db1377c0aa67f0dc


[ROCm/roctracer commit: 2673bf5e2c]
2022-09-08 20:58:05 -07:00
Laurent Morichetti a7700afbf6 Use fatal() and warning() for logging errors
Change-Id: I4d525ed2a7dba72beff6fbe43383015e55465fcd


[ROCm/roctracer commit: 57867e4803]
2022-09-06 19:38:16 -07:00
Laurent Morichetti f4ef972b10 Remove tracker.h
Change-Id: I74860431c5f4c4954ddb79fb7e2a613fecc8793b


[ROCm/roctracer commit: 9d69e7d49a]
2022-09-06 19:38:16 -07:00
Laurent Morichetti 4b82df9748 Fix nested timestamps
Change-Id: I6385d52cc858670a116f5c2eb65e4f19be73190f


[ROCm/roctracer commit: 61c232bc69]
2022-09-06 19:38:16 -07:00
Laurent Morichetti 981bf5f75a Remove the ROCprofiler loader
Was used for the HSA_EVT activities, so no longer needed.

Change-Id: I7729fb4519f2e3cee73776264647381cb5826067


[ROCm/roctracer commit: 9c57b150af]
2022-09-06 19:38:16 -07:00
Laurent Morichetti 8548060d8d Bring the HSA_EVT callbacks back to the roctracer
Change-Id: I26080b264d7989880ba7e9f00502cc680b2256d7


[ROCm/roctracer commit: c2b87b1fd7]
2022-09-06 19:38:12 -07:00
Laurent Morichetti 0197ed4253 Use a global correlation_id for all records
Change-Id: I87fe16fefb52a95242bc64b7007b71c9d8978d44


[ROCm/roctracer commit: ac3214d32a]
2022-09-06 19:20:54 -07:00
Laurent Morichetti f7c8382e33 SWDEV-351980 - Use the new hipRegister/RemoveAsyncActivityCallback
Remove the hipInitActivityCallback and use the new hipRegister/
RemoveActivityCallback which allows distinct memory pools to be used
for HIP_OPS activities.

Enable the multi_pool_activities test.

Change-Id: I6f6feaedecc9c36285bea975caf24dbf8f5f624b


[ROCm/roctracer commit: 340c7cb553]
2022-09-06 19:11:11 -07:00
Laurent Morichetti 9528bf6128 SWDEV-351980 - Remove HipApi{Callback|Activity}{Enable|Disable}Check
The code is easier to read if calling HIPActivityCallbackTracker
enable/disable_check directly. Both enable/disable_check return the
new mask, and the check whether a callback is already installed is
clearer.

Change-Id: Ic90d34489b5b4d9929dc08b4d9e93cc974b136b1


[ROCm/roctracer commit: f0e082feb1]
2022-09-06 19:11:11 -07:00
Laurent Morichetti f50c9d4149 SWDEV-351980 - Don't allocate hip_api_data and record
The HIP runtime is now allocating the hip_api_data and record on its
stack so we don't need the thread local record_data_pair stack anymore.

Refactor the API callback function to handle both the case where
synchronous user callbacks are requested and the case where asynchronous
records are requested (enable_callback & enable_activity respectively).
If the callback argument (memory pool) is not null, then activity
records are requested.

Remove CorrelationIdRegister and CorrelationIdLookup. These were used
by the HIP runtime to associate a HIP record id to a ROCtracer
correlation id. Instead, the HIP runtime is now using the correlation
ID returned in the hip_api_data_t.

Added a test to check enabling/disabling concurrent callbacks and
activities.

Change-Id: I5850cfead9861eb3602a3e8fcb7b22580d5fc979


[ROCm/roctracer commit: 88c6e0a700]
2022-09-06 19:11:11 -07:00
Laurent Morichetti 9674c2b11a Deprecate enable/disable_callback/activity[_expl]
These functions have little value as it is very unlikely an application
would want to enable all the domains.

Change-Id: I4743e8ddf6743e60c95c7ba5240950d2ef734301


[ROCm/roctracer commit: ad01ba513a]
2022-09-06 19:11:09 -07:00
Laurent Morichetti 179e03c172 Add multi_pool_activities test
This test checks that asynchronous activities can be enabled in distinct
memory pools. It enables activity reporting for HIP kernel dispatches in
one memory pool, and memory copy reporting in another memory pool.

The output of this test to stdout should be a series of kernel dispatch
records (10) followed by a series of memory copy records (10). The
records should not be interleaved.

Change-Id: Idb5cca7e650b2312a1955909932364f914737856


[ROCm/roctracer commit: cfdfa2a2d4]
2022-08-26 19:28:19 -07:00
Laurent Morichetti d2e1384c40 Remove global variables from the file plugin
The plugin's file scope global variables destructors could be called
before roctracer_plugin_finalize is called, making the global variables
undefined by the time roctracer_plugin_finalize is called.

To avoid this issue, remove all non-pod global variables from the file
plugin.

Change-Id: I4b620d67d460d9c99adfd81cbf46b0e64540c503


[ROCm/roctracer commit: 006ce7b65d]
2022-08-23 16:34:08 -07:00