00c46fd5e5
* Ability to select alternative compiler per file
Implementation of ompt interface to rocprofiler SDK. task_create and task_schedule are not supported.
Misc updates
Update OpenMP target sample
- samples/ompt -> samples/openmp_target
- fix sample test of openmp-target
- reorganize files
Rework OpenMP implementation
Minor OpenMP implementation cleanup
Rename samples/openmp_target CMake targets
Add tests/bin/openmp
- OpenMP target test app in tests/bin/openmp/target
Format samples/openmp_target CMakeLists.txt
Misc lib/rocprofiler-sdk/openmp cleanup
- fix includes
- convert_arg
Update openmp.def.cpp
- tweak includes
- remove lots of temporary variables
Update samples
- common::get_callback_id_names() -> common::get_callback_tracing_names()
- add kernel dispatch, memory copy, scratch memory buffered tracing to openmp target sample
Fix code object operation names
- add "CODE_OBJECT_" prefix
Update include/rocprofiler-sdk/openmp/api_id.h
- remove spurious comment
Miscellaneous openmp updates
- similar API for openmp_begin and openmp_end
- move implementations of ompt callbacks to openmp.cpp
- ompt_{thread_begin,thread_end,parallel_begin,parallel_end}_callbacks are openmp_events
[SWDEV-484495] Fix int truncation in CSV output (#1098)
CSV output truncates doubles to ints when it shouldn't. Derived metrics
are (mostly) doubles and lose precision (or become worthless) if treated
as an int. Converted these to double to match the format we return from
rocprof-sdk.
Co-authored-by: Benjamin Welton <ben@amd.com>
Update limit for max counter records in rocprof-tool (#1073)
A fixed sized std::array is used to store counter records in rocprofiler SDK. This limit was breached in SWDEV-484742. Upping the limit to 512 to be less likely to reach this limit again.
adding proxy ompt_data_t * arguments
fixes for proxy pointers
- Implement proxy ompt_data_t* pointers for clients
- Add ompt_data_t* arguments back to callback API
- Modify openmp sample to illustrate use of proxy pointers
formatting
SWDEV-467350: Skipping tool counter iteration for unsupported hardware (#1083)
Fixing some accumulate metrics (#1089)
* Fixing some accumulate metrics
* Fixing some more accumulate metrics
---------
Co-authored-by: Benjamin Welton <bewelton@amd.com>
updating rocprofv3 help options (#1113)
* updating rocprofv3 help options
* updating CHANGELOG
Fixing installed pacakge tests in CI (#1119)
* Fixing installed pacakge tests in CI
* Formatted rocprofv3.py with black formatter
SWDEV-488948: PC Sampling - Correlation class to provide some thread safety. Adding multithread tests. (#1112)
* SWDEV-488948: PC Sampling - Correlation class to provide some thread safety. Adding multithread tests.
* Update source/lib/rocprofiler-sdk/pc_sampling/parser/correlation.hpp
Co-authored-by: Vladimir Indic <139573562+vlaindic@users.noreply.github.com>
* Update source/lib/rocprofiler-sdk/pc_sampling/parser/correlation.hpp
Co-authored-by: Vladimir Indic <139573562+vlaindic@users.noreply.github.com>
* Adding backlog for codeobj changes
* Formatting
* Update source/lib/rocprofiler-sdk/pc_sampling/code_object.hpp
Co-authored-by: Vladimir Indic <139573562+vlaindic@users.noreply.github.com>
* Update source/lib/rocprofiler-sdk/pc_sampling/code_object.hpp
Co-authored-by: Vladimir Indic <139573562+vlaindic@users.noreply.github.com>
---------
Co-authored-by: Vladimir Indic <139573562+vlaindic@users.noreply.github.com>
SWDEV-487621: Fixes for metric definitions (#1118)
* Fixes for metric definitions
* Removing gfx8
* Update changelog
* Fixing unit tests
* Small fixes
* Fix for write size
Fix PSDB change (#1120)
Reverts change to `source/include/rocprofiler-sdk/callback_tracing.h`
from commit 9b2ece76c3
clang-18 build fix for RCCL (#1123)
Removes ambiguity on const usage, which clang-18 complains about
(preventing build with warn error).
mem copy direction field update (#1124)
Adding Node-id for debugging with log level trace (#1090)
fix botched rebase
Per Jonathan to remove -rdynamic warning so CI will continue
pedantic formatting
Correct the package name of rocprofiler-sdk (#1126)
* Correct the package name of rocprofiler-sdk
ROCM VERSION(for ex: 60300) was missing in the package name.
Added the same
* Use cmake cache string while setting the variable for ROCm Version
* correct the cmake-format
---------
Co-authored-by: Ranjith Ramakrishnan <Ranjith.Ramakrishnan@amd.com>
Fixing kokkosp tool library packaging (#1121)
* Fixing kokkosp tool library packaging
* Update source/lib/rocprofiler-sdk-tool/kokkosp/CMakeLists.txt
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
* Update CMakeLists.txt
* Update CMakeLists.txt
* Component Requirement in CPack
* Adding package dependency
* Update CMakeLists.txt
* Update rocprofiler_config_packaging.cmake
* Fix rocprofiler-sdk-tool-kokkosp BUILD/INSTALL RPATH
- CMAKE_INSTALL_LIBDIR doesn't help
* Add BUILD/INSTALL RPATH to rocprofv3-trigger-list-metrics
- fixes packaging issues
* Update packaging
- core depends on rocprofiler-sdk-roctx
- add CPACK_DEBIAN_PACKAGE_SHLIBDEPS_PRIVATE_DIRS to resolve inter-package dependencies
* Fix package depends version format
* Improve tests/rocprofv3/summary/validate logging
* Update CI workflow
- prioritize roctx package in Install Packages step
* Remove setting <package-name>_VERSION in config.cmake.in
- this is automatically handled by existence of <package-name>-config-version.cmake
* Update rocprofiler-sdk-config.cmake
- relax find_package versioning requirements to same major and minor version
* Update rocprofiler-sdk-config.cmake
- relax find_package versioning requirements (remove EXACT, specify range)
* Tweak CI workflow
* Update perfetto_reader.py
- better handle failure to load trace processor
* Misc cleanup for config packaging
* Update config packaging
* Update config packaging
* Revert perfetto for core-rpm packages
* Revert perfetto for core-rpm packages
- perfetto < 0.9.0
* Tweak tests/rocprofv3/summary/validate.py
- reorder some checks
---------
Co-authored-by: Ammar Elwazir <aelwazir@useocpm2m-387-013.amd.com>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: Jonathan R. Madsen <jonathanrmadsen@gmail.com>
Clang Warning Fixes (#1131)
Builds prevented on clang-18
Adding start and end timestamp columns in csv (#1128)
* Adding start and end timestamp columns in csv
* Adding assert check for the counter timestamps
---------
Co-authored-by: Gopesh Bhardwaj <gopesh.bhardwaj@amd.com>
rocprofv3: docs and help menu updates (#1129)
* doc updates
* Correcting ROCtx information
* Making ROCTx string consistent
* missing occurence
Renamed agent profiling service to device counting service (#1132)
* Renamed agent profiling service to device counting service
Name more aptly represents what agent profiling did (device wide
counter collection). Conversion of existing user code can be
performed by the following find/sed command:
find . -type f -exec sed -i 's/rocprofiler_agent_profile_callback_t/rocprofiler_device_counting_service_callback_t/g; s/rocprofiler_configure_agent_profile_counting_service/rocprofiler_configure_device_counting_service/g; s/agent_profile.h/device_counting_service.h/g; s/rocprofiler_sample_agent_profile_counting_service/rocprofiler_sample_device_counting_service/g' {} +
* Converted dispatch profile to dispatch counting service
* Debug for functioal counters test
* Minor changes for CI
* Minor fix
* More fixes for CI
* Update evaluate_ast.cpp
---------
Co-authored-by: Benjamin Welton <ben@amd.com>
Testing updated RPM dockers (#1136)
* Testing updated RPM dockers
* Trying to fix PSDB for test package dependency
Agent Profiling Fixes for Broken/Improper API Usage (#1122)
Prevent's multiple setups of agent profiling on the same agent.
Fixes agent read context to only read agents that were setup.
Prevent copy of agent profiling internal data struct and reset
hsa_signal on move to prevent inadvertant delete.
Simplifying PR template (#1139)
Implementation of ompt interface to rocprofiler SDK. task_create and task_schedule are not supported.
Fixing installed pacakge tests in CI (#1119)
* Fixing installed pacakge tests in CI
* Formatted rocprofv3.py with black formatter
Fix PSDB change (#1120)
Reverts change to `source/include/rocprofiler-sdk/callback_tracing.h`
from commit 9b2ece76c3
delete unused files
added arguments to some OMPT buffter records
* Fix cmake issues
Remove rocprofiler_ompt_finalize_tool
- a public API function is not necessary: should just finalize rocprofiler-sdk
Fix duplicate ROCPROFILER_{BUFFER,CALLBACK}_TRACING_KIND_STRING
Add lib/rocprofiler-sdk/ompt.hpp
- declares rocprofiler::sdk::finalize_ompt
Remove change to tests/rocprofv3/summary/conftest.py
Add set_fini_status(1) back to registration.cpp
Deleted uneeded files
Incoporate OpenMP code and sample
Fix merge issues with amd-staging
Add push_correlation_id for OpenMP tasking; improve debugability
fixup bad merge
* Suppress OpenMP data race
* Fix openmp_target sample
* Enum and struct name changes + source code reorg
- remove mix of ompt and openmp
- opted for ompt
- changes made for consistency
- ompt_api -> ompt
- openmp_api -> ompt
- OPENMP -> OMPT
* Update tests and more renaming
- dest_device_num -> dst_device_num
- src_addr -> src_address
- dest_addr -> dst_address
- remove info_type::begin
- require OMP_TARGET_OFFLOAD
* Update openmp-target test/sample env and labels
* Formatting
* Tweaks to cmake for openmp target
- Disable for thread sanitizers due to preloading issue
* OpenMP target cmake updates
- remove gfx1010 (fails on mi300)
- OPENMP_GPU_TARGETS
* Remove device_unload and target_map_emi support
- these are never supported by AMD OpenMP compilers
* Update CI workflow
- exclude openmp-target tests from navi3 and vega20
---------
Co-authored-by: Larry Meadows <Lawrence.Meadows@amd.com>
Co-authored-by: Jonathan R. Madsen <jonathanrmadsen@gmail.com>
434 라인
16 KiB
C++
434 라인
16 KiB
C++
// MIT License
|
|
//
|
|
// Copyright (c) 2023 Advanced Micro Devices, Inc. All rights reserved.
|
|
//
|
|
// Permission is hereby granted, free of charge, to any person obtaining a copy
|
|
// of this software and associated documentation files (the "Software"), to deal
|
|
// in the Software without restriction, including without limitation the rights
|
|
// to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
|
|
// copies of the Software, and to permit persons to whom the Software is
|
|
// furnished to do so, subject to the following conditions:
|
|
//
|
|
// The above copyright notice and this permission notice shall be included in all
|
|
// copies or substantial portions of the Software.
|
|
//
|
|
// THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
|
|
// IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
|
|
// FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
|
|
// AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
|
|
// LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
|
|
// OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
|
|
// SOFTWARE.
|
|
|
|
#include "lib/common/logging.hpp"
|
|
#include "lib/rocprofiler-sdk/context/context.hpp"
|
|
#include "lib/rocprofiler-sdk/context/domain.hpp"
|
|
#include "lib/rocprofiler-sdk/hip/hip.hpp"
|
|
#include "lib/rocprofiler-sdk/hsa/async_copy.hpp"
|
|
#include "lib/rocprofiler-sdk/hsa/hsa.hpp"
|
|
#include "lib/rocprofiler-sdk/hsa/memory_allocation.hpp"
|
|
#include "lib/rocprofiler-sdk/hsa/scratch_memory.hpp"
|
|
#include "lib/rocprofiler-sdk/kernel_dispatch/kernel_dispatch.hpp"
|
|
#include "lib/rocprofiler-sdk/marker/marker.hpp"
|
|
#include "lib/rocprofiler-sdk/ompt/ompt.hpp"
|
|
#include "lib/rocprofiler-sdk/page_migration/page_migration.hpp"
|
|
#include "lib/rocprofiler-sdk/rccl/rccl.hpp"
|
|
#include "lib/rocprofiler-sdk/registration.hpp"
|
|
#include "lib/rocprofiler-sdk/runtime_initialization.hpp"
|
|
|
|
#include <rocprofiler-sdk/fwd.h>
|
|
#include <rocprofiler-sdk/hip/table_id.h>
|
|
#include <rocprofiler-sdk/hsa/table_id.h>
|
|
#include <rocprofiler-sdk/marker/table_id.h>
|
|
#include <rocprofiler-sdk/rccl/table_id.h>
|
|
#include <rocprofiler-sdk/rocprofiler.h>
|
|
|
|
#include <atomic>
|
|
#include <limits>
|
|
#include <stdexcept>
|
|
#include <string_view>
|
|
#include <vector>
|
|
|
|
#define RETURN_STATUS_ON_FAIL(...) \
|
|
if(rocprofiler_status_t _status; (_status = __VA_ARGS__) != ROCPROFILER_STATUS_SUCCESS) \
|
|
{ \
|
|
return _status; \
|
|
}
|
|
|
|
namespace rocprofiler
|
|
{
|
|
namespace buffer_tracing
|
|
{
|
|
namespace
|
|
{
|
|
#define ROCPROFILER_BUFFER_TRACING_KIND_STRING(CODE) \
|
|
template <> \
|
|
struct buffer_tracing_kind_string<ROCPROFILER_BUFFER_TRACING_##CODE> \
|
|
{ \
|
|
static constexpr auto value = \
|
|
std::pair<const char*, size_t>{#CODE, std::string_view{#CODE}.length()}; \
|
|
};
|
|
|
|
template <size_t Idx>
|
|
struct buffer_tracing_kind_string;
|
|
|
|
ROCPROFILER_BUFFER_TRACING_KIND_STRING(NONE)
|
|
ROCPROFILER_BUFFER_TRACING_KIND_STRING(HSA_CORE_API)
|
|
ROCPROFILER_BUFFER_TRACING_KIND_STRING(HSA_AMD_EXT_API)
|
|
ROCPROFILER_BUFFER_TRACING_KIND_STRING(HSA_IMAGE_EXT_API)
|
|
ROCPROFILER_BUFFER_TRACING_KIND_STRING(HSA_FINALIZE_EXT_API)
|
|
ROCPROFILER_BUFFER_TRACING_KIND_STRING(HIP_RUNTIME_API)
|
|
ROCPROFILER_BUFFER_TRACING_KIND_STRING(HIP_COMPILER_API)
|
|
ROCPROFILER_BUFFER_TRACING_KIND_STRING(MARKER_CORE_API)
|
|
ROCPROFILER_BUFFER_TRACING_KIND_STRING(MARKER_CONTROL_API)
|
|
ROCPROFILER_BUFFER_TRACING_KIND_STRING(MARKER_NAME_API)
|
|
ROCPROFILER_BUFFER_TRACING_KIND_STRING(MEMORY_COPY)
|
|
ROCPROFILER_BUFFER_TRACING_KIND_STRING(MEMORY_ALLOCATION)
|
|
ROCPROFILER_BUFFER_TRACING_KIND_STRING(KERNEL_DISPATCH)
|
|
ROCPROFILER_BUFFER_TRACING_KIND_STRING(PAGE_MIGRATION)
|
|
ROCPROFILER_BUFFER_TRACING_KIND_STRING(SCRATCH_MEMORY)
|
|
ROCPROFILER_BUFFER_TRACING_KIND_STRING(CORRELATION_ID_RETIREMENT)
|
|
ROCPROFILER_BUFFER_TRACING_KIND_STRING(RCCL_API)
|
|
ROCPROFILER_BUFFER_TRACING_KIND_STRING(OMPT)
|
|
ROCPROFILER_BUFFER_TRACING_KIND_STRING(RUNTIME_INITIALIZATION)
|
|
|
|
template <size_t Idx, size_t... Tail>
|
|
std::pair<const char*, size_t>
|
|
get_kind_name(rocprofiler_buffer_tracing_kind_t kind, std::index_sequence<Idx, Tail...>)
|
|
{
|
|
if(kind == Idx) return buffer_tracing_kind_string<Idx>::value;
|
|
// recursion until tail empty
|
|
if constexpr(sizeof...(Tail) > 0) return get_kind_name(kind, std::index_sequence<Tail...>{});
|
|
return {nullptr, 0};
|
|
}
|
|
|
|
auto
|
|
get_unsupported()
|
|
{
|
|
auto unsupported = std::unordered_set<rocprofiler_buffer_tracing_kind_t>{};
|
|
|
|
// #if ROCPROFILER_SDK_USE_SYSTEM_RCCL == 0
|
|
// // Built against RCCL which does not support API tracing
|
|
// unsupported.emplace(ROCPROFILER_BUFFER_TRACING_RCCL_API);
|
|
// #endif
|
|
|
|
return unsupported;
|
|
}
|
|
} // namespace
|
|
} // namespace buffer_tracing
|
|
} // namespace rocprofiler
|
|
|
|
extern "C" {
|
|
rocprofiler_status_t
|
|
rocprofiler_configure_buffer_tracing_service(rocprofiler_context_id_t context_id,
|
|
rocprofiler_buffer_tracing_kind_t kind,
|
|
const rocprofiler_tracing_operation_t* operations,
|
|
size_t operations_count,
|
|
rocprofiler_buffer_id_t buffer_id)
|
|
{
|
|
static auto unsupported = ::rocprofiler::buffer_tracing::get_unsupported();
|
|
|
|
if(rocprofiler::registration::get_init_status() > -1)
|
|
return ROCPROFILER_STATUS_ERROR_CONFIGURATION_LOCKED;
|
|
|
|
if(unsupported.count(kind) > 0) return ROCPROFILER_STATUS_ERROR_NOT_IMPLEMENTED;
|
|
|
|
auto* ctx = rocprofiler::context::get_mutable_registered_context(context_id);
|
|
|
|
if(!ctx) return ROCPROFILER_STATUS_ERROR_CONTEXT_NOT_FOUND;
|
|
|
|
if(buffer_id.handle == 0) return ROCPROFILER_STATUS_ERROR_BUFFER_NOT_FOUND;
|
|
|
|
constexpr auto invalid_buffer_id =
|
|
rocprofiler_buffer_id_t{std::numeric_limits<uint64_t>::max()};
|
|
|
|
if(!ctx->buffered_tracer)
|
|
{
|
|
ctx->buffered_tracer = std::make_unique<rocprofiler::context::buffer_tracing_service>();
|
|
ctx->buffered_tracer->buffer_data.fill(invalid_buffer_id);
|
|
}
|
|
|
|
if(ctx->buffered_tracer->buffer_data.at(kind).handle != invalid_buffer_id.handle)
|
|
return ROCPROFILER_STATUS_ERROR_SERVICE_ALREADY_CONFIGURED;
|
|
|
|
RETURN_STATUS_ON_FAIL(rocprofiler::context::add_domain(ctx->buffered_tracer->domains, kind));
|
|
|
|
ctx->buffered_tracer->buffer_data.at(kind) = buffer_id;
|
|
|
|
for(size_t i = 0; i < operations_count; ++i)
|
|
{
|
|
RETURN_STATUS_ON_FAIL(rocprofiler::context::add_domain_op(
|
|
ctx->buffered_tracer->domains, kind, operations[i]));
|
|
}
|
|
|
|
if(kind == ROCPROFILER_BUFFER_TRACING_PAGE_MIGRATION)
|
|
RETURN_STATUS_ON_FAIL(rocprofiler::page_migration::init());
|
|
|
|
return ROCPROFILER_STATUS_SUCCESS;
|
|
}
|
|
|
|
rocprofiler_status_t
|
|
rocprofiler_query_buffer_tracing_kind_name(rocprofiler_buffer_tracing_kind_t kind,
|
|
const char** name,
|
|
uint64_t* name_len)
|
|
{
|
|
auto&& val = rocprofiler::buffer_tracing::get_kind_name(
|
|
kind, std::make_index_sequence<ROCPROFILER_BUFFER_TRACING_LAST>{});
|
|
|
|
if(name) *name = val.first;
|
|
if(name_len) *name_len = val.second;
|
|
|
|
return (val.first) ? ROCPROFILER_STATUS_SUCCESS : ROCPROFILER_STATUS_ERROR_KIND_NOT_FOUND;
|
|
}
|
|
|
|
rocprofiler_status_t
|
|
rocprofiler_query_buffer_tracing_kind_operation_name(rocprofiler_buffer_tracing_kind_t kind,
|
|
rocprofiler_tracing_operation_t operation,
|
|
const char** name,
|
|
uint64_t* name_len)
|
|
{
|
|
if(kind < ROCPROFILER_BUFFER_TRACING_NONE || kind >= ROCPROFILER_BUFFER_TRACING_LAST)
|
|
return ROCPROFILER_STATUS_ERROR_KIND_NOT_FOUND;
|
|
|
|
const char* val = nullptr;
|
|
switch(kind)
|
|
{
|
|
case ROCPROFILER_BUFFER_TRACING_NONE:
|
|
case ROCPROFILER_BUFFER_TRACING_LAST:
|
|
{
|
|
return ROCPROFILER_STATUS_ERROR_INVALID_ARGUMENT;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_HSA_CORE_API:
|
|
{
|
|
val = rocprofiler::hsa::name_by_id<ROCPROFILER_HSA_TABLE_ID_Core>(operation);
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_HSA_AMD_EXT_API:
|
|
{
|
|
val = rocprofiler::hsa::name_by_id<ROCPROFILER_HSA_TABLE_ID_AmdExt>(operation);
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_HSA_IMAGE_EXT_API:
|
|
{
|
|
val = rocprofiler::hsa::name_by_id<ROCPROFILER_HSA_TABLE_ID_ImageExt>(operation);
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_HSA_FINALIZE_EXT_API:
|
|
{
|
|
val = rocprofiler::hsa::name_by_id<ROCPROFILER_HSA_TABLE_ID_FinalizeExt>(operation);
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_MEMORY_COPY:
|
|
{
|
|
val = rocprofiler::hsa::async_copy::name_by_id(operation);
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_MEMORY_ALLOCATION:
|
|
{
|
|
val = rocprofiler::hsa::memory_allocation::name_by_id(operation);
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_SCRATCH_MEMORY:
|
|
{
|
|
val = rocprofiler::hsa::scratch_memory::name_by_id(operation);
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_MARKER_CORE_API:
|
|
{
|
|
val = rocprofiler::marker::name_by_id<ROCPROFILER_MARKER_TABLE_ID_RoctxCore>(operation);
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_MARKER_CONTROL_API:
|
|
{
|
|
val = rocprofiler::marker::name_by_id<ROCPROFILER_MARKER_TABLE_ID_RoctxControl>(
|
|
operation);
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_MARKER_NAME_API:
|
|
{
|
|
val = rocprofiler::marker::name_by_id<ROCPROFILER_MARKER_TABLE_ID_RoctxName>(operation);
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_RCCL_API:
|
|
{
|
|
val = rocprofiler::rccl::name_by_id<ROCPROFILER_RCCL_TABLE_ID>(operation);
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_HIP_RUNTIME_API:
|
|
{
|
|
val = rocprofiler::hip::name_by_id<ROCPROFILER_HIP_TABLE_ID_Runtime>(operation);
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_HIP_COMPILER_API:
|
|
{
|
|
val = rocprofiler::hip::name_by_id<ROCPROFILER_HIP_TABLE_ID_Compiler>(operation);
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_KERNEL_DISPATCH:
|
|
{
|
|
val = rocprofiler::kernel_dispatch::name_by_id(operation);
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_PAGE_MIGRATION:
|
|
{
|
|
val = rocprofiler::page_migration::name_by_id(operation);
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_OMPT:
|
|
{
|
|
val = rocprofiler::ompt::name_by_id(operation);
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_RUNTIME_INITIALIZATION:
|
|
{
|
|
val = rocprofiler::runtime_init::name_by_id(operation);
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_CORRELATION_ID_RETIREMENT:
|
|
{
|
|
return ROCPROFILER_STATUS_ERROR_NOT_IMPLEMENTED;
|
|
}
|
|
};
|
|
|
|
if(!val)
|
|
{
|
|
if(name) *name = nullptr;
|
|
if(name_len) *name_len = 0;
|
|
|
|
return ROCPROFILER_STATUS_ERROR_OPERATION_NOT_FOUND;
|
|
}
|
|
|
|
if(name) *name = val;
|
|
if(name_len) *name_len = strnlen(val, 4096);
|
|
|
|
return ROCPROFILER_STATUS_SUCCESS;
|
|
}
|
|
|
|
rocprofiler_status_t
|
|
rocprofiler_iterate_buffer_tracing_kinds(rocprofiler_buffer_tracing_kind_cb_t callback, void* data)
|
|
{
|
|
for(uint32_t i = 0; i < ROCPROFILER_BUFFER_TRACING_LAST; ++i)
|
|
{
|
|
auto _success = callback(static_cast<rocprofiler_buffer_tracing_kind_t>(i), data);
|
|
if(_success != 0) break;
|
|
}
|
|
|
|
return ROCPROFILER_STATUS_SUCCESS;
|
|
}
|
|
|
|
rocprofiler_status_t
|
|
rocprofiler_iterate_buffer_tracing_kind_operations(
|
|
rocprofiler_buffer_tracing_kind_t kind,
|
|
rocprofiler_buffer_tracing_kind_operation_cb_t callback,
|
|
void* data)
|
|
{
|
|
auto ops = std::vector<uint32_t>{};
|
|
switch(kind)
|
|
{
|
|
case ROCPROFILER_BUFFER_TRACING_NONE:
|
|
case ROCPROFILER_BUFFER_TRACING_LAST:
|
|
{
|
|
return ROCPROFILER_STATUS_ERROR_INVALID_ARGUMENT;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_HSA_CORE_API:
|
|
{
|
|
ops = rocprofiler::hsa::get_ids<ROCPROFILER_HSA_TABLE_ID_Core>();
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_HSA_AMD_EXT_API:
|
|
{
|
|
ops = rocprofiler::hsa::get_ids<ROCPROFILER_HSA_TABLE_ID_AmdExt>();
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_HSA_IMAGE_EXT_API:
|
|
{
|
|
ops = rocprofiler::hsa::get_ids<ROCPROFILER_HSA_TABLE_ID_ImageExt>();
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_HSA_FINALIZE_EXT_API:
|
|
{
|
|
ops = rocprofiler::hsa::get_ids<ROCPROFILER_HSA_TABLE_ID_FinalizeExt>();
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_MEMORY_COPY:
|
|
{
|
|
ops = rocprofiler::hsa::async_copy::get_ids();
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_MEMORY_ALLOCATION:
|
|
{
|
|
ops = rocprofiler::hsa::memory_allocation::get_ids();
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_SCRATCH_MEMORY:
|
|
{
|
|
ops = rocprofiler::hsa::scratch_memory::get_ids();
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_MARKER_CORE_API:
|
|
{
|
|
ops = rocprofiler::marker::get_ids<ROCPROFILER_MARKER_TABLE_ID_RoctxCore>();
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_MARKER_CONTROL_API:
|
|
{
|
|
ops = rocprofiler::marker::get_ids<ROCPROFILER_MARKER_TABLE_ID_RoctxControl>();
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_MARKER_NAME_API:
|
|
{
|
|
ops = rocprofiler::marker::get_ids<ROCPROFILER_MARKER_TABLE_ID_RoctxName>();
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_RCCL_API:
|
|
{
|
|
ops = rocprofiler::rccl::get_ids<ROCPROFILER_RCCL_TABLE_ID>();
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_HIP_RUNTIME_API:
|
|
{
|
|
ops = rocprofiler::hip::get_ids<ROCPROFILER_HIP_TABLE_ID_Runtime>();
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_HIP_COMPILER_API:
|
|
{
|
|
ops = rocprofiler::hip::get_ids<ROCPROFILER_HIP_TABLE_ID_Compiler>();
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_KERNEL_DISPATCH:
|
|
{
|
|
ops = rocprofiler::kernel_dispatch::get_ids();
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_PAGE_MIGRATION:
|
|
{
|
|
ops = rocprofiler::page_migration::get_ids();
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_OMPT:
|
|
{
|
|
ops = rocprofiler::ompt::get_ids();
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_RUNTIME_INITIALIZATION:
|
|
{
|
|
ops = rocprofiler::runtime_init::get_ids();
|
|
break;
|
|
}
|
|
case ROCPROFILER_BUFFER_TRACING_CORRELATION_ID_RETIREMENT:
|
|
{
|
|
return ROCPROFILER_STATUS_ERROR_NOT_IMPLEMENTED;
|
|
}
|
|
}
|
|
|
|
for(const auto& itr : ops)
|
|
{
|
|
auto _success = callback(kind, itr, data);
|
|
if(_success != 0) break;
|
|
}
|
|
return ROCPROFILER_STATUS_SUCCESS;
|
|
}
|
|
}
|
|
|
|
#undef RETURN_STATUS_ON_FAIL
|