9de3a6b0b4
* causal backtrace updates
- fix initial causal sampling period value
* causal delay updates
- tweak handling of sleep_for_overhead
* Fix experiment global scaling for prog pts
- results in drastically improved predictions
* pthread_mutex_gotcha updates
- disable all wrappers during causal profiling
* validate-causal-json.py updates
- support decimal stddev
- fix setting stddev from command-line
* causal perform_experiment_impl update
- handle start failing because finalizing
* deprecate causal::component::sample_rate
- appears to not help at all
* Rework sample info
* Increase causal unwind_depth
- use OMNITRACE_MAX_UNWIND_DEPTH
* validate-causal-json updates
- min experiments
- exclude reporting predictions with less than X experiments at a given speedup
- percent samples
- only print samples within X% of the peak (default: 95%)
* Update timemory submodule
- extensions to sampling for signals delivered via non-timer method
- e.g. via HW counter overflow
* dwarf_entry::operator< updates
- sort via file
* causal profiling docs updates
- info about backends
- info about installing/enabling perf
* config updates: causal backend
- CausalBackend enum
- OMNITRACE_CAUSAL_BACKEND: perf, timer, auto
- omnitrace-causal option: --backend
* debug update
- use spin_mutex instead of std::mutex
* address_range::contains update
- range from 0-100 contains range from 10-100 but was returning false because high was == 100 not < 100
* symbol::operator< update
- handle load address differences
* sampling updates (non-causal)
- update get_timer to get_trigger + dynamic_cast
* container::static_vector updates
- support construction from container::c_array
- update_size private member func for handling atomic m_size
* Move perf files
- moved library/causal/perf.{hpp,cpp} to library/perf.{hpp,cpp}
* causal example update
- created impl.hpp (forward decls)
- renamed {cpu,rng}_func_impl to {cpu,rng}_impl_func
- only create two threads which run N iterations instead of two threads each iteration
* Update timemory submodule
- updates to unwind::processed_entry
- updates to procfs::maps
* Updated causal documentation
- fixed line numbers changed by modifications to causal example
* omnitrace-causal exe updates
- set OMNITRACE_THREAD_POOL_SIZE to zero by default
* core/containers updates
- static_vector: provide data() member function
- c_array pop_front() and pop_back() member functions
* core: config and argparse updates + perf
- core/perf.{hpp,cpp}
- forward decl of enums
- config-related capabilities
- argparse: --sample-overflow
- renamed some config functions
- e.g. get_sampling_cpu_freq -> get_sampling_cputime_freq
- added config settings related to overflow sampling via perf
- added timer_sampling and overflow_sampling categories
* Update timemory submodule
- sampling allocator flushing
* binary updates
- lookup_ipaddr_entry
- use bfd_find_nearest_line instead of bfd_find_nearest_line_discriminator
- discriminators are not used
- explicit instantiations of inlined_symbol::serialize
* Bump VERSION to 1.10.0
* sampling and perf updates
- support overflow sampling via Linux Perf
- update perf namespace
- update perf::perf_event
- update record ctor: pointer instead of const ref
- update open member func: return optional string
- add m_batch_size member variable
- sampling updates
- support overflow sampling
- flush allocators
- increase buffer size from 1024 to 2048
- restructure post-processing in light of perf overflow supports
- improve offload memory usage only load buffers for thread
- load_offload_buffer(tid) uses thread-specific filepos
- component updates
- backtrace_metrics::operator-=
- backtrace_metrics::operator-
- backtrace::sample does not record for overflow signal
- callchain: perf overflow sample
* core updates
- component::sampling_percent does not report self + uses_percent_units
* causal updates
- tweak get_line_info
- overloads for set_current_selection (uint64_t, c_array, std::array)
- delay
- use sampling::pause/sampling::resume
- experiment
- experiment::sample derives from unwind::processed_entry
- experiment::samples is vector instead of set
- fixed samples
- overloads for is_selected (uint64_t, c_array, std::array)
- scaling factor defaults to 100 instead of 50
- serialize updates follow change to experiment::sample
- modify algorithm for increasing/decreasing experiment length
- sample_data
- use map<uintptr, uint64_t> instead of set<sample_data>
- get_samples returns vector<sample_data> instead of set<sample_data>
- sampling
- support overflow via Linux Perf
- update causal_offload_buffer
- flush sampling allocator
- backtrace
- overflow component
* libomnitrace-dl updates
- handle dl::InstrumentMode::PythonProfile
* testing updates (causal)
- causal line 155 -> causal line 100
- causal line 165 -> causal line 110
* formatting
* exit_gotcha updates
- exit_info for abort()
- message about non-zero exit code
* testing updates
- fail regex for causal tests
- validate-causal-json: >= min_experiments instead of > min_experiments
- handle OMNITRACE_DEBUG_SETTINGS in omnitrace_write_test_config
* causal sampling updates
- add new lines where appropriate
* causal data updates
- reorder diagnostic info when experiment fails to start
* binary updates
- symbol address range from address to address + symsize + 1
- add 1 based on debug info
* causal data updates
- sample_selection wait_ns defaults to 1,000 instead of 10,000
- sample_selection wait scaled by iteration number
- save_line_info_impl verbosity
- print latest_eligible_pc when experiment does not start
* causal sampling + component updates
- perf backend disables component::backtrace
- ensure get_sampling_(realtime|cputime|overflow)_signal do not malloc
* causal: remove period stats
* validate-causal-json update
- fix --help
* causal data updates
- improve eligible pc history reporting when experiment fails to start
* causal data updates
- fix compute_eligible_lines_impl
- eligible address ranges returning too many ranges
- occasionally, overwrite all *true* eligible address ranges
* causal data updates
- reduce scoped ranges to symbol ranges
- is_eligible_address() returns true contains (not just coarse)
- revert some sample_selection behavior
* binary address_multirange updates
- make coarse_range private
- fix operator+=(pair<coarse, uintptr_t>)
* causal example update
- fix nsync to default to once per iteration
* binary analysis updates
- tweak header file includes
* causal updates
- remove factoring in sleep_for_overhead
- invoke delay::process() even if experiment is not active
* causal data updates
- update latest_eligible_pc structure
* update omnitrace-install.py.in
- fix support for fedora
- /etc/os-release does not have ID_LIKE
- fallback to RHEL 8.7 if version not specified
* update omnitrace-install.py.in
- fix support for debian
- /etc/os-release does not have ID_LIKE
- version mapping
* Update documentation
- update docs on installation
* causal data and experiment updates
- data: reset_sample_selection
* causal set_current_selection debugging
- debug messages for failed e2e runs
* causal data and backtrace component updates
- data: set_current_selection returns the number of eligible addresses added
- backtrace: if cputime signal has selected zero IPs > 5x, then realtime signal starts contributing call-stacks
* core library updates
- move config::parse_numeric_range to utility namespace
- add core/utility.cpp
- support range:increment, e.g. 5-25:10 expands to '5 15 25' instead of '5 10 15 20 25'
* omnitrace-causal update
- end-to-end expands all speedups
- support range:increment in speedups
* causal backtrace updates
- remove select_ival (realtime signal always contributes when select_count == 0)
* containers: static_vector update
- explicit c_array constructor
- explicit std::array constructor
* causal data updates
- remove set_current_selection(uint64_t)
- remove set_current_selection(std::array)
- sample_selection increase default wait time
- report eligible PC candidates
- move reset_sample_selection to perform_experiment_impl
- decrease latest_eligible_pc array size
- set_current_selection does not guard for experiment::active
* core debug updates
- OMNITRACE_PRINT_COLOR macros
* causal data updates
- tweak to experiment never started message
* causal gotcha updates
- remove unused code
* critical trace updates
- remove unused code
* omnitrace-causal
- OMNITRACE_LAUNCHER
* causal data updates
- don't fail on end-to-end + omnitrace-causal
* causal backtrace updates
- reintroduce select_ival behavior
* causal data updates
- tweak verbose messages about number of PC candidates
* core mproc updates
- utilities for waiting on child PID and diagnosing status
- omnitrace::mproc::wait_pid
- omnitrace::mproc::diagnose_status
* omnitrace-run updates
- support --fork argument for executing via fork in current process + execvpe on child instead of execvpe in current process
* omnitrace-causal updates
- wait_pid and diagnose_status just call equivalent functions in omnitrace::mproc
* ubuntu-focal workflow update
- attempt to launch ubuntu-focal-codecov job with CAP_SYS_ADMIN and use perf backend
* tests reorg and updates
- remove binary-rewrite-sampling and runtime-instrument-sampling tests
- rename *-preload tests (which use omnitrace-sample exe) to *-sampling
- split tests/CMakeLists.txt into several tests/omnitrace-<category>-tests.cmake files
- tweak to causal-both-omni-func test
- add args: -n 2 -b timer
* update validate-causal-json.py
- better reasoning info for adjusting tolerance
- always apply tolerance adjustments in CI mode
* causal e2e tests update
- add label "causal-e2e" label
- tweak params
- old: 80 12 432525 500000000
- new: 80 50 432525 100000000
- disable processor affinity for slow-func/line-100 tests
- artificially inflates some speedups with perf
* unblocking_gotcha updates
- overload operator() according to gotcha function index
* blocking_gotcha updates
- overload operator() according to gotcha function index
- fix bug where potentially post block functors (e.g. pthread_mutex_trylock) throw error if lock is not acquired.
* parse_numeric_range update
- support unordered_set
* config update
- OMNITRACE_DEBUG_{TIDS,PIDS} use parse_numeric_range
1449 líneas
58 KiB
C++
1449 líneas
58 KiB
C++
// MIT License
|
|
//
|
|
// Copyright (c) 2022 Advanced Micro Devices, Inc. All Rights Reserved.
|
|
//
|
|
// Permission is hereby granted, free of charge, to any person obtaining a copy
|
|
// of this software and associated documentation files (the "Software"), to deal
|
|
// in the Software without restriction, including without limitation the rights
|
|
// to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
|
|
// copies of the Software, and to permit persons to whom the Software is
|
|
// furnished to do so, subject to the following conditions:
|
|
//
|
|
// The above copyright notice and this permission notice shall be included in all
|
|
// copies or substantial portions of the Software.
|
|
//
|
|
// THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
|
|
// IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
|
|
// FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
|
|
// AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
|
|
// LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
|
|
// OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
|
|
// SOFTWARE.
|
|
|
|
#if !defined(OMNITRACE_DL_SOURCE)
|
|
# define OMNITRACE_DL_SOURCE 1
|
|
#endif
|
|
|
|
#define OMNITRACE_COMMON_LIBRARY_NAME "dl"
|
|
|
|
#include <timemory/log/color.hpp>
|
|
|
|
#define OMNITRACE_COMMON_LIBRARY_LOG_START \
|
|
fprintf(stderr, "%s", ::tim::log::color::info());
|
|
#define OMNITRACE_COMMON_LIBRARY_LOG_END fprintf(stderr, "%s", ::tim::log::color::end());
|
|
|
|
#include "common/defines.h"
|
|
#include "common/delimit.hpp"
|
|
#include "common/environment.hpp"
|
|
#include "common/invoke.hpp"
|
|
#include "common/join.hpp"
|
|
#include "common/setup.hpp"
|
|
#include "dl/dl.hpp"
|
|
#include "omnitrace/categories.h"
|
|
#include "omnitrace/types.h"
|
|
|
|
#include <timemory/utility/filepath.hpp>
|
|
|
|
#include <cassert>
|
|
#include <chrono>
|
|
#include <gnu/libc-version.h>
|
|
#include <link.h>
|
|
#include <linux/limits.h>
|
|
#include <sys/types.h>
|
|
#include <sys/wait.h>
|
|
#include <thread>
|
|
#include <unistd.h>
|
|
|
|
//--------------------------------------------------------------------------------------//
|
|
|
|
#define OMNITRACE_DLSYM(VARNAME, HANDLE, FUNCNAME) \
|
|
if(HANDLE) \
|
|
{ \
|
|
*(void**) (&VARNAME) = dlsym(HANDLE, FUNCNAME); \
|
|
if(VARNAME == nullptr && _omnitrace_dl_verbose >= _warn_verbose) \
|
|
{ \
|
|
OMNITRACE_COMMON_LIBRARY_LOG_START \
|
|
fprintf(stderr, "[omnitrace][dl][pid=%i]> %s :: %s\n", getpid(), FUNCNAME, \
|
|
dlerror()); \
|
|
OMNITRACE_COMMON_LIBRARY_LOG_END \
|
|
} \
|
|
else if(_omnitrace_dl_verbose > _info_verbose) \
|
|
{ \
|
|
OMNITRACE_COMMON_LIBRARY_LOG_START \
|
|
fprintf(stderr, "[omnitrace][dl][pid=%i]> %s :: success\n", getpid(), \
|
|
FUNCNAME); \
|
|
OMNITRACE_COMMON_LIBRARY_LOG_END \
|
|
} \
|
|
}
|
|
|
|
//--------------------------------------------------------------------------------------//
|
|
|
|
using main_func_t = int (*)(int, char**, char**);
|
|
|
|
std::ostream&
|
|
operator<<(std::ostream& _os, const SpaceHandle& _handle)
|
|
{
|
|
_os << _handle.name;
|
|
return _os;
|
|
}
|
|
|
|
namespace omnitrace
|
|
{
|
|
namespace dl
|
|
{
|
|
namespace
|
|
{
|
|
inline int
|
|
get_omnitrace_env()
|
|
{
|
|
auto&& _debug = get_env("OMNITRACE_DEBUG", false);
|
|
return get_env("OMNITRACE_VERBOSE", (_debug) ? 100 : 0);
|
|
}
|
|
|
|
inline int
|
|
get_omnitrace_dl_env()
|
|
{
|
|
return get_env("OMNITRACE_DL_DEBUG", false)
|
|
? 100
|
|
: get_env("OMNITRACE_DL_VERBOSE", get_omnitrace_env());
|
|
}
|
|
|
|
inline bool&
|
|
get_omnitrace_is_preloaded()
|
|
{
|
|
static bool _v = []() {
|
|
auto&& _preload_libs = get_env("LD_PRELOAD", std::string{});
|
|
return (_preload_libs.find("libomnitrace-dl.so") != std::string::npos);
|
|
}();
|
|
return _v;
|
|
}
|
|
|
|
inline bool
|
|
get_omnitrace_preload()
|
|
{
|
|
static bool _v = []() {
|
|
auto&& _preload = get_env("OMNITRACE_PRELOAD", true);
|
|
auto&& _preload_libs = get_env("LD_PRELOAD", std::string{});
|
|
return (_preload &&
|
|
_preload_libs.find("libomnitrace-dl.so") != std::string::npos);
|
|
}();
|
|
return _v;
|
|
}
|
|
|
|
inline void
|
|
reset_omnitrace_preload()
|
|
{
|
|
auto&& _preload_libs = get_env("LD_PRELOAD", std::string{});
|
|
if(_preload_libs.find("libomnitrace-dl.so") != std::string::npos)
|
|
{
|
|
(void) get_omnitrace_is_preloaded();
|
|
(void) get_omnitrace_preload();
|
|
auto _modified_preload = std::string{};
|
|
for(const auto& itr : delimit(_preload_libs, ":"))
|
|
{
|
|
if(itr.find("libomnitrace") != std::string::npos) continue;
|
|
_modified_preload += common::join("", ":", itr);
|
|
}
|
|
if(!_modified_preload.empty() && _modified_preload.find(':') == 0)
|
|
_modified_preload = _modified_preload.substr(1);
|
|
|
|
setenv("LD_PRELOAD", _modified_preload.c_str(), 1);
|
|
}
|
|
}
|
|
|
|
inline pid_t
|
|
get_omnitrace_root_pid()
|
|
{
|
|
auto _pid = getpid();
|
|
setenv("OMNITRACE_ROOT_PROCESS", std::to_string(_pid).c_str(), 0);
|
|
return get_env("OMNITRACE_ROOT_PROCESS", _pid);
|
|
}
|
|
|
|
void
|
|
omnitrace_preinit() OMNITRACE_INTERNAL_API;
|
|
|
|
void
|
|
omnitrace_postinit(std::string exe = {}) OMNITRACE_INTERNAL_API;
|
|
|
|
pid_t _omnitrace_root_pid = get_omnitrace_root_pid();
|
|
|
|
// environment priority:
|
|
// - OMNITRACE_DL_DEBUG
|
|
// - OMNITRACE_DL_VERBOSE
|
|
// - OMNITRACE_DEBUG
|
|
// - OMNITRACE_VERBOSE
|
|
int _omnitrace_dl_verbose = get_omnitrace_dl_env();
|
|
|
|
// The docs for dlopen suggest that the combination of RTLD_LOCAL + RTLD_DEEPBIND
|
|
// (when available) helps ensure that the symbols in the instrumentation library
|
|
// libomnitrace.so will use it's own symbols... not symbols that are potentially
|
|
// instrumented. However, this only applies to the symbols in libomnitrace.so,
|
|
// which is NOT self-contained, i.e. symbols in timemory and the libs it links to
|
|
// (such as libpapi.so) are not protected by the deep-bind option. Additionally,
|
|
// it should be noted that DynInst does *NOT* add instrumentation by manipulating the
|
|
// dynamic linker (otherwise it would only be limited to shared libs) -- it manipulates
|
|
// the instructions in the binary so that a call to a function such as "main" actually
|
|
// calls "main_dyninst", which executes the instrumentation snippets around the actual
|
|
// "main" (this is the reason you need the dyninstAPI_RT library).
|
|
//
|
|
// UPDATE:
|
|
// Use of RTLD_DEEPBIND has been removed because it causes the dyninst
|
|
// ProcControlAPI to segfault within pthread_cond_wait on certain executables.
|
|
//
|
|
// Here are the docs on the dlopen options used:
|
|
//
|
|
// RTLD_LAZY
|
|
// Perform lazy binding. Only resolve symbols as the code that references them is
|
|
// executed. If the symbol is never referenced, then it is never resolved. (Lazy
|
|
// binding is only performed for function references; references to variables are
|
|
// always immediately bound when the library is loaded.)
|
|
//
|
|
// RTLD_LOCAL
|
|
// This is the converse of RTLD_GLOBAL, and the default if neither flag is specified.
|
|
// Symbols defined in this library are not made available to resolve references in
|
|
// subsequently loaded libraries.
|
|
//
|
|
// RTLD_DEEPBIND (since glibc 2.3.4)
|
|
// Place the lookup scope of the symbols in this library ahead of the global scope.
|
|
// This means that a self-contained library will use its own symbols in preference to
|
|
// global symbols with the same name contained in libraries that have already been
|
|
// loaded. This flag is not specified in POSIX.1-2001.
|
|
//
|
|
#if __GLIBC__ >= 2 && __GLIBC_MINOR__ >= 4
|
|
auto _omnitrace_dl_dlopen_flags = RTLD_LAZY | RTLD_LOCAL;
|
|
const char* _omnitrace_dl_dlopen_descr = "RTLD_LAZY | RTLD_LOCAL";
|
|
#else
|
|
auto _omnitrace_dl_dlopen_flags = RTLD_LAZY | RTLD_LOCAL;
|
|
const char* _omnitrace_dl_dlopen_descr = "RTLD_LAZY | RTLD_LOCAL";
|
|
#endif
|
|
|
|
/// This class contains function pointers for omnitrace's instrumentation functions
|
|
struct OMNITRACE_INTERNAL_API indirect
|
|
{
|
|
OMNITRACE_INLINE indirect(const std::string& _omnilib, const std::string& _userlib,
|
|
const std::string& _dllib)
|
|
: m_omnilib{ common::path::find_path(_omnilib, _omnitrace_dl_verbose) }
|
|
, m_dllib{ common::path::find_path(_dllib, _omnitrace_dl_verbose) }
|
|
, m_userlib{ common::path::find_path(_userlib, _omnitrace_dl_verbose) }
|
|
{
|
|
if(_omnitrace_dl_verbose >= 1)
|
|
{
|
|
OMNITRACE_COMMON_LIBRARY_LOG_START
|
|
fprintf(stderr, "[omnitrace][dl][pid=%i] %s resolved to '%s'\n", getpid(),
|
|
::basename(_omnilib.c_str()), m_omnilib.c_str());
|
|
fprintf(stderr, "[omnitrace][dl][pid=%i] %s resolved to '%s'\n", getpid(),
|
|
::basename(_dllib.c_str()), m_dllib.c_str());
|
|
fprintf(stderr, "[omnitrace][dl][pid=%i] %s resolved to '%s'\n", getpid(),
|
|
::basename(_userlib.c_str()), m_userlib.c_str());
|
|
OMNITRACE_COMMON_LIBRARY_LOG_END
|
|
}
|
|
|
|
auto _search_paths = common::join(':', common::path::dirname(_omnilib),
|
|
common::path::dirname(_dllib));
|
|
common::setup_environ(_omnitrace_dl_verbose, _search_paths, _omnilib, _dllib);
|
|
|
|
m_omnihandle = open(m_omnilib);
|
|
m_userhandle = open(m_userlib);
|
|
init();
|
|
}
|
|
|
|
OMNITRACE_INLINE ~indirect() { dlclose(m_omnihandle); }
|
|
|
|
static OMNITRACE_INLINE void* open(const std::string& _lib)
|
|
{
|
|
auto* libhandle = dlopen(_lib.c_str(), _omnitrace_dl_dlopen_flags);
|
|
|
|
if(libhandle)
|
|
{
|
|
if(_omnitrace_dl_verbose >= 2)
|
|
{
|
|
OMNITRACE_COMMON_LIBRARY_LOG_START
|
|
fprintf(stderr, "[omnitrace][dl][pid=%i] dlopen(\"%s\", %s) :: success\n",
|
|
getpid(), _lib.c_str(), _omnitrace_dl_dlopen_descr);
|
|
OMNITRACE_COMMON_LIBRARY_LOG_END
|
|
}
|
|
}
|
|
else
|
|
{
|
|
if(_omnitrace_dl_verbose >= 0)
|
|
{
|
|
perror("dlopen");
|
|
OMNITRACE_COMMON_LIBRARY_LOG_START
|
|
fprintf(stderr, "[omnitrace][dl][pid=%i] dlopen(\"%s\", %s) :: %s\n",
|
|
getpid(), _lib.c_str(), _omnitrace_dl_dlopen_descr, dlerror());
|
|
OMNITRACE_COMMON_LIBRARY_LOG_END
|
|
}
|
|
}
|
|
|
|
dlerror(); // Clear any existing error
|
|
|
|
return libhandle;
|
|
}
|
|
|
|
OMNITRACE_INLINE void init()
|
|
{
|
|
if(!m_omnihandle) m_omnihandle = open(m_omnilib);
|
|
|
|
int _warn_verbose = 0;
|
|
int _info_verbose = 2;
|
|
// Initialize all pointers
|
|
OMNITRACE_DLSYM(omnitrace_init_library_f, m_omnihandle, "omnitrace_init_library");
|
|
OMNITRACE_DLSYM(omnitrace_init_tooling_f, m_omnihandle, "omnitrace_init_tooling");
|
|
OMNITRACE_DLSYM(omnitrace_init_f, m_omnihandle, "omnitrace_init");
|
|
OMNITRACE_DLSYM(omnitrace_finalize_f, m_omnihandle, "omnitrace_finalize");
|
|
OMNITRACE_DLSYM(omnitrace_set_env_f, m_omnihandle, "omnitrace_set_env");
|
|
OMNITRACE_DLSYM(omnitrace_set_mpi_f, m_omnihandle, "omnitrace_set_mpi");
|
|
OMNITRACE_DLSYM(omnitrace_push_trace_f, m_omnihandle, "omnitrace_push_trace");
|
|
OMNITRACE_DLSYM(omnitrace_pop_trace_f, m_omnihandle, "omnitrace_pop_trace");
|
|
OMNITRACE_DLSYM(omnitrace_push_region_f, m_omnihandle, "omnitrace_push_region");
|
|
OMNITRACE_DLSYM(omnitrace_pop_region_f, m_omnihandle, "omnitrace_pop_region");
|
|
OMNITRACE_DLSYM(omnitrace_push_category_region_f, m_omnihandle,
|
|
"omnitrace_push_category_region");
|
|
OMNITRACE_DLSYM(omnitrace_pop_category_region_f, m_omnihandle,
|
|
"omnitrace_pop_category_region");
|
|
OMNITRACE_DLSYM(omnitrace_register_source_f, m_omnihandle,
|
|
"omnitrace_register_source");
|
|
OMNITRACE_DLSYM(omnitrace_register_coverage_f, m_omnihandle,
|
|
"omnitrace_register_coverage");
|
|
OMNITRACE_DLSYM(omnitrace_progress_f, m_omnihandle, "omnitrace_progress");
|
|
OMNITRACE_DLSYM(omnitrace_annotated_progress_f, m_omnihandle,
|
|
"omnitrace_annotated_progress");
|
|
|
|
OMNITRACE_DLSYM(kokkosp_print_help_f, m_omnihandle, "kokkosp_print_help");
|
|
OMNITRACE_DLSYM(kokkosp_parse_args_f, m_omnihandle, "kokkosp_parse_args");
|
|
OMNITRACE_DLSYM(kokkosp_declare_metadata_f, m_omnihandle,
|
|
"kokkosp_declare_metadata");
|
|
OMNITRACE_DLSYM(kokkosp_request_tool_settings_f, m_omnihandle,
|
|
"kokkosp_request_tool_settings");
|
|
OMNITRACE_DLSYM(kokkosp_init_library_f, m_omnihandle, "kokkosp_init_library");
|
|
OMNITRACE_DLSYM(kokkosp_finalize_library_f, m_omnihandle,
|
|
"kokkosp_finalize_library");
|
|
OMNITRACE_DLSYM(kokkosp_begin_parallel_for_f, m_omnihandle,
|
|
"kokkosp_begin_parallel_for");
|
|
OMNITRACE_DLSYM(kokkosp_end_parallel_for_f, m_omnihandle,
|
|
"kokkosp_end_parallel_for");
|
|
OMNITRACE_DLSYM(kokkosp_begin_parallel_reduce_f, m_omnihandle,
|
|
"kokkosp_begin_parallel_reduce");
|
|
OMNITRACE_DLSYM(kokkosp_end_parallel_reduce_f, m_omnihandle,
|
|
"kokkosp_end_parallel_reduce");
|
|
OMNITRACE_DLSYM(kokkosp_begin_parallel_scan_f, m_omnihandle,
|
|
"kokkosp_begin_parallel_scan");
|
|
OMNITRACE_DLSYM(kokkosp_end_parallel_scan_f, m_omnihandle,
|
|
"kokkosp_end_parallel_scan");
|
|
OMNITRACE_DLSYM(kokkosp_begin_fence_f, m_omnihandle, "kokkosp_begin_fence");
|
|
OMNITRACE_DLSYM(kokkosp_end_fence_f, m_omnihandle, "kokkosp_end_fence");
|
|
OMNITRACE_DLSYM(kokkosp_push_profile_region_f, m_omnihandle,
|
|
"kokkosp_push_profile_region");
|
|
OMNITRACE_DLSYM(kokkosp_pop_profile_region_f, m_omnihandle,
|
|
"kokkosp_pop_profile_region");
|
|
OMNITRACE_DLSYM(kokkosp_create_profile_section_f, m_omnihandle,
|
|
"kokkosp_create_profile_section");
|
|
OMNITRACE_DLSYM(kokkosp_destroy_profile_section_f, m_omnihandle,
|
|
"kokkosp_destroy_profile_section");
|
|
OMNITRACE_DLSYM(kokkosp_start_profile_section_f, m_omnihandle,
|
|
"kokkosp_start_profile_section");
|
|
OMNITRACE_DLSYM(kokkosp_stop_profile_section_f, m_omnihandle,
|
|
"kokkosp_stop_profile_section");
|
|
OMNITRACE_DLSYM(kokkosp_allocate_data_f, m_omnihandle, "kokkosp_allocate_data");
|
|
OMNITRACE_DLSYM(kokkosp_deallocate_data_f, m_omnihandle,
|
|
"kokkosp_deallocate_data");
|
|
OMNITRACE_DLSYM(kokkosp_begin_deep_copy_f, m_omnihandle,
|
|
"kokkosp_begin_deep_copy");
|
|
OMNITRACE_DLSYM(kokkosp_end_deep_copy_f, m_omnihandle, "kokkosp_end_deep_copy");
|
|
OMNITRACE_DLSYM(kokkosp_profile_event_f, m_omnihandle, "kokkosp_profile_event");
|
|
OMNITRACE_DLSYM(kokkosp_dual_view_sync_f, m_omnihandle, "kokkosp_dual_view_sync");
|
|
OMNITRACE_DLSYM(kokkosp_dual_view_modify_f, m_omnihandle,
|
|
"kokkosp_dual_view_modify");
|
|
|
|
#if OMNITRACE_USE_ROCTRACER > 0
|
|
OMNITRACE_DLSYM(hsa_on_load_f, m_omnihandle, "OnLoad");
|
|
OMNITRACE_DLSYM(hsa_on_unload_f, m_omnihandle, "OnUnload");
|
|
#endif
|
|
|
|
#if OMNITRACE_USE_ROCPROFILER > 0
|
|
OMNITRACE_DLSYM(rocp_on_load_tool_prop_f, m_omnihandle, "OnLoadToolProp");
|
|
OMNITRACE_DLSYM(rocp_on_unload_tool_f, m_omnihandle, "OnUnloadTool");
|
|
#endif
|
|
|
|
#if OMNITRACE_USE_OMPT == 0
|
|
_warn_verbose = 5;
|
|
#else
|
|
OMNITRACE_DLSYM(ompt_start_tool_f, m_omnihandle, "ompt_start_tool");
|
|
#endif
|
|
|
|
if(!m_userhandle) m_userhandle = open(m_userlib);
|
|
_warn_verbose = 0;
|
|
OMNITRACE_DLSYM(omnitrace_user_configure_f, m_userhandle,
|
|
"omnitrace_user_configure");
|
|
|
|
if(omnitrace_user_configure_f)
|
|
{
|
|
omnitrace_user_callbacks_t _cb = {};
|
|
_cb.start_trace = &omnitrace_user_start_trace_dl;
|
|
_cb.stop_trace = &omnitrace_user_stop_trace_dl;
|
|
_cb.start_thread_trace = &omnitrace_user_start_thread_trace_dl;
|
|
_cb.stop_thread_trace = &omnitrace_user_stop_thread_trace_dl;
|
|
_cb.push_region = &omnitrace_user_push_region_dl;
|
|
_cb.pop_region = &omnitrace_user_pop_region_dl;
|
|
_cb.progress = &omnitrace_user_progress_dl;
|
|
_cb.push_annotated_region = &omnitrace_user_push_annotated_region_dl;
|
|
_cb.pop_annotated_region = &omnitrace_user_pop_annotated_region_dl;
|
|
_cb.annotated_progress = &omnitrace_user_annotated_progress_dl;
|
|
(*omnitrace_user_configure_f)(OMNITRACE_USER_REPLACE_CONFIG, _cb, nullptr);
|
|
}
|
|
}
|
|
|
|
public:
|
|
using user_cb_t = omnitrace_user_callbacks_t;
|
|
|
|
// libomnitrace functions
|
|
void (*omnitrace_init_library_f)(void) = nullptr;
|
|
void (*omnitrace_init_tooling_f)(void) = nullptr;
|
|
void (*omnitrace_init_f)(const char*, bool, const char*) = nullptr;
|
|
void (*omnitrace_finalize_f)(void) = nullptr;
|
|
void (*omnitrace_set_env_f)(const char*, const char*) = nullptr;
|
|
void (*omnitrace_set_mpi_f)(bool, bool) = nullptr;
|
|
void (*omnitrace_register_source_f)(const char*, const char*, size_t, size_t,
|
|
const char*) = nullptr;
|
|
void (*omnitrace_register_coverage_f)(const char*, const char*, size_t) = nullptr;
|
|
void (*omnitrace_push_trace_f)(const char*) = nullptr;
|
|
void (*omnitrace_pop_trace_f)(const char*) = nullptr;
|
|
int (*omnitrace_push_region_f)(const char*) = nullptr;
|
|
int (*omnitrace_pop_region_f)(const char*) = nullptr;
|
|
int (*omnitrace_push_category_region_f)(omnitrace_category_t, const char*,
|
|
omnitrace_annotation_t*, size_t) = nullptr;
|
|
int (*omnitrace_pop_category_region_f)(omnitrace_category_t, const char*,
|
|
omnitrace_annotation_t*, size_t) = nullptr;
|
|
void (*omnitrace_progress_f)(const char*) = nullptr;
|
|
void (*omnitrace_annotated_progress_f)(const char*, omnitrace_annotation_t*,
|
|
size_t) = nullptr;
|
|
|
|
// libomnitrace-user functions
|
|
int (*omnitrace_user_configure_f)(int, user_cb_t, user_cb_t*) = nullptr;
|
|
|
|
// KokkosP functions
|
|
void (*kokkosp_print_help_f)(char*) = nullptr;
|
|
void (*kokkosp_parse_args_f)(int, char**) = nullptr;
|
|
void (*kokkosp_declare_metadata_f)(const char*, const char*) = nullptr;
|
|
void (*kokkosp_request_tool_settings_f)(const uint32_t,
|
|
Kokkos_Tools_ToolSettings*) = nullptr;
|
|
void (*kokkosp_init_library_f)(const int, const uint64_t, const uint32_t,
|
|
void*) = nullptr;
|
|
void (*kokkosp_finalize_library_f)() = nullptr;
|
|
void (*kokkosp_begin_parallel_for_f)(const char*, uint32_t, uint64_t*) = nullptr;
|
|
void (*kokkosp_end_parallel_for_f)(uint64_t) = nullptr;
|
|
void (*kokkosp_begin_parallel_reduce_f)(const char*, uint32_t, uint64_t*) = nullptr;
|
|
void (*kokkosp_end_parallel_reduce_f)(uint64_t) = nullptr;
|
|
void (*kokkosp_begin_parallel_scan_f)(const char*, uint32_t, uint64_t*) = nullptr;
|
|
void (*kokkosp_end_parallel_scan_f)(uint64_t) = nullptr;
|
|
void (*kokkosp_begin_fence_f)(const char*, uint32_t, uint64_t*) = nullptr;
|
|
void (*kokkosp_end_fence_f)(uint64_t) = nullptr;
|
|
void (*kokkosp_push_profile_region_f)(const char*) = nullptr;
|
|
void (*kokkosp_pop_profile_region_f)() = nullptr;
|
|
void (*kokkosp_create_profile_section_f)(const char*, uint32_t*) = nullptr;
|
|
void (*kokkosp_destroy_profile_section_f)(uint32_t) = nullptr;
|
|
void (*kokkosp_start_profile_section_f)(uint32_t) = nullptr;
|
|
void (*kokkosp_stop_profile_section_f)(uint32_t) = nullptr;
|
|
void (*kokkosp_allocate_data_f)(const SpaceHandle, const char*, const void* const,
|
|
const uint64_t) = nullptr;
|
|
void (*kokkosp_deallocate_data_f)(const SpaceHandle, const char*, const void* const,
|
|
const uint64_t) = nullptr;
|
|
void (*kokkosp_begin_deep_copy_f)(SpaceHandle, const char*, const void*, SpaceHandle,
|
|
const char*, const void*, uint64_t) = nullptr;
|
|
void (*kokkosp_end_deep_copy_f)() = nullptr;
|
|
void (*kokkosp_profile_event_f)(const char*) = nullptr;
|
|
void (*kokkosp_dual_view_sync_f)(const char*, const void* const, bool) = nullptr;
|
|
void (*kokkosp_dual_view_modify_f)(const char*, const void* const, bool) = nullptr;
|
|
|
|
// HSA functions
|
|
#if OMNITRACE_USE_ROCTRACER > 0
|
|
bool (*hsa_on_load_f)(HsaApiTable*, uint64_t, uint64_t, const char* const*) = nullptr;
|
|
void (*hsa_on_unload_f)() = nullptr;
|
|
#endif
|
|
|
|
// ROCP functions
|
|
#if OMNITRACE_USE_ROCPROFILER > 0
|
|
void (*rocp_on_load_tool_prop_f)(void* settings) = nullptr;
|
|
void (*rocp_on_unload_tool_f)() = nullptr;
|
|
#endif
|
|
|
|
// OpenMP functions
|
|
#if defined(OMNITRACE_USE_OMPT) && OMNITRACE_USE_OMPT > 0
|
|
ompt_start_tool_result_t* (*ompt_start_tool_f)(unsigned int, const char*);
|
|
#endif
|
|
|
|
auto get_omni_library() const { return m_omnilib; }
|
|
auto get_user_library() const { return m_userlib; }
|
|
auto get_dl_library() const { return m_dllib; }
|
|
|
|
private:
|
|
void* m_omnihandle = nullptr;
|
|
void* m_userhandle = nullptr;
|
|
std::string m_omnilib = {};
|
|
std::string m_dllib = {};
|
|
std::string m_userlib = {};
|
|
};
|
|
|
|
inline indirect&
|
|
get_indirect() OMNITRACE_INTERNAL_API;
|
|
|
|
indirect&
|
|
get_indirect()
|
|
{
|
|
omnitrace_preinit_library();
|
|
|
|
static auto _libomni = get_env("OMNITRACE_LIBRARY", "libomnitrace.so");
|
|
static auto _libuser = get_env("OMNITRACE_USER_LIBRARY", "libomnitrace-user.so");
|
|
static auto _libdlib = get_env("OMNITRACE_DL_LIBRARY", "libomnitrace-dl.so");
|
|
static auto* _v = new indirect{ _libomni, _libuser, _libdlib };
|
|
return *_v;
|
|
}
|
|
|
|
auto&
|
|
get_inited()
|
|
{
|
|
static bool* _v = new bool{ false };
|
|
return *_v;
|
|
}
|
|
|
|
auto&
|
|
get_finied()
|
|
{
|
|
static bool* _v = new bool{ false };
|
|
return *_v;
|
|
}
|
|
|
|
auto&
|
|
get_active()
|
|
{
|
|
static bool* _v = new bool{ false };
|
|
return *_v;
|
|
}
|
|
|
|
auto&
|
|
get_enabled()
|
|
{
|
|
static auto* _v = new std::atomic<bool>{ get_env("OMNITRACE_INIT_ENABLED", true) };
|
|
return *_v;
|
|
}
|
|
|
|
auto&
|
|
get_thread_enabled()
|
|
{
|
|
static thread_local bool _v = get_enabled();
|
|
return _v;
|
|
}
|
|
|
|
auto&
|
|
get_thread_count()
|
|
{
|
|
static thread_local int64_t _v = 0;
|
|
return _v;
|
|
}
|
|
|
|
auto&
|
|
get_thread_status()
|
|
{
|
|
static thread_local bool _v = false;
|
|
return _v;
|
|
}
|
|
|
|
InstrumentMode&
|
|
get_instrumented()
|
|
{
|
|
static auto _v = get_env("OMNITRACE_INSTRUMENT_MODE", InstrumentMode::None);
|
|
return _v;
|
|
}
|
|
|
|
// ensure finalization is called
|
|
bool _omnitrace_dl_fini = (std::atexit([]() {
|
|
if(get_active()) omnitrace_finalize();
|
|
}),
|
|
true);
|
|
} // namespace
|
|
} // namespace dl
|
|
} // namespace omnitrace
|
|
|
|
//--------------------------------------------------------------------------------------//
|
|
|
|
#define OMNITRACE_DL_INVOKE(...) \
|
|
::omnitrace::common::invoke(__FUNCTION__, ::omnitrace::dl::_omnitrace_dl_verbose, \
|
|
(::omnitrace::dl::get_thread_status() = false), \
|
|
__VA_ARGS__)
|
|
|
|
#define OMNITRACE_DL_IGNORE(...) \
|
|
::omnitrace::common::ignore(__FUNCTION__, ::omnitrace::dl::_omnitrace_dl_verbose, \
|
|
__VA_ARGS__)
|
|
|
|
#define OMNITRACE_DL_INVOKE_STATUS(STATUS, ...) \
|
|
::omnitrace::common::invoke(__FUNCTION__, ::omnitrace::dl::_omnitrace_dl_verbose, \
|
|
STATUS, __VA_ARGS__)
|
|
|
|
#define OMNITRACE_DL_LOG(LEVEL, ...) \
|
|
if(::omnitrace::dl::_omnitrace_dl_verbose >= LEVEL) \
|
|
{ \
|
|
fflush(stderr); \
|
|
OMNITRACE_COMMON_LIBRARY_LOG_START \
|
|
fprintf(stderr, "[omnitrace][" OMNITRACE_COMMON_LIBRARY_NAME "][%i] ", \
|
|
getpid()); \
|
|
fprintf(stderr, __VA_ARGS__); \
|
|
OMNITRACE_COMMON_LIBRARY_LOG_END \
|
|
fflush(stderr); \
|
|
}
|
|
|
|
using omnitrace::dl::get_indirect;
|
|
namespace dl = omnitrace::dl;
|
|
|
|
extern "C"
|
|
{
|
|
void omnitrace_preinit_library(void)
|
|
{
|
|
if(omnitrace::common::get_env("OMNITRACE_MONOCHROME", tim::log::monochrome()))
|
|
tim::log::monochrome() = true;
|
|
}
|
|
|
|
int omnitrace_preload_library(void)
|
|
{
|
|
return (::omnitrace::dl::get_omnitrace_preload()) ? 1 : 0;
|
|
}
|
|
|
|
void omnitrace_init_library(void)
|
|
{
|
|
OMNITRACE_DL_INVOKE(get_indirect().omnitrace_init_library_f);
|
|
}
|
|
|
|
void omnitrace_init_tooling(void)
|
|
{
|
|
OMNITRACE_DL_INVOKE(get_indirect().omnitrace_init_tooling_f);
|
|
}
|
|
|
|
void omnitrace_init(const char* a, bool b, const char* c)
|
|
{
|
|
if(dl::get_inited() && dl::get_finied())
|
|
{
|
|
OMNITRACE_DL_LOG(
|
|
2, "%s(%s) ignored :: already initialized and finalized\n", __FUNCTION__,
|
|
::omnitrace::join(::omnitrace::QuoteStrings{}, ", ", a, b, c).c_str());
|
|
return;
|
|
}
|
|
else if(dl::get_inited() && dl::get_active())
|
|
{
|
|
OMNITRACE_DL_LOG(
|
|
2, "%s(%s) ignored :: already initialized and active\n", __FUNCTION__,
|
|
::omnitrace::join(::omnitrace::QuoteStrings{}, ", ", a, b, c).c_str());
|
|
return;
|
|
}
|
|
|
|
if(dl::get_instrumented() < dl::InstrumentMode::PythonProfile)
|
|
dl::omnitrace_preinit();
|
|
|
|
bool _invoked = false;
|
|
OMNITRACE_DL_INVOKE_STATUS(_invoked, get_indirect().omnitrace_init_f, a, b, c);
|
|
if(_invoked)
|
|
{
|
|
dl::get_active() = true;
|
|
dl::get_inited() = true;
|
|
dl::_omnitrace_dl_verbose = dl::get_omnitrace_dl_env();
|
|
if(dl::get_instrumented() < dl::InstrumentMode::PythonProfile)
|
|
dl::omnitrace_postinit((c) ? std::string{ c } : std::string{});
|
|
}
|
|
}
|
|
|
|
void omnitrace_finalize(void)
|
|
{
|
|
if(dl::get_inited() && dl::get_finied())
|
|
{
|
|
OMNITRACE_DL_LOG(2, "%s() ignored :: already initialized and finalized\n",
|
|
__FUNCTION__);
|
|
return;
|
|
}
|
|
else if(dl::get_finied() && !dl::get_active())
|
|
{
|
|
OMNITRACE_DL_LOG(2, "%s() ignored :: already finalized but not active\n",
|
|
__FUNCTION__);
|
|
return;
|
|
}
|
|
|
|
bool _invoked = false;
|
|
OMNITRACE_DL_INVOKE_STATUS(_invoked, get_indirect().omnitrace_finalize_f);
|
|
if(_invoked)
|
|
{
|
|
dl::get_active() = false;
|
|
dl::get_finied() = true;
|
|
}
|
|
}
|
|
|
|
void omnitrace_push_trace(const char* name)
|
|
{
|
|
if(!dl::get_active()) return;
|
|
if(dl::get_thread_enabled())
|
|
{
|
|
OMNITRACE_DL_INVOKE(get_indirect().omnitrace_push_trace_f, name);
|
|
}
|
|
else
|
|
{
|
|
++dl::get_thread_count();
|
|
}
|
|
}
|
|
|
|
void omnitrace_pop_trace(const char* name)
|
|
{
|
|
if(!dl::get_active()) return;
|
|
if(dl::get_thread_enabled())
|
|
{
|
|
OMNITRACE_DL_INVOKE(get_indirect().omnitrace_pop_trace_f, name);
|
|
}
|
|
else
|
|
{
|
|
if(dl::get_thread_count()-- == 0) omnitrace_user_start_thread_trace_dl();
|
|
}
|
|
}
|
|
|
|
int omnitrace_push_region(const char* name)
|
|
{
|
|
if(!dl::get_active()) return 0;
|
|
if(dl::get_thread_enabled())
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().omnitrace_push_region_f, name);
|
|
}
|
|
else
|
|
{
|
|
++dl::get_thread_count();
|
|
}
|
|
return 0;
|
|
}
|
|
|
|
int omnitrace_pop_region(const char* name)
|
|
{
|
|
if(!dl::get_active()) return 0;
|
|
if(dl::get_thread_enabled())
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().omnitrace_pop_region_f, name);
|
|
}
|
|
else
|
|
{
|
|
if(dl::get_thread_count()-- == 0) omnitrace_user_start_thread_trace_dl();
|
|
}
|
|
return 0;
|
|
}
|
|
|
|
int omnitrace_push_category_region(omnitrace_category_t _category, const char* name,
|
|
omnitrace_annotation_t* _annotations,
|
|
size_t _annotation_count)
|
|
{
|
|
if(!dl::get_active()) return 0;
|
|
if(dl::get_thread_enabled())
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().omnitrace_push_category_region_f,
|
|
_category, name, _annotations, _annotation_count);
|
|
}
|
|
else
|
|
{
|
|
++dl::get_thread_count();
|
|
}
|
|
return 0;
|
|
}
|
|
|
|
int omnitrace_pop_category_region(omnitrace_category_t _category, const char* name,
|
|
omnitrace_annotation_t* _annotations,
|
|
size_t _annotation_count)
|
|
{
|
|
if(!dl::get_active()) return 0;
|
|
if(dl::get_thread_enabled())
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().omnitrace_pop_category_region_f,
|
|
_category, name, _annotations, _annotation_count);
|
|
}
|
|
else
|
|
{
|
|
++dl::get_thread_count();
|
|
}
|
|
return 0;
|
|
}
|
|
|
|
void omnitrace_set_env(const char* a, const char* b)
|
|
{
|
|
if(dl::get_inited() && dl::get_active())
|
|
{
|
|
OMNITRACE_DL_IGNORE(2, "already initialized and active", a, b);
|
|
return;
|
|
}
|
|
OMNITRACE_DL_LOG(2, "%s(%s, %s)\n", __FUNCTION__, a, b);
|
|
setenv(a, b, 0);
|
|
// OMNITRACE_DL_INVOKE(get_indirect().omnitrace_set_env_f, a, b);
|
|
}
|
|
|
|
void omnitrace_set_mpi(bool a, bool b)
|
|
{
|
|
if(dl::get_inited() && dl::get_active())
|
|
{
|
|
OMNITRACE_DL_IGNORE(2, "already initialized and active", a, b);
|
|
return;
|
|
}
|
|
OMNITRACE_DL_INVOKE(get_indirect().omnitrace_set_mpi_f, a, b);
|
|
}
|
|
|
|
void omnitrace_register_source(const char* file, const char* func, size_t line,
|
|
size_t address, const char* source)
|
|
{
|
|
OMNITRACE_DL_LOG(3, "%s(\"%s\", \"%s\", %zu, %zu, \"%s\")\n", __FUNCTION__, file,
|
|
func, line, address, source);
|
|
OMNITRACE_DL_INVOKE(get_indirect().omnitrace_register_source_f, file, func, line,
|
|
address, source);
|
|
}
|
|
|
|
void omnitrace_register_coverage(const char* file, const char* func, size_t address)
|
|
{
|
|
OMNITRACE_DL_INVOKE(get_indirect().omnitrace_register_coverage_f, file, func,
|
|
address);
|
|
}
|
|
|
|
int omnitrace_user_start_trace_dl(void)
|
|
{
|
|
dl::get_enabled().store(true);
|
|
return omnitrace_user_start_thread_trace_dl();
|
|
}
|
|
|
|
int omnitrace_user_stop_trace_dl(void)
|
|
{
|
|
dl::get_enabled().store(false);
|
|
return omnitrace_user_stop_thread_trace_dl();
|
|
}
|
|
|
|
int omnitrace_user_start_thread_trace_dl(void)
|
|
{
|
|
dl::get_thread_enabled() = true;
|
|
return 0;
|
|
}
|
|
|
|
int omnitrace_user_stop_thread_trace_dl(void)
|
|
{
|
|
dl::get_thread_enabled() = false;
|
|
return 0;
|
|
}
|
|
|
|
int omnitrace_user_push_region_dl(const char* name)
|
|
{
|
|
if(!dl::get_active()) return 0;
|
|
return OMNITRACE_DL_INVOKE(get_indirect().omnitrace_push_region_f, name);
|
|
}
|
|
|
|
int omnitrace_user_pop_region_dl(const char* name)
|
|
{
|
|
if(!dl::get_active()) return 0;
|
|
return OMNITRACE_DL_INVOKE(get_indirect().omnitrace_pop_region_f, name);
|
|
}
|
|
|
|
int omnitrace_user_progress_dl(const char* name)
|
|
{
|
|
OMNITRACE_DL_INVOKE(get_indirect().omnitrace_progress_f, name);
|
|
return 0;
|
|
}
|
|
|
|
int omnitrace_user_push_annotated_region_dl(const char* name,
|
|
omnitrace_annotation_t* _annotations,
|
|
size_t _annotation_count)
|
|
{
|
|
if(!dl::get_active()) return 0;
|
|
return OMNITRACE_DL_INVOKE(get_indirect().omnitrace_push_category_region_f,
|
|
OMNITRACE_CATEGORY_USER, name, _annotations,
|
|
_annotation_count);
|
|
}
|
|
|
|
int omnitrace_user_pop_annotated_region_dl(const char* name,
|
|
omnitrace_annotation_t* _annotations,
|
|
size_t _annotation_count)
|
|
{
|
|
if(!dl::get_active()) return 0;
|
|
return OMNITRACE_DL_INVOKE(get_indirect().omnitrace_pop_category_region_f,
|
|
OMNITRACE_CATEGORY_USER, name, _annotations,
|
|
_annotation_count);
|
|
}
|
|
|
|
int omnitrace_user_annotated_progress_dl(const char* name,
|
|
omnitrace_annotation_t* _annotations,
|
|
size_t _annotation_count)
|
|
{
|
|
OMNITRACE_DL_INVOKE(get_indirect().omnitrace_annotated_progress_f, name,
|
|
_annotations, _annotation_count);
|
|
return 0;
|
|
}
|
|
|
|
void omnitrace_progress(const char* _name)
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().omnitrace_progress_f, _name);
|
|
}
|
|
|
|
void omnitrace_annotated_progress(const char* _name,
|
|
omnitrace_annotation_t* _annotations,
|
|
size_t _annotation_count)
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().omnitrace_annotated_progress_f, _name,
|
|
_annotations, _annotation_count);
|
|
}
|
|
|
|
void omnitrace_set_instrumented(int _mode)
|
|
{
|
|
OMNITRACE_DL_LOG(2, "%s(%i)\n", __FUNCTION__, _mode);
|
|
auto _mode_v = static_cast<dl::InstrumentMode>(_mode);
|
|
if(_mode_v < dl::InstrumentMode::None || _mode_v >= dl::InstrumentMode::Last)
|
|
{
|
|
OMNITRACE_DL_LOG(-127,
|
|
"%s(mode=%i) invoked with invalid instrumentation mode. "
|
|
"mode should be %i >= mode < %i\n",
|
|
__FUNCTION__, _mode,
|
|
static_cast<int>(dl::InstrumentMode::None),
|
|
static_cast<int>(dl::InstrumentMode::Last));
|
|
}
|
|
dl::get_instrumented() = _mode_v;
|
|
}
|
|
|
|
//----------------------------------------------------------------------------------//
|
|
//
|
|
// KokkosP
|
|
//
|
|
//----------------------------------------------------------------------------------//
|
|
|
|
void kokkosp_print_help(char* argv0)
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().kokkosp_print_help_f, argv0);
|
|
}
|
|
|
|
void kokkosp_parse_args(int argc, char** argv)
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().kokkosp_parse_args_f, argc, argv);
|
|
}
|
|
|
|
void kokkosp_declare_metadata(const char* key, const char* value)
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().kokkosp_declare_metadata_f, key, value);
|
|
}
|
|
|
|
void kokkosp_request_tool_settings(const uint32_t version,
|
|
Kokkos_Tools_ToolSettings* settings)
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().kokkosp_request_tool_settings_f,
|
|
version, settings);
|
|
}
|
|
|
|
void kokkosp_init_library(const int loadSeq, const uint64_t interfaceVer,
|
|
const uint32_t devInfoCount, void* deviceInfo)
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().kokkosp_init_library_f, loadSeq,
|
|
interfaceVer, devInfoCount, deviceInfo);
|
|
}
|
|
|
|
void kokkosp_finalize_library()
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().kokkosp_finalize_library_f);
|
|
}
|
|
|
|
void kokkosp_begin_parallel_for(const char* name, uint32_t devid, uint64_t* kernid)
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().kokkosp_begin_parallel_for_f, name,
|
|
devid, kernid);
|
|
}
|
|
|
|
void kokkosp_end_parallel_for(uint64_t kernid)
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().kokkosp_end_parallel_for_f, kernid);
|
|
}
|
|
|
|
void kokkosp_begin_parallel_reduce(const char* name, uint32_t devid, uint64_t* kernid)
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().kokkosp_begin_parallel_reduce_f, name,
|
|
devid, kernid);
|
|
}
|
|
|
|
void kokkosp_end_parallel_reduce(uint64_t kernid)
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().kokkosp_end_parallel_reduce_f, kernid);
|
|
}
|
|
|
|
void kokkosp_begin_parallel_scan(const char* name, uint32_t devid, uint64_t* kernid)
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().kokkosp_begin_parallel_scan_f, name,
|
|
devid, kernid);
|
|
}
|
|
|
|
void kokkosp_end_parallel_scan(uint64_t kernid)
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().kokkosp_end_parallel_scan_f, kernid);
|
|
}
|
|
|
|
void kokkosp_begin_fence(const char* name, uint32_t devid, uint64_t* kernid)
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().kokkosp_begin_fence_f, name, devid,
|
|
kernid);
|
|
}
|
|
|
|
void kokkosp_end_fence(uint64_t kernid)
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().kokkosp_end_fence_f, kernid);
|
|
}
|
|
|
|
void kokkosp_push_profile_region(const char* name)
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().kokkosp_push_profile_region_f, name);
|
|
}
|
|
|
|
void kokkosp_pop_profile_region()
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().kokkosp_pop_profile_region_f);
|
|
}
|
|
|
|
void kokkosp_create_profile_section(const char* name, uint32_t* secid)
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().kokkosp_create_profile_section_f, name,
|
|
secid);
|
|
}
|
|
|
|
void kokkosp_destroy_profile_section(uint32_t secid)
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().kokkosp_destroy_profile_section_f,
|
|
secid);
|
|
}
|
|
|
|
void kokkosp_start_profile_section(uint32_t secid)
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().kokkosp_start_profile_section_f, secid);
|
|
}
|
|
|
|
void kokkosp_stop_profile_section(uint32_t secid)
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().kokkosp_stop_profile_section_f, secid);
|
|
}
|
|
|
|
void kokkosp_allocate_data(const SpaceHandle space, const char* label,
|
|
const void* const ptr, const uint64_t size)
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().kokkosp_allocate_data_f, space, label,
|
|
ptr, size);
|
|
}
|
|
|
|
void kokkosp_deallocate_data(const SpaceHandle space, const char* label,
|
|
const void* const ptr, const uint64_t size)
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().kokkosp_deallocate_data_f, space, label,
|
|
ptr, size);
|
|
}
|
|
|
|
void kokkosp_begin_deep_copy(SpaceHandle dst_handle, const char* dst_name,
|
|
const void* dst_ptr, SpaceHandle src_handle,
|
|
const char* src_name, const void* src_ptr, uint64_t size)
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().kokkosp_begin_deep_copy_f, dst_handle,
|
|
dst_name, dst_ptr, src_handle, src_name, src_ptr,
|
|
size);
|
|
}
|
|
|
|
void kokkosp_end_deep_copy()
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().kokkosp_end_deep_copy_f);
|
|
}
|
|
|
|
void kokkosp_profile_event(const char* name)
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().kokkosp_profile_event_f, name);
|
|
}
|
|
|
|
void kokkosp_dual_view_sync(const char* label, const void* const data, bool is_device)
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().kokkosp_dual_view_sync_f, label, data,
|
|
is_device);
|
|
}
|
|
|
|
void kokkosp_dual_view_modify(const char* label, const void* const data,
|
|
bool is_device)
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().kokkosp_dual_view_modify_f, label, data,
|
|
is_device);
|
|
}
|
|
|
|
//----------------------------------------------------------------------------------//
|
|
//
|
|
// HSA
|
|
//
|
|
//----------------------------------------------------------------------------------//
|
|
|
|
#if OMNITRACE_USE_ROCTRACER > 0
|
|
bool OnLoad(HsaApiTable* table, uint64_t runtime_version, uint64_t failed_tool_count,
|
|
const char* const* failed_tool_names)
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().hsa_on_load_f, table, runtime_version,
|
|
failed_tool_count, failed_tool_names);
|
|
}
|
|
|
|
void OnUnload() { return OMNITRACE_DL_INVOKE(get_indirect().hsa_on_unload_f); }
|
|
#endif
|
|
|
|
//----------------------------------------------------------------------------------//
|
|
//
|
|
// ROCP
|
|
//
|
|
//----------------------------------------------------------------------------------//
|
|
|
|
#if OMNITRACE_USE_ROCPROFILER > 0
|
|
void OnLoadToolProp(void* settings)
|
|
{
|
|
OMNITRACE_DL_LOG(-16,
|
|
"invoking %s(rocprofiler_settings_t*) within omnitrace-dl.so "
|
|
"will cause a silent failure for rocprofiler. ROCP_TOOL_LIB "
|
|
"should be set to libomnitrace.so\n",
|
|
__FUNCTION__);
|
|
abort();
|
|
return OMNITRACE_DL_INVOKE(get_indirect().rocp_on_load_tool_prop_f, settings);
|
|
}
|
|
|
|
void OnUnloadTool()
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().rocp_on_unload_tool_f);
|
|
}
|
|
#endif
|
|
|
|
//----------------------------------------------------------------------------------//
|
|
//
|
|
// OMPT
|
|
//
|
|
//----------------------------------------------------------------------------------//
|
|
#if OMNITRACE_USE_OMPT > 0
|
|
ompt_start_tool_result_t* ompt_start_tool(unsigned int omp_version,
|
|
const char* runtime_version)
|
|
{
|
|
return OMNITRACE_DL_INVOKE(get_indirect().ompt_start_tool_f, omp_version,
|
|
runtime_version);
|
|
}
|
|
#endif
|
|
}
|
|
|
|
namespace omnitrace
|
|
{
|
|
namespace dl
|
|
{
|
|
namespace
|
|
{
|
|
bool
|
|
omnitrace_preload() OMNITRACE_INTERNAL_API;
|
|
|
|
std::vector<std::string>
|
|
get_link_map(const char*,
|
|
std::vector<int>&& = { (RTLD_LAZY | RTLD_NOLOAD) }) OMNITRACE_INTERNAL_API;
|
|
|
|
const char*
|
|
get_default_mode() OMNITRACE_INTERNAL_API;
|
|
|
|
void
|
|
verify_instrumented_preloaded() OMNITRACE_INTERNAL_API;
|
|
|
|
std::vector<std::string>
|
|
get_link_map(const char* _name, std::vector<int>&& _open_modes)
|
|
{
|
|
void* _handle = nullptr;
|
|
bool _noload = false;
|
|
for(auto _mode : _open_modes)
|
|
{
|
|
_handle = dlopen(_name, _mode);
|
|
_noload = (_mode & RTLD_NOLOAD) == RTLD_NOLOAD;
|
|
if(_handle) break;
|
|
}
|
|
|
|
auto _chain = std::vector<std::string>{};
|
|
if(_handle)
|
|
{
|
|
struct link_map* _link_map = nullptr;
|
|
dlinfo(_handle, RTLD_DI_LINKMAP, &_link_map);
|
|
struct link_map* _next = _link_map->l_next;
|
|
while(_next)
|
|
{
|
|
if(_next->l_name != nullptr && !std::string_view{ _next->l_name }.empty())
|
|
{
|
|
_chain.emplace_back(_next->l_name);
|
|
}
|
|
_next = _next->l_next;
|
|
}
|
|
|
|
if(_noload == false) dlclose(_handle);
|
|
}
|
|
return _chain;
|
|
}
|
|
|
|
const char*
|
|
get_default_mode()
|
|
{
|
|
if(get_env("OMNITRACE_USE_CAUSAL", false)) return "causal";
|
|
|
|
auto _link_map = get_link_map(nullptr);
|
|
for(const auto& itr : _link_map)
|
|
{
|
|
if(itr.find("libomnitrace-rt.so") != std::string::npos ||
|
|
itr.find("libdyninstAPI_RT.so") != std::string::npos)
|
|
return "trace";
|
|
}
|
|
|
|
return "sampling";
|
|
}
|
|
|
|
void
|
|
omnitrace_preinit()
|
|
{
|
|
switch(get_instrumented())
|
|
{
|
|
case InstrumentMode::None:
|
|
case InstrumentMode::BinaryRewrite:
|
|
case InstrumentMode::ProcessCreate:
|
|
case InstrumentMode::ProcessAttach:
|
|
{
|
|
auto _use_mpip = get_env("OMNITRACE_USE_MPIP", false);
|
|
auto _use_mpi = get_env("OMNITRACE_USE_MPI", _use_mpip);
|
|
auto _causal = get_env("OMNITRACE_USE_CAUSAL", false);
|
|
auto _mode = get_env("OMNITRACE_MODE", get_default_mode());
|
|
|
|
if(_use_mpi && !(_causal && _mode == "causal"))
|
|
{
|
|
// only make this call if true bc otherwise, if
|
|
// false, it will disable the MPIP component and
|
|
// we may intercept the MPI init call later.
|
|
// If _use_mpi defaults to true above, calling this
|
|
// will override can current env or config value for
|
|
// OMNITRACE_USE_PID.
|
|
omnitrace_set_mpi(_use_mpi, dl::get_instrumented() ==
|
|
dl::InstrumentMode::ProcessAttach);
|
|
}
|
|
break;
|
|
}
|
|
case InstrumentMode::PythonProfile:
|
|
case InstrumentMode::Last: break;
|
|
}
|
|
}
|
|
|
|
void
|
|
omnitrace_postinit(std::string _exe)
|
|
{
|
|
switch(get_instrumented())
|
|
{
|
|
case InstrumentMode::None:
|
|
case InstrumentMode::BinaryRewrite:
|
|
case InstrumentMode::ProcessCreate:
|
|
case InstrumentMode::ProcessAttach:
|
|
{
|
|
if(_exe.empty())
|
|
_exe = tim::filepath::readlink(join('/', "/proc", getpid(), "exe"));
|
|
|
|
omnitrace_init_tooling();
|
|
if(_exe.empty())
|
|
omnitrace_push_trace("main");
|
|
else
|
|
omnitrace_push_trace(basename(_exe.c_str()));
|
|
break;
|
|
}
|
|
case InstrumentMode::PythonProfile:
|
|
{
|
|
omnitrace_init_tooling();
|
|
break;
|
|
}
|
|
case InstrumentMode::Last: break;
|
|
}
|
|
}
|
|
|
|
bool
|
|
omnitrace_preload()
|
|
{
|
|
auto _preload = get_omnitrace_is_preloaded() && get_omnitrace_preload() &&
|
|
get_env("OMNITRACE_ENABLED", true);
|
|
|
|
auto _link_map = get_link_map(nullptr);
|
|
auto _instr_mode =
|
|
get_env("OMNITRACE_INSTRUMENT_MODE", dl::InstrumentMode::BinaryRewrite);
|
|
for(const auto& itr : _link_map)
|
|
{
|
|
if(itr.find("libomnitrace-rt.so") != std::string::npos ||
|
|
itr.find("libdyninstAPI_RT.so") != std::string::npos)
|
|
{
|
|
omnitrace_set_instrumented(static_cast<int>(_instr_mode));
|
|
break;
|
|
}
|
|
}
|
|
|
|
verify_instrumented_preloaded();
|
|
|
|
static bool _once = false;
|
|
if(_once) return _preload;
|
|
_once = true;
|
|
|
|
if(_preload)
|
|
{
|
|
reset_omnitrace_preload();
|
|
omnitrace_preinit_library();
|
|
}
|
|
|
|
return _preload;
|
|
}
|
|
|
|
void
|
|
verify_instrumented_preloaded()
|
|
{
|
|
// if preloaded then we are fine
|
|
if(get_omnitrace_is_preloaded()) return;
|
|
|
|
// value returned by get_instrumented is set by either:
|
|
// - the search of the linked libraries
|
|
// - via the instrumenter
|
|
// if binary rewrite or runtime instrumentation, there is an opportunity for
|
|
// LD_PRELOAD
|
|
switch(dl::get_instrumented())
|
|
{
|
|
case dl::InstrumentMode::None:
|
|
case dl::InstrumentMode::ProcessAttach:
|
|
case dl::InstrumentMode::ProcessCreate:
|
|
case dl::InstrumentMode::PythonProfile:
|
|
{
|
|
return;
|
|
}
|
|
case dl::InstrumentMode::BinaryRewrite:
|
|
{
|
|
break;
|
|
}
|
|
case dl::InstrumentMode::Last:
|
|
{
|
|
throw std::runtime_error(
|
|
"Invalid instrumentation type: InstrumentMode::Last");
|
|
}
|
|
}
|
|
|
|
static const char* _notice = R"notice(
|
|
|
|
NNNNNNNN NNNNNNNN OOOOOOOOO TTTTTTTTTTTTTTTTTTTTTTTIIIIIIIIII CCCCCCCCCCCCCEEEEEEEEEEEEEEEEEEEEEE
|
|
N:::::::N N::::::N OO:::::::::OO T:::::::::::::::::::::TI::::::::I CCC::::::::::::CE::::::::::::::::::::E
|
|
N::::::::N N::::::N OO:::::::::::::OO T:::::::::::::::::::::TI::::::::I CC:::::::::::::::CE::::::::::::::::::::E
|
|
N:::::::::N N::::::NO:::::::OOO:::::::OT:::::TT:::::::TT:::::TII::::::IIC:::::CCCCCCCC::::CEE::::::EEEEEEEEE::::E
|
|
N::::::::::N N::::::NO::::::O O::::::OTTTTTT T:::::T TTTTTT I::::I C:::::C CCCCCC E:::::E EEEEEE
|
|
N:::::::::::N N::::::NO:::::O O:::::O T:::::T I::::IC:::::C E:::::E
|
|
N:::::::N::::N N::::::NO:::::O O:::::O T:::::T I::::IC:::::C E::::::EEEEEEEEEE
|
|
N::::::N N::::N N::::::NO:::::O O:::::O T:::::T I::::IC:::::C E:::::::::::::::E
|
|
N::::::N N::::N:::::::NO:::::O O:::::O T:::::T I::::IC:::::C E:::::::::::::::E
|
|
N::::::N N:::::::::::NO:::::O O:::::O T:::::T I::::IC:::::C E::::::EEEEEEEEEE
|
|
N::::::N N::::::::::NO:::::O O:::::O T:::::T I::::IC:::::C E:::::E
|
|
N::::::N N:::::::::NO::::::O O::::::O T:::::T I::::I C:::::C CCCCCC E:::::E EEEEEE
|
|
N::::::N N::::::::NO:::::::OOO:::::::O TT:::::::TT II::::::IIC:::::CCCCCCCC::::CEE::::::EEEEEEEE:::::E
|
|
N::::::N N:::::::N OO:::::::::::::OO T:::::::::T I::::::::I CC:::::::::::::::CE::::::::::::::::::::E
|
|
N::::::N N::::::N OO:::::::::OO T:::::::::T I::::::::I CCC::::::::::::CE::::::::::::::::::::E
|
|
NNNNNNNN NNNNNNN OOOOOOOOO TTTTTTTTTTT IIIIIIIIII CCCCCCCCCCCCCEEEEEEEEEEEEEEEEEEEEEE
|
|
|
|
_ _ _____ ______
|
|
| | | |/ ____| ____|
|
|
| | | | (___ | |__
|
|
| | | |\___ \| __|
|
|
| |__| |____) | |____
|
|
\____/|_____/|______|
|
|
|
|
____ __ __ _ _ _____ _______ _____ _____ ______ _____ _ _ _ _
|
|
/ __ \| \/ | \ | |_ _|__ __| __ \ /\ / ____| ____| | __ \| | | | \ | |
|
|
| | | | \ / | \| | | | | | | |__) | / \ | | | |__ ______| |__) | | | | \| |
|
|
| | | | |\/| | . ` | | | | | | _ / / /\ \| | | __|______| _ /| | | | . ` |
|
|
| |__| | | | | |\ |_| |_ | | | | \ \ / ____ \ |____| |____ | | \ \| |__| | |\ |
|
|
\____/|_| |_|_| \_|_____| |_| |_| \_\/_/ \_\_____|______| |_| \_\\____/|_| \_|
|
|
|
|
|
|
Due to a variety of edge cases we've encountered, OmniTrace now requires that binary rewritten executables and libraries be launched
|
|
with the 'omnitrace-run' executable.
|
|
|
|
In order to launch the executable with 'omnitrace-run', prefix the current command with 'omnitrace-run' and a standalone double hyphen ('--').
|
|
For MPI applications, place 'omnitrace-run --' after the MPI command.
|
|
E.g.:
|
|
|
|
<EXECUTABLE> <ARGS...>
|
|
mpirun -n 2 <EXECUTABLE> <ARGS...>
|
|
|
|
should be:
|
|
|
|
omnitrace-run -- <EXECUTABLE> <ARGS...>
|
|
mpirun -n 2 omnitrace-run -- <EXECUTABLE> <ARGS...>
|
|
|
|
Note: the command-line arguments passed to 'omnitrace-run' (which are specified before the double hyphen) will override configuration variables
|
|
and/or any configuration values specified to 'omnitrace-instrument' via the '--config' or '--env' options.
|
|
E.g.:
|
|
|
|
$ omnitrace-instrument -o ./sleep.inst --env OMNITRACE_SAMPLING_DELAY=5.0 -- sleep
|
|
$ echo "OMNITRACE_SAMPLING_FREQ = 500" > omnitrace.cfg
|
|
$ export OMNITRACE_CONFIG_FILE=omnitrace.cfg
|
|
$ omnitrace-run --sampling-freq=100 --sampling-delay=1.0 -- ./sleep.inst 10
|
|
|
|
In the first command, a default sampling delay of 5 seconds in embedded into the instrumented 'sleep.inst'.
|
|
In the second command, the sampling frequency will be set to 500 interrupts per second when OmniTrace reads the config file
|
|
In the fourth command, the sampling frequency and sampling delay are overridden to 100 interrupts per second and 1 second, respectively, when sleep.inst runs
|
|
|
|
Thanks for using OmniTrace and happy optimizing!
|
|
)notice";
|
|
|
|
// emit notice
|
|
std::cerr << _notice << std::endl;
|
|
|
|
std::quick_exit(EXIT_FAILURE);
|
|
}
|
|
|
|
bool _handle_preload = omnitrace_preload();
|
|
main_func_t main_real = nullptr;
|
|
} // namespace
|
|
} // namespace dl
|
|
} // namespace omnitrace
|
|
|
|
extern "C"
|
|
{
|
|
int omnitrace_main(int argc, char** argv, char** envp) OMNITRACE_INTERNAL_API;
|
|
void omnitrace_set_main(main_func_t) OMNITRACE_INTERNAL_API;
|
|
|
|
void omnitrace_set_main(main_func_t _main_real)
|
|
{
|
|
::omnitrace::dl::main_real = _main_real;
|
|
}
|
|
|
|
int omnitrace_main(int argc, char** argv, char** envp)
|
|
{
|
|
OMNITRACE_DL_LOG(0, "%s\n", __FUNCTION__);
|
|
using ::omnitrace::common::get_env;
|
|
using ::omnitrace::dl::get_default_mode;
|
|
|
|
// prevent re-entry
|
|
static int _reentry = 0;
|
|
if(_reentry > 0) return -1;
|
|
_reentry = 1;
|
|
|
|
if(!::omnitrace::dl::main_real)
|
|
throw std::runtime_error("[omnitrace][dl] Unsuccessful wrapping of main: "
|
|
"nullptr to real main function");
|
|
|
|
if(envp)
|
|
{
|
|
size_t _idx = 0;
|
|
while(envp[_idx] != nullptr)
|
|
{
|
|
auto _env_v = std::string_view{ envp[_idx++] };
|
|
if(_env_v.find("OMNITRACE") != 0 &&
|
|
_env_v.find("libomnitrace") == std::string_view::npos)
|
|
continue;
|
|
auto _pos = _env_v.find('=');
|
|
if(_pos < _env_v.length())
|
|
{
|
|
auto _var = std::string{ _env_v }.substr(0, _pos);
|
|
auto _val = std::string{ _env_v }.substr(_pos + 1);
|
|
OMNITRACE_DL_LOG(1, "%s(%s, %s)\n", "omnitrace_set_env", _var.c_str(),
|
|
_val.c_str());
|
|
setenv(_var.c_str(), _val.c_str(), 0);
|
|
}
|
|
}
|
|
}
|
|
|
|
auto _mode = get_env("OMNITRACE_MODE", get_default_mode());
|
|
omnitrace_init(_mode.c_str(),
|
|
dl::get_instrumented() == dl::InstrumentMode::BinaryRewrite,
|
|
argv[0]);
|
|
|
|
int ret = (*::omnitrace::dl::main_real)(argc, argv, envp);
|
|
|
|
omnitrace_pop_trace(basename(argv[0]));
|
|
omnitrace_finalize();
|
|
|
|
return ret;
|
|
}
|
|
}
|