Update mentions of OpenMP to reflect newer implementation (#2701)
Update timemory examples in docs to use the `rocprofiler-sdk` API.
Этот коммит содержится в:
коммит произвёл
GitHub
родитель
0590a72d4b
Коммит
28b2ade7d2
+207
-184
@@ -430,194 +430,217 @@ The truncation settings be changed through the ``ROCPROFSYS_MAX_WIDTH`` setting.
|
||||
Timemory text output example
|
||||
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
|
||||
|
||||
In the following example, the ``NN`` field in ``|NN>>>`` is the thread ID. If MPI support is enabled,
|
||||
this becomes ``|MM|NN>>>`` where ``MM`` is the rank.
|
||||
In the following example, the ``N`` field in ``|N>>>`` is the thread ID. If MPI support is enabled,
|
||||
this becomes ``|M|N>>>`` where ``M`` is the rank.
|
||||
If ``ROCPROFSYS_COLLAPSE_THREADS=ON`` and ``ROCPROFSYS_COLLAPSE_PROCESSES=ON`` are configured,
|
||||
neither the ``MM`` nor the ``NN`` are present unless the
|
||||
neither the ``M`` nor the ``N`` are present unless the
|
||||
component explicitly sets type traits. Type traits specify that the data is only
|
||||
relevant per-thread or per-process, such as the ``thread_cpu_clock`` clock component.
|
||||
|
||||
.. code-block:: shell
|
||||
|
||||
|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|
||||
| REAL-CLOCK TIMER (I.E. WALL-CLOCK TIMER) |
|
||||
|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|
||||
| LABEL | COUNT | DEPTH | METRIC | UNITS | SUM | MEAN | MIN | MAX | VAR | STDDEV | % SELF |
|
||||
|--------------------------------------------------------------|--------|--------|------------|--------|-----------|-----------|-----------|-----------|----------|----------|--------|
|
||||
| |00>>> main | 1 | 0 | wall_clock | sec | 13.360265 | 13.360265 | 13.360265 | 13.360265 | 0.000000 | 0.000000 | 18.2 |
|
||||
| |00>>> |_ompt_thread_initial | 1 | 1 | wall_clock | sec | 10.924161 | 10.924161 | 10.924161 | 10.924161 | 0.000000 | 0.000000 | 0.0 |
|
||||
| |00>>> |_ompt_implicit_task | 1 | 2 | wall_clock | sec | 10.923050 | 10.923050 | 10.923050 | 10.923050 | 0.000000 | 0.000000 | 0.1 |
|
||||
| |00>>> |_ompt_parallel [parallelism=12] | 1 | 3 | wall_clock | sec | 10.915026 | 10.915026 | 10.915026 | 10.915026 | 0.000000 | 0.000000 | 0.0 |
|
||||
| |00>>> |_ompt_implicit_task | 1 | 4 | wall_clock | sec | 10.647951 | 10.647951 | 10.647951 | 10.647951 | 0.000000 | 0.000000 | 0.0 |
|
||||
| |00>>> |_ompt_work_loop | 156 | 5 | wall_clock | sec | 0.000812 | 0.000005 | 0.000001 | 0.000212 | 0.000000 | 0.000018 | 100.0 |
|
||||
| |00>>> |_ompt_work_single_executor | 40 | 5 | wall_clock | sec | 0.000016 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |00>>> |_ompt_sync_region_barrier_implicit | 308 | 5 | wall_clock | sec | 0.000629 | 0.000002 | 0.000001 | 0.000017 | 0.000000 | 0.000002 | 100.0 |
|
||||
| |00>>> |_conj_grad | 76 | 5 | wall_clock | sec | 10.641165 | 0.140015 | 0.131894 | 0.155099 | 0.000017 | 0.004080 | 1.0 |
|
||||
| |00>>> |_ompt_work_single_executor | 803 | 6 | wall_clock | sec | 0.000292 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |00>>> |_ompt_work_loop | 7904 | 6 | wall_clock | sec | 7.420265 | 0.000939 | 0.000005 | 0.006974 | 0.000003 | 0.001613 | 100.0 |
|
||||
| |00>>> |_ompt_sync_region_barrier_implicit | 6004 | 6 | wall_clock | sec | 0.283160 | 0.000047 | 0.000001 | 0.004087 | 0.000000 | 0.000303 | 100.0 |
|
||||
| |00>>> |_ompt_sync_region_barrier_implementation | 3952 | 6 | wall_clock | sec | 2.829252 | 0.000716 | 0.000007 | 0.009005 | 0.000001 | 0.000985 | 99.7 |
|
||||
| |00>>> |_ompt_sync_region_reduction | 15808 | 7 | wall_clock | sec | 0.009142 | 0.000001 | 0.000000 | 0.000007 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |00>>> |_ompt_work_single_other | 1249 | 6 | wall_clock | sec | 0.000270 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |00>>> |_ompt_work_single_other | 114 | 5 | wall_clock | sec | 0.000024 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |00>>> |_ompt_sync_region_barrier_implementation | 76 | 5 | wall_clock | sec | 0.000876 | 0.000012 | 0.000008 | 0.000025 | 0.000000 | 0.000003 | 84.4 |
|
||||
| |00>>> |_ompt_sync_region_reduction | 304 | 6 | wall_clock | sec | 0.000136 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |00>>> |_ompt_master | 226 | 5 | wall_clock | sec | 0.001978 | 0.000009 | 0.000000 | 0.000038 | 0.000000 | 0.000012 | 100.0 |
|
||||
| |11>>> |_ompt_thread_worker | 1 | 4 | wall_clock | sec | 10.656145 | 10.656145 | 10.656145 | 10.656145 | 0.000000 | 0.000000 | 0.1 |
|
||||
| |11>>> |_ompt_implicit_task | 1 | 5 | wall_clock | sec | 10.649183 | 10.649183 | 10.649183 | 10.649183 | 0.000000 | 0.000000 | 0.0 |
|
||||
| |11>>> |_ompt_work_loop | 156 | 6 | wall_clock | sec | 0.000852 | 0.000005 | 0.000002 | 0.000230 | 0.000000 | 0.000019 | 100.0 |
|
||||
| |11>>> |_ompt_work_single_other | 149 | 6 | wall_clock | sec | 0.000035 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |11>>> |_ompt_sync_region_barrier_implicit | 308 | 6 | wall_clock | sec | 0.004135 | 0.000013 | 0.000001 | 0.001233 | 0.000000 | 0.000070 | 100.0 |
|
||||
| |11>>> |_conj_grad | 76 | 6 | wall_clock | sec | 10.641302 | 0.140017 | 0.131896 | 0.155102 | 0.000017 | 0.004080 | 0.6 |
|
||||
| |11>>> |_ompt_work_single_other | 2023 | 7 | wall_clock | sec | 0.000458 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |11>>> |_ompt_work_loop | 7904 | 7 | wall_clock | sec | 8.253555 | 0.001044 | 0.000005 | 0.008021 | 0.000003 | 0.001790 | 100.0 |
|
||||
| |11>>> |_ompt_sync_region_barrier_implicit | 6004 | 7 | wall_clock | sec | 0.263840 | 0.000044 | 0.000001 | 0.004087 | 0.000000 | 0.000297 | 100.0 |
|
||||
| |11>>> |_ompt_sync_region_barrier_implementation | 3952 | 7 | wall_clock | sec | 2.059823 | 0.000521 | 0.000007 | 0.009508 | 0.000001 | 0.000863 | 100.0 |
|
||||
| |11>>> |_ompt_work_single_executor | 29 | 7 | wall_clock | sec | 0.000011 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |11>>> |_ompt_work_single_executor | 5 | 6 | wall_clock | sec | 0.000002 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |11>>> |_ompt_sync_region_barrier_implementation | 76 | 6 | wall_clock | sec | 0.000975 | 0.000013 | 0.000008 | 0.000024 | 0.000000 | 0.000003 | 100.0 |
|
||||
| |10>>> |_ompt_thread_worker | 1 | 4 | wall_clock | sec | 10.681664 | 10.681664 | 10.681664 | 10.681664 | 0.000000 | 0.000000 | 0.3 |
|
||||
| |10>>> |_ompt_implicit_task | 1 | 5 | wall_clock | sec | 10.649158 | 10.649158 | 10.649158 | 10.649158 | 0.000000 | 0.000000 | 0.0 |
|
||||
| |10>>> |_ompt_work_loop | 156 | 6 | wall_clock | sec | 0.000863 | 0.000006 | 0.000002 | 0.000231 | 0.000000 | 0.000019 | 100.0 |
|
||||
| |10>>> |_ompt_work_single_other | 140 | 6 | wall_clock | sec | 0.000037 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |10>>> |_ompt_sync_region_barrier_implicit | 308 | 6 | wall_clock | sec | 0.004149 | 0.000013 | 0.000001 | 0.001221 | 0.000000 | 0.000070 | 100.0 |
|
||||
| |10>>> |_conj_grad | 76 | 6 | wall_clock | sec | 10.641288 | 0.140017 | 0.131896 | 0.155101 | 0.000017 | 0.004080 | 0.7 |
|
||||
| |10>>> |_ompt_work_single_other | 1883 | 7 | wall_clock | sec | 0.000487 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |10>>> |_ompt_work_loop | 7904 | 7 | wall_clock | sec | 8.174545 | 0.001034 | 0.000005 | 0.006899 | 0.000003 | 0.001766 | 100.0 |
|
||||
| |10>>> |_ompt_sync_region_barrier_implicit | 6004 | 7 | wall_clock | sec | 0.268808 | 0.000045 | 0.000001 | 0.004087 | 0.000000 | 0.000299 | 100.0 |
|
||||
| |10>>> |_ompt_sync_region_barrier_implementation | 3952 | 7 | wall_clock | sec | 2.126988 | 0.000538 | 0.000007 | 0.009843 | 0.000001 | 0.000872 | 99.9 |
|
||||
| |10>>> |_ompt_sync_region_reduction | 3952 | 8 | wall_clock | sec | 0.002574 | 0.000001 | 0.000000 | 0.000014 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |10>>> |_ompt_work_single_executor | 169 | 7 | wall_clock | sec | 0.000072 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |10>>> |_ompt_sync_region_barrier_implementation | 76 | 6 | wall_clock | sec | 0.000954 | 0.000013 | 0.000009 | 0.000023 | 0.000000 | 0.000003 | 95.9 |
|
||||
| |10>>> |_ompt_sync_region_reduction | 76 | 7 | wall_clock | sec | 0.000039 | 0.000001 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |10>>> |_ompt_work_single_executor | 14 | 6 | wall_clock | sec | 0.000006 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |09>>> |_ompt_thread_worker | 1 | 4 | wall_clock | sec | 10.686552 | 10.686552 | 10.686552 | 10.686552 | 0.000000 | 0.000000 | 0.3 |
|
||||
| |09>>> |_ompt_implicit_task | 1 | 5 | wall_clock | sec | 10.649151 | 10.649151 | 10.649151 | 10.649151 | 0.000000 | 0.000000 | 0.0 |
|
||||
| |09>>> |_ompt_work_loop | 156 | 6 | wall_clock | sec | 0.000880 | 0.000006 | 0.000002 | 0.000258 | 0.000000 | 0.000021 | 100.0 |
|
||||
| |09>>> |_ompt_work_single_other | 148 | 6 | wall_clock | sec | 0.000034 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |09>>> |_ompt_sync_region_barrier_implicit | 308 | 6 | wall_clock | sec | 0.004129 | 0.000013 | 0.000001 | 0.001210 | 0.000000 | 0.000069 | 100.0 |
|
||||
| |09>>> |_conj_grad | 76 | 6 | wall_clock | sec | 10.641308 | 0.140017 | 0.131895 | 0.155102 | 0.000017 | 0.004080 | 0.7 |
|
||||
| |09>>> |_ompt_work_single_other | 2043 | 7 | wall_clock | sec | 0.000473 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |09>>> |_ompt_work_loop | 7904 | 7 | wall_clock | sec | 7.977001 | 0.001009 | 0.000005 | 0.007325 | 0.000003 | 0.001732 | 100.0 |
|
||||
| |09>>> |_ompt_sync_region_barrier_implicit | 6004 | 7 | wall_clock | sec | 0.242996 | 0.000040 | 0.000001 | 0.004087 | 0.000000 | 0.000284 | 100.0 |
|
||||
| |09>>> |_ompt_sync_region_barrier_implementation | 3952 | 7 | wall_clock | sec | 2.350895 | 0.000595 | 0.000007 | 0.008689 | 0.000001 | 0.000926 | 100.0 |
|
||||
| |09>>> |_ompt_work_single_executor | 9 | 7 | wall_clock | sec | 0.000004 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |09>>> |_ompt_sync_region_barrier_implementation | 76 | 6 | wall_clock | sec | 0.000973 | 0.000013 | 0.000008 | 0.000025 | 0.000000 | 0.000003 | 100.0 |
|
||||
| |09>>> |_ompt_work_single_executor | 6 | 6 | wall_clock | sec | 0.000002 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |08>>> |_ompt_thread_worker | 1 | 4 | wall_clock | sec | 10.721622 | 10.721622 | 10.721622 | 10.721622 | 0.000000 | 0.000000 | 0.7 |
|
||||
| |08>>> |_ompt_implicit_task | 1 | 5 | wall_clock | sec | 10.649135 | 10.649135 | 10.649135 | 10.649135 | 0.000000 | 0.000000 | 0.0 |
|
||||
| |08>>> |_ompt_work_loop | 156 | 6 | wall_clock | sec | 0.000839 | 0.000005 | 0.000001 | 0.000231 | 0.000000 | 0.000019 | 100.0 |
|
||||
| |08>>> |_ompt_work_single_other | 141 | 6 | wall_clock | sec | 0.000030 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |08>>> |_ompt_sync_region_barrier_implicit | 308 | 6 | wall_clock | sec | 0.004114 | 0.000013 | 0.000001 | 0.001198 | 0.000000 | 0.000069 | 100.0 |
|
||||
| |08>>> |_conj_grad | 76 | 6 | wall_clock | sec | 10.641294 | 0.140017 | 0.131896 | 0.155101 | 0.000017 | 0.004080 | 0.6 |
|
||||
| |08>>> |_ompt_work_single_other | 1742 | 7 | wall_clock | sec | 0.000392 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |08>>> |_ompt_work_loop | 7904 | 7 | wall_clock | sec | 8.306388 | 0.001051 | 0.000005 | 0.007886 | 0.000003 | 0.001795 | 100.0 |
|
||||
| |08>>> |_ompt_sync_region_barrier_implicit | 6004 | 7 | wall_clock | sec | 0.274358 | 0.000046 | 0.000001 | 0.004090 | 0.000000 | 0.000302 | 100.0 |
|
||||
| |08>>> |_ompt_sync_region_barrier_implementation | 3952 | 7 | wall_clock | sec | 1.991251 | 0.000504 | 0.000007 | 0.008694 | 0.000001 | 0.000844 | 99.8 |
|
||||
| |08>>> |_ompt_sync_region_reduction | 7904 | 8 | wall_clock | sec | 0.003816 | 0.000000 | 0.000000 | 0.000017 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |08>>> |_ompt_work_single_executor | 310 | 7 | wall_clock | sec | 0.000112 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |08>>> |_ompt_sync_region_barrier_implementation | 76 | 6 | wall_clock | sec | 0.000955 | 0.000013 | 0.000009 | 0.000026 | 0.000000 | 0.000003 | 93.7 |
|
||||
| |08>>> |_ompt_sync_region_reduction | 152 | 7 | wall_clock | sec | 0.000060 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |08>>> |_ompt_work_single_executor | 13 | 6 | wall_clock | sec | 0.000005 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |07>>> |_ompt_thread_worker | 1 | 4 | wall_clock | sec | 10.747282 | 10.747282 | 10.747282 | 10.747282 | 0.000000 | 0.000000 | 0.9 |
|
||||
| |07>>> |_ompt_implicit_task | 1 | 5 | wall_clock | sec | 10.649093 | 10.649093 | 10.649093 | 10.649093 | 0.000000 | 0.000000 | 0.0 |
|
||||
| |07>>> |_ompt_work_loop | 156 | 6 | wall_clock | sec | 0.000923 | 0.000006 | 0.000002 | 0.000231 | 0.000000 | 0.000019 | 100.0 |
|
||||
| |07>>> |_ompt_work_single_other | 152 | 6 | wall_clock | sec | 0.000048 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |07>>> |_ompt_sync_region_barrier_implicit | 308 | 6 | wall_clock | sec | 0.003981 | 0.000013 | 0.000001 | 0.001186 | 0.000000 | 0.000068 | 100.0 |
|
||||
| |07>>> |_conj_grad | 76 | 6 | wall_clock | sec | 10.641295 | 0.140017 | 0.131896 | 0.155101 | 0.000017 | 0.004080 | 0.7 |
|
||||
| |07>>> |_ompt_work_single_other | 2043 | 7 | wall_clock | sec | 0.000648 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |07>>> |_ompt_work_loop | 7904 | 7 | wall_clock | sec | 7.978811 | 0.001009 | 0.000005 | 0.006728 | 0.000003 | 0.001732 | 100.0 |
|
||||
| |07>>> |_ompt_sync_region_barrier_implicit | 6004 | 7 | wall_clock | sec | 0.199939 | 0.000033 | 0.000001 | 0.004086 | 0.000000 | 0.000255 | 100.0 |
|
||||
| |07>>> |_ompt_sync_region_barrier_implementation | 3952 | 7 | wall_clock | sec | 2.385843 | 0.000604 | 0.000009 | 0.009039 | 0.000001 | 0.000938 | 100.0 |
|
||||
| |07>>> |_ompt_work_single_executor | 9 | 7 | wall_clock | sec | 0.000004 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |07>>> |_ompt_sync_region_barrier_implementation | 76 | 6 | wall_clock | sec | 0.000905 | 0.000012 | 0.000010 | 0.000025 | 0.000000 | 0.000003 | 100.0 |
|
||||
| |07>>> |_ompt_work_single_executor | 2 | 6 | wall_clock | sec | 0.000001 | 0.000001 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |06>>> |_ompt_thread_worker | 1 | 4 | wall_clock | sec | 10.772278 | 10.772278 | 10.772278 | 10.772278 | 0.000000 | 0.000000 | 1.1 |
|
||||
| |06>>> |_ompt_implicit_task | 1 | 5 | wall_clock | sec | 10.649092 | 10.649092 | 10.649092 | 10.649092 | 0.000000 | 0.000000 | 0.0 |
|
||||
| |06>>> |_ompt_work_loop | 156 | 6 | wall_clock | sec | 0.000888 | 0.000006 | 0.000002 | 0.000236 | 0.000000 | 0.000020 | 100.0 |
|
||||
| |06>>> |_ompt_work_single_other | 153 | 6 | wall_clock | sec | 0.000037 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |06>>> |_ompt_sync_region_barrier_implicit | 308 | 6 | wall_clock | sec | 0.004090 | 0.000013 | 0.000001 | 0.001175 | 0.000000 | 0.000067 | 100.0 |
|
||||
| |06>>> |_conj_grad | 76 | 6 | wall_clock | sec | 10.641317 | 0.140017 | 0.131896 | 0.155101 | 0.000017 | 0.004080 | 0.8 |
|
||||
| |06>>> |_ompt_work_single_other | 2041 | 7 | wall_clock | sec | 0.000476 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |06>>> |_ompt_work_loop | 7904 | 7 | wall_clock | sec | 7.467961 | 0.000945 | 0.000005 | 0.010712 | 0.000003 | 0.001627 | 100.0 |
|
||||
| |06>>> |_ompt_sync_region_barrier_implicit | 6004 | 7 | wall_clock | sec | 0.250883 | 0.000042 | 0.000001 | 0.004087 | 0.000000 | 0.000285 | 100.0 |
|
||||
| |06>>> |_ompt_sync_region_barrier_implementation | 3952 | 7 | wall_clock | sec | 2.838733 | 0.000718 | 0.000009 | 0.009015 | 0.000001 | 0.001015 | 99.9 |
|
||||
| |06>>> |_ompt_sync_region_reduction | 3952 | 8 | wall_clock | sec | 0.003334 | 0.000001 | 0.000000 | 0.000025 | 0.000000 | 0.000001 | 100.0 |
|
||||
| |06>>> |_ompt_work_single_executor | 11 | 7 | wall_clock | sec | 0.000005 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |06>>> |_ompt_sync_region_barrier_implementation | 76 | 6 | wall_clock | sec | 0.000940 | 0.000012 | 0.000009 | 0.000025 | 0.000000 | 0.000003 | 95.4 |
|
||||
| |06>>> |_ompt_sync_region_reduction | 76 | 7 | wall_clock | sec | 0.000044 | 0.000001 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |06>>> |_ompt_work_single_executor | 1 | 6 | wall_clock | sec | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |05>>> |_ompt_thread_worker | 1 | 4 | wall_clock | sec | 10.797950 | 10.797950 | 10.797950 | 10.797950 | 0.000000 | 0.000000 | 1.4 |
|
||||
| |05>>> |_ompt_implicit_task | 1 | 5 | wall_clock | sec | 10.649072 | 10.649072 | 10.649072 | 10.649072 | 0.000000 | 0.000000 | 0.0 |
|
||||
| |05>>> |_ompt_work_loop | 156 | 6 | wall_clock | sec | 0.000879 | 0.000006 | 0.000001 | 0.000248 | 0.000000 | 0.000021 | 100.0 |
|
||||
| |05>>> |_ompt_work_single_other | 142 | 6 | wall_clock | sec | 0.000034 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |05>>> |_ompt_sync_region_barrier_implicit | 308 | 6 | wall_clock | sec | 0.004062 | 0.000013 | 0.000002 | 0.001163 | 0.000000 | 0.000067 | 100.0 |
|
||||
| |05>>> |_conj_grad | 76 | 6 | wall_clock | sec | 10.641291 | 0.140017 | 0.131896 | 0.155101 | 0.000017 | 0.004080 | 0.7 |
|
||||
| |05>>> |_ompt_work_single_other | 2038 | 7 | wall_clock | sec | 0.000500 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |05>>> |_ompt_work_loop | 7904 | 7 | wall_clock | sec | 8.279191 | 0.001047 | 0.000005 | 0.006596 | 0.000003 | 0.001792 | 100.0 |
|
||||
| |05>>> |_ompt_sync_region_barrier_implicit | 6004 | 7 | wall_clock | sec | 0.250939 | 0.000042 | 0.000001 | 0.004090 | 0.000000 | 0.000286 | 100.0 |
|
||||
| |05>>> |_ompt_sync_region_barrier_implementation | 3952 | 7 | wall_clock | sec | 2.039013 | 0.000516 | 0.000009 | 0.008689 | 0.000001 | 0.000855 | 100.0 |
|
||||
| |05>>> |_ompt_work_single_executor | 14 | 7 | wall_clock | sec | 0.000005 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |05>>> |_ompt_sync_region_barrier_implementation | 76 | 6 | wall_clock | sec | 0.000926 | 0.000012 | 0.000009 | 0.000023 | 0.000000 | 0.000003 | 100.0 |
|
||||
| |05>>> |_ompt_work_single_executor | 12 | 6 | wall_clock | sec | 0.000005 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |04>>> |_ompt_thread_worker | 1 | 4 | wall_clock | sec | 10.825935 | 10.825935 | 10.825935 | 10.825935 | 0.000000 | 0.000000 | 1.6 |
|
||||
| |04>>> |_ompt_implicit_task | 1 | 5 | wall_clock | sec | 10.649068 | 10.649068 | 10.649068 | 10.649068 | 0.000000 | 0.000000 | 0.0 |
|
||||
| |04>>> |_ompt_work_loop | 156 | 6 | wall_clock | sec | 0.000884 | 0.000006 | 0.000002 | 0.000245 | 0.000000 | 0.000020 | 100.0 |
|
||||
| |04>>> |_ompt_work_single_other | 150 | 6 | wall_clock | sec | 0.000034 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |04>>> |_ompt_sync_region_barrier_implicit | 308 | 6 | wall_clock | sec | 0.004069 | 0.000013 | 0.000001 | 0.001151 | 0.000000 | 0.000066 | 100.0 |
|
||||
| |04>>> |_conj_grad | 76 | 6 | wall_clock | sec | 10.641300 | 0.140017 | 0.131896 | 0.155101 | 0.000017 | 0.004080 | 1.1 |
|
||||
| |04>>> |_ompt_work_single_other | 2041 | 7 | wall_clock | sec | 0.000448 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |04>>> |_ompt_work_loop | 7904 | 7 | wall_clock | sec | 7.438393 | 0.000941 | 0.000005 | 0.007090 | 0.000003 | 0.001624 | 100.0 |
|
||||
| |04>>> |_ompt_sync_region_barrier_implicit | 6004 | 7 | wall_clock | sec | 0.270654 | 0.000045 | 0.000001 | 0.004090 | 0.000000 | 0.000295 | 100.0 |
|
||||
| |04>>> |_ompt_sync_region_barrier_implementation | 3952 | 7 | wall_clock | sec | 2.819165 | 0.000713 | 0.000009 | 0.008379 | 0.000001 | 0.001013 | 99.9 |
|
||||
| |04>>> |_ompt_sync_region_reduction | 7904 | 8 | wall_clock | sec | 0.003932 | 0.000000 | 0.000000 | 0.000015 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |04>>> |_ompt_work_single_executor | 11 | 7 | wall_clock | sec | 0.000005 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |04>>> |_ompt_sync_region_barrier_implementation | 76 | 6 | wall_clock | sec | 0.000936 | 0.000012 | 0.000009 | 0.000025 | 0.000000 | 0.000003 | 93.2 |
|
||||
| |04>>> |_ompt_sync_region_reduction | 152 | 7 | wall_clock | sec | 0.000064 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |04>>> |_ompt_work_single_executor | 4 | 6 | wall_clock | sec | 0.000001 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |03>>> |_ompt_thread_worker | 1 | 4 | wall_clock | sec | 10.849322 | 10.849322 | 10.849322 | 10.849322 | 0.000000 | 0.000000 | 1.8 |
|
||||
| |03>>> |_ompt_implicit_task | 1 | 5 | wall_clock | sec | 10.649075 | 10.649075 | 10.649075 | 10.649075 | 0.000000 | 0.000000 | 0.0 |
|
||||
| |03>>> |_ompt_work_loop | 156 | 6 | wall_clock | sec | 0.000861 | 0.000006 | 0.000002 | 0.000238 | 0.000000 | 0.000020 | 100.0 |
|
||||
| |03>>> |_ompt_work_single_other | 120 | 6 | wall_clock | sec | 0.000028 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |03>>> |_ompt_sync_region_barrier_implicit | 308 | 6 | wall_clock | sec | 0.003993 | 0.000013 | 0.000001 | 0.001138 | 0.000000 | 0.000065 | 100.0 |
|
||||
| |03>>> |_conj_grad | 76 | 6 | wall_clock | sec | 10.641302 | 0.140017 | 0.131896 | 0.155101 | 0.000017 | 0.004080 | 0.8 |
|
||||
| |03>>> |_ompt_work_single_other | 1756 | 7 | wall_clock | sec | 0.000426 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |03>>> |_ompt_work_loop | 7904 | 7 | wall_clock | sec | 8.005617 | 0.001013 | 0.000005 | 0.011500 | 0.000003 | 0.001741 | 100.0 |
|
||||
| |03>>> |_ompt_sync_region_barrier_implicit | 6004 | 7 | wall_clock | sec | 0.231485 | 0.000039 | 0.000001 | 0.004086 | 0.000000 | 0.000277 | 100.0 |
|
||||
| |03>>> |_ompt_sync_region_barrier_implementation | 3952 | 7 | wall_clock | sec | 2.320428 | 0.000587 | 0.000009 | 0.010868 | 0.000001 | 0.000912 | 100.0 |
|
||||
| |03>>> |_ompt_work_single_executor | 296 | 7 | wall_clock | sec | 0.000120 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |03>>> |_ompt_sync_region_barrier_implementation | 76 | 6 | wall_clock | sec | 0.000967 | 0.000013 | 0.000010 | 0.000023 | 0.000000 | 0.000003 | 100.0 |
|
||||
| |03>>> |_ompt_work_single_executor | 34 | 6 | wall_clock | sec | 0.000013 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |02>>> |_ompt_thread_worker | 1 | 4 | wall_clock | sec | 10.876387 | 10.876387 | 10.876387 | 10.876387 | 0.000000 | 0.000000 | 2.1 |
|
||||
| |02>>> |_ompt_implicit_task | 1 | 5 | wall_clock | sec | 10.649050 | 10.649050 | 10.649050 | 10.649050 | 0.000000 | 0.000000 | 0.0 |
|
||||
| |02>>> |_ompt_work_loop | 156 | 6 | wall_clock | sec | 0.000924 | 0.000006 | 0.000001 | 0.000241 | 0.000000 | 0.000020 | 100.0 |
|
||||
| |02>>> |_ompt_work_single_other | 139 | 6 | wall_clock | sec | 0.000040 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |02>>> |_ompt_sync_region_barrier_implicit | 308 | 6 | wall_clock | sec | 0.003972 | 0.000013 | 0.000001 | 0.001127 | 0.000000 | 0.000064 | 100.0 |
|
||||
| |02>>> |_conj_grad | 76 | 6 | wall_clock | sec | 10.641287 | 0.140017 | 0.131895 | 0.155101 | 0.000017 | 0.004080 | 0.7 |
|
||||
| |02>>> |_ompt_work_single_other | 1902 | 7 | wall_clock | sec | 0.000553 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |02>>> |_ompt_work_loop | 7904 | 7 | wall_clock | sec | 7.906688 | 0.001000 | 0.000005 | 0.007068 | 0.000003 | 0.001713 | 100.0 |
|
||||
| |02>>> |_ompt_sync_region_barrier_implicit | 6004 | 7 | wall_clock | sec | 0.261367 | 0.000044 | 0.000001 | 0.004088 | 0.000000 | 0.000295 | 100.0 |
|
||||
| |02>>> |_ompt_sync_region_barrier_implementation | 3952 | 7 | wall_clock | sec | 2.402362 | 0.000608 | 0.000009 | 0.010399 | 0.000001 | 0.000944 | 99.9 |
|
||||
| |02>>> |_ompt_sync_region_reduction | 3952 | 8 | wall_clock | sec | 0.002937 | 0.000001 | 0.000000 | 0.000021 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |02>>> |_ompt_work_single_executor | 150 | 7 | wall_clock | sec | 0.000073 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |02>>> |_ompt_sync_region_barrier_implementation | 76 | 6 | wall_clock | sec | 0.000895 | 0.000012 | 0.000009 | 0.000026 | 0.000000 | 0.000003 | 95.2 |
|
||||
| |02>>> |_ompt_sync_region_reduction | 76 | 7 | wall_clock | sec | 0.000043 | 0.000001 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |02>>> |_ompt_work_single_executor | 15 | 6 | wall_clock | sec | 0.000007 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |01>>> |_ompt_thread_worker | 1 | 4 | wall_clock | sec | 10.901650 | 10.901650 | 10.901650 | 10.901650 | 0.000000 | 0.000000 | 2.3 |
|
||||
| |01>>> |_ompt_implicit_task | 1 | 5 | wall_clock | sec | 10.649017 | 10.649017 | 10.649017 | 10.649017 | 0.000000 | 0.000000 | 0.0 |
|
||||
| |01>>> |_ompt_work_loop | 156 | 6 | wall_clock | sec | 0.000863 | 0.000006 | 0.000001 | 0.000231 | 0.000000 | 0.000019 | 100.0 |
|
||||
| |01>>> |_ompt_work_single_other | 146 | 6 | wall_clock | sec | 0.000033 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |01>>> |_ompt_sync_region_barrier_implicit | 308 | 6 | wall_clock | sec | 0.004012 | 0.000013 | 0.000001 | 0.001115 | 0.000000 | 0.000064 | 100.0 |
|
||||
| |01>>> |_conj_grad | 76 | 6 | wall_clock | sec | 10.641316 | 0.140017 | 0.131895 | 0.155101 | 0.000017 | 0.004080 | 0.8 |
|
||||
| |01>>> |_ompt_work_single_other | 1811 | 7 | wall_clock | sec | 0.000403 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |01>>> |_ompt_work_loop | 7904 | 7 | wall_clock | sec | 7.410337 | 0.000938 | 0.000005 | 0.010556 | 0.000003 | 0.001610 | 100.0 |
|
||||
| |01>>> |_ompt_sync_region_barrier_implicit | 6004 | 7 | wall_clock | sec | 0.202494 | 0.000034 | 0.000001 | 0.003521 | 0.000000 | 0.000256 | 100.0 |
|
||||
| |01>>> |_ompt_sync_region_barrier_implementation | 3952 | 7 | wall_clock | sec | 2.943604 | 0.000745 | 0.000008 | 0.009033 | 0.000001 | 0.001024 | 100.0 |
|
||||
| |01>>> |_ompt_work_single_executor | 241 | 7 | wall_clock | sec | 0.000093 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |01>>> |_ompt_sync_region_barrier_implementation | 76 | 6 | wall_clock | sec | 0.000917 | 0.000012 | 0.000009 | 0.000026 | 0.000000 | 0.000003 | 100.0 |
|
||||
| |01>>> |_ompt_work_single_executor | 8 | 6 | wall_clock | sec | 0.000004 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |00>>> |_c_print_results | 1 | 2 | wall_clock | sec | 0.000049 | 0.000049 | 0.000049 | 0.000049 | 0.000000 | 0.000000 | 100.0 |
|
||||
|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|
||||
|--------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|
||||
| REAL-CLOCK TIMER (I.E. WALL-CLOCK TIMER) |
|
||||
|--------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|
||||
| LABEL | COUNT | DEPTH | METRIC | UNITS | SUM | MEAN | MIN | MAX | VAR | STDDEV | % SELF |
|
||||
|-------------------------------------------------|--------|--------|------------|--------|----------|----------|----------|----------|----------|----------|--------|
|
||||
| |0>>> openmp-cg.inst | 1 | 0 | wall_clock | sec | 6.100107 | 6.100107 | 6.100107 | 6.100107 | 0.000000 | 0.000000 | 16.7 |
|
||||
| |0>>> |_omp_parallel | 1 | 1 | wall_clock | sec | 5.079807 | 5.079807 | 5.079807 | 5.079807 | 0.000000 | 0.000000 | 0.0 |
|
||||
| |0>>> |_pthread_create | 2 | 2 | wall_clock | sec | 0.004790 | 0.002395 | 0.002356 | 0.002434 | 0.000000 | 0.000054 | 0.0 |
|
||||
| |2>>> |_start_thread | 1 | 3 | wall_clock | sec | 5.077748 | 5.077748 | 5.077748 | 5.077748 | 0.000000 | 0.000000 | 0.0 |
|
||||
| |2>>> |_omp_thread_begin | 1 | 4 | wall_clock | sec | 0.000001 | 0.000001 | 0.000001 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_omp_implicit_task | 1 | 4 | wall_clock | sec | 5.077417 | 5.077417 | 5.077417 | 5.077417 | 0.000000 | 0.000000 | 0.0 |
|
||||
| |2>>> |_main.omp_outlined | 1 | 5 | wall_clock | sec | 5.074730 | 5.074730 | 5.074730 | 5.074730 | 0.000000 | 0.000000 | 0.0 |
|
||||
| |2>>> |_omp_work | 10 | 6 | wall_clock | sec | 0.000453 | 0.000045 | 0.000004 | 0.000213 | 0.000000 | 0.000064 | 41.6 |
|
||||
| |2>>> |_omp_dispatch | 6 | 7 | wall_clock | sec | 0.000003 | 0.000001 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_main.omp_outlined [loop#0] | 1 | 7 | wall_clock | sec | 0.000050 | 0.000050 | 0.000050 | 0.000050 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_main.omp_outlined [loop#12] | 1 | 7 | wall_clock | sec | 0.000001 | 0.000001 | 0.000001 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_main.omp_outlined [loop#1] | 1 | 7 | wall_clock | sec | 0.000183 | 0.000183 | 0.000183 | 0.000183 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_main.omp_outlined [loop#2] | 1 | 7 | wall_clock | sec | 0.000001 | 0.000001 | 0.000001 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_main.omp_outlined [loop#3] | 1 | 7 | wall_clock | sec | 0.000014 | 0.000014 | 0.000014 | 0.000014 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_main.omp_outlined [loop#6] | 1 | 7 | wall_clock | sec | 0.000005 | 0.000005 | 0.000005 | 0.000005 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_main.omp_outlined [loop#11] | 1 | 7 | wall_clock | sec | 0.000001 | 0.000001 | 0.000001 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_main.omp_outlined [loop#7] | 1 | 7 | wall_clock | sec | 0.000005 | 0.000005 | 0.000005 | 0.000005 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_main.omp_outlined [loop#9] | 1 | 7 | wall_clock | sec | 0.000001 | 0.000001 | 0.000001 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_omp_sync_region | 8 | 6 | wall_clock | sec | 0.000212 | 0.000027 | 0.000023 | 0.000046 | 0.000000 | 0.000008 | 55.7 |
|
||||
| |2>>> |_omp_sync_region_wait | 8 | 7 | wall_clock | sec | 0.000094 | 0.000012 | 0.000010 | 0.000024 | 0.000000 | 0.000005 | 100.0 |
|
||||
| |2>>> |_conj_grad | 1 | 6 | wall_clock | sec | 0.101549 | 0.101549 | 0.101549 | 0.101549 | 0.000000 | 0.000000 | 0.2 |
|
||||
| |2>>> |_omp_work | 6 | 7 | wall_clock | sec | 0.003838 | 0.000640 | 0.000004 | 0.003671 | 0.000002 | 0.001485 | 3.6 |
|
||||
| |2>>> |_omp_dispatch | 4 | 8 | wall_clock | sec | 0.000002 | 0.000001 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_conj_grad [loop#0] | 1 | 8 | wall_clock | sec | 0.000018 | 0.000018 | 0.000018 | 0.000018 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_conj_grad [loop#1] | 1 | 8 | wall_clock | sec | 0.000001 | 0.000001 | 0.000001 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_conj_grad [loop#9] | 1 | 8 | wall_clock | sec | 0.000002 | 0.000002 | 0.000002 | 0.000002 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_conj_grad [loop#8] | 1 | 8 | wall_clock | sec | 0.000013 | 0.000013 | 0.000013 | 0.000013 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_conj_grad [loop#3] | 1 | 8 | wall_clock | sec | 0.003641 | 0.003641 | 0.003641 | 0.003641 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_conj_grad [loop#6] | 1 | 8 | wall_clock | sec | 0.000001 | 0.000001 | 0.000001 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_conj_grad [loop#5] | 1 | 8 | wall_clock | sec | 0.000023 | 0.000023 | 0.000023 | 0.000023 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_omp_sync_region | 6 | 7 | wall_clock | sec | 0.000386 | 0.000064 | 0.000024 | 0.000152 | 0.000000 | 0.000053 | 29.9 |
|
||||
| |2>>> |_omp_sync_region_wait | 6 | 8 | wall_clock | sec | 0.000271 | 0.000045 | 0.000010 | 0.000131 | 0.000000 | 0.000052 | 100.0 |
|
||||
| |2>>> |_conj_grad [loop#2] | 1 | 7 | wall_clock | sec | 0.000001 | 0.000001 | 0.000001 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_conj_grad [loop#7] | 1 | 7 | wall_clock | sec | 0.097136 | 0.097136 | 0.097136 | 0.097136 | 0.000000 | 0.000000 | 2.9 |
|
||||
| |2>>> |_omp_work | 125 | 8 | wall_clock | sec | 0.090726 | 0.000726 | 0.000004 | 0.004120 | 0.000002 | 0.001406 | 99.9 |
|
||||
| |2>>> |_omp_dispatch | 100 | 9 | wall_clock | sec | 0.000047 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_omp_sync_region | 125 | 8 | wall_clock | sec | 0.003605 | 0.000029 | 0.000023 | 0.000124 | 0.000000 | 0.000013 | 51.3 |
|
||||
| |2>>> |_omp_sync_region_wait | 125 | 9 | wall_clock | sec | 0.001755 | 0.000014 | 0.000009 | 0.000094 | 0.000000 | 0.000011 | 100.0 |
|
||||
| |2>>> |_conj_grad [loop#4] | 1 | 7 | wall_clock | sec | 0.000001 | 0.000001 | 0.000001 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_main.omp_outlined [loop#5] | 1 | 6 | wall_clock | sec | 0.000001 | 0.000001 | 0.000001 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_main.omp_outlined [loop#4] | 1 | 6 | wall_clock | sec | 0.000001 | 0.000001 | 0.000001 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_main.omp_outlined [loop#8] | 1 | 6 | wall_clock | sec | 4.972281 | 4.972281 | 4.972281 | 4.972281 | 0.000000 | 0.000000 | 0.1 |
|
||||
| |2>>> |_conj_grad | 50 | 7 | wall_clock | sec | 4.958576 | 0.099172 | 0.098079 | 0.101557 | 0.000000 | 0.000660 | 0.2 |
|
||||
| |2>>> |_omp_work | 300 | 8 | wall_clock | sec | 0.182672 | 0.000609 | 0.000004 | 0.003721 | 0.000002 | 0.001290 | 3.3 |
|
||||
| |2>>> |_omp_dispatch | 200 | 9 | wall_clock | sec | 0.000092 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_conj_grad [loop#0] | 50 | 9 | wall_clock | sec | 0.001560 | 0.000031 | 0.000026 | 0.000037 | 0.000000 | 0.000002 | 100.0 |
|
||||
| |2>>> |_conj_grad [loop#1] | 50 | 9 | wall_clock | sec | 0.000053 | 0.000001 | 0.000001 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_conj_grad [loop#9] | 50 | 9 | wall_clock | sec | 0.000067 | 0.000001 | 0.000001 | 0.000002 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_conj_grad [loop#8] | 50 | 9 | wall_clock | sec | 0.000630 | 0.000013 | 0.000012 | 0.000016 | 0.000000 | 0.000001 | 100.0 |
|
||||
| |2>>> |_conj_grad [loop#3] | 50 | 9 | wall_clock | sec | 0.172947 | 0.003459 | 0.003372 | 0.003690 | 0.000000 | 0.000068 | 100.0 |
|
||||
| |2>>> |_conj_grad [loop#6] | 50 | 9 | wall_clock | sec | 0.000070 | 0.000001 | 0.000001 | 0.000002 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_conj_grad [loop#5] | 50 | 9 | wall_clock | sec | 0.001137 | 0.000023 | 0.000017 | 0.000036 | 0.000000 | 0.000002 | 100.0 |
|
||||
| |2>>> |_omp_sync_region | 300 | 8 | wall_clock | sec | 0.008423 | 0.000028 | 0.000023 | 0.000068 | 0.000000 | 0.000009 | 54.2 |
|
||||
| |2>>> |_omp_sync_region_wait | 300 | 9 | wall_clock | sec | 0.003860 | 0.000013 | 0.000009 | 0.000053 | 0.000000 | 0.000007 | 100.0 |
|
||||
| |2>>> |_conj_grad [loop#2] | 50 | 8 | wall_clock | sec | 0.000057 | 0.000001 | 0.000001 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_conj_grad [loop#7] | 50 | 8 | wall_clock | sec | 4.759600 | 0.095192 | 0.094130 | 0.097386 | 0.000000 | 0.000630 | 2.9 |
|
||||
| |2>>> |_omp_work | 6250 | 9 | wall_clock | sec | 4.442086 | 0.000711 | 0.000004 | 0.003788 | 0.000002 | 0.001370 | 99.9 |
|
||||
| |2>>> |_omp_dispatch | 5000 | 10 | wall_clock | sec | 0.002305 | 0.000000 | 0.000000 | 0.000005 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_omp_sync_region | 6250 | 9 | wall_clock | sec | 0.177916 | 0.000028 | 0.000018 | 0.000262 | 0.000000 | 0.000010 | 53.8 |
|
||||
| |2>>> |_omp_sync_region_wait | 6250 | 10 | wall_clock | sec | 0.082161 | 0.000013 | 0.000004 | 0.000248 | 0.000000 | 0.000008 | 100.0 |
|
||||
| |2>>> |_conj_grad [loop#4] | 50 | 8 | wall_clock | sec | 0.000058 | 0.000001 | 0.000001 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_omp_work | 200 | 7 | wall_clock | sec | 0.002753 | 0.000014 | 0.000004 | 0.000032 | 0.000000 | 0.000010 | 98.3 |
|
||||
| |2>>> |_omp_dispatch | 100 | 8 | wall_clock | sec | 0.000046 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_omp_sync_region | 250 | 7 | wall_clock | sec | 0.006061 | 0.000024 | 0.000023 | 0.000042 | 0.000000 | 0.000003 | 60.0 |
|
||||
| |2>>> |_omp_sync_region_wait | 250 | 8 | wall_clock | sec | 0.002423 | 0.000010 | 0.000009 | 0.000014 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |2>>> |_omp_sync_region | 1 | 5 | wall_clock | sec | 0.002650 | 0.002650 | 0.002650 | 0.002650 | 0.000000 | 0.000000 | 0.3 |
|
||||
| |2>>> |_omp_sync_region_wait | 1 | 6 | wall_clock | sec | 0.002641 | 0.002641 | 0.002641 | 0.002641 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_start_thread | 1 | 3 | wall_clock | sec | 5.080178 | 5.080178 | 5.080178 | 5.080178 | 0.000000 | 0.000000 | 0.1 |
|
||||
| |1>>> |_omp_thread_begin | 1 | 4 | wall_clock | sec | 0.000001 | 0.000001 | 0.000001 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_omp_implicit_task | 1 | 4 | wall_clock | sec | 5.077434 | 5.077434 | 5.077434 | 5.077434 | 0.000000 | 0.000000 | 0.0 |
|
||||
| |1>>> |_main.omp_outlined | 1 | 5 | wall_clock | sec | 5.074749 | 5.074749 | 5.074749 | 5.074749 | 0.000000 | 0.000000 | 0.0 |
|
||||
| |1>>> |_omp_work | 10 | 6 | wall_clock | sec | 0.000252 | 0.000025 | 0.000002 | 0.000124 | 0.000000 | 0.000038 | 33.4 |
|
||||
| |1>>> |_omp_dispatch | 6 | 7 | wall_clock | sec | 0.000001 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_main.omp_outlined [loop#0] | 1 | 7 | wall_clock | sec | 0.000035 | 0.000035 | 0.000035 | 0.000035 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_main.omp_outlined [loop#12] | 1 | 7 | wall_clock | sec | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_main.omp_outlined [loop#1] | 1 | 7 | wall_clock | sec | 0.000113 | 0.000113 | 0.000113 | 0.000113 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_main.omp_outlined [loop#2] | 1 | 7 | wall_clock | sec | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_main.omp_outlined [loop#3] | 1 | 7 | wall_clock | sec | 0.000010 | 0.000010 | 0.000010 | 0.000010 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_main.omp_outlined [loop#6] | 1 | 7 | wall_clock | sec | 0.000003 | 0.000003 | 0.000003 | 0.000003 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_main.omp_outlined [loop#11] | 1 | 7 | wall_clock | sec | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_main.omp_outlined [loop#7] | 1 | 7 | wall_clock | sec | 0.000003 | 0.000003 | 0.000003 | 0.000003 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_main.omp_outlined [loop#9] | 1 | 7 | wall_clock | sec | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_omp_sync_region | 8 | 6 | wall_clock | sec | 0.000582 | 0.000073 | 0.000028 | 0.000274 | 0.000000 | 0.000082 | 9.5 |
|
||||
| |1>>> |_omp_sync_region_wait | 8 | 7 | wall_clock | sec | 0.000526 | 0.000066 | 0.000022 | 0.000260 | 0.000000 | 0.000080 | 100.0 |
|
||||
| |1>>> |_conj_grad | 1 | 6 | wall_clock | sec | 0.101518 | 0.101518 | 0.101518 | 0.101518 | 0.000000 | 0.000000 | 0.1 |
|
||||
| |1>>> |_omp_work | 6 | 7 | wall_clock | sec | 0.002260 | 0.000377 | 0.000002 | 0.002143 | 0.000001 | 0.000865 | 3.1 |
|
||||
| |1>>> |_omp_dispatch | 4 | 8 | wall_clock | sec | 0.000001 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_conj_grad [loop#0] | 1 | 8 | wall_clock | sec | 0.000039 | 0.000039 | 0.000039 | 0.000039 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_conj_grad [loop#1] | 1 | 8 | wall_clock | sec | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_conj_grad [loop#9] | 1 | 8 | wall_clock | sec | 0.000001 | 0.000001 | 0.000001 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_conj_grad [loop#8] | 1 | 8 | wall_clock | sec | 0.000006 | 0.000006 | 0.000006 | 0.000006 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_conj_grad [loop#3] | 1 | 8 | wall_clock | sec | 0.002126 | 0.002126 | 0.002126 | 0.002126 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_conj_grad [loop#6] | 1 | 8 | wall_clock | sec | 0.000001 | 0.000001 | 0.000001 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_conj_grad [loop#5] | 1 | 8 | wall_clock | sec | 0.000014 | 0.000014 | 0.000014 | 0.000014 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_omp_sync_region | 6 | 7 | wall_clock | sec | 0.001972 | 0.000329 | 0.000017 | 0.001609 | 0.000000 | 0.000633 | 3.0 |
|
||||
| |1>>> |_omp_sync_region_wait | 6 | 8 | wall_clock | sec | 0.001913 | 0.000319 | 0.000009 | 0.001602 | 0.000000 | 0.000634 | 100.0 |
|
||||
| |1>>> |_conj_grad [loop#2] | 1 | 7 | wall_clock | sec | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_conj_grad [loop#7] | 1 | 7 | wall_clock | sec | 0.097191 | 0.097191 | 0.097191 | 0.097191 | 0.000000 | 0.000000 | 1.5 |
|
||||
| |1>>> |_omp_work | 125 | 8 | wall_clock | sec | 0.054511 | 0.000436 | 0.000002 | 0.002273 | 0.000001 | 0.000849 | 100.0 |
|
||||
| |1>>> |_omp_dispatch | 100 | 9 | wall_clock | sec | 0.000019 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_omp_sync_region | 125 | 8 | wall_clock | sec | 0.041210 | 0.000330 | 0.000011 | 0.002052 | 0.000000 | 0.000584 | 2.6 |
|
||||
| |1>>> |_omp_sync_region_wait | 125 | 9 | wall_clock | sec | 0.040135 | 0.000321 | 0.000005 | 0.002033 | 0.000000 | 0.000580 | 100.0 |
|
||||
| |1>>> |_conj_grad [loop#4] | 1 | 7 | wall_clock | sec | 0.000001 | 0.000001 | 0.000001 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_main.omp_outlined [loop#5] | 1 | 6 | wall_clock | sec | 0.000001 | 0.000001 | 0.000001 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_main.omp_outlined [loop#4] | 1 | 6 | wall_clock | sec | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_main.omp_outlined [loop#8] | 1 | 6 | wall_clock | sec | 4.972283 | 4.972283 | 4.972283 | 4.972283 | 0.000000 | 0.000000 | 0.0 |
|
||||
| |1>>> |_conj_grad | 50 | 7 | wall_clock | sec | 4.958649 | 0.099173 | 0.098081 | 0.101558 | 0.000000 | 0.000661 | 0.1 |
|
||||
| |1>>> |_omp_work | 300 | 8 | wall_clock | sec | 0.110605 | 0.000369 | 0.000002 | 0.002215 | 0.000001 | 0.000786 | 2.7 |
|
||||
| |1>>> |_omp_dispatch | 200 | 9 | wall_clock | sec | 0.000038 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_conj_grad [loop#0] | 50 | 9 | wall_clock | sec | 0.001071 | 0.000021 | 0.000020 | 0.000023 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_conj_grad [loop#1] | 50 | 9 | wall_clock | sec | 0.000022 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_conj_grad [loop#9] | 50 | 9 | wall_clock | sec | 0.000025 | 0.000001 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_conj_grad [loop#8] | 50 | 9 | wall_clock | sec | 0.000326 | 0.000007 | 0.000006 | 0.000007 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_conj_grad [loop#3] | 50 | 9 | wall_clock | sec | 0.105352 | 0.002107 | 0.002074 | 0.002198 | 0.000000 | 0.000021 | 100.0 |
|
||||
| |1>>> |_conj_grad [loop#6] | 50 | 9 | wall_clock | sec | 0.000039 | 0.000001 | 0.000001 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_conj_grad [loop#5] | 50 | 9 | wall_clock | sec | 0.000719 | 0.000014 | 0.000013 | 0.000017 | 0.000000 | 0.000001 | 100.0 |
|
||||
| |1>>> |_omp_sync_region | 300 | 8 | wall_clock | sec | 0.084256 | 0.000281 | 0.000011 | 0.001618 | 0.000000 | 0.000527 | 3.0 |
|
||||
| |1>>> |_omp_sync_region_wait | 300 | 9 | wall_clock | sec | 0.081757 | 0.000273 | 0.000005 | 0.001589 | 0.000000 | 0.000523 | 100.0 |
|
||||
| |1>>> |_conj_grad [loop#2] | 50 | 8 | wall_clock | sec | 0.000026 | 0.000001 | 0.000000 | 0.000002 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_conj_grad [loop#7] | 50 | 8 | wall_clock | sec | 4.759698 | 0.095194 | 0.094132 | 0.097391 | 0.000000 | 0.000630 | 1.6 |
|
||||
| |1>>> |_omp_work | 6250 | 9 | wall_clock | sec | 2.699342 | 0.000432 | 0.000002 | 0.002232 | 0.000001 | 0.000837 | 100.0 |
|
||||
| |1>>> |_omp_dispatch | 5000 | 10 | wall_clock | sec | 0.000934 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_omp_sync_region | 6250 | 9 | wall_clock | sec | 1.983999 | 0.000317 | 0.000010 | 0.001662 | 0.000000 | 0.000554 | 2.7 |
|
||||
| |1>>> |_omp_sync_region_wait | 6250 | 10 | wall_clock | sec | 1.929991 | 0.000309 | 0.000004 | 0.001648 | 0.000000 | 0.000550 | 100.0 |
|
||||
| |1>>> |_conj_grad [loop#4] | 50 | 8 | wall_clock | sec | 0.000026 | 0.000001 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_omp_work | 200 | 7 | wall_clock | sec | 0.001422 | 0.000007 | 0.000002 | 0.000017 | 0.000000 | 0.000006 | 98.7 |
|
||||
| |1>>> |_omp_dispatch | 100 | 8 | wall_clock | sec | 0.000018 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |1>>> |_omp_sync_region | 250 | 7 | wall_clock | sec | 0.010015 | 0.000040 | 0.000020 | 0.000055 | 0.000000 | 0.000007 | 16.9 |
|
||||
| |1>>> |_omp_sync_region_wait | 250 | 8 | wall_clock | sec | 0.008318 | 0.000033 | 0.000013 | 0.000049 | 0.000000 | 0.000007 | 100.0 |
|
||||
| |1>>> |_omp_sync_region | 1 | 5 | wall_clock | sec | 0.002662 | 0.002662 | 0.002662 | 0.002662 | 0.000000 | 0.000000 | 0.2 |
|
||||
| |1>>> |_omp_sync_region_wait | 1 | 6 | wall_clock | sec | 0.002657 | 0.002657 | 0.002657 | 0.002657 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_omp_implicit_task | 1 | 2 | wall_clock | sec | 5.074911 | 5.074911 | 5.074911 | 5.074911 | 0.000000 | 0.000000 | 0.0 |
|
||||
| |0>>> |_main.omp_outlined | 1 | 3 | wall_clock | sec | 5.074855 | 5.074855 | 5.074855 | 5.074855 | 0.000000 | 0.000000 | 0.0 |
|
||||
| |0>>> |_omp_work | 10 | 4 | wall_clock | sec | 0.000249 | 0.000025 | 0.000002 | 0.000124 | 0.000000 | 0.000037 | 35.1 |
|
||||
| |0>>> |_omp_dispatch | 6 | 5 | wall_clock | sec | 0.000002 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_main.omp_outlined [loop#0] | 1 | 5 | wall_clock | sec | 0.000031 | 0.000031 | 0.000031 | 0.000031 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_main.omp_outlined [loop#12] | 1 | 5 | wall_clock | sec | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_main.omp_outlined [loop#1] | 1 | 5 | wall_clock | sec | 0.000111 | 0.000111 | 0.000111 | 0.000111 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_main.omp_outlined [loop#2] | 1 | 5 | wall_clock | sec | 0.000001 | 0.000001 | 0.000001 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_main.omp_outlined [loop#3] | 1 | 5 | wall_clock | sec | 0.000011 | 0.000011 | 0.000011 | 0.000011 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_main.omp_outlined [loop#6] | 1 | 5 | wall_clock | sec | 0.000003 | 0.000003 | 0.000003 | 0.000003 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_main.omp_outlined [loop#11] | 1 | 5 | wall_clock | sec | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_main.omp_outlined [loop#7] | 1 | 5 | wall_clock | sec | 0.000002 | 0.000002 | 0.000002 | 0.000002 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_main.omp_outlined [loop#9] | 1 | 5 | wall_clock | sec | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_omp_sync_region | 8 | 4 | wall_clock | sec | 0.000650 | 0.000081 | 0.000029 | 0.000332 | 0.000000 | 0.000102 | 7.9 |
|
||||
| |0>>> |_omp_sync_region_wait | 8 | 5 | wall_clock | sec | 0.000599 | 0.000075 | 0.000022 | 0.000323 | 0.000000 | 0.000101 | 100.0 |
|
||||
| |0>>> |_conj_grad | 1 | 4 | wall_clock | sec | 0.101564 | 0.101564 | 0.101564 | 0.101564 | 0.000000 | 0.000000 | 0.1 |
|
||||
| |0>>> |_omp_work | 6 | 5 | wall_clock | sec | 0.002244 | 0.000374 | 0.000002 | 0.002137 | 0.000001 | 0.000864 | 3.0 |
|
||||
| |0>>> |_omp_dispatch | 4 | 6 | wall_clock | sec | 0.000001 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_conj_grad [loop#0] | 1 | 6 | wall_clock | sec | 0.000036 | 0.000036 | 0.000036 | 0.000036 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_conj_grad [loop#1] | 1 | 6 | wall_clock | sec | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_conj_grad [loop#9] | 1 | 6 | wall_clock | sec | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_conj_grad [loop#8] | 1 | 6 | wall_clock | sec | 0.000008 | 0.000008 | 0.000008 | 0.000008 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_conj_grad [loop#3] | 1 | 6 | wall_clock | sec | 0.002118 | 0.002118 | 0.002118 | 0.002118 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_conj_grad [loop#6] | 1 | 6 | wall_clock | sec | 0.000001 | 0.000001 | 0.000001 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_conj_grad [loop#5] | 1 | 6 | wall_clock | sec | 0.000013 | 0.000013 | 0.000013 | 0.000013 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_omp_sync_region | 6 | 5 | wall_clock | sec | 0.002043 | 0.000340 | 0.000029 | 0.001609 | 0.000000 | 0.000623 | 2.0 |
|
||||
| |0>>> |_omp_sync_region_wait | 6 | 6 | wall_clock | sec | 0.002002 | 0.000334 | 0.000023 | 0.001601 | 0.000000 | 0.000622 | 100.0 |
|
||||
| |0>>> |_conj_grad [loop#2] | 1 | 5 | wall_clock | sec | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_conj_grad [loop#7] | 1 | 5 | wall_clock | sec | 0.097195 | 0.097195 | 0.097195 | 0.097195 | 0.000000 | 0.000000 | 1.5 |
|
||||
| |0>>> |_omp_work | 125 | 6 | wall_clock | sec | 0.054617 | 0.000437 | 0.000002 | 0.002348 | 0.000001 | 0.000850 | 100.0 |
|
||||
| |0>>> |_omp_dispatch | 100 | 7 | wall_clock | sec | 0.000020 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_omp_sync_region | 125 | 6 | wall_clock | sec | 0.041158 | 0.000329 | 0.000019 | 0.001930 | 0.000000 | 0.000577 | 2.3 |
|
||||
| |0>>> |_omp_sync_region_wait | 125 | 7 | wall_clock | sec | 0.040230 | 0.000322 | 0.000012 | 0.001922 | 0.000000 | 0.000574 | 100.0 |
|
||||
| |0>>> |_conj_grad [loop#4] | 1 | 5 | wall_clock | sec | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_main.omp_outlined [loop#5] | 1 | 4 | wall_clock | sec | 0.000001 | 0.000001 | 0.000001 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_main.omp_outlined [loop#4] | 1 | 4 | wall_clock | sec | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_omp_masked | 1 | 4 | wall_clock | sec | 0.000005 | 0.000005 | 0.000005 | 0.000005 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_main.omp_outlined [loop#8] | 1 | 4 | wall_clock | sec | 4.972270 | 4.972270 | 4.972270 | 4.972270 | 0.000000 | 0.000000 | 0.1 |
|
||||
| |0>>> |_omp_masked | 150 | 5 | wall_clock | sec | 0.000450 | 0.000003 | 0.000001 | 0.000007 | 0.000000 | 0.000001 | 100.0 |
|
||||
| |0>>> |_conj_grad | 50 | 5 | wall_clock | sec | 4.958396 | 0.099168 | 0.098075 | 0.101553 | 0.000000 | 0.000660 | 0.1 |
|
||||
| |0>>> |_omp_work | 300 | 6 | wall_clock | sec | 0.110576 | 0.000369 | 0.000001 | 0.002217 | 0.000001 | 0.000787 | 2.7 |
|
||||
| |0>>> |_omp_dispatch | 200 | 7 | wall_clock | sec | 0.000039 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_conj_grad [loop#0] | 50 | 7 | wall_clock | sec | 0.000950 | 0.000019 | 0.000018 | 0.000020 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_conj_grad [loop#1] | 50 | 7 | wall_clock | sec | 0.000022 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_conj_grad [loop#9] | 50 | 7 | wall_clock | sec | 0.000028 | 0.000001 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_conj_grad [loop#8] | 50 | 7 | wall_clock | sec | 0.000332 | 0.000007 | 0.000006 | 0.000011 | 0.000000 | 0.000001 | 100.0 |
|
||||
| |0>>> |_conj_grad [loop#3] | 50 | 7 | wall_clock | sec | 0.105495 | 0.002110 | 0.002082 | 0.002202 | 0.000000 | 0.000022 | 100.0 |
|
||||
| |0>>> |_conj_grad [loop#6] | 50 | 7 | wall_clock | sec | 0.000044 | 0.000001 | 0.000001 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_conj_grad [loop#5] | 50 | 7 | wall_clock | sec | 0.000699 | 0.000014 | 0.000013 | 0.000017 | 0.000000 | 0.000001 | 100.0 |
|
||||
| |0>>> |_omp_sync_region | 300 | 6 | wall_clock | sec | 0.084068 | 0.000280 | 0.000009 | 0.001608 | 0.000000 | 0.000525 | 2.9 |
|
||||
| |0>>> |_omp_sync_region_wait | 300 | 7 | wall_clock | sec | 0.081660 | 0.000272 | 0.000002 | 0.001581 | 0.000000 | 0.000522 | 100.0 |
|
||||
| |0>>> |_conj_grad [loop#2] | 50 | 6 | wall_clock | sec | 0.000024 | 0.000000 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_conj_grad [loop#7] | 50 | 6 | wall_clock | sec | 4.759733 | 0.095195 | 0.094133 | 0.097391 | 0.000000 | 0.000630 | 1.5 |
|
||||
| |0>>> |_omp_work | 6250 | 7 | wall_clock | sec | 2.704389 | 0.000433 | 0.000002 | 0.002239 | 0.000001 | 0.000838 | 100.0 |
|
||||
| |0>>> |_omp_dispatch | 5000 | 8 | wall_clock | sec | 0.000990 | 0.000000 | 0.000000 | 0.000003 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_omp_sync_region | 6250 | 7 | wall_clock | sec | 1.983312 | 0.000317 | 0.000008 | 0.001660 | 0.000000 | 0.000551 | 2.5 |
|
||||
| |0>>> |_omp_sync_region_wait | 6250 | 8 | wall_clock | sec | 1.932873 | 0.000309 | 0.000002 | 0.001652 | 0.000000 | 0.000548 | 100.0 |
|
||||
| |0>>> |_conj_grad [loop#4] | 50 | 6 | wall_clock | sec | 0.000025 | 0.000001 | 0.000000 | 0.000001 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_omp_work | 200 | 5 | wall_clock | sec | 0.001475 | 0.000007 | 0.000002 | 0.000018 | 0.000000 | 0.000006 | 98.7 |
|
||||
| |0>>> |_omp_dispatch | 100 | 6 | wall_clock | sec | 0.000019 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_omp_sync_region | 250 | 5 | wall_clock | sec | 0.009059 | 0.000036 | 0.000028 | 0.000055 | 0.000000 | 0.000006 | 17.7 |
|
||||
| |0>>> |_omp_sync_region_wait | 250 | 6 | wall_clock | sec | 0.007460 | 0.000030 | 0.000021 | 0.000049 | 0.000000 | 0.000006 | 100.0 |
|
||||
| |0>>> |_omp_sync_region | 1 | 3 | wall_clock | sec | 0.000033 | 0.000033 | 0.000033 | 0.000033 | 0.000000 | 0.000000 | 23.0 |
|
||||
| |0>>> |_omp_sync_region_wait | 1 | 4 | wall_clock | sec | 0.000025 | 0.000025 | 0.000025 | 0.000025 | 0.000000 | 0.000000 | 100.0 |
|
||||
| |0>>> |_c_print_results | 1 | 1 | wall_clock | sec | 0.000012 | 0.000012 | 0.000012 | 0.000012 | 0.000000 | 0.000000 | 100.0 |
|
||||
|--------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|
||||
|
||||
Timemory JSON output
|
||||
-------------------------------------------------------------------------
|
||||
@@ -754,7 +777,7 @@ root node, which has the name ``unknown-hash=0``.
|
||||
},
|
||||
{
|
||||
"hash": 6107876127803219007,
|
||||
"prefix": "|0>>> |_ompt_thread_initial",
|
||||
"prefix": "|0>>> |_omp_thread_begin",
|
||||
"depth": 1,
|
||||
"entry": {
|
||||
"laps": 1,
|
||||
@@ -776,7 +799,7 @@ root node, which has the name ``unknown-hash=0``.
|
||||
},
|
||||
{
|
||||
"hash": 15402802091993617561,
|
||||
"prefix": "|0>>> |_ompt_implicit_task",
|
||||
"prefix": "|0>>> |_omp_implicit_task",
|
||||
"depth": 2,
|
||||
"entry": {
|
||||
"laps": 1,
|
||||
|
||||
Ссылка в новой задаче
Block a user