gobhardw
5d390717b5
SWDEV-427554 Fixing mainline ASAN build
...
Change-Id: I63cd047ceb75dea5f8ed6f84946e1ec209c7d812
2023-10-18 21:42:47 +05:30
Ammar ELWazir
d816f133d1
SWDEV-302415: Fixing Kernel Dispatchs with trace-start off option
...
Change-Id: I225b88cb769d994f1007e7bc66f176e7fa40db05
2023-10-16 09:39:42 -04:00
Giovanni LB
7418c52cc8
SWDEV-423659: Disabling HIP_ACTIVITY when HSA_ACTIVITY is enabled.
...
Change-Id: If64fabdcd0d8a718dd0017c2bc821a94c999e87e
2023-09-26 01:13:21 -04:00
Ammar ELWazir
d4a33cf33a
Pull from Github
...
Squashed commit of the following:
commit f029195705a15700380c6f832ba5d15d46fd6de7
Author: Jonathan R. Madsen <jrmadsen@users.noreply.github.com >
Date: Thu Jul 13 14:38:56 2023 -0500
Formatting workflows for source (clang-format) and cmake (cmake-format) (#4 )
* Add .cmake-format.yaml file
* Add formatting workflow
* provide base input for creating PR
* Update scheme for extracting branch name
- disable running formatting on push to amd-staging branch
* patch .cmake-format.yaml for find_package signature
- apparently cmake-format doesn't format the full signature of find_package
* run formatting (clang-format v11) (#7 )
Co-authored-by: jrmadsen <jrmadsen@users.noreply.github.com >
* run cmake formatting (cmake-format) (#6 )
Co-authored-by: jrmadsen <jrmadsen@users.noreply.github.com >
---------
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
commit bc4d135fdd8a1a9e51235f18a5d575fd2b3735e6
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Thu Jul 13 12:55:17 2023 -0500
Removing Build cache for potential issues with auto-generated header files (#5 )
Change-Id: I9e2319f4335e2f88585ffa6fac2bd88a1c952e6e
commit ce86dea6a311d44d880fa684eb78f3329295e2a4
Author: Jonathan R. Madsen <jrmadsen@users.noreply.github.com >
Date: Thu Jul 13 11:08:58 2023 -0500
Fix decltype(<hsa-function>) function pointer usage (#3 )
- the following is done in several places:
decltype(hsa_memory_allocate)* hsa_memory_allocate
- above can cause compiler errors
- replace decltype(<hsa-function>) with decltype(::<hsa-function>)
- this ensures that the type within the decltype is recognized as the global scope HSA function, not the variable
- in many places, the variable has a "_fn" suffix to prevent this issue but added '::' anyway for consistency
commit ac49fdd92a72e9c99394253a02da413a6c2e3b3a
Merge: a07946a 03a0855
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Wed Jul 12 11:36:24 2023 -0500
Merge pull request #2 from ROCm-Developer-Tools/gerrit-amd-staging
Pull from gerrit
commit 03a085588cffe863e8f466de67be1cfb205b675a
Merge: e88cad2 a07946a
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Wed Jul 12 10:57:30 2023 -0500
Merge branch 'amd-staging' into gerrit-amd-staging
commit a07946a5cd4c670c83c27ad1a076a9d4567ce6d7
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Wed Jul 12 15:46:04 2023 +0000
Enabling Cached Builds
commit 525e494a7f13941077a8fd4ad6840904db4d27d4
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Wed Jul 12 04:53:54 2023 +0000
Updating missed GPU Targets
commit 42c75862f628c9bee7cfb7dc04dff2619430efbc
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Wed Jul 12 04:43:02 2023 +0000
Adding V1 Testing
commit 9d72fd4aee85e4b0c12e717060d2730fa5b73be1
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Wed Jul 12 03:34:31 2023 +0000
Fixing Artifacts directory path
commit f4000cc558b3b2e4676f7994f7ce8c8e6f94518e
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Wed Jul 12 03:27:26 2023 +0000
Fixing CMake for test build job
commit 2ce8115d4c33948c3c8f957f545a95a04e1d6cd2
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Wed Jul 12 03:16:18 2023 +0000
Fixing Ubuntu CMake for ubuntu test build
commit 6d0ed439191be900748d0c025157f9d689a73ec7
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Wed Jul 12 01:28:41 2023 +0000
Removing Navi21
commit e349a7642e5ae5eb03ab9fcd0a0f74f09f78cab5
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Wed Jul 12 01:14:14 2023 +0000
Removing Navi21
commit fefd02fe68d2a4bca7ec2e381960ad004ee9fc5b
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Wed Jul 12 00:42:48 2023 +0000
Fixing CMake Job
commit 2ea46abf7bf92643efa8c549fa70346ffbd79d65
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Wed Jul 12 00:35:13 2023 +0000
Fixing CMake Job
commit d99d681ed1999c5fcf291dc678b11a77205fb0f3
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Wed Jul 12 00:32:13 2023 +0000
Fixing Pull Latest Dockers and CMake Jobs
commit dfc4498072d13b4a1df3a63047d34c682c3d9a29
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Tue Jul 11 23:54:21 2023 +0000
Fixing CMake job
commit 919efe04de707f7c702031be15c3e2c5f8442cbb
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Tue Jul 11 23:52:13 2023 +0000
Adding Pull Last dockers job
commit be1b1256e8b0e05308e8f7e7e69bee3acca55281
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Tue Jul 11 18:25:40 2023 -0500
Update cmake.yml
commit 212299fa4355ae6ec18f9aaacbb79c51ea6c6f97
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Tue Jul 11 18:23:35 2023 -0500
Update cmake.yml
commit 7c2c1327086a61466cc6cac39f70865c051a8bc7
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Tue Jul 11 18:18:53 2023 -0500
Update cmake.yml
commit 191b5ce007e612e814c1d7a3afb4ad398f3852e1
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Tue Jul 11 16:03:22 2023 -0500
Update cmake.yml
commit 8824113d95f3e13c7ce4d0af8e0d9d8f522a6c4a
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Tue Jul 11 16:28:09 2023 +0000
Fixing Pull from Gerrit job name
Change-Id: I9e7ed9a27a13ca49d62c93bdadb30f0057e4d385
commit cc3d5e4b02ffb439e8cc2b3efa53527c376f9982
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Tue Jul 11 16:21:43 2023 +0000
Adding Staging sync job
Change-Id: I0551f43878b0678ce4b3e74e27d62357cf95ad95
commit b9be2eee71380a2e6dd34d520e92d0c4209277a0
Author: Ammar ELWazir <Ammar.ELWazir@amd.com >
Date: Tue Jul 11 15:57:11 2023 +0000
Fixing build.sh
Change-Id: Ia987b0244f0875370d5fe69907b3f5e9cea914de
commit 9eee33a95a1abd656a7ac5ca10a9f245e9825431
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Mon Jul 10 21:39:46 2023 -0500
Update cmake.yml
commit 7093b85a78497140e8b52632ca2a002bdaeacd62
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Mon Jul 10 21:33:29 2023 -0500
Update cmake.yml
commit f54697172c72a67740f9fdfa0c217b6ea6931576
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Mon Jul 10 21:01:26 2023 -0500
Update cmake.yml
commit 1b6620e16f8940386b0f4f04e69e2410d21c0e26
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Mon Jul 10 20:21:02 2023 -0500
Update cmake.yml
commit a94bec740c6b42c4b79c87bca20fa87b99bf060d
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Mon Jul 10 19:46:35 2023 -0500
Update cmake.yml
commit 85d6b29d4375a69d575c18ece8542c50f2ddfcc3
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Mon Jul 10 19:34:39 2023 -0500
Update cmake.yml
commit 8c004887cf1435f1a6214c3d2455299a8a27bd4c
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Mon Jul 10 19:31:17 2023 -0500
Update cmake.yml
commit a14a9168e17d9348a53c6e9c9a47ba1edb4c4509
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Mon Jul 10 19:25:46 2023 -0500
Update cmake.yml
commit 000f2f40b84e6a2f7d4becdbf5aed01436ca4c83
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Mon Jul 10 19:08:18 2023 -0500
Update cmake.yml
commit a28a53d56731cad848fa9133d1c4dbaa8fc7afa7
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Mon Jul 10 19:03:39 2023 -0500
Update cmake.yml
commit a6a2db01027f0b01fdfbb5997ddb772c7f51b649
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Mon Jul 10 18:21:53 2023 -0500
Update cmake.yml
commit 118ef2a88b2d44e3207c31c343da3e5e5ec6f176
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Mon Jul 10 17:55:57 2023 -0500
Update cmake.yml
commit 03c4c232396440cd0be6d2dd7baf4ceea1c2589d
Author: Ammar ELWazir <aelwazir@amd.com >
Date: Mon Jul 10 17:48:49 2023 -0500
Create cmake.yml
Change-Id: I77992f15694e77cbae49c56f9ff02f4f9079235d
2023-07-13 20:54:30 -04:00
Ammar ELWazir
472624e3bd
SWDEV-374256: GPU Kernel Dispatch Trace Period Support
...
Change-Id: Idaabe82a30013e3aba4bcb65bd0a89ce2d14ad97
2023-06-21 12:46:33 -04:00
Giovanni LB
e1285e3fd4
SWDEV-405575: Added gfx941 and gfx942
...
Change-Id: I45a49cd64a76d3ae32c209497c70fe27b5be212b
2023-06-19 11:11:37 -04:00
Ammar ELWazir
9e62e066fe
V1/V2 API Library Separation
...
V1 library will be supported as librocprofiler64.so and V2 will be supported as librocprofiler64v2.so and headers will be rocprofiler.h for V1 and v2/rocprofiler.h for v2
Change-Id: Ibe5bdbf2f79f0175342c648e917ae77918186604
2023-05-02 22:44:43 -04:00
gobhardw
14977e4dc1
SWDEV-374072 : rocprof gpu selector fix
...
Change-Id: I155e63a5dc1ecbacd76d80b0df76da99b645ed9f
2023-03-29 15:55:06 +00:00
Ammar ELWazir
8032adb64f
Adding rocprofilerv2
...
Change-Id: Ic0cc280ba207d2b8f6ccae1cd4ac3184152fc1ad
2023-03-09 13:20:33 +00:00
Laurent Morichetti
5fd1c7e8e3
Fix vgpr count calculation for gfx90a and gfx940
...
Read accum_offset from compute_pgm_rsrc3 to report both the arch vgprs
and the accum vgprs
Change-Id: I99e746d54a6a1671e343da5658cc6ce970f79939
2022-08-03 14:02:36 -07:00
Ranjith Ramakrishnan
e7eb195924
SWDEV-345870 - Correct include paths for new directory layout
...
Use hsa header files from /opt/rocm-ver/include rather than using wrapper files from /opt/rocm-ver/hsa/include/hsa
Change-Id: Id7a9bde19447cd2a0fd6e03b11c08471f09c2a46
2022-07-14 16:08:41 -07:00
Chun Yang
f9017cbdc5
SWDEV-296922 : Incorrect rounding due to integer division in rocprofiler metrics
...
Changed derived metrics to double from int64.
Fixed standalone test due to int64 to float change
Fixed intercept test due to int64 to float change.
Change-Id: I49631c187406ae9dd94a869b3bb13772012e8cdf
2021-09-23 14:52:35 -07:00
Evgeny
780dfa37d4
cleanup after separating for staging and npi branches
...
Change-Id: Iadd624df21b85f1590e901a8125680743e3281a3
2021-04-08 20:37:47 +00:00
Evgeny
82d7bb2145
SWDEV-265287 : integration spmltgen.py script
...
Change-Id: Ief3e93225fb6660e72a04e4bd4b379262b73c914
2021-04-08 10:04:39 -04:00
Evgeny
e2c9d13e5b
SWDEV-274821 SPM initialization fix
...
Change-Id: I5e27928a60083eff328bab3e79937ce11bce11bd
2021-03-22 09:18:36 +00:00
Evgeny
7e60bf163e
SWDEV-255662 : spm kfd mode support
...
Change-Id: I840c7e92d3d5a59d8e5402c4d8ef86bc123dd07c
2020-12-02 13:02:45 -06:00
Evgeny
f2c9980647
fixing sqtt trace for zero size case
...
Change-Id: I75712485f518725af46a3b419339a212d1e762a0
2020-12-01 18:19:51 -05:00
Evgeny
ccc6005c25
fixing c_str() as strdup
...
Change-Id: Ib5cb68d16ce66fd2ae072168de4c16895f32b57f
2020-10-27 14:45:51 -05:00
Evgeny
0d164ba672
enable contexts wait
...
Change-Id: Ie2adf04662fddc8051fb5418904c9c659e264d78
2020-09-21 21:06:03 -04:00
Evgeny
8850e46071
kernel objects dumping
...
Change-Id: I5a16e05b7df438efa903948701b65a9ced99e5f3
initial codeobj event implementation
Change-Id: Ia7fac3c2b9897a004cfe88c4de82ba8c18284196
update - codeobj event implementation
Change-Id: I2b91b6e689875af03f0086f5a0872a97a629fd83
update2 - codeobj event implementation
Change-Id: Icff75f14fd21963e40db95373fa74880957a9e32
fix - codeobj event implementation
Change-Id: I76c33c875cb429fb12a974bb408b217f187b4536
URI buffer fix - codeobj event implementation
Change-Id: I7ce1a758e021455da3fe5b8a6e4ae3ab46e9760e
HSA events exposing
Change-Id: I3664ab4e5111c4ccedaf068dcb19f48055f0ef9b
HSA events data struct normalizing
Change-Id: I365ef0db45e0a9314bd2a1a4d29dd4eb4e91297d
2020-09-11 10:01:54 -05:00
Xianwei Zhang
b445610cd1
concurrent: enable/fix the related settings
...
Concurrent profiling relies on the aqlprofile read_api
and tracker. This patch set those options to enable
the concurrent profiling.
Change-Id: Ib97d4d8facfbc11f2684d83109397cd13f117d5e
2020-08-26 16:04:57 -04:00
Evgeny
80747de208
optimization mechanism fix: correct tracker handler; kernel name query on completion;
...
Change-Id: I14da152b4ac3c7d8fd1af2f54e9d71f834071622
2020-08-03 23:34:49 -05:00
Evgeny
7364edcc5b
kernel name filtering fix - handling [] brakets
...
Change-Id: I46a62d991a52045694640837393df229cf7a3133
2020-07-29 18:47:31 -05:00
Xianwei Zhang
61c9df4631
pmc: add support of concurrent kernel profiling
...
The profiling was only enabled in serial mode, i.e., kernels
are serialized in execution, and counters are reset at each
kernel start and read at kernel completion. This patch adds
the concurrent mode, by issuing the process-level start
packet to reset counters, and then reading twice at kernel
start and end time to obtain the counter value difference.
The new concurrent profiling usage needs the integration
with the corresponding augment at aqlprofile side.
Change-Id: I94b4442eadc8c64b8fba51b1e4916fc8b895ad21
2020-07-16 14:39:46 -05:00
Evgeny
2a7f77b290
counters dumping optimization
...
Change-Id: I8c694e5380e15179453148dd9ab3a3e51b6db861
2020-07-15 09:57:41 -05:00
Evgeny
f214238880
cleanup
...
Change-Id: I518f741d3d3d0bb514936c4db8b727bdc79d9fdc
2020-06-24 12:22:22 -05:00
Evgeny
9f7e936d70
concurrent sqtt support
...
Change-Id: I91391fafabc93aefa5d244d870ef82b96a59dc52
2020-06-23 20:00:49 -04:00
Evgeny
9950b97567
disabling destruction
...
Change-Id: I2a7d05a8f597b3bc8bd07bffe7181f9dcace1cbf
2020-05-07 03:34:30 -05:00
Evgeny
3af87a7423
adding pid for kernel results to support multi-process profiling
...
Change-Id: I283228a4b4145599c5e637dd6faa771b9f4b6345
2020-05-05 05:35:32 -05:00
Evgeny
fe70682184
toll destruction fix
...
Change-Id: If069c820526e21a0a4b80ac516f9669a81f34cab
2020-04-28 03:16:15 -05:00
Evgeny
7be9a42ab3
fixing hsa intercept test
...
Change-Id: I2671dfc6a9bd3e01a0c926aa3ea367b8c7a0279e
2020-03-28 17:24:16 -05:00
Evgeny
9df9fddcfb
PC sampling bringup
...
Change-Id: I0d041c4c8c3778f2c328cde38432bc72223706a3
pc sampling integration fix
Change-Id: Ia66ff876d2d99ec4d561daf8320b65d75f5cd2fe
2020-03-28 13:07:45 -05:00
Evgeny
7ecf9ef1e3
dynamic kernel lds size
...
Change-Id: I051b85550d12ca8662127615be7adfb12c5f9585
2020-02-27 17:44:22 -06:00
Evgeny
40730e34e4
adding AgentInfo::lds_block_size
...
Change-Id: I186893add96dc92570e710ae78b475897ebfe531
2020-02-18 14:00:19 -06:00
Evgeny
d2324e780f
hsa-interept mode enabling in head
...
Change-Id: I048a975deaae691868034ee361fea71ffb5bdd83
2020-02-14 00:22:36 -06:00
Evgeny
3d4c4d1cce
device id/type for submit callback
...
Change-Id: I69b31bd28850198c8ca44e55bf8f2ff9540d92fc
2020-02-13 22:00:56 -06:00
Evgeny
4edc6c98cf
adding agent to device callback data
...
Change-Id: I7d631e4faae685ce23704ac65308bf2aae0c157a
2020-02-13 22:00:56 -06:00
Evgeny
304568c1a3
kernel free on executable_destroy
...
Change-Id: I062e08fd5df94f4d21b443b74494dc6e95201f3d
2020-02-13 22:00:56 -06:00
Evgeny
086c29fff1
adding code object freeze handling, kernel code allocate/device callbacks
...
Change-Id: I6ec32a0c4e945260cd11f88c1d2a0a7f22d6e10c
documenting comments
Change-Id: I1b7af70d51042c48aef0474d0b6b54275b42c456
2020-02-13 22:00:56 -06:00
Evgeny
f95ed067ba
adding allocate/device/memcopy/submit callbacks
...
Change-Id: Ie83ae3cd57cdf5038e83db70891b103439c37d55
2020-02-13 22:00:08 -06:00
Evgeny
389cb50854
trace parameters fix
...
Change-Id: I5cd8c897b9982f9abc9e36c18a4a488bd65fbd30
2019-12-10 04:33:29 -06:00
Evgeny
c8eb0641bf
sgpr/vgpr reported sizes in terms of registers; lds size in bytes;
...
Change-Id: Id527687cc075a4e85ea4d0063ed8f336e34d2454
2019-11-27 15:19:14 -06:00
Evgeny
60043d198b
code object tracking and v3 code object support
...
Change-Id: I081ada13f6364ea4401a97a485dedfa9bf8a45fe
2019-11-19 20:18:09 -06:00
Evgeny
dc05345a78
setting of granulated lds, sgpr, vgpr reserved by HW for the kernel
...
Change-Id: Ib3e60c0b97787f56f769136480a46a8560e00cc8
2019-11-11 18:11:16 -05:00
Xianwei Zhang
ee917344ff
build: slightly change files to pass compilation
...
Compiled using Clang and encountered errors on ambiguous pointer
type, implicit conversion NULL to bool, and unused function, etc.
As a fix, this patch slightly updates the related codes.
Change-Id: I4a73d66293313dda36749036f5c73aaf2a445231
2019-08-22 23:29:16 -04:00
Evgeny
988283533f
adding SPM test
...
Change-Id: I55dde52e668206e238ea860dd11e0ed5501a582a
2019-06-04 15:34:21 -05:00
Evgeny
6d6eb2b2d0
adding SPM trace support
...
Change-Id: I07c0d3ec5c4ec51ea00fea539c930c6ed27b1163
2019-06-04 05:07:19 -05:00
Evgeny
8c15ff4bf8
extending xontext pool API
...
Change-Id: I56809354a9ea991489479117160f3d4f451f82b3
2019-03-29 11:12:40 -05:00
Evgeny
f834529669
improving rocprof tracing usage
...
Change-Id: I4db513c93406a7038b8b681d12a0dc4298975f01
2019-03-12 18:19:17 -05:00
Evgeny
996b64309a
fix of reported completion signal in th ecase when it is proxied by the tracker
...
Change-Id: Ic1100fde03bc34e0130405285d32d6b97a031967
2019-02-20 19:21:35 -06:00