Anusha GodavarthySurya
d25dc8af85
SWDEV-287027 - Improve launch bounds error log message
...
Change-Id: I3229580daa97f141de89bacddfce45cd43f24b19
2021-05-31 01:51:58 -04:00
Anusha Godavarthy Surya
8cc0e04239
SWDEV-240806 - Initial commit for hipGraph and stream capture infrastructure
...
On StreamBegincapture captures the parameters passed to APIs and respective node will be created and added to graph
All parameters are passed to STREAM_CAPTURE macro, it checks if stream in capture mode and redirects the call to the capture function and returns
Updated hipStream and hipEvent with capture parameters
Added handling for hipStreamBeginCapture & hipStreamEndCapture
Change-Id: Ic8926a7b4336c2cc81f0b3a9a224aa392c474134
2021-05-07 17:38:16 -04:00
Maneesh Gupta
055a460f12
SWDEV-252801 - Fail if launch params exceeds launch_bounds
...
Change-Id: I90c8d0f1897ca63f00c3eec7c99d649a5c43416d
2021-05-06 22:38:50 -07:00
Christophe Paquot
f665936fe1
SWDEV-270599 - Refactored the way we get time from start/stop events in all cases
...
Change-Id: Id3f9bcad45d643e493daf9d5f47b3a032a427177
2021-05-03 13:16:36 -07:00
Tao Sang
b99efb9d67
SWDEV-283515 - Fix crashing in kernel launch on MGPUS
...
Fix wrong mixing of current device and stream device in
ihipModuleLaunchKernel() and hipLaunchCooperativeKernel().
Fix missing hipSetDevice() in hipMemcpyWithStream* tests.
Change-Id: I09333bb40d239bb42c832df5ea16d17eeaeff5e7
2021-04-29 14:46:47 -04:00
Rahul Garg
47fe205223
SWDEV-266973 - Add missing HIP_RETURN
...
Change-Id: Ib569a8cb030179b71406d47fd08c7e1e7ab6eb7c
2021-04-27 19:55:28 -04:00
Anusha GodavarthySurya
94f6202032
SWDEV-240806 - [hip-graph] hip_memory and hip_kernel separate command creation from enqueue
...
hipMemcpy1D all variants, hipMemcpy3D, hipMemset, hipMemset3D and hipLaunchKernel
Change-Id: Ieeb767cf87aa0546bc3b3d5d2e01ab618e0d285c
2021-04-20 11:04:42 -04:00
Maneesh Gupta
dff604f557
SWDEV-187253 - Print warning if launch_params > bounds
...
Change-Id: I138546bf816887576e797105f97ce37245911045
2021-03-24 22:29:19 -07:00
Payam
2e52ceca29
SWDEV-182753 - Throw error if launch params>bounds
...
Change-Id: I5f9931efac37e3149894ca116a1b0c2dcc3e5bc5
2021-03-12 19:17:08 -05:00
cjatin
564417d00f
SWDEV-271274 - Changing DevLogPrintf to LogPrintf for cases where error condition is being returned
...
Change-Id: I153d071f7859e9412ea0c69f53c53d190453d73d
2021-03-01 14:33:46 -05:00
agodavar
d5b4762d96
SWDEV-255979 - Added support of __managed__ static variable
...
Change-Id: I9d5cbbecc8c19ec38a95c94ab4130465ba76c102
2021-02-17 13:01:29 -05:00
agodavar
a16d91a708
SWDEV-267154 - Add checks for grid and block dimensions 0
...
Change-Id: I54799c0a4a42a91f6226f94a1e3ff7bc79046a98
2021-01-27 00:32:31 -05:00
Tony Tye
cd1117d627
SWDEV-268864 - Use new isa class to access isa related information
...
Change-Id: I2122bf85155997d802ca54e456c2dae66b801825
2021-01-18 22:31:01 +00:00
Rahul Garg
584771e7a2
Handle overflow
...
Fixes SWDEV-261712
Change-Id: I730fcdca8c2640fde9cfdfe2f4b4cf4894807fed
2020-12-09 19:09:43 -05:00
Jatin Chaudhary
dfe6f1ef27
Adding checks for function, size and params in ihipModuleLaunchKernel to have common message for hipModuleLaunch and hipExtModuleLaunch
...
Change-Id: I11921ad753930872572571dbfd1c12daba83a725
2020-12-07 03:17:14 -05:00
Sourabh Betigeri
b65314d168
SWDEV-251091: Added null checks and empty file check in hip_module and hip_platform
...
Change-Id: Iad99f996dcf90ffb86f62e79bf4cbd53b35b7e3b
2020-10-20 19:55:01 -04:00
kjayapra-amd
40322a9f1c
SWDEV-254363 - Checks to make sure kernel is launched on the same device as it was launched.
...
Change-Id: I2f273a70b1a5fc0e9fc9c6144eabd32466ff4e59
2020-10-06 08:38:09 -04:00
Vladislav Sytchenko
d5aa0f607d
Revert "Revert "Merge branch 'amd-master-next' into amd-npi-next""
...
This reverts commit 9142674ef0 .
Reason for revert: <INSERT REASONING HERE>
Change-Id: I92ceb171e31026ed1864704cef2fc1497b883ef9
2020-10-05 13:20:58 -04:00
Vladislav Sytchenko
9142674ef0
Revert "Merge branch 'amd-master-next' into amd-npi-next"
...
This reverts commit d7c02619d6 .
Reason for revert: <INSERT REASONING HERE>
Change-Id: I53322718dadde2c98f96140b8e260ec7ee9ef721
2020-10-05 13:02:39 -04:00
Aryan Salmanpour
4817ec4778
fix a bug for calculating the prev_sum of MGSyncInfo struct used in multi-gpu cooperative launch
...
SWDEV-251619
Change-Id: I6566bfc9384bb4677562085991a7a01d3cea3935
2020-10-01 14:05:22 -04:00
Satyanvesh Dittakavi
f80044bd6a
SWDEV-248293 - Fix elapsedTime
...
Fixes the elapsed time for the scenarios where HIP API takes one of the start/stop events
and another one is recorded using hipEventRecord.
Change-Id: I51831b2651fc8e7207ff0e3fcc6dc7c1b4239fa8
2020-09-02 03:22:47 -04:00
Tao Sang
2332174c8a
Replace private libelf with elfio
...
Change-Id: I59ae33f7cc55e73f6519af14be91dd9863b03af3
2020-09-01 10:29:11 -04:00
Todd tiantuo Li
a243a69e98
SWDEV-240803 - add hipFuncSetSharedMemConfig
...
Change-Id: I160b04677b3e7b99b3981ae7ecc84a0e3811d5e8
2020-08-20 18:18:24 -04:00
Todd tiantuo Li
3ac8483124
move hipFuncSetCacheConfig from hip_device to hip_module
...
Change-Id: If5a930e1210e76531f5ba9a0e5f5ec98ad473a19
2020-08-17 17:39:01 -04:00
Aryan Salmanpour
359c0fbb76
SWDEV-248057 - fix the calculation of allGridSize used in multi_grid_group() API
...
Change-Id: Ib470094e28dcacaa4769dc5c7ab08924f5b7fa41
2020-08-17 12:52:40 -04:00
Jatin Chaudhary
628a308475
Adding Anyorder flag to HIP
...
Change-Id: Ie20931541b3febe23fa9ac36ebc0c90de75a5f0a
2020-08-17 08:46:52 -04:00
Todd tiantuo Li
fb43f21044
SWDEV-240803 - add hipFuncSetAttribute and hipFuncAttribute
...
Change-Id: I3f4d67b19d89fd348fa5b884af4a2542ee4aba60
2020-08-14 17:39:29 -04:00
Saleel Kudchadker
dec95e58e3
Enable queue profile only if we attach a profiler
...
Submit explicit profile marker for hipEventRecord to record
timestamps. Enable explicit signal profiling if the API specifies
start and stop events.
Toggle this with env var HIP_FORCE_QUEUE_PROFILING=0
Change-Id: Iae449a63ec3ebf6c2880e65d7b1dd1031a29018f
2020-08-06 11:28:16 -04:00
Anusha Godavarthy Surya
8481524416
SWDEV-243961 Added overflow check for globalWorkSize
...
Change-Id: I153d99398eb4619baad8c0a52466e0d5d5f898ca
2020-08-05 11:18:07 -04:00
kjayapra-amd
0b788c4c67
SWDEV-236178 - Store texture reference metadata for dynamically loaded modules.
...
Change-Id: I99ecc80da7e29c691341a01a09e4532972f1e3e5
2020-06-11 22:34:50 -04:00
kjayapra-amd
840347f0d0
SWDEV-236178 - Reorganizing Platform/Modules code for easy access.
...
Change-Id: Ie8920260ffc4ff01e44b48af8cec9ea5aed1aa9b
2020-06-11 10:11:20 -04:00
kjayapra-amd
9261a35be9
SWDEV-234295 - Pass flag to ROCclr to not clear device programs during program::build()
...
Change-Id: I50b9fa1a96da6895f73fdf4a7c0d3f096b1188da
2020-06-05 09:53:11 -04:00
Jatin
126573df4c
Adding changes for hipExtLaunchKernel for rocCLR
...
Change-Id: Iba52bc3bde7c37f3fb375a55ba0947e87b3cdc9b
2020-06-02 14:16:41 -04:00
Joseph Greathouse
90453b68d3
Fix occupancy calculation functions in ROCclr path
...
The hipOccupancyMaxPotentialBlockSize API is meant to return the
number of threads for the highest-occupancy workgroup, and the number
of those workgroups. It was previously calculating the number of
maximum-sized workgroups that would fit on a single CU. This is
a mixture of the API we wanted (to calculate max potential block size)
and the MaxBlocksPerMultiprocessor function.
This patch fixes it up so that the internal occupancy calculation
function works for two uses: the traditional function that calculates
the maximum blocks per multiprocessor when a user passes in a fixed
block size (used for hipMaxBlocksPerMultiprocessor style functions)
and a function that calculates the size of a block that would lead
to maximum occupancy, and how many blocks of that size would be
needed to fill the whole GPU (for hipOccupancyMaxPotentialBlockSize
style functions).
This also updates the occupancy calculation function to prepare for
gfx10, which does not have SGPR-based occupancy limits.
Change-Id: Ie007b3f9d5ebc4e166b50a3a051498af35650f35
2020-05-28 10:22:10 -05:00
Saleel Kudchadker
facb05495f
Fix elapsed time calculation for null stream
...
SWDEV-237377 - This fixes time calculation where the event may
be recorded on Null stream and work submitted on other streams
Change-Id: Ie36310dea5cee2fed4a514ed01f04db4b47e571c
2020-05-27 18:42:07 -04:00
Evgeny
10cb7645dc
API tracing instrumentation
...
Change-Id: I257409b9fe299b009ded3e3a43287322d5f93a70
2020-05-14 11:03:09 -05:00
Vlad Sytchenko
a88e52ba80
Correct HIP_FUNC_ATTRIBUTE_NUM_REGS query
...
Change-Id: I526cc7871c690260df0fa8c1b3b4b15fbc5af219
2020-05-09 12:42:30 -04:00
Vlad Sytchenko
b5f9d2f818
Correct HIP_FUNC_ATTRIBUTE_MAX_THREADS_PER_BLOCK query
...
We should be returning the max workgroup size calculated by the compiler.
Change-Id: If86590efbb9b291f470bdbe87e5df992e661c539
2020-05-08 14:36:47 -04:00
Vlad Sytchenko
276bfc9667
Fix confusion in hipFuncGetAttribute()
...
Cuda shared == OpenCL local
Cuda local == OpenCL private
Change-Id: I5a204945ecde35919b9e9def20bbb2662fffea2b
2020-05-08 14:36:36 -04:00
kjayapra-amd
8f38c3260e
SWDEV-232464 - Need to initialize image with ptr passed since they can pass image not of type __ClangOffloadBundler.
...
Change-Id: I2c50042220a0230bc445ed21728f114a229c53e1
2020-05-06 14:25:43 -04:00
Payam
dba0e72de2
name change vdi to rocclr
...
Change-Id: I06d198bbb4a499e153b290b73a92afed3553b252
2020-05-06 09:14:30 -04:00