kjayapra-amd
e9bd41bf1a
SWDEV-234295 - Dont clear device programs during amd::program::build()
...
Change-Id: I87bc7e2c830edee783ee490bbb087492467f2704
2020-06-03 12:18:25 -04:00
German Andryeyev
dc4e09a63a
Avoid lock for last queued command
...
Use atomics for last queued command update
Change-Id: I759e9d78ea72f23c0d45dbede6250b231e122276
2020-05-29 11:06:55 -04:00
Christophe Paquot
0782acabb5
Use a dedicated lock for last queued command set/get
...
Change-Id: If3d2144841c7863cf7afe2ca85aea62e0a3a33c7
2020-05-28 12:49:39 -07:00
Evgeny
4ffe859cee
fixing a typo
...
Change-Id: I2fc113e1f52ae9356636ddd4597c36c822bed49b
2020-05-26 14:49:27 -04:00
Aryan Salmanpour
fed94b8604
Add support for setting CU mask on ROCclr for ROCm backend
...
Change-Id: I0dbe2eeb33467fc0f24b26929119c10e9b455da7
2020-05-15 14:23:43 -04:00
Vlad Sytchenko
6e985845b3
Take into account dynamic LDS size when validating the launch parameters.
...
Bottom layers don't error check this value, so we might and up writing a bad value to a register and cause the SPI to hang.
Change-Id: I6da4ae71c66a25c63ebb804da4afe4ca7fb831b7
2020-05-08 09:37:06 -04:00
Michael LIAO
12fcfee41d
Fix build failure.
...
- Also fix `-Wreorder` warning. NFC.
Change-Id: I766fdc622c9107f901a55498bdc8fef3d821d1b7
2020-05-07 10:39:10 -04:00
Michael LIAO
503ef06555
Clear executable permission.
...
Change-Id: Ia0d363b1ba89d7947e5b5a55cb67edba86f0515e
2020-05-07 10:38:58 -04:00
kjayapra-amd
347e36e31b
SWDEV-232464 - Memory Map modules loaded via file from hipModuleLoad
...
Change-Id: I0e644a161c8000abe1b07fbec72de09f1c0a4b18
2020-05-04 12:40:16 -04:00
Alex Xie
6c5a42b33c
SWDEV-232894 Port hipMemcpy optimizations from HCC to VDI
...
Apply the optimization to change for OpenCL too.
Clean up some unnecessary checks.
Change-Id: I840261fe35baeeadeba7388e86779d482f509aad
2020-04-30 11:06:28 -04:00
Laurent Morichetti
9e1964ddaa
Make the device binary copy optional
...
Device binaries that are embedded inside the host binary do not
require a copy. Their lifetime is guaranteed to exceed that of the
loaded executable.
Add a 'make_copy' parameter to amd::Program::addDeviceProgram. If
make_copy is false the original image will be used and will not
get freed when the amd::Program is destroyed.
Change-Id: I7973bb0243f5a2d1b639b8a88445cfe6af919dd7
2020-04-29 18:39:57 -04:00
Alex Xie
009d0b5f55
SWDEV-232894 Port hipMemcpy optimizations from HCC to VDI
...
Change-Id: I6bebe9ac503a9f80d067aeea8a848409ad210338
2020-04-27 14:53:58 -04:00
kjayapra-amd
7458bf9964
SWDEV-229840 - Improve error messages on ROCCLR Layer.
...
Change-Id: Iab7d9156cdc206db86385aa05023a0095ed40f92
2020-04-19 20:01:49 -04:00
Saleel Kudchadker
b4d852de7f
Fix env var name for CL_AGENT
...
This got incorrectly renamed to VDI_AGENT
Change-Id: Ifd7f7c89c0a4e4bfc29e0bceae2c22bf6ba01379
2020-04-08 14:07:11 -04:00
Saleel Kudchadker
a97f206ab1
Change for OpenCL2.2 header promotion
...
Change-Id: Icd579e5c94af6e8d984a3f6c2606be44dec72e91
2020-04-01 16:32:03 -04:00
German Andryeyev
9f795d750d
SWDEV-224639 - Device memory is still occupied
...
- Add cache free on OCL context destroy
- Remove std::mem_fun() usage, since it was removed in c++17
Change-Id: If6acd08f13a2298912ecd78fc025dcf0b32aee54
2020-03-30 17:39:21 -04:00
German Andryeyev
85c09043f3
SWDEV-79445 - Remove cl_amd_object_metadata extension
...
Change-Id: I35502d5e3bd4be4a087f4b49b68098f8aa31fdb4
2020-03-24 19:12:52 -04:00
Payam
1b6f21ad9a
removing AMD emails per palamida scan
...
Change-Id: If7307f5b1f81a43f2725ec5abd3b8989cbddbcc5
2020-03-11 21:26:55 -04:00
German Andryeyev
288967eff4
SWDEV-193956 - Fix a regression in OCL for user events
...
- Check the queue for nullptr, since the user events may not have
a queue, associated with them
Change-Id: Ib969a052acc9108ca3fd0c063157fe4d47c5b244
2020-03-09 11:10:23 -04:00
German Andryeyev
0fc433e076
SWDEV-193956 - [hipclang-vdi-rocm][perf]
...
~45% to 50% of Performance drop on rocBLAS_int8 test
Add support for active waits without blocking the host thread.
Change-Id: Ie7bb48dcafcb4c93d448bf74749b829b626c3578
2020-03-04 17:02:15 -05:00
German Andryeyev
77ff161324
SWDEV-193956 - [hipclang-vdi-rocm][perf]
...
Retain command before a wait.
Change-Id: I0ab8f1352af32c96567dad9e31b3687bdf407bf6
2020-02-21 16:43:21 -05:00
German Andryeyev
a66d09f5a3
SWDEV-193956 - [hipclang-vdi-rocm][perf]
...
~45% to 50% of Performance drop on rocBLAS_int8 test
Use the last command in the queue for a wait.
Add extra print information about processed commands.
Add an option to disable file location printing.
Change-Id: I4187883e1a90e571fde3128af98368108fda8785
2020-02-21 15:21:15 -05:00
Christophe Paquot
566144edb2
Append before setting last command to avoid corner case
...
Change-Id: Iafe5f899427f0119e7f43e96af38e6e3a1dbfc93
2020-02-13 22:23:20 -05:00
Laurent Morichetti
d9d9c69399
Replace cl_* integral types with standard types.
...
cl_bool -> bool
cl_int -> int32_t
cl_uint -> uint32_t
cl_long -> int64_t
cl_ulong -> uint64_t
cl_float -> float
cl_double -> double
cl_bitfield -> uint64_t
Change-Id: I840c8993b55f98f5b745d21e27f5f28233647a58
2020-02-12 13:16:06 -08:00
German Andryeyev
da5f4c554e
Add CMake support for PAL backend
...
Change-Id: Id0673d43217be305aa4ca7a612c416baee3fb23c
2020-02-10 11:18:24 -05:00
Laurent Morichetti
b4c6143a2f
Update copyright info
...
Change-Id: Ia4f9ff0f5f873b4223a8cca154188bb0d2f1abba
2020-02-04 09:26:14 -08:00
Laurent Morichetti
20c7173849
Merge branch 'origin/pghafari/vdi-prototype' into lmoriche/amd-master
...
Change-Id: Id3b833d405596735becb3346f3b08c6da57033fe
2020-01-30 20:12:13 -08:00