AravindanC
2228383fcb
SWDEV-352878 - Remove CLANG include dependencies
...
Change-Id: I46bba0fdf21a3541e08ff0c465ff076d478094f9
2023-01-13 07:41:10 -08:00
Satyanvesh Dittakavi
c193503722
SWDEV-376386 - Support __hip_move_dpp with hipRTC
...
Change-Id: If9a6fe23d6df143a7f522823fca7183416048c6a
2023-01-13 00:05:31 -05:00
Anusha GodavarthySurya
33eef6d95e
SWDEV-364576 - use hipLimitMallocHeapSize to set initial heap buffer size
...
Change-Id: I2f1d3b229bc28571ce6c55f93f8ef73dda516b50
2023-01-12 23:34:29 -05:00
Matt Arsenault
3e54bad5b3
SWDEV-1 - Directly call __builtin_amdgcn_fence
...
The device libraries provide hip.bc, which only contains
__atomic_work_item_fence. This is identical to OpenCL's
atomic_work_item_fence, except prefixed with __. This only existed to
switch over the unknown values the OpenCL function provides. There's
no need to go through the wrapper here since it's using constant
values, and without it we can drop one of the bitcode libraries. I'm
guessing this was a holdover from before the fence builtin existed.
Change-Id: I8b04614f96194eafddeba295215c12c3856e6c42
2023-01-12 12:04:09 -05:00
Matt Arsenault
91b851fad7
SWDEV-1 - Remove unused intrinsic declaration
...
The uses have moved to using the proper __builtin_amdgcn_ds_swizzle.
Change-Id: I82159ebf41fb91ac6a9d07b33311e569dce5e754
2023-01-12 12:04:09 -05:00
Ioannis Assiouras
e97dcfbe48
SWDEV-374405 - Added OpenGL interoperability APIs for nvidia
...
Change-Id: I1d6ad5c610ab6ee805bd07dacea316b3242ed129
2023-01-12 05:22:14 -05:00
Jaydeep Patel
bb486b04c6
SWDEV-374383 - Check from and to if no of dependencies are more than 0, Check from and to node are belongs to graph pass in.
...
Change-Id: I0cdafb497147f9ac7f84d40203a21e9eefd595a3
2023-01-11 11:43:22 -05:00
Rakesh Roy
b321e2afc6
SWDEV-368881 - Implement hipModuleLaunchCooperative* APIs
...
- Add implementation for hipModuleLaunchCooperativeKernel, hipModuleLaunchCooperativeKernelMultiDevice APIs
Change-Id: I2a80264e458549211c34b413397a08b57d021147
2023-01-10 13:50:20 -05:00
Ajay
c6a7ce66bb
SWDEV-369556 - Unit_hipMemGetAddressRange_Negative cases
...
- return value for invalid device ptr
- offset greater than size
Change-Id: I8bcf878c3c1c0a29836ea664bfb9048e1114b74e
2023-01-10 06:57:54 -05:00
Maneesh Gupta
6c53a670c1
SWDEV-345571 - Revert changes for warp shuffle functions
...
This reverts commit ad1fae7df4 and 695533d435 .
Change-Id: I548f15055417453a7fd323a481ed410598830521
2023-01-09 22:56:52 -05:00
Jiabao Xie
03200953e5
SWDEV-374375 - moving nullptr check into hipGraphNode::isValidNode
...
Change-Id: Ic9264521d6f0b36ffc948ca9feeaa70eb6e98eb6
2023-01-09 16:27:38 -05:00
Jacob Lambert
7dd54c746e
SWDEV-356171 - Adding support for name lowering for bitcodes (-fgpu-rdc) via
...
Comgr mangled name fetching APIs
Change-Id: I10ce2b1288c459e3e61218d68a270aba2148376e
2023-01-09 14:49:51 -05:00
kjayapra-amd
0c8c08cf51
SWDEV-361374 - Adding support for hipPointerSetAttributes
...
Change-Id: I56710b808854acefca07b3b5898deebee38063e9
2023-01-09 12:36:57 -05:00
Juan Manuel MARTINEZ CAAMAÑO
c391af49dc
SWDEV-290637 - Specify a working directory when calling hip_embed_pch.sh
...
Consider the case where cmake is invoked as follows:
cd <parent_dir>
cmake -S<source_dir> -B<build_dir> ...
If `WORKING_DIRECTORY` parameter is not specified in `execute_command`,
`hip_pch.o` is created outside `<build_dir>`, in `<parent_dir>/hip_pch.o`.
Change-Id: I751f4390628315ad70c36148e09824c19c93ae9e
2023-01-09 03:17:07 -05:00
German
f39655c0c8
SWDEV-372757 - Don't destroy null queue in MT
...
Skip destruction of the null queue for MT. Windows can destroy worker
thread on exit during the stream destruction, causing a race condition.
Change-Id: I3e4bc3ab2b477d38c0aec1fc59e6f1af0f8f6b0f
2023-01-09 00:03:09 -05:00
Jaydeep Patel
695533d435
SWDEV-375137 - Bypass defination of half and half2 alias for internal include of hip_fp16.h
...
Change-Id: I5258a487d4c2d93e889a523fde81f1b3906d9318
2023-01-06 21:34:51 -05:00
Jiabao Xie
e5490697cc
SWDEV-374375 - Added checks to see if hip graph nodes are valid
...
Change-Id: I9e6eabf0e4a9100de6f9d26bf5b815957f12d4f2
2023-01-06 16:58:56 -05:00
pghafari
7615fba2d5
SWDEV-375135 - adding error string
...
Change-Id: I73276462627a47d405c8439d8a843ac266fc5d38
2023-01-06 16:31:35 -05:00
pghafari
335a24d1a0
SWDEV-369042 - updating to GetError to match cuda
...
Change-Id: I0ec2330443b26cb1c8cedba942f31fb1267cd09d
2023-01-06 05:33:41 -05:00
Juan Manuel MARTINEZ CAAMAÑO
537df06c4f
SWDEV-290384 - Missing hiprtcResult case in hiprtcGetErrorString
...
Change-Id: I11d18ff19f6f6f1b36febf87a586bc9ffb95af10
2023-01-06 03:38:44 -05:00
Satyanvesh Dittakavi
516acd4e60
SWDEV-375020 - Revert "SWDEV-370552 - Improve hiprtc performance by passing the right flags to comgr"
...
This reverts commit 627e689ad7 .
Reason: Adding "-Xclang" and "-disable-llvm-passes" in hiprtc
compilation seem to cause perf issues and memory fault in few apps.
Change-Id: I12ff46293c7b7afb24d570351736d884d2881b77
2023-01-06 00:09:06 -05:00
Ajay
3e4ed8d917
SWDEV-369618 - hipOccupancyMaxPotentialBlockSizeVariableSMemWithFlags invalid flag scenario
...
Change-Id: I07df2640be71a6b0aed160d54e80ffdeb85ec554
2023-01-05 23:40:29 -05:00
Jaydeep Patel
7c32f66e67
SWDEV-374372 - Return invalid value if pitch is less than width in bytes.
...
Change-Id: I33806d747cd344250d02e217de8e9b6d5a7f83c1
2023-01-05 10:40:14 -05:00
Jaydeep Patel
67792dd858
SWDEV-374373 - Occupied size should be within range.
...
Change-Id: I20e24c628c51154c2f8f36dca0a6fa4ead1ee6d0
2023-01-05 08:02:20 -05:00
Anusha GodavarthySurya
067c7316ba
SWDEV-373173 - Correct max VGPRs and VGPR Granularity in Occupancy calculation for different asics
...
Change-Id: Ib213ed8767a793a832776fba7c0811c2f023a8ad
2023-01-04 01:34:40 -05:00
German Andryeyev
38ad6234f9
SWDEV-360031 - Erase destroyed stream from capture
...
Parallel streams can be destroyed during capture. Make sure runtime
removes the destoryed streams from the list of parallel captured streams
Change-Id: I46b6cbb951e8711f0bf02f2826c3b890b2541ffe
2023-01-03 13:34:07 -05:00
Satyanvesh Dittakavi
409f23d905
SWDEV-374371 - hipGraphAddMemsetNode should return error if width is 0
...
Change-Id: I1cd4be271f65c9cade8dddb6d00191e1b74c96bc
2023-01-03 10:29:16 -05:00
sdashmiz
ce60d47b61
SWDEV-370309 - GetNodes should do levelorder
...
- GetNode should retrun nodes based on the dependency between them
Signed-off-by: sdashmiz <shadi.dashmiz@amd.com >
Change-Id: Ie8ecdd976f8dc9d9b5dc2adc6ae8f879912c8169
2023-01-03 04:54:55 -05:00
Jaydeep Patel
f2b0542442
SWDEV-374370 - Propogate element size to blit kernel.
...
Change-Id: I8a3355c62947265be34e6a2befd26f86b3dc6cd2
2023-01-01 23:54:09 -05:00
pghafari
a643c199de
SWDEV-356549 - del Null strm in destroyAllStreams
...
Change-Id: Ie04172ee39e6333247d07791dffda753a8b9ef41
2022-12-29 15:45:54 -05:00
Rakesh Roy
95b2441cbc
SWDEV-374789 - Validate HostQueue before accessing it
...
- hip::getQueue will return nullptr when comgr fails to load blit kernels.
Change-Id: I3bea1f5c13301d4a35e40ab7c6faa73f9c6aecf4
2022-12-28 01:04:10 -05:00
pghafari
2c906460cb
SWDEV-369580 - hipModuleGetTexRef matching cuda
...
Change-Id: I3f071fe0b289c0ba8316c691f066f47d12b156b5
2022-12-27 13:58:26 -05:00
Anusha GodavarthySurya
ad1fae7df4
SWDEV-345571 - Added support for half/half2 data types for warp shuffle functions
...
Change-Id: Ia0791760c1b0a35bbbf0b1f8435aa924af205169
2022-12-26 01:24:06 -05:00
Ioannis Assiouras
1278cf555e
SWDEV-374393, SWDEV-373631 - Added input checks for hipMemcpy2D/3D
...
Change-Id: I5ae57326c35b599d9ecec05e2a22bc6086027322
2022-12-22 11:21:55 -05:00
Ioannis Assiouras
99d2d016c1
SWDEV-374355, SWDEV-374356, SWDEV-374357, SWDEV-374358, SWDEV-374359 - Added input sanity checks to hipExternalMemoryGetMappedBuffer
...
Change-Id: I61e43dece3fb9bd961e92db8e015f08170327538
2022-12-22 10:46:10 +00:00
Jaydeep Patel
c4003ef62a
SWDEV-374398 - Pitch should not be less than width for both src and dst.
...
Change-Id: Ib8fa8e3bd0c655f3de213b3798d369ba866b2f01
2022-12-21 23:08:55 -05:00
Anusha GodavarthySurya
707ca92a80
SWDEV-373147 - Return error when getDeviceKernel returns nullptr
...
Change-Id: Ie9fcaa2bff87cb8ef363dc7e8362b74b8725b3b6
2022-12-21 22:14:10 -05:00
Ioannis Assiouras
d29a2e545d
SWDEV-374392 - Make hipMemCpy2/3DAsync synchronous for any HostToHost and DeviceToPageableHost
...
Change-Id: Iaa411681a49427a9e9b3a06e90abb0b32f5a6851
2022-12-21 14:59:39 -05:00
Ioannis Assiouras
b83271b4ca
SWDEV-374402 - Added check for invalid events in hipExtLaunchKernel
...
Change-Id: I8dfd69aa1ef38971a72b98c723d2099347a66d11
2022-12-21 12:37:12 -05:00
Jaydeep Patel
845bf0adb4
SWDEV-374382 - Change hip error type to context is destroyed instead for invalid stream.
...
Change-Id: I5ed22463204d7bff3741599eeced3d030da78fc4
2022-12-21 12:54:39 +00:00
Jaydeep Patel
148f67a36d
SWDEV-374394 - Correct hip error for invalid stream.
...
Change-Id: I96949c58e050cb3149ab8434aebe0f1a97a6e857
2022-12-20 21:27:56 -05:00
Ioannis Assiouras
f4f377bcaa
SWDEV-374402 - Added input stream check for hipExtLaunchKernel
...
Change-Id: I42915c3b9b10b4a4a5b3ae74b6a2dccc8fddbf7f
2022-12-20 14:29:58 -05:00
Ioannis Assiouras
97c5f772c6
SWDEV-374400, SWDEV-374404 - Change error message in hipFuncGetAttribute(s) to match nvidia.
...
Change-Id: I95b69a49ced24115f0aebc288ea067a49b770aca
2022-12-20 12:49:17 -05:00
pghafari
3b143cf3a2
SWDEV-369575, SWDEV-369568, SWDEV-369574 - matching
...
matching cuda return value
hipModuleLaunchKernel and hipExtModuleLaunchKernel matching with cuda in
negative cases
Change-Id: I636070924c81116dccbfa74f693061c60530de45
2022-12-20 12:24:23 -05:00
Jaydeep Patel
027f71ccfb
SWDEV-374395 - Correct rhs.
...
Change-Id: I433d60344a6d23bc48f4db58d5501af89bc1989c
2022-12-20 11:38:11 +00:00
Satyanvesh Dittakavi
627e689ad7
SWDEV-370552 - Improve hiprtc performance by passing the right flags to comgr
...
Change-Id: I99674b033d1248d9b34ac598b4780bd84a38c279
2022-12-19 06:16:56 -05:00
Satyanvesh Dittakavi
e90721435b
SWDEV-372767 - Fix hipCUB and rocWMMA build failures
...
Change-Id: If3672fd50075c67cfce4d94e73b1376c4aa8d2ae
2022-12-16 03:08:18 -05:00
pghafari
c0f779fdf1
SWDEV-369576, SWDEV-369578, SWDEV-369577, SWDEV-369579, SWDEV-369563 - matching cuda return value
...
SWDEV-369576, SWDEV-369578, SWDEV-369579, SWDEV-369563 - matching cuda return value
Change-Id: I21b598e959571979307238c485492eb7fe2388cc
2022-12-16 01:12:49 -05:00
kjayapra-amd
2f8847bfbd
SWDEV-363865 - Use COMGR path to unbundle code object in HIPRTC.
...
Change-Id: Ifca29ea0c6ff319c3f584397e542a1658d220c4a
2022-12-15 07:16:55 -05:00
Jacob Lambert
2835ff1a63
SWDEV-322000 - Removing scoped locks in code regions with Comgr calls
...
Comgr now internally locks to avoid multi-threading issues with LLVM
Change-Id: I74b903d373af55c2efade17a9caf31fda3ca1795
2022-12-14 12:40:45 -05:00