rocm-systems

Автор	SHA1	Сообщение	Дата
Vladislav Sytchenko	5429b40afe	Rework the texture C++ API Currently the texture C++ API is forwarded to the ihip*Impl() calls, which are not even a part of Cuda. These should be forwarded to their respective Cuda C APIs instead. This change also fixes a bug with hipUnbindTexture() creating a dangling pointer. Change-Id: Ifafc9d106855a11bec84a18ea214b3d89e39990d	2020-03-18 18:14:53 -04:00
Vladislav Sytchenko	3e460ab514	Correct the declaration of hipBindTexture2D() The texture reference needs to be passed as a constant pointer. Change-Id: Idde461f0f328ac87ce677b6bab3203161b514cbf	2020-03-18 18:08:23 -04:00
Vladislav Sytchenko	2d77399747	Correct the declaration of hipBindTextureToArray() The texture reference needs to be passed as a constant pointer. Change-Id: Iff171626536071fb2020cfff7132ec930577b1b9	2020-03-18 18:08:13 -04:00
Vladislav Sytchenko	7190fa518e	Correct the declaration of hipBindTexture() The texture reference needs to be passed as a constant pointer. Change-Id: I36ca0bddaba30becfc2ce70dd9e5b7db66c57f27	2020-03-18 18:08:01 -04:00
Vladislav Sytchenko	551bcc6293	Add missing mipmap API entries Introduce hipFreeMipmappedArray(), hipMallocMipmappedArray() and hipGetMipmappedArrayLevel() APIs. Change-Id: I878228c79fa1c54536c17d6baf45f83d51d2b1c7	2020-03-18 18:07:45 -04:00
Vladislav Sytchenko	99e744ab4a	Don't hardcode the texture read mode The readmode needs to be inferred from the template arguments. Change-Id: I067037035e2492a24eac47e16d4015f879be0ea7	2020-03-18 18:07:33 -04:00
Vladislav Sytchenko	117f0ab102	Add constraints to texture indirect functions Similar to the previous patch, this change adds type constraints to texture indirect functions. Since we don't have to deduce the return type for these, we simply just have to check if the user provided a valid channel type. Change-Id: Ia094bd6126e01df2ea90902c9aa59cb6cfe85773	2020-03-18 12:24:40 -04:00
Vladislav Sytchenko	ef2415edc7	Add constraints to texture fetch functions When sampling a pixel the hw always returns a float4. The type in the texture reference controls the bitcast that we perform before returning the sampled pixel. Creating a texture with an unsupported will lead to potential UB. This change makes it so that it's only possible to use textures with a type that makes sense. Using something like texture<int, hipTextureType1D, hipReadModeNormalizedFloat> will now lead to a compilation error with a message "Invalid channel type!". Change-Id: I7fde44cb1d4b9737e0c48c28cb59c018c59ccaa2	2020-03-18 12:24:40 -04:00
Yaxun (Sam) Liu	08d9759eba	Workaround for libc++ include path for HIP-Clang (#1917 ) HIP-Clang cuda_wrapper headers require clang include path before standard C++ include path. However libc++ include path requires to be before clang include path. To workaround this, we pass -isystem with the parent directory of clang include path instead of the clang include path itself.	2020-03-18 11:20:21 +05:30
Sarbojit Sarkar	82926666c4	[hip-vdi]Fix for TF build failure [SWDEV-225827] Change-Id: I8478779bef92bad8353b8d066b28c220bb59b98d	2020-03-17 22:52:01 -04:00
Vladislav Sytchenko	a0751402d8	Rework device texture headers This change addresses three things. First the available APIs are brought up to par with Cuda (missing ones are added and incorrect ones removed). Second the size of hip/hcc_detail/texture_functions.h. Using some template magic we can bring down the code size down from ~11k lines to only ~900 lines in total. Third this change fixes some bugs in the declaration of the texture fetch funcitons. Currently the return type for textures with read mode set to hipReadModeNormalizedFloat is not float. This causes pixel data to be lost during the bitcast when the texture pixel element size is less than the size of float. The new headers will only be enabled for VDI to avoid breaking HCC. Change-Id: I77cb29293fb79e55681be094c37702a48d80b64c	2020-03-17 17:04:37 -04:00
Jatin Chaudhary	16a6a94fbf	Adding Half Abs APIs (#1902 )	2020-03-17 14:13:19 +05:30
Sameer Sahasrabuddhe	899c878703	enable HCC printf when using hip-clang (#1947 ) This allows printf to work with hip-clang and HCC runtime. See comments under #1919 for a reported bug and feature request.	2020-03-17 14:03:27 +05:30
Joseph Greathouse	f7e85649f4	Fix compiler warning on NVCC path (#1942 ) GCC emits a warning about using static functions like hipCUDAErrorTohipError inside this function, because it has an inline directive, but it's not static. Adding static to this function to silence warnings (and prevent potential problems in the future).	2020-03-17 14:02:59 +05:30
Joseph Greathouse	4128d68ed7	Fix occupancy calculations API on NVCC (#1941 ) NVCC warned if you tried to use hipOccupancyMaxActiveBlocksPerMultiprocessor because when passing in a device function pointer, "const void* func" was insufficient to describe it accurately. Adding a C++ templated class type definition for this function.	2020-03-17 14:02:48 +05:30
Sarbojit2019	320742e8a0	Fix __sad signature match with Cuda (#1936 ) Fix for issue #1930	2020-03-17 14:02:00 +05:30
Aryan Salmanpour	015895a265	[HIP] add cooperative kernel launch APIs on NVCC (#1929 )	2020-03-17 14:01:11 +05:30
Maneesh Gupta	eee5cc8621	Annotate `__constant__` (#1901 )	2020-03-17 13:59:44 +05:30
mhbliao	774035d869	[hip] Improve the portability of the header for vector type support. (#1873 ) - Need to check the availability of `__has_attribute` builtin macro instead of compiler versions. That's more reliable and portable among various compilers. - Provides a very basic support of vectors for unknown compilers.	2020-03-17 13:59:24 +05:30
Sameer Sahasrabuddhe	64cd527335	SWDEV-204784: separate printf declaration for vdi/clang There are now two implementations of printf in HIP: 1. The implemenation for HCC is controlled by the HC_FEATURE_PRINTF macro, and it works only with the HCC compiler used in combination with the HCC runtime. 2. The implementation for hip-clang requires the VDI runtime, and is always enabled with that combination. Change-Id: Ibaeda7900ffe2ce602ca0094aafed0f1147ac2b6	2020-03-16 04:00:39 -04:00
Evgeny Mankov	70f5646f8a	Merge pull request #1908 from asalmanp/prop_mulit_coop [HIP] add hip specific properties for cooperative kernel multi device	2020-03-12 19:12:11 +03:00
Alex Voicu	1c5f526e6b	Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_robust_constant	2020-03-12 14:20:26 +00:00
Maneesh Gupta	0726abf424	Expose support for non-returning atomic FADD (#1909 ) Change-Id: If5359488324477315a9bd4f308a75f606c065b39	2020-03-11 14:33:15 +05:30
Vladislav Sytchenko	4ca9cda372	Fix typo in device __shfl_xor function Change-Id: I8bcdd53ced00c596a0af013a0c34e37aa67c93ae	2020-03-10 13:23:08 -04:00
Nick Curtis	09edc7e49c	Fix incorrect shfl_xor for Windows copy/paste error, need __shfl_xor w/ lane_mask	2020-03-10 12:04:05 -05:00
Vladislav Sytchenko	ecd7c99b49	Add hipDrvMemcpy3D. This is the equivalent of cuMemcpy3D. Change-Id: Ib2e06dbd6f5093c931cdfd36c87617f32acffc2d	2020-03-09 16:11:25 -04:00
Sameer Sahasrabuddhe	09130b3b92	separate printf declaration for vdi/clang There are now two implementations of printf in HIP: 1. The implemenation for HCC is controlled by the HC_FEATURE_PRINTF macro, and it works only with the HCC compiler used in combination with the HCC runtime. 2. The implementation for hip-clang requires the VDI runtime, and is always enabled with that combination.	2020-03-09 09:40:05 +05:30
Lad, Aditya	d80edf9541	Merge branch 'master' into amd-master-next Conflicts: CMakeLists.txt tests/src/texture/simpleTexture2DLayered.cpp tests/src/texture/simpleTexture3D.cpp Change-Id: I4aa4754d391b5f37ddf15fa0bcfc84d9da020119	2020-03-06 14:10:44 -05:00
Aryan Salmanpour	7e45c54ea6	move new enums to the end to maintain compatibility	2020-03-06 11:38:44 -05:00
Maneesh Gupta	4a40010ac6	Expose support for non-returning atomic FADD Change-Id: If5359488324477315a9bd4f308a75f606c065b39	2020-03-05 10:30:52 +05:30
agodavar	6a5d04209c	Fix hipExtLaunchMultiKernelMultiDevice compilation issue Fix compilation error on hip-hcc+clang , hip-vdi+clang Enabled hipExtLaunchMultiKernelMultiDevice test on hip-vdi path hipExtLaunchMultiKernelMultiDevice common declaration for all paths Change-Id: I76031840614fce8e12a8e845548fa43a389a741a	2020-03-04 15:38:14 -05:00
Aryan Salmanpour	03797ae986	[HIP] add hip specific properties for cooperative kernel multi device	2020-03-03 13:25:36 -05:00
Alex Voicu	27480ff5a2	Annotate `__constant__`	2020-02-28 22:54:00 +02:00
saleelk	3e1f41c165	Fix HIPRTC headers to export C style symbols (#1879 )	2020-02-28 16:47:29 +05:30
Rahul Garg	6c5fa32815	Remove deprecated HIP markers (#1876 )	2020-02-28 16:47:15 +05:30
Rahul Garg	edc97f3073	Add hipDrvOccupancyMaxActiveBlocksPerMultiprocessor[WithFlags] (#1854 ) Equivalent to cuOccupancyMaxActiveBlocksPerMultiprocessor[WithFlags].	2020-02-28 16:46:55 +05:30
Nick Curtis	b7dd073d93	fix long shuffle implementations for windows (#1895 ) Fixes for SWDEV-223694	2020-02-26 15:53:56 +05:30
Rahul Garg	8c5e5e435b	Fix hipMemcpy3D (#1798 ) Fixes #1790 and #1791. hipMemcpy3D still requires further refactoring for different input and output combinations.	2020-02-17 19:35:35 +05:30
Tao Sang	b3f445c0f5	Temporarily comment out Hcc-specific APIs for CLang compiler Temporarily comment out Hcc-specific template functions hipExtLaunchKernelGGL and hipOccupancyMaxPotentialBlockSize for CLang compiler so that all test cases under hip/samples can be built successfully for Clang + Hip/Hcc runtime. Change-Id: Iafc761257be4a7b34eafa6759a01f369570cd6ce	2020-02-16 22:26:47 -05:00
Nick Curtis	797a929a65	Implement long / long long shuffles (#1829 ) Implement additional data-types for shuffles (long and long long). Based upon the double implementation.	2020-02-15 09:51:09 +05:30
ansurya	8c6934223b	Reduce GPU copying based on arch it runs on (#1751 ) Implements SWDEV-213230.	2020-02-13 14:21:51 +05:30
Aryan Salmanpour	959f1b0f0e	fix build error in nvcc path	2020-02-11 12:16:51 -05:00
Aryan Salmanpour	5a29f27455	Fix a typo causing a build error	2020-02-10 11:44:40 -05:00
Aryan Salmanpour	874b201ee2	resolve merge conflict	2020-02-10 10:30:55 -05:00
Maneesh Gupta	f8e1c01900	Revert "Match Occupancy APIs syntax with CUDA (#1625 )" (#1857 ) Reverting this for now till we figure out how to avoid the build breakage. This reverts commit `fa98798b63`.	2020-02-10 10:45:28 +05:30
Alex Voicu	dd34ea95d6	(Maybe) Match alignment between Clang and GCC. (#1789 ) Should fix #1740 and the related internal bug.	2020-02-10 10:44:49 +05:30
vsytch	ef514eef71	Device texture functions should not normalize the sampled pixel (#1826 ) * Device texture functions should not normalize the sampled pixel. This is already done by HW. * Add support to use h/w capability for normalized float data convertion for driver API's Co-authored-by: ansurya <50609411+ansurya@users.noreply.github.com>	2020-02-05 20:56:17 +05:30
Aryan Salmanpour	c8137263d6	code clean up	2020-01-31 13:08:25 -05:00
Aryan Salmanpour	6e867eacb6	[HIP][HIPIFY] Add some missing flags for cooperative launch and occupancy APIs	2020-01-30 15:05:53 -05:00
satyanveshd	fa98798b63	Match Occupancy APIs syntax with CUDA (#1625 ) * Match Occupancy APIs syntax with CUDA and fix tests using these APIs	2020-01-29 13:05:53 -08:00

... 6 7 8 9 10 ...

1557 Коммитов