rocm-systems

Author	SHA1	Message	Date
Evgeny	ef7ff69ff0	adding hipKernelNameRefByPtr function Change-Id: Iefc18967b10394b85a207ffdb5bbfe5e3601474d	2020-05-28 10:59:48 -04:00
Michael LIAO	f6addba699	[hip] Those texture interfaces are C interfaces should be always exposed. Change-Id: Ie34f1420839b17486346149b1672e70ec0088b54	2020-05-27 15:03:59 -04:00
Sarbojit Sarkar	83b11f9a61	[doc]shflsync update 1. Updated FAQ with shftsync not supported hip_faq.md 2. Corrected some of input parameter description in hcc_details/hip_runtime_api.h 3. Redirect shfl() to shfl__sync() for nvcc path where CUDA > 9.0 Change-Id: I3d8184db5fcc622852c9bad96b706348e8dfc16c	2020-05-27 02:17:40 -04:00
Mahesha Shivamallappa	01dae52d64	Add support for cooperative group type - thread_block Change-Id: If3770b6d6718a638b70f527ae2533d9ef3267ff4	2020-05-22 23:08:42 -04:00
Aryan Salmanpour	7dd5b19290	Add support for hipExtStreamCreateWithCUMask API Change-Id: I369d0eaca493821c4badc6b18ac02daa2fddc95f	2020-05-22 11:34:06 -04:00
Evgeny	5abb8e1a68	API tracing instrumentation Change-Id: I257409b9fe299b009ded3e3a43287322d5f93a70	2020-05-14 11:03:09 -05:00
Matt Arsenault	d2dd307c7d	Remove some asm declarations for intrinsics This technique should never be used, and only accessed through __builtins. There's currently no builtin for groupstaticsize. I left ds_swizzle since for some reason it switches to the builtin based on __HCC__ or not. Change-Id: If1e1394221dba83ea4add6db5e94d6b715552044	2020-05-11 15:20:58 -04:00
Michael LIAO	a2dbcc075c	[hip] Fix `-Wduplicate-decl-specifier` warning. NFC. Change-Id: Iae48bbb7805c39f1005c920df8e76504426f2d3b	2020-05-11 10:12:33 -04:00
Sarbojit Sarkar	3612851809	Enabling hipGetDeviceFlags required in [SWDEV-229170] Change-Id: I998d37e5847f9651345554bada86df6fce86d1eb	2020-05-08 01:37:23 -04:00
Payam	c5f76c3de3	name change vdi to rocclr Change-Id: I06d198bbb4a499e153b290b73a92afed3553b252	2020-05-06 09:14:30 -04:00
Rahul Garg	60c34fbd4d	Make HIP C compliant Change-Id: Ic2fa650675e68200c841ce3db622da836b169f33	2020-05-05 12:49:40 -04:00
Vlad Sytchenko	bfad8d2833	Fix even more typos from `5429b40afe` Change-Id: I4f44261547b321a214348943ff5117eb5bd55b06	2020-05-04 15:26:56 -04:00
Alex Xie	d890d77da4	SWDEV-221166 - Detect support for large bar access through HIP runtime API Change-Id: Iaa9756c1b5e40c1ab5afb38e44a6699fa5f6c13f	2020-05-01 20:39:52 -04:00
Michael LIAO	64507de694	Fix more typos from `5429b40afe`. Change-Id: I75ed28a5862daffc0778910d7ba3b97f51a87949	2020-05-01 12:19:30 -04:00
root	2689246de6	Merge master into amd-master-next Change-Id: I3fc1dc0c860d627053537581e75561e8a7efe327	2020-04-26 22:19:37 +00:00
Yaxun (Sam) Liu	808dae6813	Enable template max and min for HIP-Clang (#2028 ) It was for HCC only. HIP-Clang also needs it for __fp16 since AMDMIGraphX uses it. Change-Id: Id49322b7b89ef799accdf6b47627a6fce51d1ab5	2020-04-24 12:30:28 -07:00
Yaxun (Sam) Liu	4143d81618	Enable template max and min for HIP-Clang This change is required by AMDMIGraphX. It was for HCC only. HIP-Clang also needs it for __fp16 since AMDMIGraphX uses it. Change-Id: Id49322b7b89ef799accdf6b47627a6fce51d1ab5	2020-04-24 09:51:17 -04:00
Vlad Sytchenko	8d6347c6b8	Make sure to zero out all the unset texture fields These might contain garbage causing the runtime to incorrectly parse the state of the texture references. Change-Id: I93c726fa30b580b3e14c50ac939f3c71b0d1c8d9	2020-04-23 16:38:52 -04:00
Maneesh Gupta	a0b5dfd625	Merge in the rocclr based hip runtime (#2032 ) * Merge master-next changes in master (include vdi development in master branch)	2020-04-23 09:12:06 -07:00
Michael LIAO	218044577e	[hip] Fix typos. Change-Id: I9d85d0e70033d144dbd4d61cb434ffbe023af8c0	2020-04-22 16:44:54 -04:00
Michael LIAO	19f793f1cd	[hip] Generate assertion message in assertion. Change-Id: Ie66f6563e8728fd0e21cf22dcc6619e4a0e5c28d	2020-04-21 16:44:40 -04:00
Michael LIAO	16d9fe5e37	[vdi] Refactor texture/surface reference support. Change-Id: I8014d82aae7139ef5f95e4b50c4fc6da200dbc9d	2020-04-21 11:56:48 -04:00
Aryan Salmanpour	386a0e0123	disable printf on hip-clang on Windows (#2021 )	2020-04-17 10:33:24 +05:30
Jeff Daily	ef596cd088	add IPC event support (#1996 )	2020-04-17 10:31:22 +05:30
Yaxun (Sam) Liu	8d83e95457	Disable device side malloc (#2009 ) * Disable device side malloc Currently device side malloc is not working and takes excessive device memory. Disable it for now until a working malloc is implemented. Change-Id: I1ad908c1c53a83752383b4be96688a848642c699	2020-04-14 16:07:14 +05:30
Yaxun (Sam) Liu	88304c15e6	Fix MIOpen build failure This is charrypick of `9ead991784` and https://github.com/ROCm-Developer-Tools/HIP/pull/2009 Fix cmake config file Removed cmake target files under packaging directory. Merged cmake config .in files for HIP-Clang and HCC as one. Use cmake generated target files in both install and packaging. This makes cmake config file consistent for make install and make package. Let device side malloc/free return nullptr and trap Change-Id: I448f3ea2d4934648089bad371debc203f895cba6	2020-04-13 23:01:31 -04:00
Vlad Sytchenko	f311b0062f	Fix Windows build Change-Id: I8c46c8ee82a6e47483d4c0430b483eead3772e5b	2020-04-10 22:25:04 -04:00
Maneesh Gupta	2af31479e2	Merge branch 'amd-master' into amd-master-next Change-Id: I3094c15008093f2072bcd38aca4ea90aeae2d97b	2020-04-09 06:31:00 -04:00
Michael LIAO	a48b312aa9	[hip] Fix volatile-qualified member function declartion. - It should be a volatile-qualified member function instead of returning volatile type. Change-Id: Id7aaa1953d56151b59e469ef22b9f4280f63bebb	2020-04-07 12:49:26 -04:00
Rahul Garg	ba8a556ea9	Rename hipDrvOccupancy to hipModuleOccupancy and match CUDA syntax (#1943 )	2020-04-07 14:02:52 +05:30
German Andryeyev	5fe91ccb1b	SWDEV-184710 Support hipLaunchCooperativeKernelMultiDevice() - Add validation logic for MGPU launches to pass a cuda test Change-Id: Iccca7fde43493fc3bc6685512d39202271ae3e92	2020-04-06 16:38:27 -04:00
lmoriche	9de5e90ab5	Don't duplicate embedded code objects (#1991 ) If the code object is embedded in an already mapped file, and the lifetime of the mapped file exceeds the lifetime of the executable, we do not need to make a copy of the binary. This allows the ROCR to present the code object URI as file:///path/to/file#offset=X&size=Y.	2020-04-06 15:37:35 +05:30
ansurya	770e76e752	Initial support for bfloat16 (#1980 )	2020-04-06 15:35:43 +05:30
Yaxun (Sam) Liu	4af2106d10	Fix ambiguity of fma for _Float16 for libc++ (#1976 ) libc++ defines fma as template function for auto promotion of mixed-type arguments. libc++ does not handle _Float16 as _Float16 is not a supported type by C++ standard. As such, it is unlikely we can commit our fix for _Float16 to libc++ trunk. Therefore we handle _Float16 with a template specialization of __numeric_type in HIP headers. Change-Id: If01960a657ebf1a7a67463cdcf66fab7458dff3c	2020-04-06 15:35:18 +05:30
Vladislav Sytchenko	aea688b79c	Add entry points for hipTexObject*() API Even though the runtime and driver texture object API is one to one, the structs used by these APIs are not. See hipResourceDesc vs HIP_RESOURCE_DESC differences. These differences are not trivial and most likely won't be able to handled by hipify, so we need new API entry points. Change-Id: Id4bcb1ad0ae15378dbdb5a2ed07e5ea30f320082	2020-04-01 14:51:51 -04:00
Michael LIAO	b72196613a	[vdi] Fix hipGetSymbol{Address\|Size} - Use symbol value as the qeury key. Compared to the symbol name, the symbol value is more robust as developers may use unqualified or qualified identifiers. It also removes the mangling and/or demangling requirement for the runtime API. Change-Id: I9d4259f3842612c7cc98551269fc2092d8b5c19e	2020-03-31 00:26:53 -04:00
Maneesh Gupta	cbc3d1713f	Remove address_space(1) typecast and use __ockl_atomic_add_noret_f32 (#1956 ) * Remove address_space(1) typecast for ockl_global_atomic_add_f32 * use __ockl_atomic_add_noret_f32	2020-03-28 17:28:33 +05:30
Sameer Sahasrabuddhe	9a0c5d0653	enable HCC printf when using hip-clang This is cherry-picked from PR#1947 that was committed to the github repo. It allows printf to work with hip-clang and HCC runtime. Change-Id: I754753250ea1e694cf3441722e2d4c9d25fa75bc	2020-03-28 00:18:21 -04:00
Siu Chi Chan	43abf84f54	don't expose symbols from code_object_bundle (#1971 ) Change-Id: I56479485aad42c3d517fe6d9055be1cd846eeb00	2020-03-27 14:09:07 +05:30
Vladislav Sytchenko	e0187ba405	Add initial entry points for mipmapped array API Change-Id: Icd59cc7323ddcb6773da6105260415a1e6f4cdcb	2020-03-26 14:45:20 -04:00
Vladislav Sytchenko	2028b6eb29	Headers need to export C symbols for texture API This also adds declarations of all the missing texture APIs. hipTexRefSet*() functions need to take a textureReference as a ptr for type erasure to work. Runtime has been modified to accomodate this. This change only applies to VDI. Change-Id: Icf43cc5bd44dfc2c39084b7fe56d5a793bf7319f	2020-03-26 14:45:20 -04:00
Vladislav Sytchenko	ced0582a52	Set textureObject to nullptr This avoids dangling pointers for newly initiazlied textures Change-Id: Ia444b91fe17fd756ed583ec595ae1febbdfbd034	2020-03-26 14:45:20 -04:00
Vladislav Sytchenko	b09fe1280e	Correct typos in texture function declarations Change-Id: I492995e984eda2e8a5e806c5d4c9c78da09ac483	2020-03-26 12:43:17 -04:00
Sarbojit2019	5024f9057a	Fix for __usad issue (#1972 ) Fixes #1930	2020-03-26 17:09:44 +05:30
Benjamin Sherman	3d38135ae2	Add const qualifiers to HIP_vector_type unary arithmetic operators (#1965 ) Resolves issue #1960	2020-03-26 17:09:00 +05:30
Joseph Greathouse	f61b79d9a3	Fix cooperative launch APIs to set hipGetLastError (#1935 ) * Fix cooperative launch APIs to set hipGetLastError Previously, the cooperative launch APIs did not properly log their errors in the global hipGetLastError variable before returning back to the user. As such, the APIs would leave hipSuccess in the last error, which would break some use cases. This fixes that problem by making a trampoline function that does the HIP_INIT_API and ihipLogStatus. * Add missing flag to the log of multi-GPU launch	2020-03-25 14:39:24 -07:00
Nick Curtis	b4c69a2e4a	Update hip_runtime_api.h (#1966 ) Correct URL for deprecated api list	2020-03-23 10:16:24 -07:00
Vladislav Sytchenko	4829a7c215	Add support for creating typed buffers What Cuda refers to "linear texture memory" is the OpenCL equivalent of CL_MEM_OBJECT_IMAGE1D_BUFFER. For these types of allocations we should create a typed buffer instead of an image. Currently there is no check in the texture fetch functions as to what kind of SRD is written into the texture object, so any kind of incorrect programming will cause the TA to hang. Fortunately for us, every one writes correct code :) Change-Id: I80dab85a992f2c0754ebf303d40ac6b5e045c7c1	2020-03-18 18:15:17 -04:00
Vladislav Sytchenko	5429b40afe	Rework the texture C++ API Currently the texture C++ API is forwarded to the ihip*Impl() calls, which are not even a part of Cuda. These should be forwarded to their respective Cuda C APIs instead. This change also fixes a bug with hipUnbindTexture() creating a dangling pointer. Change-Id: Ifafc9d106855a11bec84a18ea214b3d89e39990d	2020-03-18 18:14:53 -04:00
Vladislav Sytchenko	3e460ab514	Correct the declaration of hipBindTexture2D() The texture reference needs to be passed as a constant pointer. Change-Id: Idde461f0f328ac87ce677b6bab3203161b514cbf	2020-03-18 18:08:23 -04:00

1 2 3 4 5 ...

921 Commits