rocm-systems

Автор	SHA1	Сообщение	Дата
Maneesh Gupta	203dd6cb70	Merge pull request #482 from ROCm-Developer-Tools/feature_clean_up_hip_math Switch to using ROCDL directly, as opposed to via HC. Add missing bits.	2018-06-06 16:07:22 +05:30
Maneesh Gupta	02ea7f13b3	Merge pull request #496 from gargrahul/add_gettexresdesc_nvcc Add getTextureResourceDescriptor on NVCC	2018-06-06 15:12:11 +05:30
Maneesh Gupta	de5043c47c	Merge pull request #487 from gargrahul/fix_hiparray_alloc_flag_nvcc Map hipArray alloc flags on NVCC	2018-06-06 15:11:40 +05:30
Rahul Garg	17bb8dbe86	Add getTextureResourceDescriptor on NVCC	2018-06-05 18:46:25 +05:30
Rahul Garg	1a02bc364f	Add integrated device property	2018-06-02 13:11:16 +05:30
Alex Voicu	8bfcb22c57	Rename for minimal confusion.	2018-06-01 22:55:33 +01:00
Alex Voicu	5e90c6a081	Missing __device__.	2018-06-01 19:48:36 +01:00
Alex Voicu	ebd6d49cd2	Fix typos / address review comments.	2018-06-01 16:20:21 +01:00
Alex Voicu	417869821d	Re-sync with upstream.	2018-06-01 15:49:05 +01:00
Rahul Garg	d3621a2500	Map hipArray alloc flags on NVCC	2018-06-01 17:28:43 +05:30
Maneesh Gupta	68c36f9125	Merge pull request #447 from ROCm-Developer-Tools/feature_native_vector_types Switch over to using native vector types, for better codegen. Remove noise.	2018-06-01 13:58:07 +05:30
Maneesh Gupta	bdf2645713	Merge pull request #466 from ROCm-Developer-Tools/feature_use_Float16 Feature use _Float16 and match CUDA __half behaviour.	2018-06-01 13:50:12 +05:30
Alex Voicu	ab4b2a650b	Re-sync with upstream. Add integer abs.	2018-05-31 16:38:00 +01:00
Alex Voicu	b9b1e0b703	Merge branch 'feature_use_Float16' of https://github.com/ROCm-Developer-Tools/HIP into feature_use_Float16	2018-05-31 15:27:31 +01:00
Alex Voicu	0108819e2a	Add missing interop with volatile. Fix unit tests.	2018-05-31 15:27:12 +01:00
Alex Voicu	59db16fd36	Switch to using ROCDL directly, as opposed to via HC. Add missing bits.	2018-05-31 03:17:26 +01:00
Yaxun (Sam) Liu	b7641cf835	Fix __syncthreads for hip-clang	2018-05-30 16:33:18 -04:00
Yaxun (Sam) Liu	6b8218e5a0	Add more function declarations for hip-clang	2018-05-30 06:51:03 -04:00
Maneesh Gupta	a1fea02c6e	Merge pull request #469 from yxsamliu/fun Add math function declaration for hip-clang	2018-05-30 15:01:27 +05:30
Maneesh Gupta	7fd92d471b	Add signature for hipFuncGetAttributes to nvcc_details Change-Id: I06d5b50028fcfe3ea9d81f749b02be6b094c0977	2018-05-30 11:24:48 +05:30
Rahul Garg	d67b614b22	Fix hipBindTexture on NVCC path	2018-05-30 10:59:07 +05:30
Maneesh Gupta	c8aca45856	Merge pull request #471 from Jorghi12/patch-4 Adding double/long int signatures for abs	2018-05-30 08:31:32 +05:30
Alex Voicu	75390c7083	GCC only supports power-of-2 component counts, so round up to next square.	2018-05-29 01:22:26 +01:00
Rahul Garg	024f77ce61	Add 1d texture types for NVCC path	2018-05-28 15:02:06 +05:30
Jorghi12	6e2b9d0547	Update math_functions.h CUDA also has a function named labs.	2018-05-26 16:22:10 -04:00
Alex Voicu	98395ad38c	Add missing alias half / half2 aliases	2018-05-26 12:10:50 +01:00
Jorghi12	4f6904b5c7	Adding double/long int signatures for abs Adding overloads for abs that are found in cuda's math_functions.	2018-05-26 00:41:24 -04:00
Yaxun (Sam) Liu	b7801f1fe3	Add more function declarations to hip-clang	2018-05-25 16:18:46 -04:00
Alex Voicu	32388cbf28	Missing bits.	2018-05-25 20:12:21 +01:00
Yaxun (Sam) Liu	f9693a9210	Add math function declaration for hip-clang	2018-05-25 12:22:16 -04:00
Alex Voicu	6f819f226b	Missing commit.	2018-05-23 17:57:47 +01:00
Maneesh Gupta	df3bb9fc32	Merge pull request #445 from ROCm-Developer-Tools/feature_func_attributes Add support for the hipFuncGetAttributes interface.	2018-05-22 09:37:41 +05:30
Maneesh Gupta	661561eead	Add hipMalloc3D to nvcc detail Change-Id: I8a5654066ed1504e3b05eddbbdebf05fd52aa149	2018-05-21 11:33:09 +05:30
founta	1a108ef7f3	defined hipPitchedPtr Added a define for hipPitchedPtr to resolve a compiler error	2018-05-18 09:11:50 -04:00
Maneesh Gupta	03ac8e6a92	Merge pull request #433 from gargrahul/add_hipmemset3d Added hipMemset3D	2018-05-18 14:54:15 +05:30
Maneesh Gupta	dcb4477f9f	Merge pull request #440 from yxsamliu/assert2 Add __assert_fail, __device_trap and hipErrorAssert for clang	2018-05-18 14:13:27 +05:30
Alex Voicu	b8c98d2357	Switch over to using native vector types, for better codegen. Remove noise.	2018-05-12 00:14:17 +01:00
Alex Voicu	13274ce559	Add support for the hipFuncGetAttributes interface.	2018-05-11 03:35:10 +01:00
Yaxun (Sam) Liu	19f3ed6f62	Fix warning about inlined function is not defined	2018-05-08 16:38:50 -04:00
Yaxun (Sam) Liu	7672b44c79	Add __assert_fail, __device_trap and hipErrorAssert for clang	2018-05-08 15:42:27 -04:00
Siu Chi Chan	b285145966	add intrinsics mbcnt_lo, mbcnt_hi, lane_id	2018-05-08 13:43:53 -04:00
Rahul Garg	da302c3e93	Added hipMemset3D	2018-05-07 10:24:30 +05:30
Rahul Garg	b4585a2a90	Cleaned up	2018-05-02 12:54:17 +05:30
Rahul Garg	9de5f23d54	Fix texture 3D for HIP/NVCC	2018-05-02 11:56:37 +05:30
Maneesh Gupta	adcd8553c3	Merge pull request #419 from gargrahul/fix_hip_gcc_host_build Fix gcc build issues for host code	2018-05-01 12:42:50 +05:30
Maneesh Gupta	e17ade3afb	Merge pull request #415 from deven-amd/master Checkin to fix bugs in math functions.	2018-05-01 12:29:03 +05:30
Maneesh Gupta	caa13f98ed	Merge pull request #410 from gargrahul/hipCtx_deprecation_warning Mark hipCtx APIs as deprecated	2018-05-01 12:23:22 +05:30
Maneesh Gupta	365d155774	Merge pull request #406 from gargrahul/fix_tex1D_coordinate_type Resolve tex1D float to int conversion issue	2018-05-01 09:18:11 +05:30
Rahul Garg	639001933c	Fix gcc build issues for host code	2018-04-25 12:30:11 +05:30
Deven Desai	af586bbbf2	Checkin to fix bugs in math functions. This change fixes the following bugs that were discovered while debuggnig TF unit test failures (cwise_ops_test) 1. __hisinf and __hisnan routines Both had incorrect implementations. 2. abs A "long long" (64bit int) version was missing, resulting in the 32bit version being used for 64bit ints (which resulted in incorrect results, when the value passed in was outside the 32bit int range) 3. lgamma We seemed to have a custom version for the 'double' datatype (which was giving incorrect results). Replaced it with a call to the 'double' version of the underlying 'hc::precision_math::lgamma'	2018-04-24 18:10:07 +00:00

1 2 3 4 5 ...

701 Коммитов