Граф коммитов

701 Коммитов

Автор SHA1 Сообщение Дата
Maneesh Gupta 203dd6cb70 Merge pull request #482 from ROCm-Developer-Tools/feature_clean_up_hip_math
Switch to using ROCDL directly, as opposed to via HC. Add missing bits.
2018-06-06 16:07:22 +05:30
Maneesh Gupta 02ea7f13b3 Merge pull request #496 from gargrahul/add_gettexresdesc_nvcc
Add getTextureResourceDescriptor on NVCC
2018-06-06 15:12:11 +05:30
Maneesh Gupta de5043c47c Merge pull request #487 from gargrahul/fix_hiparray_alloc_flag_nvcc
Map hipArray alloc flags on NVCC
2018-06-06 15:11:40 +05:30
Rahul Garg 17bb8dbe86 Add getTextureResourceDescriptor on NVCC 2018-06-05 18:46:25 +05:30
Rahul Garg 1a02bc364f Add integrated device property 2018-06-02 13:11:16 +05:30
Alex Voicu 8bfcb22c57 Rename for minimal confusion. 2018-06-01 22:55:33 +01:00
Alex Voicu 5e90c6a081 Missing __device__. 2018-06-01 19:48:36 +01:00
Alex Voicu ebd6d49cd2 Fix typos / address review comments. 2018-06-01 16:20:21 +01:00
Alex Voicu 417869821d Re-sync with upstream. 2018-06-01 15:49:05 +01:00
Rahul Garg d3621a2500 Map hipArray alloc flags on NVCC 2018-06-01 17:28:43 +05:30
Maneesh Gupta 68c36f9125 Merge pull request #447 from ROCm-Developer-Tools/feature_native_vector_types
Switch over to using native vector types, for better codegen. Remove noise.
2018-06-01 13:58:07 +05:30
Maneesh Gupta bdf2645713 Merge pull request #466 from ROCm-Developer-Tools/feature_use_Float16
Feature use _Float16 and match CUDA __half behaviour.
2018-06-01 13:50:12 +05:30
Alex Voicu ab4b2a650b Re-sync with upstream. Add integer abs. 2018-05-31 16:38:00 +01:00
Alex Voicu b9b1e0b703 Merge branch 'feature_use_Float16' of https://github.com/ROCm-Developer-Tools/HIP into feature_use_Float16 2018-05-31 15:27:31 +01:00
Alex Voicu 0108819e2a Add missing interop with volatile. Fix unit tests. 2018-05-31 15:27:12 +01:00
Alex Voicu 59db16fd36 Switch to using ROCDL directly, as opposed to via HC. Add missing bits. 2018-05-31 03:17:26 +01:00
Yaxun (Sam) Liu b7641cf835 Fix __syncthreads for hip-clang 2018-05-30 16:33:18 -04:00
Yaxun (Sam) Liu 6b8218e5a0 Add more function declarations for hip-clang 2018-05-30 06:51:03 -04:00
Maneesh Gupta a1fea02c6e Merge pull request #469 from yxsamliu/fun
Add math function declaration for hip-clang
2018-05-30 15:01:27 +05:30
Maneesh Gupta 7fd92d471b Add signature for hipFuncGetAttributes to nvcc_details
Change-Id: I06d5b50028fcfe3ea9d81f749b02be6b094c0977
2018-05-30 11:24:48 +05:30
Rahul Garg d67b614b22 Fix hipBindTexture on NVCC path 2018-05-30 10:59:07 +05:30
Maneesh Gupta c8aca45856 Merge pull request #471 from Jorghi12/patch-4
Adding double/long int signatures for abs
2018-05-30 08:31:32 +05:30
Alex Voicu 75390c7083 GCC only supports power-of-2 component counts, so round up to next square. 2018-05-29 01:22:26 +01:00
Rahul Garg 024f77ce61 Add 1d texture types for NVCC path 2018-05-28 15:02:06 +05:30
Jorghi12 6e2b9d0547 Update math_functions.h
CUDA also has a function named labs.
2018-05-26 16:22:10 -04:00
Alex Voicu 98395ad38c Add missing alias half / half2 aliases 2018-05-26 12:10:50 +01:00
Jorghi12 4f6904b5c7 Adding double/long int signatures for abs
Adding overloads for abs that are found in cuda's math_functions.
2018-05-26 00:41:24 -04:00
Yaxun (Sam) Liu b7801f1fe3 Add more function declarations to hip-clang 2018-05-25 16:18:46 -04:00
Alex Voicu 32388cbf28 Missing bits. 2018-05-25 20:12:21 +01:00
Yaxun (Sam) Liu f9693a9210 Add math function declaration for hip-clang 2018-05-25 12:22:16 -04:00
Alex Voicu 6f819f226b Missing commit. 2018-05-23 17:57:47 +01:00
Maneesh Gupta df3bb9fc32 Merge pull request #445 from ROCm-Developer-Tools/feature_func_attributes
Add support for the hipFuncGetAttributes interface.
2018-05-22 09:37:41 +05:30
Maneesh Gupta 661561eead Add hipMalloc3D to nvcc detail
Change-Id: I8a5654066ed1504e3b05eddbbdebf05fd52aa149
2018-05-21 11:33:09 +05:30
founta 1a108ef7f3 defined hipPitchedPtr
Added a define for hipPitchedPtr to resolve a compiler error
2018-05-18 09:11:50 -04:00
Maneesh Gupta 03ac8e6a92 Merge pull request #433 from gargrahul/add_hipmemset3d
Added hipMemset3D
2018-05-18 14:54:15 +05:30
Maneesh Gupta dcb4477f9f Merge pull request #440 from yxsamliu/assert2
Add __assert_fail, __device_trap and hipErrorAssert for clang
2018-05-18 14:13:27 +05:30
Alex Voicu b8c98d2357 Switch over to using native vector types, for better codegen. Remove noise. 2018-05-12 00:14:17 +01:00
Alex Voicu 13274ce559 Add support for the hipFuncGetAttributes interface. 2018-05-11 03:35:10 +01:00
Yaxun (Sam) Liu 19f3ed6f62 Fix warning about inlined function is not defined 2018-05-08 16:38:50 -04:00
Yaxun (Sam) Liu 7672b44c79 Add __assert_fail, __device_trap and hipErrorAssert for clang 2018-05-08 15:42:27 -04:00
Siu Chi Chan b285145966 add intrinsics mbcnt_lo, mbcnt_hi, lane_id 2018-05-08 13:43:53 -04:00
Rahul Garg da302c3e93 Added hipMemset3D 2018-05-07 10:24:30 +05:30
Rahul Garg b4585a2a90 Cleaned up 2018-05-02 12:54:17 +05:30
Rahul Garg 9de5f23d54 Fix texture 3D for HIP/NVCC 2018-05-02 11:56:37 +05:30
Maneesh Gupta adcd8553c3 Merge pull request #419 from gargrahul/fix_hip_gcc_host_build
Fix gcc build issues for host code
2018-05-01 12:42:50 +05:30
Maneesh Gupta e17ade3afb Merge pull request #415 from deven-amd/master
Checkin to fix bugs in math functions.
2018-05-01 12:29:03 +05:30
Maneesh Gupta caa13f98ed Merge pull request #410 from gargrahul/hipCtx_deprecation_warning
Mark hipCtx APIs as deprecated
2018-05-01 12:23:22 +05:30
Maneesh Gupta 365d155774 Merge pull request #406 from gargrahul/fix_tex1D_coordinate_type
Resolve tex1D float to int conversion issue
2018-05-01 09:18:11 +05:30
Rahul Garg 639001933c Fix gcc build issues for host code 2018-04-25 12:30:11 +05:30
Deven Desai af586bbbf2 Checkin to fix bugs in math functions.
This change fixes the following bugs that were discovered while debuggnig TF unit test failures (cwise_ops_test)

1. __hisinf and __hisnan routines
   Both had incorrect implementations.

2. abs
   A "long long" (64bit int) version was missing, resulting in the 32bit version being used for 64bit ints (which resulted in incorrect results, when the value passed in was outside the 32bit int range)

3. lgamma
  We seemed to have a custom version for the 'double' datatype (which was giving incorrect results). Replaced it with a call to the 'double' version of the underlying 'hc::precision_math::lgamma'
2018-04-24 18:10:07 +00:00