Maneesh Gupta
203dd6cb70
Merge pull request #482 from ROCm-Developer-Tools/feature_clean_up_hip_math
...
Switch to using ROCDL directly, as opposed to via HC. Add missing bits.
2018-06-06 16:07:22 +05:30
Maneesh Gupta
02ea7f13b3
Merge pull request #496 from gargrahul/add_gettexresdesc_nvcc
...
Add getTextureResourceDescriptor on NVCC
2018-06-06 15:12:11 +05:30
Maneesh Gupta
de5043c47c
Merge pull request #487 from gargrahul/fix_hiparray_alloc_flag_nvcc
...
Map hipArray alloc flags on NVCC
2018-06-06 15:11:40 +05:30
Rahul Garg
17bb8dbe86
Add getTextureResourceDescriptor on NVCC
2018-06-05 18:46:25 +05:30
Rahul Garg
1a02bc364f
Add integrated device property
2018-06-02 13:11:16 +05:30
Alex Voicu
8bfcb22c57
Rename for minimal confusion.
2018-06-01 22:55:33 +01:00
Alex Voicu
5e90c6a081
Missing __device__.
2018-06-01 19:48:36 +01:00
Alex Voicu
ebd6d49cd2
Fix typos / address review comments.
2018-06-01 16:20:21 +01:00
Alex Voicu
417869821d
Re-sync with upstream.
2018-06-01 15:49:05 +01:00
Rahul Garg
d3621a2500
Map hipArray alloc flags on NVCC
2018-06-01 17:28:43 +05:30
Maneesh Gupta
68c36f9125
Merge pull request #447 from ROCm-Developer-Tools/feature_native_vector_types
...
Switch over to using native vector types, for better codegen. Remove noise.
2018-06-01 13:58:07 +05:30
Maneesh Gupta
bdf2645713
Merge pull request #466 from ROCm-Developer-Tools/feature_use_Float16
...
Feature use _Float16 and match CUDA __half behaviour.
2018-06-01 13:50:12 +05:30
Alex Voicu
ab4b2a650b
Re-sync with upstream. Add integer abs.
2018-05-31 16:38:00 +01:00
Alex Voicu
b9b1e0b703
Merge branch 'feature_use_Float16' of https://github.com/ROCm-Developer-Tools/HIP into feature_use_Float16
2018-05-31 15:27:31 +01:00
Alex Voicu
0108819e2a
Add missing interop with volatile. Fix unit tests.
2018-05-31 15:27:12 +01:00
Alex Voicu
59db16fd36
Switch to using ROCDL directly, as opposed to via HC. Add missing bits.
2018-05-31 03:17:26 +01:00
Yaxun (Sam) Liu
b7641cf835
Fix __syncthreads for hip-clang
2018-05-30 16:33:18 -04:00
Yaxun (Sam) Liu
6b8218e5a0
Add more function declarations for hip-clang
2018-05-30 06:51:03 -04:00
Maneesh Gupta
a1fea02c6e
Merge pull request #469 from yxsamliu/fun
...
Add math function declaration for hip-clang
2018-05-30 15:01:27 +05:30
Maneesh Gupta
7fd92d471b
Add signature for hipFuncGetAttributes to nvcc_details
...
Change-Id: I06d5b50028fcfe3ea9d81f749b02be6b094c0977
2018-05-30 11:24:48 +05:30
Rahul Garg
d67b614b22
Fix hipBindTexture on NVCC path
2018-05-30 10:59:07 +05:30
Maneesh Gupta
c8aca45856
Merge pull request #471 from Jorghi12/patch-4
...
Adding double/long int signatures for abs
2018-05-30 08:31:32 +05:30
Alex Voicu
75390c7083
GCC only supports power-of-2 component counts, so round up to next square.
2018-05-29 01:22:26 +01:00
Rahul Garg
024f77ce61
Add 1d texture types for NVCC path
2018-05-28 15:02:06 +05:30
Jorghi12
6e2b9d0547
Update math_functions.h
...
CUDA also has a function named labs.
2018-05-26 16:22:10 -04:00
Alex Voicu
98395ad38c
Add missing alias half / half2 aliases
2018-05-26 12:10:50 +01:00
Jorghi12
4f6904b5c7
Adding double/long int signatures for abs
...
Adding overloads for abs that are found in cuda's math_functions.
2018-05-26 00:41:24 -04:00
Yaxun (Sam) Liu
b7801f1fe3
Add more function declarations to hip-clang
2018-05-25 16:18:46 -04:00
Alex Voicu
32388cbf28
Missing bits.
2018-05-25 20:12:21 +01:00
Yaxun (Sam) Liu
f9693a9210
Add math function declaration for hip-clang
2018-05-25 12:22:16 -04:00
Alex Voicu
6f819f226b
Missing commit.
2018-05-23 17:57:47 +01:00
Maneesh Gupta
df3bb9fc32
Merge pull request #445 from ROCm-Developer-Tools/feature_func_attributes
...
Add support for the hipFuncGetAttributes interface.
2018-05-22 09:37:41 +05:30
Maneesh Gupta
661561eead
Add hipMalloc3D to nvcc detail
...
Change-Id: I8a5654066ed1504e3b05eddbbdebf05fd52aa149
2018-05-21 11:33:09 +05:30
founta
1a108ef7f3
defined hipPitchedPtr
...
Added a define for hipPitchedPtr to resolve a compiler error
2018-05-18 09:11:50 -04:00
Maneesh Gupta
03ac8e6a92
Merge pull request #433 from gargrahul/add_hipmemset3d
...
Added hipMemset3D
2018-05-18 14:54:15 +05:30
Maneesh Gupta
dcb4477f9f
Merge pull request #440 from yxsamliu/assert2
...
Add __assert_fail, __device_trap and hipErrorAssert for clang
2018-05-18 14:13:27 +05:30
Alex Voicu
b8c98d2357
Switch over to using native vector types, for better codegen. Remove noise.
2018-05-12 00:14:17 +01:00
Alex Voicu
13274ce559
Add support for the hipFuncGetAttributes interface.
2018-05-11 03:35:10 +01:00
Yaxun (Sam) Liu
19f3ed6f62
Fix warning about inlined function is not defined
2018-05-08 16:38:50 -04:00
Yaxun (Sam) Liu
7672b44c79
Add __assert_fail, __device_trap and hipErrorAssert for clang
2018-05-08 15:42:27 -04:00
Siu Chi Chan
b285145966
add intrinsics mbcnt_lo, mbcnt_hi, lane_id
2018-05-08 13:43:53 -04:00
Rahul Garg
da302c3e93
Added hipMemset3D
2018-05-07 10:24:30 +05:30
Rahul Garg
b4585a2a90
Cleaned up
2018-05-02 12:54:17 +05:30
Rahul Garg
9de5f23d54
Fix texture 3D for HIP/NVCC
2018-05-02 11:56:37 +05:30
Maneesh Gupta
adcd8553c3
Merge pull request #419 from gargrahul/fix_hip_gcc_host_build
...
Fix gcc build issues for host code
2018-05-01 12:42:50 +05:30
Maneesh Gupta
e17ade3afb
Merge pull request #415 from deven-amd/master
...
Checkin to fix bugs in math functions.
2018-05-01 12:29:03 +05:30
Maneesh Gupta
caa13f98ed
Merge pull request #410 from gargrahul/hipCtx_deprecation_warning
...
Mark hipCtx APIs as deprecated
2018-05-01 12:23:22 +05:30
Maneesh Gupta
365d155774
Merge pull request #406 from gargrahul/fix_tex1D_coordinate_type
...
Resolve tex1D float to int conversion issue
2018-05-01 09:18:11 +05:30
Rahul Garg
639001933c
Fix gcc build issues for host code
2018-04-25 12:30:11 +05:30
Deven Desai
af586bbbf2
Checkin to fix bugs in math functions.
...
This change fixes the following bugs that were discovered while debuggnig TF unit test failures (cwise_ops_test)
1. __hisinf and __hisnan routines
Both had incorrect implementations.
2. abs
A "long long" (64bit int) version was missing, resulting in the 32bit version being used for 64bit ints (which resulted in incorrect results, when the value passed in was outside the 32bit int range)
3. lgamma
We seemed to have a custom version for the 'double' datatype (which was giving incorrect results). Replaced it with a call to the 'double' version of the underlying 'hc::precision_math::lgamma'
2018-04-24 18:10:07 +00:00