Граф коммитов

659 Коммитов

Автор SHA1 Сообщение Дата
Rahul Garg 976b9ed200 Cleaned up 2018-05-02 12:54:17 +05:30
Rahul Garg 1d76c48e3d Fix texture 3D for HIP/NVCC 2018-05-02 11:56:37 +05:30
Maneesh Gupta 2e6b46f308 Merge pull request #419 from gargrahul/fix_hip_gcc_host_build
Fix gcc build issues for host code
2018-05-01 12:42:50 +05:30
Maneesh Gupta af0c227df4 Merge pull request #415 from deven-amd/master
Checkin to fix bugs in math functions.
2018-05-01 12:29:03 +05:30
Maneesh Gupta f8431772a8 Merge pull request #410 from gargrahul/hipCtx_deprecation_warning
Mark hipCtx APIs as deprecated
2018-05-01 12:23:22 +05:30
Maneesh Gupta 885e51e1a5 Merge pull request #406 from gargrahul/fix_tex1D_coordinate_type
Resolve tex1D float to int conversion issue
2018-05-01 09:18:11 +05:30
Rahul Garg 720e209366 Fix gcc build issues for host code 2018-04-25 12:30:11 +05:30
Deven Desai 65a90c55e7 Checkin to fix bugs in math functions.
This change fixes the following bugs that were discovered while debuggnig TF unit test failures (cwise_ops_test)

1. __hisinf and __hisnan routines
   Both had incorrect implementations.

2. abs
   A "long long" (64bit int) version was missing, resulting in the 32bit version being used for 64bit ints (which resulted in incorrect results, when the value passed in was outside the 32bit int range)

3. lgamma
  We seemed to have a custom version for the 'double' datatype (which was giving incorrect results). Replaced it with a call to the 'double' version of the underlying 'hc::precision_math::lgamma'
2018-04-24 18:10:07 +00:00
Rahul Garg bc61260168 Moved deprecated macro to the end 2018-04-20 21:57:23 +05:30
Rahul Garg 89674c54a7 Mark hipCtx APIs as deprecated 2018-04-20 17:34:48 +05:30
Rahul Garg a898ccfb3f Resolve tex1D float to int conversion issue 2018-04-18 14:15:18 +05:30
Rahul Garg fcc0866681 Added hipMemset2DAsync support 2018-04-17 18:27:27 +05:30
Maneesh Gupta 2ab05a2069 Merge pull request #395 from lmoriche/master
Add missing __device__ function attributes.
2018-04-12 14:51:32 +05:30
Laurent Morichetti 8dcfbf5bee Add missing __device__ function attributes. 2018-04-11 09:29:37 -07:00
Maneesh Gupta d945c85fa7 Merge pull request #388 from lmoriche/master
Add support for clang offload bundles and <<<>>> kernel launch.
2018-04-09 12:40:28 +05:30
Siu Chi Chan 5df923cdfa adjust the hcc work week for the texture fix 2018-04-06 15:46:55 -04:00
Laurent Morichetti 7f03ff1012 Don't include __clang_cuda_builtin_vars.h 2018-04-05 16:27:05 -07:00
Laurent Morichetti 323caee8d2 Add support for clang offload bundles and <<<>>> kernel launch. 2018-04-04 13:32:33 -07:00
Rahul Garg 414cb24351 Fix texture driver api TRFS flags 2018-04-02 21:47:20 +05:30
Siu Chi Chan 53d9cce9f9 Change constant address space to 4
when compiling with a newer hcc.  This is
due to a recent change to address space
mapping in the amdgpu compiler backend.
2018-03-26 18:13:59 -04:00
Laurent Morichetti 8f4c368dce Merge remote-tracking branch 'upstream/master' 2018-03-21 11:17:52 -07:00
Laurent Morichetti 84a723b2dc Add HIP_KERNEL_NAME/HIP_SYMBOL definitions for HIP-clang, and rename hipLaunch->hipLaunchByPtr. 2018-03-21 11:07:21 -07:00
Maneesh Gupta 9e47fccc89 Apply .clangformat to all repo source files
Change-Id: I7e79c6058f0303f9a98911e3b7dd2e8596079344
2018-03-12 11:29:03 +05:30
Maneesh Gupta eee7fa6072 Merge pull request #363 from gargrahul/surface_object_api
Added surface object support
2018-03-09 11:50:30 +05:30
Maneesh Gupta 1190a9e5d0 Merge pull request #288 from AlexVlx/feature_purge_genco
Purge hsagenco.sh
2018-03-05 09:59:56 +05:30
Rahul Garg d2426e1b9a Fixed byte offset issue
Added HIP/NVCC support
2018-03-04 19:05:37 +05:30
Laurent Morichetti 49f819ab9e __CUDA__ and __HIP__ are both defined when compiling with 'clang -x hip', so make sure __HIP__ is not defined in the case of __HIP_PLATFORM_NVCC__. 2018-03-02 06:51:51 -08:00
Laurent Morichetti 3251d129ef Use __HIP__ instead of __HIPCC__, and __CUDA__ instead of __CUDACC__ 2018-02-28 14:20:55 -08:00
Laurent Morichetti 70dad80383 Fixes a build error with hcc 2018-02-28 14:12:09 -08:00
Laurent Morichetti 73dd7f0e05 Add initial support for clang 2018-02-28 12:31:26 -08:00
Rahul Garg 92283d24d0 Added surface object support 2018-02-26 11:59:03 +05:30
Alex Voicu 4fadfae944 Missing merge. 2018-02-12 20:21:37 +00:00
Alex Voicu a704bd8b44 Re-sync with upstream. 2018-02-12 20:20:24 +00:00
Maneesh Gupta 647d1ba310 Merge pull request #321 from gargrahul/hipMemcpyArray_Functions
Added support for hipMemcpy Array functions-
2018-02-12 10:36:38 +05:30
Rahul Garg b8c23f979b Fixed host allocated globals address lookup for host usage
Fixed texture driver APIs failure
2018-01-30 18:06:31 +05:30
Maneesh Gupta 5eea5ea227 Merge branch 'master' into feature_purge_genco 2018-01-29 16:02:03 +05:30
Maneesh Gupta 8a98b2d665 Merge pull request #346 from scchan/fix_rhel_build
use assign rather than insert
2018-01-26 06:59:25 +05:30
Siu Chi Chan 6f90e4213f use assign rather than insert 2018-01-25 16:51:29 +00:00
Kent Knox 85284dd48f Fixing rocblas build failure
with ::Bundled_code_header constructor

Disabling hipPrintfKernel test from CI
2018-01-25 10:29:40 -06:00
Alex Voicu 09c704a2d0 Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_purge_genco 2018-01-17 14:02:19 +00:00
Rahul Garg ca5bcb5af4 Added support for -
- hipMemcpyFromArray
- hipMemcpyAtoH
- hipMemcpyHtoA
2018-01-16 11:44:19 +05:30
Maneesh Gupta c124899669 Merge pull request #282 from gargrahul/texture_driver_3d_support
Added support for 3D texture driver apis
2017-12-18 15:39:26 +05:30
Alex Voicu 4d0d4dc701 Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_purge_genco 2017-12-14 13:50:49 +00:00
Maneesh Gupta 574797cdff Merge pull request #286 from gargrahul/fix_hipDeviceGetAttribute_nvcc
Fix hipGetDeviceAttribute dtest for HIP/NVCC
2017-12-12 12:49:23 +05:30
Maneesh Gupta c4192eec7d Merge pull request #285 from aaronenyeshi/fix-ilogb-unreachable
Fix ilogb/ilogbf functions to return int
2017-12-12 10:47:33 +05:30
Alex Voicu 4e0739c68a This introduces LipoProteinLipase (lpl), a simple tool for creating fat binaries. It represents a direct replacement of the creaky hccgenco.sh script, which had various issues. The format it uses is that of a code object bundle, generated by the Clang Offload Bundler. The output is always suffixed with the ".adipose" extension. It is shared with HCC. The hipcc script and associated tests are modified to use lpl. Help can be obtained by invoking lpl --help. A more computer-sciency / corporate friendly name is likely to be beneficial, which is a reason for choosing easily searchable/replaceable names such as lpl or adipose. 2017-12-08 04:22:57 +00:00
Rahul Garg a62ef42c09 Fix hipGetDeviceAttribute dtest for HIP/NVCC 2017-12-06 15:49:06 +05:30
Aaron En Ye Shi b439b45641 Fix ilogb/ilogbf functions to return int
This patch will fix hipDoublePrecisionMathDevice test on ThinLTO, which uncovered that hip math_function's ilogb/ilogbf should return type int instead of double. This will match rocdl.
2017-12-05 23:14:10 +00:00
Rahul Garg 105df94cd0 Added support for
- 3D texture driver APIs
- hipMalloc3D
- hipMemcpy3D for destination other than array
2017-12-05 14:11:13 +05:30
Alex Voicu e186bd9533 This is primarily intended as an additional cleanup of the module functionality, in the aftermath of adopting module based dispatch. The main effort was associated with refactoring the questionable ihipModuleGetSymbol. It was quaintly written and misleading, in that it had little to do with getting symbols, and was exactly retrieving a kernel object. Error handling is modified so as to reduce branching depth. Functions which serve as interfaces to the HSA RT are moved in a separate helper header. Code object readers are properly deleted. Some leftover dead functionality pertaining to associating namespace scope variables with their allocated memory is removed. Executable loading is changed to use a string which holds the ELF image of the code object being loaded, thus avoiding some corner cases where using a istream would fail. 2017-12-03 23:09:06 +00:00