Gráfico de commits

641 Commits

Autor SHA1 Mensaje Fecha
Rahul Garg 414cb24351 Fix texture driver api TRFS flags 2018-04-02 21:47:20 +05:30
Siu Chi Chan 53d9cce9f9 Change constant address space to 4
when compiling with a newer hcc.  This is
due to a recent change to address space
mapping in the amdgpu compiler backend.
2018-03-26 18:13:59 -04:00
Laurent Morichetti 8f4c368dce Merge remote-tracking branch 'upstream/master' 2018-03-21 11:17:52 -07:00
Laurent Morichetti 84a723b2dc Add HIP_KERNEL_NAME/HIP_SYMBOL definitions for HIP-clang, and rename hipLaunch->hipLaunchByPtr. 2018-03-21 11:07:21 -07:00
Maneesh Gupta 9e47fccc89 Apply .clangformat to all repo source files
Change-Id: I7e79c6058f0303f9a98911e3b7dd2e8596079344
2018-03-12 11:29:03 +05:30
Maneesh Gupta eee7fa6072 Merge pull request #363 from gargrahul/surface_object_api
Added surface object support
2018-03-09 11:50:30 +05:30
Maneesh Gupta 1190a9e5d0 Merge pull request #288 from AlexVlx/feature_purge_genco
Purge hsagenco.sh
2018-03-05 09:59:56 +05:30
Rahul Garg d2426e1b9a Fixed byte offset issue
Added HIP/NVCC support
2018-03-04 19:05:37 +05:30
Laurent Morichetti 49f819ab9e __CUDA__ and __HIP__ are both defined when compiling with 'clang -x hip', so make sure __HIP__ is not defined in the case of __HIP_PLATFORM_NVCC__. 2018-03-02 06:51:51 -08:00
Laurent Morichetti 3251d129ef Use __HIP__ instead of __HIPCC__, and __CUDA__ instead of __CUDACC__ 2018-02-28 14:20:55 -08:00
Laurent Morichetti 70dad80383 Fixes a build error with hcc 2018-02-28 14:12:09 -08:00
Laurent Morichetti 73dd7f0e05 Add initial support for clang 2018-02-28 12:31:26 -08:00
Rahul Garg 92283d24d0 Added surface object support 2018-02-26 11:59:03 +05:30
Alex Voicu 4fadfae944 Missing merge. 2018-02-12 20:21:37 +00:00
Alex Voicu a704bd8b44 Re-sync with upstream. 2018-02-12 20:20:24 +00:00
Maneesh Gupta 647d1ba310 Merge pull request #321 from gargrahul/hipMemcpyArray_Functions
Added support for hipMemcpy Array functions-
2018-02-12 10:36:38 +05:30
Rahul Garg b8c23f979b Fixed host allocated globals address lookup for host usage
Fixed texture driver APIs failure
2018-01-30 18:06:31 +05:30
Maneesh Gupta 5eea5ea227 Merge branch 'master' into feature_purge_genco 2018-01-29 16:02:03 +05:30
Maneesh Gupta 8a98b2d665 Merge pull request #346 from scchan/fix_rhel_build
use assign rather than insert
2018-01-26 06:59:25 +05:30
Siu Chi Chan 6f90e4213f use assign rather than insert 2018-01-25 16:51:29 +00:00
Kent Knox 85284dd48f Fixing rocblas build failure
with ::Bundled_code_header constructor

Disabling hipPrintfKernel test from CI
2018-01-25 10:29:40 -06:00
Alex Voicu 09c704a2d0 Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_purge_genco 2018-01-17 14:02:19 +00:00
Rahul Garg ca5bcb5af4 Added support for -
- hipMemcpyFromArray
- hipMemcpyAtoH
- hipMemcpyHtoA
2018-01-16 11:44:19 +05:30
Maneesh Gupta c124899669 Merge pull request #282 from gargrahul/texture_driver_3d_support
Added support for 3D texture driver apis
2017-12-18 15:39:26 +05:30
Alex Voicu 4d0d4dc701 Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_purge_genco 2017-12-14 13:50:49 +00:00
Maneesh Gupta 574797cdff Merge pull request #286 from gargrahul/fix_hipDeviceGetAttribute_nvcc
Fix hipGetDeviceAttribute dtest for HIP/NVCC
2017-12-12 12:49:23 +05:30
Maneesh Gupta c4192eec7d Merge pull request #285 from aaronenyeshi/fix-ilogb-unreachable
Fix ilogb/ilogbf functions to return int
2017-12-12 10:47:33 +05:30
Alex Voicu 4e0739c68a This introduces LipoProteinLipase (lpl), a simple tool for creating fat binaries. It represents a direct replacement of the creaky hccgenco.sh script, which had various issues. The format it uses is that of a code object bundle, generated by the Clang Offload Bundler. The output is always suffixed with the ".adipose" extension. It is shared with HCC. The hipcc script and associated tests are modified to use lpl. Help can be obtained by invoking lpl --help. A more computer-sciency / corporate friendly name is likely to be beneficial, which is a reason for choosing easily searchable/replaceable names such as lpl or adipose. 2017-12-08 04:22:57 +00:00
Rahul Garg a62ef42c09 Fix hipGetDeviceAttribute dtest for HIP/NVCC 2017-12-06 15:49:06 +05:30
Aaron En Ye Shi b439b45641 Fix ilogb/ilogbf functions to return int
This patch will fix hipDoublePrecisionMathDevice test on ThinLTO, which uncovered that hip math_function's ilogb/ilogbf should return type int instead of double. This will match rocdl.
2017-12-05 23:14:10 +00:00
Rahul Garg 105df94cd0 Added support for
- 3D texture driver APIs
- hipMalloc3D
- hipMemcpy3D for destination other than array
2017-12-05 14:11:13 +05:30
Alex Voicu e186bd9533 This is primarily intended as an additional cleanup of the module functionality, in the aftermath of adopting module based dispatch. The main effort was associated with refactoring the questionable ihipModuleGetSymbol. It was quaintly written and misleading, in that it had little to do with getting symbols, and was exactly retrieving a kernel object. Error handling is modified so as to reduce branching depth. Functions which serve as interfaces to the HSA RT are moved in a separate helper header. Code object readers are properly deleted. Some leftover dead functionality pertaining to associating namespace scope variables with their allocated memory is removed. Executable loading is changed to use a string which holds the ELF image of the code object being loaded, thus avoiding some corner cases where using a istream would fail. 2017-12-03 23:09:06 +00:00
Alex Voicu 33bb425013 Fix legacy mode detection of the address of an agent allocated variable. In this mode, there exist two executables per each code object, one created by HCC and one created by HIP. Since we dispatch through HCC in legacy mode, we should obtain the address for an agent allocated variable from the latter's executable. Also add two omitted validity checks, whose absence could lead to segfaults when the current process had no .kernel section and / or when an invalid or empty blob was extracted from the latter. 2017-11-30 03:29:04 +00:00
Alex Voicu 2e39534377 Add missing space (the final frontier). 2017-11-29 21:50:43 +00:00
Alex Voicu 4966518846 Revert "Revert adoption of CUDA indexing in general - this can only work with later versions of the compiler, just like module based dispatch, and thus must be guarded against usage in earlier (e.g. 1.6) versions."
This reverts commit 4792475
2017-11-29 21:49:10 +00:00
Alex Voicu 92e80f0943 Use a much simpler guard for version 1.6, which allows for direct CUDA indexing to be used. 2017-11-29 21:47:04 +00:00
Alex Voicu 4792475d01 Revert adoption of CUDA indexing in general - this can only work with later versions of the compiler, just like module based dispatch, and thus must be guarded against usage in earlier (e.g. 1.6) versions. 2017-11-29 21:01:28 +00:00
Alex Voicu 93e595c267 Fix compiler version check. 2017-11-29 03:05:53 +00:00
Alex Voicu 37c1811b2a Fix oversight in selection mechanism which led to erroneous code to be compiled for the grid_launch_GGL component. 2017-11-29 01:37:52 +00:00
Alex Voicu c6ab6f292b Choose whether or not to use functional grid_launch based on the version of HCC used to compile. 2017-11-29 00:17:44 +00:00
Alex Voicu d5c8de3f41 Change memset kernel to use memcpy instead of placement new. Simplify indexers. 2017-11-28 19:45:47 +00:00
Alex Voicu 6a0efb7ed2 Re-sync with upstream and re-factor platform global management for texture references. 2017-11-28 19:15:29 +00:00
Alex Voicu d37a5a6008 Merge remote-tracking branch 'origin/master' into feature_use_module_based_dispatch_instead_of_pfe
# Conflicts:
#	src/hip_module.cpp
2017-11-28 17:29:11 +00:00
Ben Sander e93a24bdbe Merge pull request #256 from gargrahul/texture_driver_api_support
Texture driver APIs support
2017-11-27 13:52:39 -06:00
Alex Voicu dfa532db98 Remove leftover comment. 2017-11-22 19:37:03 +00:00
Rahul Garg 38029f2849 Fixed review comments 2017-11-21 21:19:06 +05:30
Rahul Garg 24307fe5c4 Changed function hipMemcpy_2D to hipMemcpyParam2D 2017-11-21 12:36:24 +05:30
Alex Voicu f8c1c1b38e Refactor the __device__ versions of memset and memcpy to be less awkward i.e. not return nullptr as opposed to the destination pointer (it can only be assumed it was done for maximum confusion) and actually unroll as they claim to. Change all of the {to, from}Symbol functions to use hipModuleGetGlobal, as opposed to hc::accelerator::get_symbol_address which is no longer valid with module based dispatch. 2017-11-21 02:40:34 +00:00
Alex Voicu 27bc487635 Correct ill-formed merge in earlier commit and adjust for differences with the new CUDA natural indexing mechanism. 2017-11-20 16:33:52 +00:00
Alex Voicu 30d90dab38 Re-sync with upstream. 2017-11-20 15:34:50 +00:00