Maneesh Gupta
d945c85fa7
Merge pull request #388 from lmoriche/master
...
Add support for clang offload bundles and <<<>>> kernel launch.
2018-04-09 12:40:28 +05:30
Siu Chi Chan
5df923cdfa
adjust the hcc work week for the texture fix
2018-04-06 15:46:55 -04:00
Laurent Morichetti
7f03ff1012
Don't include __clang_cuda_builtin_vars.h
2018-04-05 16:27:05 -07:00
Laurent Morichetti
323caee8d2
Add support for clang offload bundles and <<<>>> kernel launch.
2018-04-04 13:32:33 -07:00
Rahul Garg
414cb24351
Fix texture driver api TRFS flags
2018-04-02 21:47:20 +05:30
Siu Chi Chan
53d9cce9f9
Change constant address space to 4
...
when compiling with a newer hcc. This is
due to a recent change to address space
mapping in the amdgpu compiler backend.
2018-03-26 18:13:59 -04:00
Laurent Morichetti
8f4c368dce
Merge remote-tracking branch 'upstream/master'
2018-03-21 11:17:52 -07:00
Laurent Morichetti
84a723b2dc
Add HIP_KERNEL_NAME/HIP_SYMBOL definitions for HIP-clang, and rename hipLaunch->hipLaunchByPtr.
2018-03-21 11:07:21 -07:00
Maneesh Gupta
9e47fccc89
Apply .clangformat to all repo source files
...
Change-Id: I7e79c6058f0303f9a98911e3b7dd2e8596079344
2018-03-12 11:29:03 +05:30
Maneesh Gupta
eee7fa6072
Merge pull request #363 from gargrahul/surface_object_api
...
Added surface object support
2018-03-09 11:50:30 +05:30
Maneesh Gupta
1190a9e5d0
Merge pull request #288 from AlexVlx/feature_purge_genco
...
Purge hsagenco.sh
2018-03-05 09:59:56 +05:30
Rahul Garg
d2426e1b9a
Fixed byte offset issue
...
Added HIP/NVCC support
2018-03-04 19:05:37 +05:30
Laurent Morichetti
49f819ab9e
__CUDA__ and __HIP__ are both defined when compiling with 'clang -x hip', so make sure __HIP__ is not defined in the case of __HIP_PLATFORM_NVCC__.
2018-03-02 06:51:51 -08:00
Laurent Morichetti
3251d129ef
Use __HIP__ instead of __HIPCC__, and __CUDA__ instead of __CUDACC__
2018-02-28 14:20:55 -08:00
Laurent Morichetti
70dad80383
Fixes a build error with hcc
2018-02-28 14:12:09 -08:00
Laurent Morichetti
73dd7f0e05
Add initial support for clang
2018-02-28 12:31:26 -08:00
Rahul Garg
92283d24d0
Added surface object support
2018-02-26 11:59:03 +05:30
Alex Voicu
4fadfae944
Missing merge.
2018-02-12 20:21:37 +00:00
Alex Voicu
a704bd8b44
Re-sync with upstream.
2018-02-12 20:20:24 +00:00
Maneesh Gupta
647d1ba310
Merge pull request #321 from gargrahul/hipMemcpyArray_Functions
...
Added support for hipMemcpy Array functions-
2018-02-12 10:36:38 +05:30
Rahul Garg
b8c23f979b
Fixed host allocated globals address lookup for host usage
...
Fixed texture driver APIs failure
2018-01-30 18:06:31 +05:30
Maneesh Gupta
5eea5ea227
Merge branch 'master' into feature_purge_genco
2018-01-29 16:02:03 +05:30
Maneesh Gupta
8a98b2d665
Merge pull request #346 from scchan/fix_rhel_build
...
use assign rather than insert
2018-01-26 06:59:25 +05:30
Siu Chi Chan
6f90e4213f
use assign rather than insert
2018-01-25 16:51:29 +00:00
Kent Knox
85284dd48f
Fixing rocblas build failure
...
with ::Bundled_code_header constructor
Disabling hipPrintfKernel test from CI
2018-01-25 10:29:40 -06:00
Alex Voicu
09c704a2d0
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_purge_genco
2018-01-17 14:02:19 +00:00
Rahul Garg
ca5bcb5af4
Added support for -
...
- hipMemcpyFromArray
- hipMemcpyAtoH
- hipMemcpyHtoA
2018-01-16 11:44:19 +05:30
Maneesh Gupta
c124899669
Merge pull request #282 from gargrahul/texture_driver_3d_support
...
Added support for 3D texture driver apis
2017-12-18 15:39:26 +05:30
Alex Voicu
4d0d4dc701
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_purge_genco
2017-12-14 13:50:49 +00:00
Maneesh Gupta
574797cdff
Merge pull request #286 from gargrahul/fix_hipDeviceGetAttribute_nvcc
...
Fix hipGetDeviceAttribute dtest for HIP/NVCC
2017-12-12 12:49:23 +05:30
Maneesh Gupta
c4192eec7d
Merge pull request #285 from aaronenyeshi/fix-ilogb-unreachable
...
Fix ilogb/ilogbf functions to return int
2017-12-12 10:47:33 +05:30
Alex Voicu
4e0739c68a
This introduces LipoProteinLipase (lpl), a simple tool for creating fat binaries. It represents a direct replacement of the creaky hccgenco.sh script, which had various issues. The format it uses is that of a code object bundle, generated by the Clang Offload Bundler. The output is always suffixed with the ".adipose" extension. It is shared with HCC. The hipcc script and associated tests are modified to use lpl. Help can be obtained by invoking lpl --help. A more computer-sciency / corporate friendly name is likely to be beneficial, which is a reason for choosing easily searchable/replaceable names such as lpl or adipose.
2017-12-08 04:22:57 +00:00
Rahul Garg
a62ef42c09
Fix hipGetDeviceAttribute dtest for HIP/NVCC
2017-12-06 15:49:06 +05:30
Aaron En Ye Shi
b439b45641
Fix ilogb/ilogbf functions to return int
...
This patch will fix hipDoublePrecisionMathDevice test on ThinLTO, which uncovered that hip math_function's ilogb/ilogbf should return type int instead of double. This will match rocdl.
2017-12-05 23:14:10 +00:00
Rahul Garg
105df94cd0
Added support for
...
- 3D texture driver APIs
- hipMalloc3D
- hipMemcpy3D for destination other than array
2017-12-05 14:11:13 +05:30
Alex Voicu
e186bd9533
This is primarily intended as an additional cleanup of the module functionality, in the aftermath of adopting module based dispatch. The main effort was associated with refactoring the questionable ihipModuleGetSymbol. It was quaintly written and misleading, in that it had little to do with getting symbols, and was exactly retrieving a kernel object. Error handling is modified so as to reduce branching depth. Functions which serve as interfaces to the HSA RT are moved in a separate helper header. Code object readers are properly deleted. Some leftover dead functionality pertaining to associating namespace scope variables with their allocated memory is removed. Executable loading is changed to use a string which holds the ELF image of the code object being loaded, thus avoiding some corner cases where using a istream would fail.
2017-12-03 23:09:06 +00:00
Alex Voicu
33bb425013
Fix legacy mode detection of the address of an agent allocated variable. In this mode, there exist two executables per each code object, one created by HCC and one created by HIP. Since we dispatch through HCC in legacy mode, we should obtain the address for an agent allocated variable from the latter's executable. Also add two omitted validity checks, whose absence could lead to segfaults when the current process had no .kernel section and / or when an invalid or empty blob was extracted from the latter.
2017-11-30 03:29:04 +00:00
Alex Voicu
2e39534377
Add missing space (the final frontier).
2017-11-29 21:50:43 +00:00
Alex Voicu
4966518846
Revert "Revert adoption of CUDA indexing in general - this can only work with later versions of the compiler, just like module based dispatch, and thus must be guarded against usage in earlier (e.g. 1.6) versions."
...
This reverts commit 4792475
2017-11-29 21:49:10 +00:00
Alex Voicu
92e80f0943
Use a much simpler guard for version 1.6, which allows for direct CUDA indexing to be used.
2017-11-29 21:47:04 +00:00
Alex Voicu
4792475d01
Revert adoption of CUDA indexing in general - this can only work with later versions of the compiler, just like module based dispatch, and thus must be guarded against usage in earlier (e.g. 1.6) versions.
2017-11-29 21:01:28 +00:00
Alex Voicu
93e595c267
Fix compiler version check.
2017-11-29 03:05:53 +00:00
Alex Voicu
37c1811b2a
Fix oversight in selection mechanism which led to erroneous code to be compiled for the grid_launch_GGL component.
2017-11-29 01:37:52 +00:00
Alex Voicu
c6ab6f292b
Choose whether or not to use functional grid_launch based on the version of HCC used to compile.
2017-11-29 00:17:44 +00:00
Alex Voicu
d5c8de3f41
Change memset kernel to use memcpy instead of placement new. Simplify indexers.
2017-11-28 19:45:47 +00:00
Alex Voicu
6a0efb7ed2
Re-sync with upstream and re-factor platform global management for texture references.
2017-11-28 19:15:29 +00:00
Alex Voicu
d37a5a6008
Merge remote-tracking branch 'origin/master' into feature_use_module_based_dispatch_instead_of_pfe
...
# Conflicts:
# src/hip_module.cpp
2017-11-28 17:29:11 +00:00
Ben Sander
e93a24bdbe
Merge pull request #256 from gargrahul/texture_driver_api_support
...
Texture driver APIs support
2017-11-27 13:52:39 -06:00
Alex Voicu
dfa532db98
Remove leftover comment.
2017-11-22 19:37:03 +00:00
Rahul Garg
38029f2849
Fixed review comments
2017-11-21 21:19:06 +05:30