Maneesh Gupta
2ff55124fa
[dtests] Fix HIT block in hipPeerToPeer_simple.cpp
2018-02-14 09:13:50 +05:30
Phaneendr-kumar Lanka
5642a95dfa
[nvccTests] Enable hipGetDeviceAttribute on nvcc
2018-01-10 10:36:25 +05:30
Phaneendr-kumar Lanka
adb3bfa7fb
Merge branch 'master' into nvccTests3
2018-01-03 11:24:26 +05:30
Maneesh Gupta
acac5f6c6e
Merge pull request #304 from kknox/docker2
...
Upgrade to nvidia-docker2 for CUDA testing
2017-12-22 11:05:52 +05:30
Kent Knox
b1dc27b035
Disable CUDA build/test while upgrading to nvidia-docker2
...
This enables us to remove the driver version from startup string
2017-12-21 11:10:55 -06:00
Phaneendr-kumar Lanka
88374a8a63
[nvccTests] Enable hipPeerToPeer_simple on nvcc
2017-12-20 14:10:47 +05:30
Maneesh Gupta
1026e58ad9
Merge pull request #298 from emankov/master
...
[HIPIFY][cmake] Fix require_program function
2017-12-19 08:42:27 +05:30
Evgeny Mankov
56bffb5999
[HIPIFY][cmake] Fix require_program function
...
Function require_program erroneously doesn't report "Can't find program" on any missing program except the first one due to the cached FOUND_PROGRAM value.
Additionally:
+ Do not throw FATAL_ERROR on missing program in order to obtain the whole list of missing programs (if any).
+ Report also found program location.
2017-12-18 19:57:04 +03:00
Maneesh Gupta
fae9842931
Merge pull request #297 from phani544/nvccTests3
...
[nvccTests] Resubmit hipMemcpyDtoD & inline_asm_vadd
2017-12-18 15:41:19 +05:30
Maneesh Gupta
814d55f046
Merge pull request #282 from gargrahul/texture_driver_3d_support
...
Added support for 3D texture driver apis
2017-12-18 15:39:26 +05:30
Phaneendr-kumar Lanka
7cceebd9ed
[nvccTests] Resubmit hipMemcpyDtoD & inline_asm_vadd
2017-12-18 14:46:19 +05:30
Maneesh Gupta
80ad20b893
Merge pull request #295 from mangupta/static_check
...
[cmake] Added target static_check for static code analysis
2017-12-15 10:25:58 +05:30
Maneesh Gupta
9af3e027b8
Merge pull request #294 from phani544/nvccWarnings
...
[nvccWarnings] Fix warnings seen with dtests on nvcc path
2017-12-14 15:58:38 +05:30
Phaneendr-kumar Lanka
eea7d495c7
[nvccWarnings] Fix warnings seen with dtests on nvcc path
2017-12-14 14:10:37 +05:30
Maneesh Gupta
7930c03e42
[cmake] Added target static_check for static code analysis
...
- Added static code analysis using cppcheck
- Fixed couple of warnings reported by cppcheck
Change-Id: Ie3b9b06e95ada322e7dc2ce3d6b91831e870198d
2017-12-14 12:16:44 +05:30
Maneesh Gupta
2cdfef0f19
Merge pull request #293 from phani544/nvccWarnings
...
[nvccWarnings] Fix gpu deprecated targets warnings
2017-12-13 09:58:59 +05:30
Phaneendr-kumar Lanka
8a721b4a36
[nvccWarnings] Fix gpu deprecated targets warnings
2017-12-12 16:00:14 +05:30
Maneesh Gupta
26764971c5
Merge pull request #290 from gargrahul/fix_hipPeerToPeer_simple
...
Fixed hipPeerToPeer_simple test
2017-12-12 12:50:14 +05:30
Maneesh Gupta
6cb5e92d75
Merge pull request #286 from gargrahul/fix_hipDeviceGetAttribute_nvcc
...
Fix hipGetDeviceAttribute dtest for HIP/NVCC
2017-12-12 12:49:23 +05:30
Maneesh Gupta
6293d3f887
Merge pull request #285 from aaronenyeshi/fix-ilogb-unreachable
...
Fix ilogb/ilogbf functions to return int
2017-12-12 10:47:33 +05:30
Rahul Garg
b32489700c
Fixed hipPeerToPeer_simple test
...
- Moved test inside p2p dir
- Updated HIPCHECK to ignore hipErrorPeerAccessAlreadyEnabled
- Added check for mGPUs
2017-12-11 15:23:18 +05:30
Ben Sander
87c5ab8ca5
Merge pull request #283 from emankov/master
...
[HIPIFY] Disable cudaBuiltin matcher.
2017-12-07 12:38:02 -08:00
Evgeny Mankov
5d92a6c252
[HIPIFY] Remove cudaBuiltin matcher
2017-12-06 20:22:14 +03:00
Rahul Garg
0782c52271
Fix hipGetDeviceAttribute dtest for HIP/NVCC
2017-12-06 15:49:06 +05:30
Aaron En Ye Shi
707aaef99d
Fix ilogb/ilogbf functions to return int
...
This patch will fix hipDoublePrecisionMathDevice test on ThinLTO, which uncovered that hip math_function's ilogb/ilogbf should return type int instead of double. This will match rocdl.
2017-12-05 23:14:10 +00:00
Ben Sander
163b0f7978
Merge pull request #281 from mangupta/issue126
...
[samples] Adds a sample that shows using HIP with cmake
2017-12-05 11:42:11 -06:00
Evgeny Mankov
71d2fb20c8
[HIPIFY] Disable cudaBuiltin matcher.
...
As HIP has started to support vanilla CUDA syntax for threadIdx, blockIdx, blockDim and gridDim.
Other CUDA builtins are not tracked for now.
2017-12-05 20:28:51 +03:00
Evgeny Mankov
f24dfc6f36
[HIPIFY] remove duplicates from CUDA_IDENTIFIER_MAP
2017-12-05 19:46:53 +03:00
Evgeny Mankov
841dd69c90
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP
2017-12-05 18:19:15 +03:00
Maneesh Gupta
df4faf3b26
Merge pull request #278 from bensander/update_docs_and_square
...
Update docs and square
2017-12-05 15:57:36 +05:30
Rahul Garg
115c7f2b79
Added support for
...
- 3D texture driver APIs
- hipMalloc3D
- hipMemcpy3D for destination other than array
2017-12-05 14:11:13 +05:30
Maneesh Gupta
b3fb8ea92f
Simplify square sample's Makefile
...
Change-Id: I44349a880a3c57ca0e833d67d9c380b706655b1e
2017-12-05 11:54:50 +05:30
Maneesh Gupta
6d420faca2
[samples] Adds a sample that shows using HIP with cmake
...
Change-Id: Ief983ea0894d7b5d1ea46a755f9134dda0a1bb8f
2017-12-05 10:48:29 +05:30
Ben Sander
d62074e3fc
Merge pull request #279 from AlexVlx/feature_use_module_based_dispatch_instead_of_pfe
...
Clean up kernel retrieval code / module management
2017-12-03 18:41:27 -06:00
Ben Sander
2ae6882d4f
Merge pull request #276 from bensander/temp_disable_tests
...
Temporarily disable a couple tests pending some HCC work
2017-12-03 18:33:47 -06:00
Alex Voicu
5ca3176a91
Re-sync with upstream.
2017-12-03 23:14:58 +00:00
Alex Voicu
5127ce67e8
This is primarily intended as an additional cleanup of the module functionality, in the aftermath of adopting module based dispatch. The main effort was associated with refactoring the questionable ihipModuleGetSymbol. It was quaintly written and misleading, in that it had little to do with getting symbols, and was exactly retrieving a kernel object. Error handling is modified so as to reduce branching depth. Functions which serve as interfaces to the HSA RT are moved in a separate helper header. Code object readers are properly deleted. Some leftover dead functionality pertaining to associating namespace scope variables with their allocated memory is removed. Executable loading is changed to use a string which holds the ELF image of the code object being loaded, thus avoiding some corner cases where using a istream would fail.
2017-12-03 23:09:06 +00:00
Ben Sander
6f3ed5e3cf
Merge pull request #271 from bensander/cppcheck
...
Cppcheck
2017-12-02 08:36:17 -06:00
Ben Sander
657e65733f
Remove coordinate conversions (threadIdx.x -> hipThreadIdx_x)
...
Since these are now supported directly.
2017-12-02 07:44:54 -06:00
Ben Sander
b9fa704521
Update square sample for recent HIP ease-of-use improvements
2017-12-02 07:44:27 -06:00
Ben Sander
5a642c1717
Merge pull request #277 from AlexVlx/feature_use_module_based_dispatch_instead_of_pfe
...
Fix hang by purging archaisms
2017-12-02 07:11:55 -06:00
Alex Voicu
02fc25de4c
Remove stray leftover comment.
2017-12-02 00:03:10 +00:00
Alex Voicu
6027d3f332
Replace archaic use of homebrew functionality with calls to the HC maths library. This fixes a hang observed when building hipTestDeviceDouble.
2017-12-02 00:01:47 +00:00
Ben Sander
03d1263a51
Update RELEASE.md and bugs to reflect HIP infra improvements.
2017-12-01 16:09:24 -06:00
Ben Sander
7d2675520a
Merge pull request #267 from AlexVlx/feature_sanitise_conversions
...
Fix curious bitwise conversions.
2017-12-01 15:51:08 -06:00
Ben Sander
62cbe22d50
Temporarily disable a couple tests pending some HCC work
2017-12-01 21:46:28 +00:00
Ben Sander
9bba97fdcc
Fix some cppcheck style issues.
2017-12-01 20:45:34 +00:00
Ben Sander
4313686d6e
Fix warning from default cppchek.
2017-12-01 20:45:33 +00:00
Ben Sander
dac57ba854
Merge pull request #246 from AlexVlx/feature_use_module_based_dispatch_instead_of_pfe
...
Use natural dispatch syntax
2017-12-01 14:21:25 -06:00
Alex Voicu
7c0b9a005b
Fix legacy mode detection of the address of an agent allocated variable. In this mode, there exist two executables per each code object, one created by HCC and one created by HIP. Since we dispatch through HCC in legacy mode, we should obtain the address for an agent allocated variable from the latter's executable. Also add two omitted validity checks, whose absence could lead to segfaults when the current process had no .kernel section and / or when an invalid or empty blob was extracted from the latter.
2017-11-30 03:29:04 +00:00