Gráfico de Commits

1476 Commits

Autor SHA1 Mensagem Data
Evgeny Mankov befa8dbbce Merge pull request #341 from emankov/hipBLAS
[HIPIFY][fix] CUDA and cuBLAS main headers correct handling
2018-01-24 18:09:12 +03:00
Evgeny Mankov 600d5d7c06 [HIPIFY][fix] CUDA and cuBLAS main headers correct handling 2018-01-23 23:43:36 +03:00
Evgeny Mankov c528f4f9c5 [HIPIFY][tests][win] Uncomment @echo off 2018-01-23 21:46:27 +03:00
Evgeny Mankov 368db8bf6c [HIPIFY][tests][win] Fix run_test.bat
All checks should not occur in input file for FileCheck. The issue found on CHECK-NOT.
Change removes all lit checks in the hipified file based on regexp, and the resulted stdout is fed as stdin for FileCheck.
2018-01-23 21:43:18 +03:00
Alex Voicu ed75522ba5 Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_purge_genco 2018-01-17 14:02:19 +00:00
Evgeny Mankov 9c0dc5cfe7 [HIPIFY][tests] remove concurentKernels.cu as it is one of CUDA SDK samples. 2018-01-16 20:41:08 +03:00
Evgeny Mankov 88c5ffee3a Merge pull request #319 from emankov/issue_211
[HIPIFY][fix][#211] Algorithm for explicit insert of hip include directive
2018-01-16 19:47:15 +03:00
Evgeny Mankov 51e0c0a981 Merge pull request #323 from emankov/cudaBuiltins
[HIPIFY][tests] Remove checks on cudaBuiltins
2018-01-16 19:46:41 +03:00
Evgeny Mankov c328e3987c Update headers_test_03.cu 2018-01-16 19:21:59 +03:00
Evgeny Mankov a784a9c824 Update headers_test_04.cu 2018-01-16 19:21:14 +03:00
Evgeny Mankov 74b489b5b2 [HIPIFY][tests] Remove checks on cudaBuiltins
As HIP has started to support vanilla CUDA syntax for threadIdx, blockIdx, blockDim and gridDim.
Other CUDA builtins are not tracked for now.
2018-01-16 17:13:29 +03:00
Evgeny Mankov 58faf1ddc9 [HIPIFY][tests] Add more suffixes to lit config 2018-01-16 16:40:31 +03:00
Maneesh Gupta c46967cdd8 Merge pull request #302 from phani544/nvccWarnings
[nvccWarnings] Fix -Wno-deprecated-declarations in hip_anyall and hip…
2018-01-16 12:16:51 +05:30
Maneesh Gupta 07ea65d2a7 Merge pull request #301 from gargrahul/fix_hipPeerToPeer_simple_singlegpu
Return pass on single gpu in hipPeerToPeer_simple
2018-01-16 12:16:33 +05:30
Maneesh Gupta d12de062af Merge pull request #312 from phani544/nvcctests4
[nvcc] Enable hipGetDeviceAttribute
2018-01-16 11:05:15 +05:30
Evgeny Mankov 09655a0853 [HIPIFY][fix][#211] Algorithm for explicit insert of hip include directive
If in source CUDA file main header (cuda_runtime.h or cuda.h) is not presented, corresponding HIP main header (hip_runtime.h) should be explicitly included in output hipified file.

[Algorithm]
1. If #pragma once is presented, HIP main header should be placed just after it;
2. Otherwise if any other (not CUDA main) header is presented, HIP main header should be placed just before it;
3. Otherwise HIP main header should be placed in the beginning of output file.

P.S.
There might be one more situation when #ifndef #define ... #endif guard for the entire file is presented (make sense for *.h, *.hpp, *.cuh files). In this case HIP main include should be placed just after such #ifdef, or after #pragma once, if it is also presented. This situation will be handled in a separate change.
2018-01-15 21:05:05 +03:00
emankov 6500cce6aa [HIPIFY][#311][fix] Get rid of socat in run_test.sh 2018-01-15 14:20:37 +03:00
Evgeny Mankov 5df3a9bb3b [HIPIFY][tests][win] CUDA samples root env. var is changes
Env. var NVCUDASAMPLES_ROOT is changed to NVCUDASAMPLESX_Y_ROOT where X - major ver, Y - minor ver.

Reason: NVCUDASAMPLES_ROOT contains path to CUDA SDK installed last, while NVCUDASAMPLESX_Y_ROOT contains samples of the same version as of CUDA_TOOLKIT_ROOT_DIR.
2018-01-12 17:15:37 +03:00
Evgeny Mankov f11cb396f0 Merge pull request #310 from emankov/win_testing
[HIPIFY][tests] Add Windows testing support
2018-01-12 16:41:56 +03:00
Evgeny Mankov fdc10f0856 [HIPIFY][tests] Add setlocal to batch script 2018-01-10 21:03:02 +03:00
Phaneendr-kumar Lanka e9f9456166 [nvcc] Enable hipGetDeviceAttribute 2018-01-10 10:51:01 +05:30
Phaneendr-kumar Lanka 5642a95dfa [nvccTests] Enable hipGetDeviceAttribute on nvcc 2018-01-10 10:36:25 +05:30
Evgeny Mankov 3d5b2ef5f6 [HIPIFY][tests] Add Windows testing support 2018-01-09 20:20:28 +03:00
Evgeny Mankov 7e7cfa10cc [HIPIFY][FIX][#306] Eliminate second cuda main include directive
// hipified to #include<hip/hip_runtime.h>
#include<cuda.h> // 1st cuda main include (Driver API)
// to eliminate
#include<cuda_runtime.h> // 2nd cuda main include (Runtime API)

HIP has one header hip_runtime.h for both CUDA APIs, thus second cuda main include directive is eliminated entirely.
2017-12-26 20:54:54 +03:00
Phaneendr-kumar Lanka 88374a8a63 [nvccTests] Enable hipPeerToPeer_simple on nvcc 2017-12-20 14:10:47 +05:30
Phaneendr-kumar Lanka bd60ef9031 [nvccWarnings] Fix -Wno-deprecated-declarations in hip_anyall and hip_ballot 2017-12-20 12:05:21 +05:30
Rahul Garg 9f62283bad Return pass on single gpu in hipPeerToPeer_simple 2017-12-20 09:36:00 +05:30
Maneesh Gupta cebb070d30 Implement hipStreamAddCallback
Change-Id: Ib851e4d86ba9c8406ca37b88162ea483ccbc9d36
2017-12-19 16:06:14 +05:30
Phaneendr-kumar Lanka 7cceebd9ed [nvccTests] Resubmit hipMemcpyDtoD & inline_asm_vadd 2017-12-18 14:46:19 +05:30
Alex Voicu e14ac87474 Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP into feature_purge_genco 2017-12-14 13:50:49 +00:00
Phaneendr-kumar Lanka eea7d495c7 [nvccWarnings] Fix warnings seen with dtests on nvcc path 2017-12-14 14:10:37 +05:30
Maneesh Gupta 26764971c5 Merge pull request #290 from gargrahul/fix_hipPeerToPeer_simple
Fixed hipPeerToPeer_simple test
2017-12-12 12:50:14 +05:30
Rahul Garg b32489700c Fixed hipPeerToPeer_simple test
- Moved test inside p2p dir
- Updated HIPCHECK to ignore hipErrorPeerAccessAlreadyEnabled
- Added check for mGPUs
2017-12-11 15:23:18 +05:30
Alex Voicu b842394957 This introduces LipoProteinLipase (lpl), a simple tool for creating fat binaries. It represents a direct replacement of the creaky hccgenco.sh script, which had various issues. The format it uses is that of a code object bundle, generated by the Clang Offload Bundler. The output is always suffixed with the ".adipose" extension. It is shared with HCC. The hipcc script and associated tests are modified to use lpl. Help can be obtained by invoking lpl --help. A more computer-sciency / corporate friendly name is likely to be beneficial, which is a reason for choosing easily searchable/replaceable names such as lpl or adipose. 2017-12-08 04:22:57 +00:00
Rahul Garg 0782c52271 Fix hipGetDeviceAttribute dtest for HIP/NVCC 2017-12-06 15:49:06 +05:30
Ben Sander 62cbe22d50 Temporarily disable a couple tests pending some HCC work 2017-12-01 21:46:28 +00:00
Alex Voicu 32e11e7dc6 Revert "Revert adoption of CUDA indexing in general - this can only work with later versions of the compiler, just like module based dispatch, and thus must be guarded against usage in earlier (e.g. 1.6) versions."
This reverts commit d2fd1f5
2017-11-29 21:49:10 +00:00
Alex Voicu fbaf729f88 Revert "Revert adoption of CUDA indexing in general - this can only work with later versions of the compiler, just like module based dispatch, and thus must be guarded against usage in earlier (e.g. 1.6) versions."
This reverts commit d2fd1f5
2017-11-29 21:36:29 +00:00
Alex Voicu dc67ca3feb Merge remote-tracking branch 'origin/master' into feature_use_module_based_dispatch_instead_of_pfe
# Conflicts:
#	src/hip_module.cpp
2017-11-28 17:29:11 +00:00
Ben Sander 0da0426f94 Merge pull request #256 from gargrahul/texture_driver_api_support
Texture driver APIs support
2017-11-27 13:52:39 -06:00
Evgeny Mankov aa05b3d84e Merge pull request #262 from ChrisKitching/frontendaction
[HIPIFY] Mostly fix preprocessor-or-template induced issues
2017-11-27 17:30:11 +03:00
Rahul Garg 04bc5a1d1f Porting guides update for texture APIs usage 2017-11-24 12:00:55 +05:30
Alex Voicu 4131b47134 Modify the set component of the memcpy test (unclear why there is a memset component to begin with). 2017-11-21 17:52:01 +00:00
Alex Voicu c5f2b22d0d Re-sync with upstream. 2017-11-20 15:34:50 +00:00
Maneesh Gupta db378fbc9e Merge pull request #266 from gargrahul/fix_half2_gfx900
Fixed half2 issue on gfx900
2017-11-20 07:28:41 +05:30
Maneesh Gupta 1174534e85 Merge pull request #265 from phani544/nvccTests
[nvccTests]Enabled inline_asm_vadd on nvcc
2017-11-20 07:28:29 +05:30
Ben Sander 59956a57ca Fix test on cuda 2017-11-19 15:31:02 -06:00
Ben Sander 5a7a28ad29 Merge branch 'feature_natural_indexing' of https://github.com/AlexVlx/HIP 2017-11-19 15:25:17 -06:00
Ben Sander e0c3f684ae Temporarily disable P2P on nvidia (fails on dual GPU) 2017-11-19 15:21:37 -06:00
Alex Voicu cffd0e14eb This implements the trivial change needed to move back from the hip{Something}_{x, y, z} macros to the natural CUDA syntax of Something.{x, y, z}. This is contained in lines 384-404 in hip_runtime.h. All of the other changes have to do with changing unit tests to use this syntax. The macros are retained for backwards compatibility. 2017-11-19 01:54:12 +00:00