Gráfico de commits

3724 Commits

Autor SHA1 Mensaje Fecha
Maneesh Gupta 75abc65e12 [dtests] Fix complex_loading_behavior.cpp build issues on nvcc path (#1242) 2019-07-24 07:49:39 +00:00
Aaron Enye Shi 0de4caa085 Fix hipMemcpy-size test running out of Host Mem (#1224)
* Fix hipMemcpy-size test running out of Host Mem

The hipMemcpy-size uses a maxElem calculated from the total GPU mem /8. Then it will allocate 4 times that amount of host memory. This tests begins failing when there is not enough host memory, such as on systems with 32GB GPU mem, and 16GB RAM. This fixes the test if not enough host memory is available on the system.

* Add windows support to hipMemcpy-size fix

* avoid linking extra libs for windows

* HIPMemcpy-size Remove freeCPU including swap
2019-07-24 07:49:20 +00:00
Evgeny Mankov cde3d75fa4 Merge pull request #1249 from emankov/cuDNN
[HIPIFY][doc] Update Readme.md
2019-07-23 19:35:19 +03:00
Evgeny Mankov b8e4f93b5c [HIPIFY][doc] Update Readme.md
+ 8.0.1 - is the latest supported clang version on Linux
+ Testing update
2019-07-23 19:30:57 +03:00
Evgeny Mankov 4204c99cc5 Merge pull request #1248 from emankov/cuDNN
HIPIFY] Clang 8.0.1 support
2019-07-23 19:20:12 +03:00
Evgeny Mankov 91e461fcf2 HIPIFY] Clang 8.0.1 support
+ Tested on Windows and Linux;
+ Provide patch for clang's bug 38811;
+ Update Readme.md accordingly.

P.S.
With the next 9.0.0 release patches for Windows won't be needed, cause all fixes will be there.
2019-07-23 19:18:17 +03:00
Evgeny Mankov 543da8e8ae Merge pull request #1244 from emankov/cuDNN
[HIPIFY][#1239] Fix an assert
2019-07-22 22:18:58 +03:00
Evgeny Mankov d19fcbcde4 [HIPIFY][#1239] Fix an assert 2019-07-22 22:13:54 +03:00
Evgeny Mankov 866cc4e646 Merge pull request #1241 from emankov/cuDNN
[HIPIFY][#941][fix] Workaround for an assert on forbidden option "--"
2019-07-19 20:25:19 +03:00
Evgeny Mankov f5d6bb8267 [HIPIFY][#941][fix] Workaround for an assert on forbidden option "--"
+ Fixes the following assert in debug version:
Assertion failed: (S.empty() || S[0] != '-') && "Option can't start with '-", file C:\GIT\LLVM\trunk-for-submits\llvm-project\llvm\lib\Support\CommandLine.cpp, line 440
+ DashDash option left declared in order to be listed in help.
2019-07-19 20:23:17 +03:00
Yaxun (Sam) Liu d7aea38d74 Temporarily disable maxTexture1D in hipDeviceProp_t for VDI (#1237) 2019-07-19 15:38:32 +00:00
gilbertlee-amd 8ff7a99a4c Fix Typo HipHostMemory to HipHostMalloc (#1236) 2019-07-19 04:46:07 +00:00
Maneesh Gupta 888ccbc205 [hit] Workaround for %cc and %cxx mappings. (#1233)
* [hit] Workaround for %cc and %cxx mappings.

HIP CMakeLists.txt modifies CMAKE_C_COMPILER and CMAKE_CXX_COMPILER.
This messes up any dtests that want to test against cc/c++.

So hardcode %cc to /usr/bin/cc and %cxx to /usr/bin/c++ for now till
we come up with a better solution.

Change-Id: I7dce93ce8360191e612a94e3a735e5612ac27ab5

* [hit] Add auto-variable %hip-path to syntax for BUILD_CMD

Change-Id: Id097a183fbce2b2c9691d0180d3304dd17a4e016
2019-07-19 04:45:56 +00:00
cdevadas d5dba47804 Increased the number of implicit-kernarg bytes to 56 (#1217) 2019-07-19 04:45:34 +00:00
ansurya fa4d6b353a [HIP][Tests] Added new testcases for Module API (#1150)
* [HIP][tests] New testcases for module api

* [HIP][Tests]Support for CUDA devices

* Updated tests as per latest master & test GetGlobal to work on all platforms
2019-07-19 04:45:20 +00:00
wkwchau 38254caf7a Fixed bug of determine max block size in hipOccupancyMaxPotentialBlockSize (#1235) 2019-07-18 03:19:29 +00:00
ansurya 8e496c09d9 Add Max Texture 1D,2D,3D device properties (#1226)
* Add Max Texture 1D,2D,3D device properties

* Corrected testcase to use enums defined in hipDeviceAttribute_t

* Added texture 1D,2D and 3D support for NVIDIA path
2019-07-18 03:18:50 +00:00
ansurya 6b6258b84f UChar and UShort textures as Normalized Float (#1213)
* UChar and UShort textures as Normalized Float

* UChar and UShort textures as Normalized Float for all float variants

* Handled uninitilaized texture format value
2019-07-18 03:18:28 +00:00
ansurya 84f8441146 HIPCommander code cleanup (#1207)
* HIPCommander code cleanup

* Removed non-used headerfiles

* Removed empty ifdef
2019-07-18 03:18:15 +00:00
Rahul Garg 1dcf618d20 Fix HIP_VISIBLE_DEVICES order (#1184)
* Fix HIP_VISIBLE_DEVICES order

* Fix device IDs mismatch

* Fix review comments- loop order and device range check

* Handle incomplete VISIBLE device env variable

* Revert "Handle incomplete VISIBLE device env variable"
2019-07-18 03:18:04 +00:00
Aryan Salmanpour 034201f658 [hip] fix a bug where we parse kernel's arguments layout for a given kernel multiple times (#1232) 2019-07-17 07:29:07 +00:00
Evgeny Mankov b1fad7915b [HIPIFY][bash] Make 'find' sorted by extension (#1231)
+ Source files are the first to go. It is needed for in-place hipification in order to avoid errors with included but already hipified header files.
+ More extensions support for batch processing.
2019-07-17 07:28:54 +00:00
Maneesh Gupta 8a28afeb84 [dtests] Fix typo in hipAPIStreamDisable.cpp (#1227)
Change-Id: Ib88a6be49ba192578129bb88ad09817839981398
2019-07-17 07:28:38 +00:00
Maneesh Gupta f7bb4f233c [dtests] Temporarily disable failing tests (#1218)
Change-Id: I7bb36a964c635bfbdf67484f3454d8fff7168eb2
2019-07-17 07:28:21 +00:00
Evgeny Mankov 7a87fc6c04 Merge pull request #1220 from emankov/master
[HIP][HIPIFY] Split HIP_ARRAY_DESCRIPTOR struct to HIP_ARRAY_DESCRIPTOR and HIP_ARRAY3D_DESCRIPTOR
2019-07-16 18:15:44 +03:00
Evgeny Mankov ee2332602e Merge pull request #1223 from emankov/cuDNN
[HIPIFY][doc] Update README.md: the latest cuDNN 7.6.1.34 is supported
2019-07-12 18:51:07 +03:00
Evgeny Mankov e5fcb5f964 [HIPIFY][doc] Update README.md: the latest cuDNN 7.6.1.34 is supported 2019-07-12 18:49:54 +03:00
Evgeny Mankov 09162d9a53 [HIP] Fix segfault on uninitialized struct members in hipArrayCreate and hipArray3DCreate 2019-07-12 16:38:26 +03:00
Evgeny Mankov 2b9cf03586 Merge pull request #1222 from emankov/perl
[HIPIFY][#1221] Add regex replacements for include refs with path separators while generating hipify-perl
2019-07-12 16:20:40 +03:00
Evgeny Mankov 6b4a1fe1db [HIPIFY][#1221] Add regex replacements for include refs with path separators while generating hipify-perl 2019-07-12 16:18:00 +03:00
kpyzhov e6a8d1daee Added gfx906 kernel binary to the HIPFB file for hipModule test. (#1215) 2019-07-12 03:13:45 +00:00
Yaxun (Sam) Liu 2b5ff1678a Add amdgpu targets gfx1010 and gfx1012 (#1211) 2019-07-12 03:13:35 +00:00
Evgeny Mankov c7117df91b [HIP][HIPIFY] Split HIP_ARRAY_DESCRIPTOR struct to HIP_ARRAY_DESCRIPTOR and HIP_ARRAY3D_DESCRIPTOR
[Reason] To be compatible with CUDA [#1133]

Update HIP code, hipify-clang, tests and docs

[TODO] Add support of the corresponding functions on nvcc fallback path
2019-07-11 14:58:16 +03:00
mhbliao a16cd0c69a Add missing compound assignment operators. (#1216) 2019-07-11 09:16:15 +00:00
Jatin Chaudhary 3e213d94e1 Adding fix to initialize a dim3 variable from blockIdx. (#1212)
Adding gitignore for ctags
2019-07-11 09:16:03 +00:00
Maneesh Gupta 797510614e Revert "[ci] Disable hipLaunchParm.tst for now" (#1087) 2019-07-11 00:24:23 +00:00
Aryan Salmanpour f21601733e [hip][samples] fix a typo in module_api/defaultDriver sample (#1210) 2019-07-10 03:05:19 +00:00
Aryan Salmanpour e87b5a1e8a Fix a compilation error when templated operator<< fails if 'U' type is std::ostream. (#1209) 2019-07-10 03:05:04 +00:00
Maneesh Gupta b5150c8bc7 [ci] Enable tests on ROCm 2.6 (#1208) 2019-07-10 03:04:52 +00:00
darkbuck c8b76a6a63 Fix even more typo. (#1204) 2019-07-08 11:00:51 +00:00
Jatin Chaudhary 5ed16432f8 Adding bounds check before hipMemset (#1190)
* Adding bounds check in ihipMemset

* Adding ihipMemPtrGetInfo to hipMemPtrGetInfo
2019-07-08 11:00:38 +00:00
Evgeny Mankov b5a77a6cd8 Merge pull request #1206 from emankov/master
[HIPIFY] Statistics to CSV file dumping revise
2019-07-07 20:58:16 +03:00
Evgeny Mankov 3b40e53cff [HIPIFY] Statistics to CSV file dumping revise
+ Add option -print-stats-csv to dump statistics to CSV file
+ If -o-dir is specified, CSV file will be dumped there
+ Generate 1 summary file sum_stat.csv in case of multiple sources
2019-07-07 20:55:17 +03:00
searlmc1 a9870f0465 Fix typo ( T in Vec3_cmp = needs to be int ). (#1203)
Typo introduced here:
commit 67abac1365
Author: Alex Voicu <alexandru.voicu@amd.com>
Date:   Mon Jun 24 20:02:09 2019 -0500

    Put 3-wide vector types on a ketogenic diet. (#1180)
2019-07-04 00:44:26 +00:00
kpyzhov b6b749f714 Change the type of hipTextureObject_t to pointer to a struct. (#1199)
This is necessary to allow Runtime to perform required texture buffer handling.
2019-07-04 00:44:06 +00:00
Jannik 2a1b0ba27d Fix call to cuda library (#1197)
* Remove flags parameter from hipOccupancyMaxPotentialBlockSize

This commit makes the hipOccupancyMaxPotentialBlockSize method consistent with hcc path and the CUDA API.
2019-07-04 00:43:22 +00:00
Aryan Salmanpour 999f45fc11 [hip] Move _criticalData of ihipStream_t class to private section and use criticalData() to access it (#1177) 2019-07-04 00:42:19 +00:00
Maneesh Gupta 502a734ebf Fix sample module_api_global for hip-clang (#1201)
module_api_global relies on a HCC only feature which allows host code
to write to device variables. This feature does not exist in CUDA
or hip-clang, which causes the sample not working in CUDA or hip-clang.

This patch fixes the sample by using standard features of CUDA and
hip-clang. The fixed sample works in HCC, CUDA and hip-clang.
2019-07-03 08:52:19 +00:00
Yaxun (Sam) Liu 85c3b903df Add __hip_pinned_shadow__ for hip-clang and fix texture reference tests (#1200) 2019-07-03 08:51:59 +00:00
Maneesh Gupta d57946275b Added missing NULL checks and corrected API return values (#1188)
* Added missing NULL checks and corrected API return values as per validation

* Added missing NULL checks
2019-07-03 08:51:39 +00:00