Граф коммитов

3358 Коммитов

Автор SHA1 Сообщение Дата
Rahul Garg 7629cdd2cf Fix HIP_VISIBLE_DEVICES order (#1184)
* Fix HIP_VISIBLE_DEVICES order

* Fix device IDs mismatch

* Fix review comments- loop order and device range check

* Handle incomplete VISIBLE device env variable

* Revert "Handle incomplete VISIBLE device env variable"


[ROCm/hip commit: 1dcf618d20]
2019-07-18 03:18:04 +00:00
Aryan Salmanpour 774f778352 [hip] fix a bug where we parse kernel's arguments layout for a given kernel multiple times (#1232)
[ROCm/hip commit: 034201f658]
2019-07-17 07:29:07 +00:00
Evgeny Mankov d883d24616 [HIPIFY][bash] Make 'find' sorted by extension (#1231)
+ Source files are the first to go. It is needed for in-place hipification in order to avoid errors with included but already hipified header files.
+ More extensions support for batch processing.

[ROCm/hip commit: b1fad7915b]
2019-07-17 07:28:54 +00:00
Maneesh Gupta 7f41814190 [dtests] Fix typo in hipAPIStreamDisable.cpp (#1227)
Change-Id: Ib88a6be49ba192578129bb88ad09817839981398

[ROCm/hip commit: 8a28afeb84]
2019-07-17 07:28:38 +00:00
Maneesh Gupta d4040e8e9c [dtests] Temporarily disable failing tests (#1218)
Change-Id: I7bb36a964c635bfbdf67484f3454d8fff7168eb2

[ROCm/hip commit: f7bb4f233c]
2019-07-17 07:28:21 +00:00
Evgeny Mankov 1ccb6ea65e Merge pull request #1220 from emankov/master
[HIP][HIPIFY] Split HIP_ARRAY_DESCRIPTOR struct to HIP_ARRAY_DESCRIPTOR and HIP_ARRAY3D_DESCRIPTOR

[ROCm/hip commit: 7a87fc6c04]
2019-07-16 18:15:44 +03:00
Evgeny Mankov 4af0161c34 [HIPIFY][doc] Update README.md: the latest cuDNN 7.6.1.34 is supported
[ROCm/hip commit: e5fcb5f964]
2019-07-12 18:49:54 +03:00
Evgeny Mankov d11063e64c [HIP] Fix segfault on uninitialized struct members in hipArrayCreate and hipArray3DCreate
[ROCm/hip commit: 09162d9a53]
2019-07-12 16:38:26 +03:00
Evgeny Mankov 33a5eca319 [HIPIFY][#1221] Add regex replacements for include refs with path separators while generating hipify-perl
[ROCm/hip commit: 6b4a1fe1db]
2019-07-12 16:18:00 +03:00
kpyzhov 5b1910d140 Added gfx906 kernel binary to the HIPFB file for hipModule test. (#1215)
[ROCm/hip commit: e6a8d1daee]
2019-07-12 03:13:45 +00:00
Yaxun (Sam) Liu 296d1606f2 Add amdgpu targets gfx1010 and gfx1012 (#1211)
[ROCm/hip commit: 2b5ff1678a]
2019-07-12 03:13:35 +00:00
Evgeny Mankov 96801f7b3a [HIP][HIPIFY] Split HIP_ARRAY_DESCRIPTOR struct to HIP_ARRAY_DESCRIPTOR and HIP_ARRAY3D_DESCRIPTOR
[Reason] To be compatible with CUDA [#1133]

Update HIP code, hipify-clang, tests and docs

[TODO] Add support of the corresponding functions on nvcc fallback path


[ROCm/hip commit: c7117df91b]
2019-07-11 14:58:16 +03:00
mhbliao ffa0aaaf20 Add missing compound assignment operators. (#1216)
[ROCm/hip commit: a16cd0c69a]
2019-07-11 09:16:15 +00:00
Jatin Chaudhary 047139cc38 Adding fix to initialize a dim3 variable from blockIdx. (#1212)
Adding gitignore for ctags

[ROCm/hip commit: 3e213d94e1]
2019-07-11 09:16:03 +00:00
Maneesh Gupta 4bd481c762 Revert "[ci] Disable hipLaunchParm.tst for now" (#1087)
[ROCm/hip commit: 797510614e]
2019-07-11 00:24:23 +00:00
Aryan Salmanpour 706d958297 [hip][samples] fix a typo in module_api/defaultDriver sample (#1210)
[ROCm/hip commit: f21601733e]
2019-07-10 03:05:19 +00:00
Aryan Salmanpour d46430b450 Fix a compilation error when templated operator<< fails if 'U' type is std::ostream. (#1209)
[ROCm/hip commit: e87b5a1e8a]
2019-07-10 03:05:04 +00:00
Maneesh Gupta 6b29461f65 [ci] Enable tests on ROCm 2.6 (#1208)
[ROCm/hip commit: b5150c8bc7]
2019-07-10 03:04:52 +00:00
darkbuck 8368ca472d Fix even more typo. (#1204)
[ROCm/hip commit: c8b76a6a63]
2019-07-08 11:00:51 +00:00
Jatin Chaudhary c7f8ffe41e Adding bounds check before hipMemset (#1190)
* Adding bounds check in ihipMemset

* Adding ihipMemPtrGetInfo to hipMemPtrGetInfo


[ROCm/hip commit: 5ed16432f8]
2019-07-08 11:00:38 +00:00
Evgeny Mankov 5e8190aa9c [HIPIFY] Statistics to CSV file dumping revise
+ Add option -print-stats-csv to dump statistics to CSV file
+ If -o-dir is specified, CSV file will be dumped there
+ Generate 1 summary file sum_stat.csv in case of multiple sources


[ROCm/hip commit: 3b40e53cff]
2019-07-07 20:55:17 +03:00
searlmc1 a82b834fc4 Fix typo ( T in Vec3_cmp = needs to be int ). (#1203)
Typo introduced here:
commit e474469213
Author: Alex Voicu <alexandru.voicu@amd.com>
Date:   Mon Jun 24 20:02:09 2019 -0500

    Put 3-wide vector types on a ketogenic diet. (#1180)

[ROCm/hip commit: a9870f0465]
2019-07-04 00:44:26 +00:00
kpyzhov e1c47e773c Change the type of hipTextureObject_t to pointer to a struct. (#1199)
This is necessary to allow Runtime to perform required texture buffer handling.

[ROCm/hip commit: b6b749f714]
2019-07-04 00:44:06 +00:00
Jannik caf1f4fbb4 Fix call to cuda library (#1197)
* Remove flags parameter from hipOccupancyMaxPotentialBlockSize

This commit makes the hipOccupancyMaxPotentialBlockSize method consistent with hcc path and the CUDA API.


[ROCm/hip commit: 2a1b0ba27d]
2019-07-04 00:43:22 +00:00
Aryan Salmanpour a2655fd90a [hip] Move _criticalData of ihipStream_t class to private section and use criticalData() to access it (#1177)
[ROCm/hip commit: 999f45fc11]
2019-07-04 00:42:19 +00:00
Maneesh Gupta 01ee58e0c7 Fix sample module_api_global for hip-clang (#1201)
module_api_global relies on a HCC only feature which allows host code
to write to device variables. This feature does not exist in CUDA
or hip-clang, which causes the sample not working in CUDA or hip-clang.

This patch fixes the sample by using standard features of CUDA and
hip-clang. The fixed sample works in HCC, CUDA and hip-clang.

[ROCm/hip commit: 502a734ebf]
2019-07-03 08:52:19 +00:00
Yaxun (Sam) Liu 4d2f0de651 Add __hip_pinned_shadow__ for hip-clang and fix texture reference tests (#1200)
[ROCm/hip commit: 85c3b903df]
2019-07-03 08:51:59 +00:00
Maneesh Gupta b7fe02394f Added missing NULL checks and corrected API return values (#1188)
* Added missing NULL checks and corrected API return values as per validation

* Added missing NULL checks


[ROCm/hip commit: d57946275b]
2019-07-03 08:51:39 +00:00
Jatin Chaudhary 9484eb6831 Adding new unroll example (#1187)
[ROCm/hip commit: 3152af0175]
2019-07-03 08:51:19 +00:00
Yaxun Sam Liu afa71d710c Fix sample module_api_global for hip-clang
module_api_global relies on a HCC only feature which allows host code
to write to device variables. This feature does not exist in CUDA
or hip-clang, which causes the sample not working in CUDA or hip-clang.

This patch fixes the sample by using standard features of CUDA and
hip-clang. The fixed sample works in HCC, CUDA and hip-clang.


[ROCm/hip commit: 60e1733afe]
2019-07-02 16:36:53 -04:00
eshcherb 9a1e96321e hip_api_name fixed to be inline to suppress not used warning (#1194)
[ROCm/hip commit: e72893a771]
2019-06-30 03:38:43 +00:00
Evgeny Mankov eeb66048f3 [HIPIFY] Clang 7.1.0 support
+ Provide patch for clang's bug 38811
+ Update Readme.md accordingly


[ROCm/hip commit: 36cda83066]
2019-06-29 17:23:18 +03:00
Anusha Godavarthy Surya 7eedb4d047 Added missing NULL checks
[ROCm/hip commit: 7a4c3f61c8]
2019-06-27 20:19:30 +05:30
Anusha Godavarthy Surya f1d6b56fc4 Added missing NULL checks and corrected API return values as per validation
[ROCm/hip commit: 3d5f6be1c7]
2019-06-27 00:19:05 +05:30
Maneesh Gupta c1b0d625d7 Add dependency on amd_comgr in hip-config-*.cmake.in (#1179)
Change-Id: Ia12a40cd5c89920a50ec714ceae99f60259a6f66

[ROCm/hip commit: 573978625b]
2019-06-26 15:18:11 +05:30
wkwchau 3c963cc0e1 Fixed bug in hipOccupancyMaxPotentialBlockSize for the SGPRs limitation of gfx8 devices (#1176)
[ROCm/hip commit: 47f16264ed]
2019-06-26 15:18:00 +05:30
Maneesh Gupta 7874e84d3d [packaging] Add dependency on comgr to hip_hcc package (#1182)
[ROCm/hip commit: d4a92519eb]
2019-06-25 06:32:46 +05:30
Aaron Enye Shi 52614c2729 Fix dlpi_name info empty when using GCC on ub18 (#1181)
This fixes a bug where GCC++ on Ubuntu 18.04 creates failing executables compared to GCC++ on 16.04 and clang++. While creating function names on Ubuntu 18.04, dl_phdr_info seems to provide a non-zero value for dlpi_addr on initial iteration, and an empty string in dlpi_name. This is causing failure when linking with g++, since the empty string prevents the kernel function from being loaded. Clang++ and GCC on UB16 provide a zero value for dlpi_addr. To fix this, we need to verify both addr and name exists, so that /proc/self/exe can be properly loaded.

[ROCm/hip commit: 77bef86949]
2019-06-25 06:32:29 +05:30
Alex Voicu e474469213 Put 3-wide vector types on a ketogenic diet. (#1180)
* Put 3-wide vector types on a ketogenic diet.

* Remove needless include.

* Do not be narrow-minded.

* Do not be narrow-minded.

* Put the C people on a diet too.


[ROCm/hip commit: 67abac1365]
2019-06-25 06:32:09 +05:30
Evgeny Mankov 7cb92aeb9a [HIPIFY] Fix multiple input files support
Reported in #1168


[ROCm/hip commit: db6571ae75]
2019-06-20 18:05:57 +03:00
Aryan Salmanpour 45fa752888 [hip] implement the hipExtLaunchMultiKernelMultiDevice API (#1165)
* [hip] implement the hipExtLaunchMultiKernelMultiDevice API

* add a guard to check the HCC version for acquire_locked_hsa_queue() API which was introdued in HCC for ROCm 2.5

* modified code based on the requested changes

* changes to lock all streams before launching kernels for each device and unlock them after the dispatches

* check each stream to be valid before starting to lock all the streams


[ROCm/hip commit: 96dc74897d]
2019-06-20 05:59:05 +05:30
wkwchau 40bd111519 Implement the hipOccupancyMaxPotentialBlockSize function (#1162)
* Implement the hipOccupancyMaxPotentialBlockSize function

* Replaced hipGetDeviceProperties() call by ihipGetDeviceProperties() in ihipOccupancyMaxPotentialBlockSize()

* Add test for hipOccupancyMaxPotentialBlockSize in Module API

* Added extern declaration for ihipGetDeviceProperties() to be accessed inside ihipOccupancyMaxPotentialBlockSize()

* fixed hipOccupancyMaxPotentialBlockSize test build issue

* Fix hipOccupancyMaxPotentialBlockSize dtest

* Add BUILD_CMD in hipOccupancyMaxPotentialBlockSize dtest

* Revert "Add BUILD_CMD in hipOccupancyMaxPotentialBlockSize dtest"

This reverts commit 0480ff56f1441fc515d2c26ce33783e303423938.

* Disable hipOccupancyMaxPotentialBlockSize dtest on NVCC

* move extern declaration of ihipGetDeviceProperties to hip_module.cpp

* Update the limiation of 32 wavefronts per CU and 800/512 SGPRs for VI/pre-VI chips to calculate the occupancy


[ROCm/hip commit: d492f1fd6b]
2019-06-20 05:58:29 +05:30
Maneesh Gupta 57e0c02e28 Merge pull request #1164 from gargrahul/cleanup_module_api_samples
Add cleanup code in module api samples

[ROCm/hip commit: ba323cdef1]
2019-06-20 05:57:46 +05:30
Maneesh Gupta 2238e6671e Merge pull request #1173 from icarus-sparry/icarus-sparry-patch-1
Make hip_prof_gen.py compatible with both python 2 and 3

[ROCm/hip commit: a8f8c8994d]
2019-06-19 13:37:02 +05:30
Maneesh Gupta 2a41dc2a64 Merge pull request #1171 from mhbliao/hliao/master/hipcc
[hipcc] Revise include path calculation.

[ROCm/hip commit: 6b83b6bfca]
2019-06-19 13:36:51 +05:30
Maneesh Gupta cd72288515 Merge pull request #1167 from eshcherb/hip_prof_refactoring_190611
prof layer includes refactoring

[ROCm/hip commit: fc074d96d0]
2019-06-19 13:36:33 +05:30
Maneesh Gupta f16a25dd64 Merge pull request #1163 from gargrahul/fix_module_api_kernarg_sample
Fix sample to use kernelargs for launch

[ROCm/hip commit: 0ad0d06930]
2019-06-19 13:36:17 +05:30
Icarus Sparry b5a26215cc Make hip_prof_gen.py compatible with both python 2 and 3
Convert python 2 constructs to python 3 compatible ones.

In python 3, print is a function, so use write methods (which are always functions) instead.

In python3 keys() returns an iterator, rather than a list. This means you can't change the data structure that is being iterated over. Converting this iterator into a list mimics the python 2 behavior.

[ROCm/hip commit: cc374b2bd3]
2019-06-17 12:03:36 -07:00
Michael LIAO 49fce7ba8a [hipcc] Revise include path calculation.
- Once HIP_VDI_HOME is defined but HIP_CLANG_INCLUDE_PATH is not,
  calculate it directly without HIP_CLANG_PATH is defined or not;
  Otherwise, we may leave HIP_CLANG_INCLUDE_PATH undefined, if clang is
  not installed following the official way (so far, HIP-Clang breaks
  that), we may leave HIP_CLANG_INCLUDE_PATH undefined before its uses.


[ROCm/hip commit: e32940357f]
2019-06-14 13:35:00 -04:00
Rahul Garg 884d0fef76 HACK for SWDEV-173477/SWDEV-190701
[ROCm/hip commit: bc528b1e8b]
2019-06-13 18:15:31 -07:00