Граф коммитов

3490 Коммитов

Автор SHA1 Сообщение Дата
Rahul Garg 33af62d66e Revert "Using HSA API for hipMemsetAsync (#1346)" (#1381)
This reverts commit 4313f00563.

[ROCm/clr commit: d433f6fb58]
2019-09-03 05:13:46 +00:00
Nicholas Malaya 85c3f035e4 Fix Broken Link in hip_porting_guide (#1376)
The math library equivalents between CUDA-HIP are broken. This is a key feature for converting to AMD hardware. This fix corrects the broken link and moves the library equivalents to sit under the "Porting a New Cuda Project" header.

[ROCm/clr commit: 796b8aafd7]
2019-09-03 05:13:35 +00:00
Sarbojit2019 e0f1719c1b Removed hipLaunchKernel macro got missed in Merge (#1374)
[ROCm/clr commit: 4d270775c8]
2019-09-03 05:13:07 +00:00
Evgeny Mankov 88a6eadbb6 [HIPIFY][doc] Update README.md (testing, versions)
[ROCm/clr commit: d9b5e2f74a]
2019-09-02 18:18:43 +03:00
Evgeny Mankov 12efe906e7 [HIPIFY][tests] Add occupancy test
[ROCm/clr commit: 608331250e]
2019-09-02 17:54:06 +03:00
Evgeny Mankov 46373a3631 [HIPIFY][fix] Fix for upcoming LLVM 10.0
[ROCm/clr commit: 8eb7d53cec]
2019-09-01 18:15:03 +03:00
Sameer Sahasrabuddhe 929b6d5f34 remove obsolete test for OCKL Asynchronous Streams
The implementation for OCKL AS was recently removed from the device
library since that feature is now superseded by hostcall.


[ROCm/clr commit: 68c133bd78]
2019-08-30 20:41:29 +05:30
Evgeny Mankov 56b05d61e1 [HIPIFY][DNN] cuDNN 7.6.3 support
[ROCm/clr commit: a0a98790f0]
2019-08-29 19:55:38 +03:00
Evgeny Mankov 2b87d21227 Merge pull request #1372 from emankov/master
[HIPIFY][#207][fix] Translate all preprocessor's conditional blocks

[ROCm/clr commit: 451721a821]
2019-08-29 10:30:05 +03:00
Yaxun (Sam) Liu e5a61a51de Fix sample 11_texture_driver for hip-clang (#1370)
[ROCm/clr commit: 793b18c0b1]
2019-08-29 01:03:20 +00:00
Yaxun (Sam) Liu 0a81b64a19 Do not change opt level for -g for hip-clang (#1369)
Currently hipcc uses -O3 for hip-clang by default but uses -O0 if -g is used. This
causes surprise for users since -g should not affect default opt level.

[ROCm/clr commit: 19a9b15306]
2019-08-29 01:03:11 +00:00
Sarbojit2019 15b80a759d Updated hipErrorString and CUDAErrorTohipError (#1365)
[ROCm/clr commit: b5ff9e71ed]
2019-08-29 01:02:59 +00:00
Sarbojit2019 4d23e35532 [HIP] Reclaiming hipLaunchKernel API (#1353)
* [HIP] Reclaiming hipLaunchKernel API

* Reclaiming hipLaunchKernel : Incorporated review comments

* Incorporated review comments

* Removed hipLaunchKernel Macro from nvcc path


[ROCm/clr commit: 1ae43cbeba]
2019-08-29 01:02:41 +00:00
satyanveshd 0e4701d59f [sample] add new cookbook sample - occupancy (#1352)
* occupancy.cpp with Makefile

* occupancy sample changes according tothe comments

* Changes according to the review comments

* Occupancy Sample Changes

* Changes according to review comments


[ROCm/clr commit: d2df21e58c]
2019-08-29 01:01:49 +00:00
mshivama 757aa67a43 Device side support for Cooperative Group feature (#1202)
* first cut of the header implementation of cooperative group feature

* add diclarations for device library functions

* fixed various compile time issues in the CG headers

* enabled copy construction and copy assignment

* fixed a minor bug related to conditional compilation macro

* fixed few more CG constructor issues and added a unit testcase

* fixed typo

* extended unit testcase

* compute size of partitioned CG from mask

* bit of code refactoring

* removed boilerplate code

* fixed few of the review comments by Brian

* Changes to the sigantures of few grid and multi-grid related OCKL functions

* changes to declarations of OCKL functions related to CG feature

* removed all the block level support as it is not planned for 2.9

* Have taken care of review comments by Brian

* Have taken care of review comments by Brian

* removed unused functions which were initially intended to use in block level cg support


[ROCm/clr commit: f74903851e]
2019-08-29 01:01:25 +00:00
Evgeny Mankov 65ba03dd4c [HIPIFY][#207][fix] Translate all preprocessor's conditional blocks
+ Start to translate preprocessor's false conditional blocks too:
  based on clang's https://reviews.llvm.org/D66597;
  available only starting from LLVM 10.0 or trunk.
+ Option -skip-excluded-preprocessor-conditional-blocks for skipping excluded conditional blocks:
  the default behavior for hipify-clang built with LLVM < 10.0;
  false by default for hipify-clang built with LLVM 10 or trunk.
+ Add 4 preprocessor unit tests, 2 of which are LLVM 10.0 only
+ Update couple of existing tests by setting -skip-excluded-preprocessor-conditional-blocks option:
  update lit testing accordingly


[ROCm/clr commit: 32f22a0117]
2019-08-28 21:17:35 +03:00
Evgeny Mankov 676c4b5bd4 [HIPIFY][cmake] Minor refactoring
1. Fix setting std c++ (11|14)
2. Get rid of WIN32, use MSVC instead
3. Use VERSION_GREATER_EQUAL and VERSION_MAJOR instead of logical expressions


[ROCm/clr commit: e3eb62ac4b]
2019-08-28 20:40:10 +03:00
Maneesh Gupta cb3803e423 [packaging] update soft-links that already exist (#1364)
Change-Id: Ibe9840802b1765094ef998edf2210e95404cb989

[ROCm/clr commit: b92c16f4d9]
2019-08-26 23:41:14 -04:00
Michael LIAO eb1039a1e8 [hcc] Fix previous replacement of result_of_t.
- `result_of_t` is defined as the shortcut of
  ```
  template< class T >
  using result_of_t = typename result_of<T>::type;
  ```


[ROCm/clr commit: c84628f420]
2019-08-26 10:58:38 -04:00
ramcherukuri 4e3b65f10c moving result_of_t to result_of
[ROCm/clr commit: 6ea9e2b249]
2019-08-24 08:59:58 -04:00
Evgeny Mankov bf769d3691 [HIPIFY][cmake] Explicitly set c++14
[Reason] LLVM became c++14 last week due to the following change:

37508d3dd94b0154861a90b1909d17b01400df99
Replace llvm::integer_sequence and friends with the C++14 standard version


[ROCm/clr commit: 7f5635d3ff]
2019-08-23 19:32:24 +03:00
Sarbojit2019 0cf9a5b532 Added missing device prop fields into hipInfo sample (#1357)
* Added prop.integrated into hipInfo sample

* Added missing deviceProp fileds in hipInfo


[ROCm/clr commit: 919bb6c74d]
2019-08-23 09:21:47 +00:00
Rahul Garg 7f71fe3af9 Make Bundled_code_header visible for hipRTC usage (#1359)
[ROCm/clr commit: d367fdf28c]
2019-08-23 09:20:02 +00:00
chrispaquot 4b13efac04 Set device before processing each one (#1358)
[ROCm/clr commit: d0b11deb7f]
2019-08-23 09:19:50 +00:00
Aryan Salmanpour e8f69a8c36 [hip] add initial implementation for hipLaunchCooperativeKernel API (#1339)
* [hip] add initial implementation for hipLaunchCooperativeKernel API

* [hip] use total number of work groups to initialize the GWS resource

* [hip] use only one argument for init_gws kernel

* [hip] use the device associated with the stream for checking the device properties


[ROCm/clr commit: 32ce882d6e]
2019-08-23 09:19:35 +00:00
Rahul Garg 14a510ee74 Fix memcpy with IPC slowness (#1321)
* Fix memcpy with IPC slowness

* Make early erroneous returns

* Real Clean up

* Real Clean up++


[ROCm/clr commit: a786728939]
2019-08-23 09:19:18 +00:00
Evgeny Mankov e7b6fac93e [HIPIFY][fix] Set the correct exit value for hipify-clang if errors.
This fixes a hanging of unit tests.


[ROCm/clr commit: 3d07393003]
2019-08-21 21:36:53 +03:00
Evgeny Mankov 2f3f8b9a98 [HIPIFY] CUDA 10.1 Update 2 support
[ROCm/clr commit: 8293f9a706]
2019-08-21 21:06:20 +03:00
Rahul Garg b7447d21c3 Remove -g from hipBusBandwidth makefile (#1351)
[ROCm/clr commit: 900590967d]
2019-08-21 10:01:31 +00:00
Maneesh Gupta 91fc0687fa Packaging and versioning changes (#1348)
- Fixes SWDEV-200435
- Brings versioning and packaging naming inline with rest of the ROCm
  components.
- Minor bug fixes.

Change-Id: I543a24bf18baffe74fa3ccac8c93bddba7231c5e

[ROCm/clr commit: e50f05f3f2]
2019-08-21 10:01:23 +00:00
Sarbojit2019 40c0bc5588 Compilation failure on nvcc path when using hipChannelFormatKind (#1345)
Fix for github #1183 issue reported

[ROCm/clr commit: 684801fc1d]
2019-08-21 10:01:03 +00:00
Jatin Chaudhary 4313f00563 Using HSA API for hipMemsetAsync (#1346)
[ROCm/clr commit: 9bbd09b04f]
2019-08-21 10:00:10 +00:00
kpyzhov b783820b90 Corrected declaration of __ockl_clz_u64() (#1340)
[ROCm/clr commit: d81267ae15]
2019-08-20 12:06:36 +00:00
Sarbojit2019 45c928dbba [hipcc] Fix for uninitialized variables on nvcc path
Fixes #1186

[ROCm/clr commit: f106375a34]
2019-08-20 12:06:22 +00:00
Maneesh Gupta 57c7e493e3 [ci] Enable tests on ROCm 2.7 (#1337)
Change-Id: I06fb58b48af7f52f309f659969bb42bf9e8e9faf

[ROCm/clr commit: fa891c7a21]
2019-08-20 12:05:30 +00:00
Yaxun (Sam) Liu 035568f6b5 Fix missing decl for hip-clang
Add back decl for hipHccModuleLaunchKernel and hipExtModuleLaunchKernel for HIP/VDI only


[ROCm/clr commit: 68a59ddbe8]
2019-08-19 18:27:13 -04:00
mhbliao a6e8fa0ca0 [hip] Allow from/to half conversion on host side. (#1334)
[ROCm/clr commit: 282f06014e]
2019-08-16 02:13:59 +00:00
Rahul Garg 147e516d1f Fix build issues seen with hip-clang path (#1331)
[ROCm/clr commit: a0cc627c42]
2019-08-16 02:13:45 +00:00
Yaxun (Sam) Liu 5cb8a914c2 Fix assert for windows. (#1329)
MSVC assert.h has no guard for include once. The macro assert overrides
device assert definition. Do not include it for device compilation.

[ROCm/clr commit: c4655d167e]
2019-08-16 02:13:33 +00:00
Rahul Garg 8966072920 Add hipMemcpy3DAsync (#1320)
* Add hipMemcpy3DAsync

* Fix CI build error

* Move back stream resolution to internal function

* Remove stream redefinition and check


[ROCm/clr commit: fbc9f7e20a]
2019-08-16 02:13:16 +00:00
Rahul Garg a160d3ee2e [dtests] refactor windows specific changes (#1313)
* [dtests] refactor windows specific changes

* Refactor hipMemoryAllocateCoherentDriver - PR- 1309

* Fix missing z in _putenv_s

* Revert "Fix missing z in _putenv_s"

This reverts commit 099a1b20a5c75c5f122d57c0ad2bca01745cdc9c.

* Refactor changes from PR 1299

* Update hipEnvVarDriver.cpp


[ROCm/clr commit: c741ad3c80]
2019-08-16 02:13:00 +00:00
Rahul Garg 1ca562e14c Fix undefined identifier issue for hipExtModuleLaunchKernel
[ROCm/clr commit: 7f9de881cb]
2019-08-14 16:46:32 -04:00
Maneesh Gupta 7a43bb6606 [hit] Add support for specifying dependencies in HIT syntax (#1323)
[ROCm/clr commit: 3634695737]
2019-08-14 11:30:42 +00:00
Sarbojit2019 2c441ad3b6 [HIP] Fix for hipArray_t failure on nvcc path
Fixes SWDEV-148407

[ROCm/clr commit: c4618f2c3b]
2019-08-14 11:30:06 +00:00
Rahul Garg 84fa282f9a [docs] Attribute details for launch kernel greater than 256 (#1319)
* Add attribute detail for kernel launch group > 256

* Add example


[ROCm/clr commit: d90c66f5bf]
2019-08-14 11:29:04 +00:00
eshcherb 572a076fa5 parser support for spaces after func name (#1294)
* parser support for spaces after func name

* removing reference


[ROCm/clr commit: 91c9f29cd1]
2019-08-14 11:28:42 +00:00
Evgeny Mankov e3cb2756c4 [HIPIFY] Add cudaMallocManaged -> hipMallocManaged
+ Add mapping for corresponding data types
+ Add a test
+ Update docs


[ROCm/clr commit: 9166b1554e]
2019-08-13 17:56:06 +03:00
Evgeny Mankov e6a0069d90 [HIPIFY][Driver API] Add support for hipFuncGetAttribute() and enum hipFunction_attribute
[ROCm/clr commit: 79de17dc84]
2019-08-13 16:48:07 +03:00
Evgeny Mankov d2231b9ad6 [HIPIFY] Add cuMemcpy2DAsync -> hipMemcpyParam2DAsync mapping
[ROCm/clr commit: b9deceab05]
2019-08-13 16:31:19 +03:00
Evgeny Mankov 1492370d1e [HIPIFY] Cooperative groups support
[ROCm/clr commit: af557aaaf6]
2019-08-12 19:20:13 +03:00