* first cut of the header implementation of cooperative group feature
* add diclarations for device library functions
* fixed various compile time issues in the CG headers
* enabled copy construction and copy assignment
* fixed a minor bug related to conditional compilation macro
* fixed few more CG constructor issues and added a unit testcase
* fixed typo
* extended unit testcase
* compute size of partitioned CG from mask
* bit of code refactoring
* removed boilerplate code
* fixed few of the review comments by Brian
* Changes to the sigantures of few grid and multi-grid related OCKL functions
* changes to declarations of OCKL functions related to CG feature
* removed all the block level support as it is not planned for 2.9
* Have taken care of review comments by Brian
* Have taken care of review comments by Brian
* removed unused functions which were initially intended to use in block level cg support
[ROCm/hip commit: d75dc4eb29]
1. Fix setting std c++ (11|14)
2. Get rid of WIN32, use MSVC instead
3. Use VERSION_GREATER_EQUAL and VERSION_MAJOR instead of logical expressions
[ROCm/hip commit: eeb4452b23]
- `result_of_t` is defined as the shortcut of
```
template< class T >
using result_of_t = typename result_of<T>::type;
```
[ROCm/hip commit: 63e47e525b]
[Reason] LLVM became c++14 last week due to the following change:
37508d3dd94b0154861a90b1909d17b01400df99
Replace llvm::integer_sequence and friends with the C++14 standard version
[ROCm/hip commit: e1d4f8510a]
* [hip] add initial implementation for hipLaunchCooperativeKernel API
* [hip] use total number of work groups to initialize the GWS resource
* [hip] use only one argument for init_gws kernel
* [hip] use the device associated with the stream for checking the device properties
[ROCm/hip commit: 5066700ace]
- Fixes SWDEV-200435
- Brings versioning and packaging naming inline with rest of the ROCm
components.
- Minor bug fixes.
Change-Id: I543a24bf18baffe74fa3ccac8c93bddba7231c5e
[ROCm/hip commit: c42075715b]
MSVC assert.h has no guard for include once. The macro assert overrides
device assert definition. Do not include it for device compilation.
[ROCm/hip commit: 7aa7a4ce22]
* Add hipMemcpy3DAsync
* Fix CI build error
* Move back stream resolution to internal function
* Remove stream redefinition and check
[ROCm/hip commit: 2405621f62]
Compilation error being observed due to popen(), pclose() and setenv() linux calls on windows. Replaced with appropriate calls on windows.
[ROCm/hip commit: 9b31d26237]
Compilation error being observed on windows due to aligned_alloc() call. Mapped the call to _aligned_malloc() for windows.
[ROCm/hip commit: 9abae7114c]
* Removed unwanted #include sys/time.h , gettimeofday() and timeval variables and this also helps avavoid compilation error in windows due to gettimeofday() call equivalent of which is not available in windows
* Changed the Macro name from GPU_PRINT_TIME to MY_LAUNCH_MACRO
[ROCm/hip commit: e94c0592de]
Changed the third arg of the functions __hip_as_write_block and __ockl_as_write_block from ulong to uint64_t so as to fix the compilation error in windows
[ROCm/hip commit: d3ffad7c83]
* add default visibility to most APIs in program_state
* remove unwanted C++ headers
* Add symbol visibility pragmas and compiler flags
* Add visibility attribute to APIs in channel_descriptor and hip_hcc
* remove unused headers
* simplify build flags with hcc
* add pragma visibility hidden to functional_grid_launch
* [CMake] add gfx908 back
[ROCm/hip commit: 83af327ef2]
* Add support for hipFunGetAttribute
* Support NVCC path
* Test using sample module_api_global
* Try fixing CI build failure due to hip_prof_gen scan
* Fix for CI build issue
* Resolve conflict
* Rebase and resolve conflicts with master
* Fix build error
* Fix NVCC path build error
[ROCm/hip commit: 6ce86f409d]