Граф коммитов

4140 Коммитов

Автор SHA1 Сообщение Дата
Rahul Garg dc702f5ffe Merge pull request #1616 from ROCm-Developer-Tools/hotfix_volatile_accessors
Accessors should work even when oddly volatile.
2019-11-01 13:45:54 -07:00
Alex Voicu 2d76dde05b Accessors should work even when oddly volatile. 2019-11-01 22:18:01 +02:00
Evgeny Mankov 48bb6df7e2 Merge pull request #1615 from emankov/hipify
[HIPIFY][CUB][#1460][perl] Add "cub::" namespace prefix support in hipify-perl as well
2019-11-01 14:35:55 +03:00
Evgeny Mankov c5a2a2daf2 [HIPIFY][CUB][#1460][perl] Add "cub::" namespace prefix support in hipify-perl as well 2019-11-01 14:34:18 +03:00
Rahul Garg 1bec1445bb Merge pull request #1582 from amd-lthakur/hipExtMLK
Adding a directed test case for hipExtModuleLaunchKernel() api.
2019-10-31 17:13:26 -07:00
Rahul Garg 2199f8a6c6 Merge pull request #1598 from lmoriche/master
Fix a code object memory corruption
2019-10-31 17:12:24 -07:00
Rahul Garg f556e15361 Add stream 2019-10-31 12:15:56 -04:00
Rahul Garg 0718ba0f00 Fix HIP init calls in hipMemcpy2DFromArray 2019-10-31 12:15:56 -04:00
Evgeny Mankov 8b99b0ffd8 Merge pull request #1612 from emankov/hipify
[HIPIFY][cmake][#1572] Fix: Do not override CMAKE_INSTALL_PREFIX
2019-10-31 16:58:36 +03:00
Evgeny Mankov e79fd55d01 [HIPIFY][cmake][#1572] Fix: Do not override CMAKE_INSTALL_PREFIX
Affects building with HIP, standalone building is not changed
2019-10-31 16:55:06 +03:00
Rahul Garg 07f4431de8 Formatting changes 2019-10-30 18:12:51 -07:00
Rahul Garg cd1435cbc7 Formatting changes ,variable name and check update 2019-10-30 18:09:21 -07:00
Rahul Garg aeb7cebbad Merge pull request #1515 from ansurya/tex_unbind_issue_fix
Fix undefined ref to hipUnbindTexture for texture types
2019-10-30 17:54:15 -07:00
Laurent Morichetti 3243f06eef Addressed review comments
Change comment "must exceed" to "must be no shorter than"
move the std::string instead of creating a copy
2019-10-30 13:14:41 -07:00
Evgeny Mankov 961bc5737e Merge pull request #1593 from emankov/doc
[HIP][cmake] Move all *_INSTALL_DIR variables up before first add_subdirectory()
2019-10-30 22:10:05 +03:00
Rahul Garg b94f5bd667 Merge pull request #1607 from mhbliao/hliao/master/missing.api.hip.clang
[HIP] Correct headers and add missing function templates for hip-clang.
2019-10-30 07:48:57 -07:00
Michael LIAO 61bc68a5f4 [HIP] Correct headers and add missing function templates for hip-clang.
- Fix 2 runtime API prototypes
  `hipOccupancyMaxActiveBlocksPerMultiprocessor` and
  `hipOccupancyMaxActiveBlocksPerMultiprocessorWithFlags`
- Add missing function templates of them in hip-clang.
2019-10-29 22:00:11 -04:00
Rahul Garg 9840cdac99 Merge pull request #1602 from ROCm-Developer-Tools/revert-1560-satyanveshd/hipoccupy
Revert "Cooperative groups match with cuda SWDEV-205006"
2019-10-29 16:54:36 -07:00
Evgeny Mankov daab61e8e8 Merge pull request #1604 from emankov/hipify
[HIPIFY][#1603] Fix
2019-10-29 22:12:39 +03:00
Evgeny Mankov 050fdad7b7 [HIPIFY][#1603] Fix 2019-10-29 22:10:36 +03:00
Rahul Garg 27221bc823 Revert "Fix occupany APIs (#1560)"
This reverts commit 6c5fbf9b4a.
2019-10-29 11:41:08 -07:00
Evgeny Mankov 8a7e6fb747 Merge pull request #1601 from emankov/hipify
[HIPIFY][Linux] Rollback --cuda-compile-host-device on Linux
2019-10-29 20:55:29 +03:00
Evgeny Mankov dd2243f2fa [HIPIFY][Linux] Rollback --cuda-compile-host-device on Linux
[Reason] It doesn't work with LLVM 9 and higher; Windows is fine
2019-10-29 20:53:54 +03:00
Evgeny Mankov 99c4a40da1 Merge pull request #1600 from emankov/hipify
[HIPIFY] Introduce --cuda-compile-host-device for LLVM >= 9
2019-10-29 19:47:15 +03:00
Evgeny Mankov 411b18a124 [HIPIFY] Introduce --cuda-compile-host-device for LLVM >= 9
* LLVM < 9 continues using --cuda-host-only
2019-10-29 19:42:53 +03:00
Evgeny Mankov 50df94be1e Merge pull request #1599 from emankov/hipify
[HIPIFY] cudaMemcpy2DFromArray(Async) support
2019-10-29 19:14:00 +03:00
Evgeny Mankov 5dd00bdf52 [HIPIFY] cudaMemcpy2DFromArray(Async) support 2019-10-29 19:12:42 +03:00
Laurent Morichetti 66c91b42e6 Fix a code object memory corruption
The lifetime of the buffer given to
hsa_code_object_reader_create_from_memory must exceed that of the
code object reader. We need to create a copy of the code object
binary memory (file) that is kept allocated until the code object
reader is destroyed.
2019-10-29 08:23:57 -07:00
Evgeny Mankov 3921ea9057 Merge pull request #1594 from emankov/HIP
[HIP][doc] Fix typo: AMD-clang -> HIP-clang
2019-10-28 23:22:57 +03:00
Evgeny Mankov 3df22b2fde [HIP][doc] NVIDIA-nvcc -> HIP-nvcc 2019-10-28 22:46:33 +03:00
Evgeny Mankov d312bce79d [HIP][doc] AMD-hcc -> HIP-hcc 2019-10-28 21:41:12 +03:00
Evgeny Mankov 6284b041e5 [HIP][doc] Fix typo: AMD-clang -> HIP-clang
HIP-clang is already used below instead of AMD-clang
2019-10-28 21:19:21 +03:00
Evgeny Mankov 8100e084b8 [HIP][cmake] Move all *_INSTALL_DIR variables up before first add_subdirectory()
[REASON]
Those vars (may) used by cmake in subdirectories (#1571)
2019-10-28 21:07:00 +03:00
Evgeny Mankov 7f367ff933 Merge pull request #1590 from emankov/doc
[HIPIFY][tests] Fix ambiguous call to cusparseGetErrorString declared in cusparse.h
2019-10-25 16:08:22 +03:00
Evgeny Mankov f68bee02f5 [HIPIFY][tests] Rename the ambiguous call as well 2019-10-25 16:07:31 +03:00
Evgeny Mankov 9529e1d91d [HIPIFY][tests] Fix ambiguous call to cusparseGetErrorString declared in cusparse.h 2019-10-25 16:04:20 +03:00
Anusha Godavarthy Surya 9332a39838 Merge branch 'master' into tex_unbind_issue_fix 2019-10-25 15:54:25 +05:30
amd-lthakur 626cd5d07a Excluded the test case for nvcc platform 2019-10-25 15:52:11 +05:30
Anusha Godavarthy Surya ae838f8cee merge from master 2019-10-25 15:52:09 +05:30
Alex Voicu 40522e2b6a Add missing operators, fix GCC compilation. (#1589) 2019-10-25 15:44:24 +05:30
Alex Voicu f909a393ff Fix deadlock, remove old __sync_* use. (#1584)
This fixes a deadlock introduced by the switch to TTAS loops, and is therefore mildly urgent (to prevent the CI from hoovering in the broken code).
2019-10-25 15:44:17 +05:30
Rahul Garg 66a3c874c8 [dtest] Fix hipMemset2D test (#1579)
Reverts changes made in #1399. This is a RT api test. For testing hipMemAllocPitch , a new test should be written and that should use correct memset API.
2019-10-25 15:44:05 +05:30
Rahul Garg 14b870d1ce Add hipMemcpy2DfromArray (#1510)
Adds hipMemcpy2DFromArray and hipMemcpy2DFromArrayAsync equivalent to cudaMemcpy2DFromArray and cudaMemcpy2DFromArrayAsync.
2019-10-25 15:43:33 +05:30
Anusha Godavarthy Surya c0fc5e718c Merge branch 'master' into tex_unbind_issue_fix 2019-10-25 15:36:55 +05:30
Anusha Godavarthy Surya b9c8dd8ac6 Fixed CI build failure 2019-10-25 12:21:41 +05:30
amd-lthakur b2238bacd4 Refactored the file as suggested 2019-10-25 10:44:38 +05:30
amd-lthakur 84fb936dfe Update matmul.cpp 2019-10-25 09:22:07 +05:30
amd-lthakur 9a860e6766 Update hipExtModuleLaunchKernel.cpp 2019-10-25 09:19:49 +05:30
Rahul Garg ff8d3fa446 Update profiling doc (#1576) 2019-10-24 17:51:55 +05:30
Jatin Chaudhary f53b1a1755 Adding New Analyze Target Merging with cppcheck (#1583) 2019-10-24 17:46:06 +05:30