Граф коммитов

163 Коммитов

Автор SHA1 Сообщение Дата
Rahul Garg 2dfc912094 Add cleanup code in module api samples 2019-06-07 05:23:11 +05:30
Evgeny Mankov 66c182946e [HIP][HIPIFY] Make hipMemcpyParam2D coherent with cuMemcpy2D
+ Makes hip_Memcpy2D struct compatible with CUDA_MEMCPY2D struct
+ Add hipMemcpyParam2D support in nvcc fallback path
+ Update hipify-clang, tests and docs accordingly
2019-05-22 18:31:39 +03:00
Evgeny Mankov f5f240744a [HIP][tests] Copyright notice update 2019-05-20 18:46:23 +03:00
Nick Curtis 7391b26fac Markdown fixes & Whitespace cleanup for samples (#1096)
* Fix multiline code blocks in README's

* Whitespace cleanup
2019-05-12 19:27:44 +05:30
Rahul Garg 2796df1b26 Test hipModuleGetGlobal 2019-03-15 04:08:03 +05:30
Rahul Garg 599ae3af2e Add extension for kernel concurrency on same stream 2019-03-06 12:55:39 +05:30
Maneesh Gupta 8244ad03d0 Merge pull request #938 from gargrahul/fix_hipBusBW_p2p_bidir
Fix hipBusBW sample for P2P bidirectional test
2019-02-28 07:14:38 +05:30
Rahul Garg 5f54c5cb59 Fix hipBusBW sample for P2P bidirectional test 2019-02-28 00:56:07 +05:30
Rahul Garg dfc0ce51d0 Fix hipBusBW overflow with setting beats/iterations 2019-02-27 00:18:52 +05:30
Rahul Garg cea97d2beb Fix error codes in P2P sample test 2019-02-05 06:30:57 +05:30
Rahul Garg 39735ffedd Fixed HIP error check in P2P sample test 2019-01-23 01:10:07 +05:30
Yaxun Sam Liu e787b89865 Fix sample bit_extract for hip-clang 2018-11-12 16:50:47 -05:00
Alex Voicu e7270853cc Guard new includes. 2018-10-18 12:29:27 +01:00
Alex Voicu b76b5bb1b0 Re-sync with upstream. 2018-10-18 12:27:03 +01:00
Maneesh Gupta bfceb14751 Replace hipLaunchKernel -> hipLaunchKernelGGL
Change-Id: I4d99009e1199811d417becf1e1b934ec4d4e30be
2018-10-17 14:32:25 +05:30
Alex Voicu 6e4935f225 Update samples. 2018-10-13 23:28:22 +01:00
Rahul Garg 9e167ab02e Remove adipose extn from launchKernelHcc sample 2018-09-12 16:41:24 +05:30
Rahul Garg dbf1737658 Clean up module api samples 2018-08-08 22:28:13 +05:30
Rahul Garg 30c587d2b1 Fix hipCommander Makefile 2018-05-16 15:01:32 +05:30
Rahul Garg bd985285df Removed hidden args and hipLaunchParm from HIP/HCC path 2018-03-16 22:50:25 +05:30
Rahul Garg 65b2fc4b9b Change co file name 2018-03-16 12:54:44 +05:30
Rahul Garg 01ee90d564 Fixed function not found issue 2018-03-16 12:35:25 +05:30
Maneesh Gupta 6b09bde675 Apply .clangformat to all repo source files
Change-Id: I7e79c6058f0303f9a98911e3b7dd2e8596079344
2018-03-12 11:29:03 +05:30
Alex Voicu 182156b12b This introduces LipoProteinLipase (lpl), a simple tool for creating fat binaries. It represents a direct replacement of the creaky hccgenco.sh script, which had various issues. The format it uses is that of a code object bundle, generated by the Clang Offload Bundler. The output is always suffixed with the ".adipose" extension. It is shared with HCC. The hipcc script and associated tests are modified to use lpl. Help can be obtained by invoking lpl --help. A more computer-sciency / corporate friendly name is likely to be beneficial, which is a reason for choosing easily searchable/replaceable names such as lpl or adipose. 2017-12-08 04:22:57 +00:00
Ben Sander 1a23d5e95a Merge pull request #281 from mangupta/issue126
[samples] Adds a sample that shows using HIP with cmake
2017-12-05 11:42:11 -06:00
Maneesh Gupta 81bcfafe8d Simplify square sample's Makefile
Change-Id: I44349a880a3c57ca0e833d67d9c380b706655b1e
2017-12-05 11:54:50 +05:30
Maneesh Gupta c15d48c543 [samples] Adds a sample that shows using HIP with cmake
Change-Id: Ief983ea0894d7b5d1ea46a755f9134dda0a1bb8f
2017-12-05 10:48:29 +05:30
Ben Sander 421a50e830 Update square sample for recent HIP ease-of-use improvements 2017-12-02 07:44:27 -06:00
Alex Voicu 071b260cf6 Revert "Revert adoption of CUDA indexing in general - this can only work with later versions of the compiler, just like module based dispatch, and thus must be guarded against usage in earlier (e.g. 1.6) versions."
This reverts commit fe3719a
2017-11-29 21:49:10 +00:00
Alex Voicu 6e2e720b26 Revert "Revert adoption of CUDA indexing in general - this can only work with later versions of the compiler, just like module based dispatch, and thus must be guarded against usage in earlier (e.g. 1.6) versions."
This reverts commit d2fd1f5
2017-11-29 21:36:29 +00:00
Ben Sander 75a4e404ca Merge pull request #256 from gargrahul/texture_driver_api_support
Texture driver APIs support
2017-11-27 13:52:39 -06:00
Rahul Garg 3d2e40a5df Changed function hipMemcpy_2D to hipMemcpyParam2D 2017-11-21 12:36:24 +05:30
Rahul Garg 741702888f Update hipModuleGetTexRef API 2017-11-19 22:10:46 +05:30
Rahul Garg 657aa51d5d -Fixed texture driver API sample
-Added hipTexRefSetAddress and hipTexRefSetAddress2D APIs
2017-11-15 18:23:28 +05:30
Rahul Garg ac124b3179 Fix module_api sample 2017-11-13 08:56:39 +05:30
Rahul Garg 0fffdeba92 Added texture 2D driver API usage example 2017-11-09 22:35:29 +05:30
Ben Sander 731c1afea6 Merge pull request #198 from AlexVlx/feature_support_globals_for_module_api
Feature support globals for module api
2017-10-27 01:53:34 +02:00
Rahul Garg 626521007d Example showing globals use with module APIs 2017-10-24 18:12:25 +05:30
Rahul Garg f19c685f88 Use 2X for bidir p2p memory bandwidth calc 2017-10-23 21:57:20 +05:30
Ben Sander 9fef6f860c Use 2X for bidir memory bandwidth calc 2017-10-21 07:47:32 -05:00
Sandeep Kumar 451f36a42a Add more info for inline asm in hip kernel guide and cookbook readme 2017-09-13 12:57:37 +05:30
Ben Sander 6ac55d2b34 Refactor dispatch latency test and fix several bugs. 2017-08-17 08:46:58 -05:00
Aditya Atluri 8e3e104313 fixed device selection during compilation to use rocm_agent_enumerator
1. Changed hipcc to use rocm_agent_enumerator
2. Changed square sample test to use device variable
2017-07-21 15:50:12 -05:00
Maneesh Gupta 8252ae785b GPUOpen-ProfessionalCompute-Tools -> ROCm-Developer-Tools
Change-Id: I9f5b29dd1097385acecb0c672770d8adca2fdcf7
2017-07-05 11:44:44 +05:30
Aditya Atluri 98905a7272 automate gcnarch detection
Change-Id: Ibbad22db136f7f5e2be84c82e9169298a144cc77
2017-06-29 12:01:40 -05:00
Aditya Atluri a491a49f98 removed rm for /opt/rocm/hip/src in inline asm sample
Change-Id: I0c02bccd4cd35e01a8e889ea1e586ea8baf0ab90
2017-06-20 11:35:52 -05:00
Sandeep Kumar 5c530e7c32 Add peer2peer bandwidth and latency test
Change-Id: I6d88e4aa9f6e64096af16579eebef4740734203e
2017-06-14 09:44:56 +05:30
Sandeep Kumar 7c6b0384bb Add readme for inline asm and unroll cookbook samples
Change-Id: I71b7a5652c3dad181c5df60ab0dd1b81d79f1bfb
2017-05-31 09:25:50 +05:30
Sandeep Kumar 83472bfa78 Add unroll and inline asm cookbook samples
Change-Id: Ie5a0fbb01b7fca82959090d89299533d49e092f1
2017-05-31 09:25:35 +05:30
Sandeep Kumar 3bc6df2044 Print msg for single gpu
Change-Id: I2d23c73542add8973990ba96592016726994422e
2017-05-31 09:25:17 +05:30