Gráfico de Commits

180 Commits

Autor SHA1 Mensagem Data
Maneesh Gupta bfceb14751 Replace hipLaunchKernel -> hipLaunchKernelGGL
Change-Id: I4d99009e1199811d417becf1e1b934ec4d4e30be
2018-10-17 14:32:25 +05:30
Rahul Garg 9e167ab02e Remove adipose extn from launchKernelHcc sample 2018-09-12 16:41:24 +05:30
Rahul Garg dbf1737658 Clean up module api samples 2018-08-08 22:28:13 +05:30
Maneesh Gupta 1c622a59a4 Merge pull request #452 from gargrahul/fix_hipCommander_makefile
Fix hipCommander Makefile
2018-05-17 07:25:27 +05:30
Rahul Garg 30c587d2b1 Fix hipCommander Makefile 2018-05-16 15:01:32 +05:30
Maneesh Gupta f4613d1cef Merge pull request #373 from gargrahul/fix_function_not_found_tex_drv_sample
Fixed function not found issue in texture driver api sample
2018-03-19 14:11:28 +05:30
Rahul Garg bd985285df Removed hidden args and hipLaunchParm from HIP/HCC path 2018-03-16 22:50:25 +05:30
Rahul Garg 65b2fc4b9b Change co file name 2018-03-16 12:54:44 +05:30
Rahul Garg 01ee90d564 Fixed function not found issue 2018-03-16 12:35:25 +05:30
Maneesh Gupta 6b09bde675 Apply .clangformat to all repo source files
Change-Id: I7e79c6058f0303f9a98911e3b7dd2e8596079344
2018-03-12 11:29:03 +05:30
Alex Voicu 182156b12b This introduces LipoProteinLipase (lpl), a simple tool for creating fat binaries. It represents a direct replacement of the creaky hccgenco.sh script, which had various issues. The format it uses is that of a code object bundle, generated by the Clang Offload Bundler. The output is always suffixed with the ".adipose" extension. It is shared with HCC. The hipcc script and associated tests are modified to use lpl. Help can be obtained by invoking lpl --help. A more computer-sciency / corporate friendly name is likely to be beneficial, which is a reason for choosing easily searchable/replaceable names such as lpl or adipose. 2017-12-08 04:22:57 +00:00
Ben Sander 1a23d5e95a Merge pull request #281 from mangupta/issue126
[samples] Adds a sample that shows using HIP with cmake
2017-12-05 11:42:11 -06:00
Maneesh Gupta 81bcfafe8d Simplify square sample's Makefile
Change-Id: I44349a880a3c57ca0e833d67d9c380b706655b1e
2017-12-05 11:54:50 +05:30
Maneesh Gupta c15d48c543 [samples] Adds a sample that shows using HIP with cmake
Change-Id: Ief983ea0894d7b5d1ea46a755f9134dda0a1bb8f
2017-12-05 10:48:29 +05:30
Ben Sander 421a50e830 Update square sample for recent HIP ease-of-use improvements 2017-12-02 07:44:27 -06:00
Alex Voicu 071b260cf6 Revert "Revert adoption of CUDA indexing in general - this can only work with later versions of the compiler, just like module based dispatch, and thus must be guarded against usage in earlier (e.g. 1.6) versions."
This reverts commit fe3719a
2017-11-29 21:49:10 +00:00
Alex Voicu 2d36d3d36e Merge branch 'feature_use_module_based_dispatch_instead_of_pfe' of https://github.com/AlexVlx/HIP into feature_use_module_based_dispatch_instead_of_pfe 2017-11-29 21:45:56 +00:00
Alex Voicu 6e2e720b26 Revert "Revert adoption of CUDA indexing in general - this can only work with later versions of the compiler, just like module based dispatch, and thus must be guarded against usage in earlier (e.g. 1.6) versions."
This reverts commit d2fd1f5
2017-11-29 21:36:29 +00:00
Alex Voicu fe3719af15 Revert adoption of CUDA indexing in general - this can only work with later versions of the compiler, just like module based dispatch, and thus must be guarded against usage in earlier (e.g. 1.6) versions. 2017-11-29 21:01:28 +00:00
Ben Sander 75a4e404ca Merge pull request #256 from gargrahul/texture_driver_api_support
Texture driver APIs support
2017-11-27 13:52:39 -06:00
Rahul Garg 3d2e40a5df Changed function hipMemcpy_2D to hipMemcpyParam2D 2017-11-21 12:36:24 +05:30
Rahul Garg 741702888f Update hipModuleGetTexRef API 2017-11-19 22:10:46 +05:30
Rahul Garg 657aa51d5d -Fixed texture driver API sample
-Added hipTexRefSetAddress and hipTexRefSetAddress2D APIs
2017-11-15 18:23:28 +05:30
Maneesh Gupta c256089392 Merge pull request #261 from gargrahul/fix_module_api_sample
Fix module_api sample
2017-11-13 11:55:54 +05:30
Rahul Garg ac124b3179 Fix module_api sample 2017-11-13 08:56:39 +05:30
Rahul Garg 0fffdeba92 Added texture 2D driver API usage example 2017-11-09 22:35:29 +05:30
Ben Sander 7f96edc89a Merge pull request #222 from bensander/fix_device_prop
Fix device prop
2017-10-30 17:58:48 +01:00
Ben Sander 731c1afea6 Merge pull request #198 from AlexVlx/feature_support_globals_for_module_api
Feature support globals for module api
2017-10-27 01:53:34 +02:00
Rahul Garg 626521007d Example showing globals use with module APIs 2017-10-24 18:12:25 +05:30
Rahul Garg f19c685f88 Use 2X for bidir p2p memory bandwidth calc 2017-10-23 21:57:20 +05:30
Ben Sander 9fef6f860c Use 2X for bidir memory bandwidth calc 2017-10-21 07:47:32 -05:00
Maneesh Gupta 1188b3ba8f Merge branch 'master' into roc-1.6.x
Change-Id: I8c5861c83032c6006731595ec40e09fdc9102749
2017-09-22 12:01:40 +05:30
Sandeep Kumar 451f36a42a Add more info for inline asm in hip kernel guide and cookbook readme 2017-09-13 12:57:37 +05:30
Maneesh Gupta 5efad49773 Merge pull request #159 from bensander/hipDispatchLatency
Refactor dispatch latency test and fix several bugs.
2017-08-22 14:49:14 +05:30
Ben Sander 6ac55d2b34 Refactor dispatch latency test and fix several bugs. 2017-08-17 08:46:58 -05:00
Aditya Atluri cd48b06719 fixed device selection during compilation to use rocm_agent_enumerator
1. Changed hipcc to use rocm_agent_enumerator
2. Changed square sample test to use device variable
2017-07-28 10:43:11 +05:30
Ben Sander 2dbafb89ce Merge pull request #108 from adityaatluri/enum-fix
fixed device selection during compilation to use rocm_agent_enumerator
2017-07-21 16:42:48 -05:00
Aditya Atluri 8e3e104313 fixed device selection during compilation to use rocm_agent_enumerator
1. Changed hipcc to use rocm_agent_enumerator
2. Changed square sample test to use device variable
2017-07-21 15:50:12 -05:00
Maneesh Gupta b897d5599f Merge branch 'amd-develop' 2017-07-06 12:16:47 +05:30
Maneesh Gupta 8252ae785b GPUOpen-ProfessionalCompute-Tools -> ROCm-Developer-Tools
Change-Id: I9f5b29dd1097385acecb0c672770d8adca2fdcf7
2017-07-05 11:44:44 +05:30
Maneesh Gupta 465bc42a12 Merge branch 'roc-1.6.x' into master
Change-Id: I367a3940a0a9e5658abc28a7dc2bfb9cf4167dc8
2017-06-30 09:59:30 +05:30
Aditya Atluri 98905a7272 automate gcnarch detection
Change-Id: Ibbad22db136f7f5e2be84c82e9169298a144cc77
2017-06-29 12:01:40 -05:00
Aditya Atluri a491a49f98 removed rm for /opt/rocm/hip/src in inline asm sample
Change-Id: I0c02bccd4cd35e01a8e889ea1e586ea8baf0ab90
2017-06-20 11:35:52 -05:00
Sandeep Kumar 5c530e7c32 Add peer2peer bandwidth and latency test
Change-Id: I6d88e4aa9f6e64096af16579eebef4740734203e
2017-06-14 09:44:56 +05:30
Sandeep Kumar 7c6b0384bb Add readme for inline asm and unroll cookbook samples
Change-Id: I71b7a5652c3dad181c5df60ab0dd1b81d79f1bfb
2017-05-31 09:25:50 +05:30
Sandeep Kumar 83472bfa78 Add unroll and inline asm cookbook samples
Change-Id: Ie5a0fbb01b7fca82959090d89299533d49e092f1
2017-05-31 09:25:35 +05:30
Sandeep Kumar 3bc6df2044 Print msg for single gpu
Change-Id: I2d23c73542add8973990ba96592016726994422e
2017-05-31 09:25:17 +05:30
Maneesh Gupta fa64db5171 Merge branch 'rocm-rel-1.5'
Change-Id: Ib2318f9c0d01a1bc8be2fcb172a3075e82851877
2017-05-02 09:06:49 +05:30
Maneesh Gupta b8fd2f159a Merge branch 'amd-develop' into amd-master
Change-Id: I312fb9d1181733ef5160d1e993e2ae57ced0f6b3
(cherry picked from commit 88fb807af0)
2017-04-25 00:01:30 -04:00
Ben Sander 2335bcdd03 Fix compilation error with nvcc (c++ nullptr) 2017-04-21 09:01:34 -05:00