Maneesh Gupta
|
bfceb14751
|
Replace hipLaunchKernel -> hipLaunchKernelGGL
Change-Id: I4d99009e1199811d417becf1e1b934ec4d4e30be
|
2018-10-17 14:32:25 +05:30 |
|
Rahul Garg
|
9e167ab02e
|
Remove adipose extn from launchKernelHcc sample
|
2018-09-12 16:41:24 +05:30 |
|
Rahul Garg
|
dbf1737658
|
Clean up module api samples
|
2018-08-08 22:28:13 +05:30 |
|
Maneesh Gupta
|
1c622a59a4
|
Merge pull request #452 from gargrahul/fix_hipCommander_makefile
Fix hipCommander Makefile
|
2018-05-17 07:25:27 +05:30 |
|
Rahul Garg
|
30c587d2b1
|
Fix hipCommander Makefile
|
2018-05-16 15:01:32 +05:30 |
|
Maneesh Gupta
|
f4613d1cef
|
Merge pull request #373 from gargrahul/fix_function_not_found_tex_drv_sample
Fixed function not found issue in texture driver api sample
|
2018-03-19 14:11:28 +05:30 |
|
Rahul Garg
|
bd985285df
|
Removed hidden args and hipLaunchParm from HIP/HCC path
|
2018-03-16 22:50:25 +05:30 |
|
Rahul Garg
|
65b2fc4b9b
|
Change co file name
|
2018-03-16 12:54:44 +05:30 |
|
Rahul Garg
|
01ee90d564
|
Fixed function not found issue
|
2018-03-16 12:35:25 +05:30 |
|
Maneesh Gupta
|
6b09bde675
|
Apply .clangformat to all repo source files
Change-Id: I7e79c6058f0303f9a98911e3b7dd2e8596079344
|
2018-03-12 11:29:03 +05:30 |
|
Alex Voicu
|
182156b12b
|
This introduces LipoProteinLipase (lpl), a simple tool for creating fat binaries. It represents a direct replacement of the creaky hccgenco.sh script, which had various issues. The format it uses is that of a code object bundle, generated by the Clang Offload Bundler. The output is always suffixed with the ".adipose" extension. It is shared with HCC. The hipcc script and associated tests are modified to use lpl. Help can be obtained by invoking lpl --help. A more computer-sciency / corporate friendly name is likely to be beneficial, which is a reason for choosing easily searchable/replaceable names such as lpl or adipose.
|
2017-12-08 04:22:57 +00:00 |
|
Ben Sander
|
1a23d5e95a
|
Merge pull request #281 from mangupta/issue126
[samples] Adds a sample that shows using HIP with cmake
|
2017-12-05 11:42:11 -06:00 |
|
Maneesh Gupta
|
81bcfafe8d
|
Simplify square sample's Makefile
Change-Id: I44349a880a3c57ca0e833d67d9c380b706655b1e
|
2017-12-05 11:54:50 +05:30 |
|
Maneesh Gupta
|
c15d48c543
|
[samples] Adds a sample that shows using HIP with cmake
Change-Id: Ief983ea0894d7b5d1ea46a755f9134dda0a1bb8f
|
2017-12-05 10:48:29 +05:30 |
|
Ben Sander
|
421a50e830
|
Update square sample for recent HIP ease-of-use improvements
|
2017-12-02 07:44:27 -06:00 |
|
Alex Voicu
|
071b260cf6
|
Revert "Revert adoption of CUDA indexing in general - this can only work with later versions of the compiler, just like module based dispatch, and thus must be guarded against usage in earlier (e.g. 1.6) versions."
This reverts commit fe3719a
|
2017-11-29 21:49:10 +00:00 |
|
Alex Voicu
|
2d36d3d36e
|
Merge branch 'feature_use_module_based_dispatch_instead_of_pfe' of https://github.com/AlexVlx/HIP into feature_use_module_based_dispatch_instead_of_pfe
|
2017-11-29 21:45:56 +00:00 |
|
Alex Voicu
|
6e2e720b26
|
Revert "Revert adoption of CUDA indexing in general - this can only work with later versions of the compiler, just like module based dispatch, and thus must be guarded against usage in earlier (e.g. 1.6) versions."
This reverts commit d2fd1f5
|
2017-11-29 21:36:29 +00:00 |
|
Alex Voicu
|
fe3719af15
|
Revert adoption of CUDA indexing in general - this can only work with later versions of the compiler, just like module based dispatch, and thus must be guarded against usage in earlier (e.g. 1.6) versions.
|
2017-11-29 21:01:28 +00:00 |
|
Ben Sander
|
75a4e404ca
|
Merge pull request #256 from gargrahul/texture_driver_api_support
Texture driver APIs support
|
2017-11-27 13:52:39 -06:00 |
|
Rahul Garg
|
3d2e40a5df
|
Changed function hipMemcpy_2D to hipMemcpyParam2D
|
2017-11-21 12:36:24 +05:30 |
|
Rahul Garg
|
741702888f
|
Update hipModuleGetTexRef API
|
2017-11-19 22:10:46 +05:30 |
|
Rahul Garg
|
657aa51d5d
|
-Fixed texture driver API sample
-Added hipTexRefSetAddress and hipTexRefSetAddress2D APIs
|
2017-11-15 18:23:28 +05:30 |
|
Maneesh Gupta
|
c256089392
|
Merge pull request #261 from gargrahul/fix_module_api_sample
Fix module_api sample
|
2017-11-13 11:55:54 +05:30 |
|
Rahul Garg
|
ac124b3179
|
Fix module_api sample
|
2017-11-13 08:56:39 +05:30 |
|
Rahul Garg
|
0fffdeba92
|
Added texture 2D driver API usage example
|
2017-11-09 22:35:29 +05:30 |
|
Ben Sander
|
7f96edc89a
|
Merge pull request #222 from bensander/fix_device_prop
Fix device prop
|
2017-10-30 17:58:48 +01:00 |
|
Ben Sander
|
731c1afea6
|
Merge pull request #198 from AlexVlx/feature_support_globals_for_module_api
Feature support globals for module api
|
2017-10-27 01:53:34 +02:00 |
|
Rahul Garg
|
626521007d
|
Example showing globals use with module APIs
|
2017-10-24 18:12:25 +05:30 |
|
Rahul Garg
|
f19c685f88
|
Use 2X for bidir p2p memory bandwidth calc
|
2017-10-23 21:57:20 +05:30 |
|
Ben Sander
|
9fef6f860c
|
Use 2X for bidir memory bandwidth calc
|
2017-10-21 07:47:32 -05:00 |
|
Maneesh Gupta
|
1188b3ba8f
|
Merge branch 'master' into roc-1.6.x
Change-Id: I8c5861c83032c6006731595ec40e09fdc9102749
|
2017-09-22 12:01:40 +05:30 |
|
Sandeep Kumar
|
451f36a42a
|
Add more info for inline asm in hip kernel guide and cookbook readme
|
2017-09-13 12:57:37 +05:30 |
|
Maneesh Gupta
|
5efad49773
|
Merge pull request #159 from bensander/hipDispatchLatency
Refactor dispatch latency test and fix several bugs.
|
2017-08-22 14:49:14 +05:30 |
|
Ben Sander
|
6ac55d2b34
|
Refactor dispatch latency test and fix several bugs.
|
2017-08-17 08:46:58 -05:00 |
|
Aditya Atluri
|
cd48b06719
|
fixed device selection during compilation to use rocm_agent_enumerator
1. Changed hipcc to use rocm_agent_enumerator
2. Changed square sample test to use device variable
|
2017-07-28 10:43:11 +05:30 |
|
Ben Sander
|
2dbafb89ce
|
Merge pull request #108 from adityaatluri/enum-fix
fixed device selection during compilation to use rocm_agent_enumerator
|
2017-07-21 16:42:48 -05:00 |
|
Aditya Atluri
|
8e3e104313
|
fixed device selection during compilation to use rocm_agent_enumerator
1. Changed hipcc to use rocm_agent_enumerator
2. Changed square sample test to use device variable
|
2017-07-21 15:50:12 -05:00 |
|
Maneesh Gupta
|
b897d5599f
|
Merge branch 'amd-develop'
|
2017-07-06 12:16:47 +05:30 |
|
Maneesh Gupta
|
8252ae785b
|
GPUOpen-ProfessionalCompute-Tools -> ROCm-Developer-Tools
Change-Id: I9f5b29dd1097385acecb0c672770d8adca2fdcf7
|
2017-07-05 11:44:44 +05:30 |
|
Maneesh Gupta
|
465bc42a12
|
Merge branch 'roc-1.6.x' into master
Change-Id: I367a3940a0a9e5658abc28a7dc2bfb9cf4167dc8
|
2017-06-30 09:59:30 +05:30 |
|
Aditya Atluri
|
98905a7272
|
automate gcnarch detection
Change-Id: Ibbad22db136f7f5e2be84c82e9169298a144cc77
|
2017-06-29 12:01:40 -05:00 |
|
Aditya Atluri
|
a491a49f98
|
removed rm for /opt/rocm/hip/src in inline asm sample
Change-Id: I0c02bccd4cd35e01a8e889ea1e586ea8baf0ab90
|
2017-06-20 11:35:52 -05:00 |
|
Sandeep Kumar
|
5c530e7c32
|
Add peer2peer bandwidth and latency test
Change-Id: I6d88e4aa9f6e64096af16579eebef4740734203e
|
2017-06-14 09:44:56 +05:30 |
|
Sandeep Kumar
|
7c6b0384bb
|
Add readme for inline asm and unroll cookbook samples
Change-Id: I71b7a5652c3dad181c5df60ab0dd1b81d79f1bfb
|
2017-05-31 09:25:50 +05:30 |
|
Sandeep Kumar
|
83472bfa78
|
Add unroll and inline asm cookbook samples
Change-Id: Ie5a0fbb01b7fca82959090d89299533d49e092f1
|
2017-05-31 09:25:35 +05:30 |
|
Sandeep Kumar
|
3bc6df2044
|
Print msg for single gpu
Change-Id: I2d23c73542add8973990ba96592016726994422e
|
2017-05-31 09:25:17 +05:30 |
|
Maneesh Gupta
|
fa64db5171
|
Merge branch 'rocm-rel-1.5'
Change-Id: Ib2318f9c0d01a1bc8be2fcb172a3075e82851877
|
2017-05-02 09:06:49 +05:30 |
|
Maneesh Gupta
|
b8fd2f159a
|
Merge branch 'amd-develop' into amd-master
Change-Id: I312fb9d1181733ef5160d1e993e2ae57ced0f6b3
(cherry picked from commit 88fb807af0)
|
2017-04-25 00:01:30 -04:00 |
|
Ben Sander
|
2335bcdd03
|
Fix compilation error with nvcc (c++ nullptr)
|
2017-04-21 09:01:34 -05:00 |
|