Rahul Garg
2dfc912094
Add cleanup code in module api samples
2019-06-07 05:23:11 +05:30
Evgeny Mankov
66c182946e
[HIP][HIPIFY] Make hipMemcpyParam2D coherent with cuMemcpy2D
...
+ Makes hip_Memcpy2D struct compatible with CUDA_MEMCPY2D struct
+ Add hipMemcpyParam2D support in nvcc fallback path
+ Update hipify-clang, tests and docs accordingly
2019-05-22 18:31:39 +03:00
Evgeny Mankov
f5f240744a
[HIP][tests] Copyright notice update
2019-05-20 18:46:23 +03:00
Nick Curtis
7391b26fac
Markdown fixes & Whitespace cleanup for samples ( #1096 )
...
* Fix multiline code blocks in README's
* Whitespace cleanup
2019-05-12 19:27:44 +05:30
Rahul Garg
2796df1b26
Test hipModuleGetGlobal
2019-03-15 04:08:03 +05:30
Rahul Garg
599ae3af2e
Add extension for kernel concurrency on same stream
2019-03-06 12:55:39 +05:30
Maneesh Gupta
8244ad03d0
Merge pull request #938 from gargrahul/fix_hipBusBW_p2p_bidir
...
Fix hipBusBW sample for P2P bidirectional test
2019-02-28 07:14:38 +05:30
Rahul Garg
5f54c5cb59
Fix hipBusBW sample for P2P bidirectional test
2019-02-28 00:56:07 +05:30
Rahul Garg
dfc0ce51d0
Fix hipBusBW overflow with setting beats/iterations
2019-02-27 00:18:52 +05:30
Rahul Garg
cea97d2beb
Fix error codes in P2P sample test
2019-02-05 06:30:57 +05:30
Rahul Garg
39735ffedd
Fixed HIP error check in P2P sample test
2019-01-23 01:10:07 +05:30
Yaxun Sam Liu
e787b89865
Fix sample bit_extract for hip-clang
2018-11-12 16:50:47 -05:00
Alex Voicu
e7270853cc
Guard new includes.
2018-10-18 12:29:27 +01:00
Alex Voicu
b76b5bb1b0
Re-sync with upstream.
2018-10-18 12:27:03 +01:00
Maneesh Gupta
bfceb14751
Replace hipLaunchKernel -> hipLaunchKernelGGL
...
Change-Id: I4d99009e1199811d417becf1e1b934ec4d4e30be
2018-10-17 14:32:25 +05:30
Alex Voicu
6e4935f225
Update samples.
2018-10-13 23:28:22 +01:00
Rahul Garg
9e167ab02e
Remove adipose extn from launchKernelHcc sample
2018-09-12 16:41:24 +05:30
Rahul Garg
dbf1737658
Clean up module api samples
2018-08-08 22:28:13 +05:30
Rahul Garg
30c587d2b1
Fix hipCommander Makefile
2018-05-16 15:01:32 +05:30
Rahul Garg
bd985285df
Removed hidden args and hipLaunchParm from HIP/HCC path
2018-03-16 22:50:25 +05:30
Rahul Garg
65b2fc4b9b
Change co file name
2018-03-16 12:54:44 +05:30
Rahul Garg
01ee90d564
Fixed function not found issue
2018-03-16 12:35:25 +05:30
Maneesh Gupta
6b09bde675
Apply .clangformat to all repo source files
...
Change-Id: I7e79c6058f0303f9a98911e3b7dd2e8596079344
2018-03-12 11:29:03 +05:30
Alex Voicu
182156b12b
This introduces LipoProteinLipase (lpl), a simple tool for creating fat binaries. It represents a direct replacement of the creaky hccgenco.sh script, which had various issues. The format it uses is that of a code object bundle, generated by the Clang Offload Bundler. The output is always suffixed with the ".adipose" extension. It is shared with HCC. The hipcc script and associated tests are modified to use lpl. Help can be obtained by invoking lpl --help. A more computer-sciency / corporate friendly name is likely to be beneficial, which is a reason for choosing easily searchable/replaceable names such as lpl or adipose.
2017-12-08 04:22:57 +00:00
Ben Sander
1a23d5e95a
Merge pull request #281 from mangupta/issue126
...
[samples] Adds a sample that shows using HIP with cmake
2017-12-05 11:42:11 -06:00
Maneesh Gupta
81bcfafe8d
Simplify square sample's Makefile
...
Change-Id: I44349a880a3c57ca0e833d67d9c380b706655b1e
2017-12-05 11:54:50 +05:30
Maneesh Gupta
c15d48c543
[samples] Adds a sample that shows using HIP with cmake
...
Change-Id: Ief983ea0894d7b5d1ea46a755f9134dda0a1bb8f
2017-12-05 10:48:29 +05:30
Ben Sander
421a50e830
Update square sample for recent HIP ease-of-use improvements
2017-12-02 07:44:27 -06:00
Alex Voicu
071b260cf6
Revert "Revert adoption of CUDA indexing in general - this can only work with later versions of the compiler, just like module based dispatch, and thus must be guarded against usage in earlier (e.g. 1.6) versions."
...
This reverts commit fe3719a
2017-11-29 21:49:10 +00:00
Alex Voicu
6e2e720b26
Revert "Revert adoption of CUDA indexing in general - this can only work with later versions of the compiler, just like module based dispatch, and thus must be guarded against usage in earlier (e.g. 1.6) versions."
...
This reverts commit d2fd1f5
2017-11-29 21:36:29 +00:00
Ben Sander
75a4e404ca
Merge pull request #256 from gargrahul/texture_driver_api_support
...
Texture driver APIs support
2017-11-27 13:52:39 -06:00
Rahul Garg
3d2e40a5df
Changed function hipMemcpy_2D to hipMemcpyParam2D
2017-11-21 12:36:24 +05:30
Rahul Garg
741702888f
Update hipModuleGetTexRef API
2017-11-19 22:10:46 +05:30
Rahul Garg
657aa51d5d
-Fixed texture driver API sample
...
-Added hipTexRefSetAddress and hipTexRefSetAddress2D APIs
2017-11-15 18:23:28 +05:30
Rahul Garg
ac124b3179
Fix module_api sample
2017-11-13 08:56:39 +05:30
Rahul Garg
0fffdeba92
Added texture 2D driver API usage example
2017-11-09 22:35:29 +05:30
Ben Sander
731c1afea6
Merge pull request #198 from AlexVlx/feature_support_globals_for_module_api
...
Feature support globals for module api
2017-10-27 01:53:34 +02:00
Rahul Garg
626521007d
Example showing globals use with module APIs
2017-10-24 18:12:25 +05:30
Rahul Garg
f19c685f88
Use 2X for bidir p2p memory bandwidth calc
2017-10-23 21:57:20 +05:30
Ben Sander
9fef6f860c
Use 2X for bidir memory bandwidth calc
2017-10-21 07:47:32 -05:00
Sandeep Kumar
451f36a42a
Add more info for inline asm in hip kernel guide and cookbook readme
2017-09-13 12:57:37 +05:30
Ben Sander
6ac55d2b34
Refactor dispatch latency test and fix several bugs.
2017-08-17 08:46:58 -05:00
Aditya Atluri
8e3e104313
fixed device selection during compilation to use rocm_agent_enumerator
...
1. Changed hipcc to use rocm_agent_enumerator
2. Changed square sample test to use device variable
2017-07-21 15:50:12 -05:00
Maneesh Gupta
8252ae785b
GPUOpen-ProfessionalCompute-Tools -> ROCm-Developer-Tools
...
Change-Id: I9f5b29dd1097385acecb0c672770d8adca2fdcf7
2017-07-05 11:44:44 +05:30
Aditya Atluri
98905a7272
automate gcnarch detection
...
Change-Id: Ibbad22db136f7f5e2be84c82e9169298a144cc77
2017-06-29 12:01:40 -05:00
Aditya Atluri
a491a49f98
removed rm for /opt/rocm/hip/src in inline asm sample
...
Change-Id: I0c02bccd4cd35e01a8e889ea1e586ea8baf0ab90
2017-06-20 11:35:52 -05:00
Sandeep Kumar
5c530e7c32
Add peer2peer bandwidth and latency test
...
Change-Id: I6d88e4aa9f6e64096af16579eebef4740734203e
2017-06-14 09:44:56 +05:30
Sandeep Kumar
7c6b0384bb
Add readme for inline asm and unroll cookbook samples
...
Change-Id: I71b7a5652c3dad181c5df60ab0dd1b81d79f1bfb
2017-05-31 09:25:50 +05:30
Sandeep Kumar
83472bfa78
Add unroll and inline asm cookbook samples
...
Change-Id: Ie5a0fbb01b7fca82959090d89299533d49e092f1
2017-05-31 09:25:35 +05:30
Sandeep Kumar
3bc6df2044
Print msg for single gpu
...
Change-Id: I2d23c73542add8973990ba96592016726994422e
2017-05-31 09:25:17 +05:30