Maneesh Gupta
bfceb14751
Replace hipLaunchKernel -> hipLaunchKernelGGL
...
Change-Id: I4d99009e1199811d417becf1e1b934ec4d4e30be
2018-10-17 14:32:25 +05:30
Rahul Garg
9e167ab02e
Remove adipose extn from launchKernelHcc sample
2018-09-12 16:41:24 +05:30
Rahul Garg
dbf1737658
Clean up module api samples
2018-08-08 22:28:13 +05:30
Rahul Garg
30c587d2b1
Fix hipCommander Makefile
2018-05-16 15:01:32 +05:30
Rahul Garg
bd985285df
Removed hidden args and hipLaunchParm from HIP/HCC path
2018-03-16 22:50:25 +05:30
Rahul Garg
65b2fc4b9b
Change co file name
2018-03-16 12:54:44 +05:30
Rahul Garg
01ee90d564
Fixed function not found issue
2018-03-16 12:35:25 +05:30
Maneesh Gupta
6b09bde675
Apply .clangformat to all repo source files
...
Change-Id: I7e79c6058f0303f9a98911e3b7dd2e8596079344
2018-03-12 11:29:03 +05:30
Alex Voicu
182156b12b
This introduces LipoProteinLipase (lpl), a simple tool for creating fat binaries. It represents a direct replacement of the creaky hccgenco.sh script, which had various issues. The format it uses is that of a code object bundle, generated by the Clang Offload Bundler. The output is always suffixed with the ".adipose" extension. It is shared with HCC. The hipcc script and associated tests are modified to use lpl. Help can be obtained by invoking lpl --help. A more computer-sciency / corporate friendly name is likely to be beneficial, which is a reason for choosing easily searchable/replaceable names such as lpl or adipose.
2017-12-08 04:22:57 +00:00
Ben Sander
1a23d5e95a
Merge pull request #281 from mangupta/issue126
...
[samples] Adds a sample that shows using HIP with cmake
2017-12-05 11:42:11 -06:00
Maneesh Gupta
81bcfafe8d
Simplify square sample's Makefile
...
Change-Id: I44349a880a3c57ca0e833d67d9c380b706655b1e
2017-12-05 11:54:50 +05:30
Maneesh Gupta
c15d48c543
[samples] Adds a sample that shows using HIP with cmake
...
Change-Id: Ief983ea0894d7b5d1ea46a755f9134dda0a1bb8f
2017-12-05 10:48:29 +05:30
Ben Sander
421a50e830
Update square sample for recent HIP ease-of-use improvements
2017-12-02 07:44:27 -06:00
Alex Voicu
071b260cf6
Revert "Revert adoption of CUDA indexing in general - this can only work with later versions of the compiler, just like module based dispatch, and thus must be guarded against usage in earlier (e.g. 1.6) versions."
...
This reverts commit fe3719a
2017-11-29 21:49:10 +00:00
Alex Voicu
6e2e720b26
Revert "Revert adoption of CUDA indexing in general - this can only work with later versions of the compiler, just like module based dispatch, and thus must be guarded against usage in earlier (e.g. 1.6) versions."
...
This reverts commit d2fd1f5
2017-11-29 21:36:29 +00:00
Ben Sander
75a4e404ca
Merge pull request #256 from gargrahul/texture_driver_api_support
...
Texture driver APIs support
2017-11-27 13:52:39 -06:00
Rahul Garg
3d2e40a5df
Changed function hipMemcpy_2D to hipMemcpyParam2D
2017-11-21 12:36:24 +05:30
Rahul Garg
741702888f
Update hipModuleGetTexRef API
2017-11-19 22:10:46 +05:30
Rahul Garg
657aa51d5d
-Fixed texture driver API sample
...
-Added hipTexRefSetAddress and hipTexRefSetAddress2D APIs
2017-11-15 18:23:28 +05:30
Rahul Garg
ac124b3179
Fix module_api sample
2017-11-13 08:56:39 +05:30
Rahul Garg
0fffdeba92
Added texture 2D driver API usage example
2017-11-09 22:35:29 +05:30
Ben Sander
731c1afea6
Merge pull request #198 from AlexVlx/feature_support_globals_for_module_api
...
Feature support globals for module api
2017-10-27 01:53:34 +02:00
Rahul Garg
626521007d
Example showing globals use with module APIs
2017-10-24 18:12:25 +05:30
Rahul Garg
f19c685f88
Use 2X for bidir p2p memory bandwidth calc
2017-10-23 21:57:20 +05:30
Ben Sander
9fef6f860c
Use 2X for bidir memory bandwidth calc
2017-10-21 07:47:32 -05:00
Sandeep Kumar
451f36a42a
Add more info for inline asm in hip kernel guide and cookbook readme
2017-09-13 12:57:37 +05:30
Ben Sander
6ac55d2b34
Refactor dispatch latency test and fix several bugs.
2017-08-17 08:46:58 -05:00
Aditya Atluri
8e3e104313
fixed device selection during compilation to use rocm_agent_enumerator
...
1. Changed hipcc to use rocm_agent_enumerator
2. Changed square sample test to use device variable
2017-07-21 15:50:12 -05:00
Maneesh Gupta
8252ae785b
GPUOpen-ProfessionalCompute-Tools -> ROCm-Developer-Tools
...
Change-Id: I9f5b29dd1097385acecb0c672770d8adca2fdcf7
2017-07-05 11:44:44 +05:30
Aditya Atluri
98905a7272
automate gcnarch detection
...
Change-Id: Ibbad22db136f7f5e2be84c82e9169298a144cc77
2017-06-29 12:01:40 -05:00
Aditya Atluri
a491a49f98
removed rm for /opt/rocm/hip/src in inline asm sample
...
Change-Id: I0c02bccd4cd35e01a8e889ea1e586ea8baf0ab90
2017-06-20 11:35:52 -05:00
Sandeep Kumar
5c530e7c32
Add peer2peer bandwidth and latency test
...
Change-Id: I6d88e4aa9f6e64096af16579eebef4740734203e
2017-06-14 09:44:56 +05:30
Sandeep Kumar
7c6b0384bb
Add readme for inline asm and unroll cookbook samples
...
Change-Id: I71b7a5652c3dad181c5df60ab0dd1b81d79f1bfb
2017-05-31 09:25:50 +05:30
Sandeep Kumar
83472bfa78
Add unroll and inline asm cookbook samples
...
Change-Id: Ie5a0fbb01b7fca82959090d89299533d49e092f1
2017-05-31 09:25:35 +05:30
Sandeep Kumar
3bc6df2044
Print msg for single gpu
...
Change-Id: I2d23c73542add8973990ba96592016726994422e
2017-05-31 09:25:17 +05:30
Ben Sander
2335bcdd03
Fix compilation error with nvcc (c++ nullptr)
2017-04-21 09:01:34 -05:00
Maneesh Gupta
ad280696c6
Fix build issues with bit_extract sample
...
Change-Id: I628b3c83a16f7adf0ab8ca60aecde8c073c34fd9
2017-04-07 15:24:10 +05:30
Maneesh Gupta
59db2f453f
Fix build issues in hipCommander sample
...
- Remove -stdlib=libstdc++ from Makefile
- Removed deleted HIP header file fom includes
Change-Id: Ia189396bee19fc52b679259df56c6c6e2bafb6fe
2017-04-07 14:54:03 +05:30
Aditya Atluri
8e2b7147a5
added module api sample which uses hipHccModuleLaunchKernel
...
Change-Id: I7bce60b4480a3b5ff7ed69c3256078ded65a0945
2017-03-31 14:30:40 -05:00
Aditya Atluri
93a0b55616
added debug support for HIP sample
...
Change-Id: Ia7265234082039b68114f7421f4dbcb7149d4d2b
2017-03-31 14:13:46 -05:00
Aditya Atluri
8bc80debe4
Fixed bit_extract
...
Change-Id: I92d7b7a302e3fa0db84889fb5dc6b612e6a53c73
2017-03-31 13:35:26 -05:00
Aditya Atluri
7735b454a1
added new api hipHccModuleLaunchKernel
...
1. hipHccModuleLaunchKernel is same as hipModuleLaunchKernel with OpenCL workitem model
2. Added copy right
3. Fixed header naming
Change-Id: I6a7c35a3566e2f8d3f5056613e34193775d4b236
2017-03-31 12:11:34 -05:00
Sun, Peng
c4c4d95db6
revert workaround for square sample and update doc on GGL
...
Change-Id: I731c68ca4111e7dc2e45bef51c4cad2c23fc81f8
2017-03-21 10:26:09 -05:00
pensun
323807d02b
Initial integration with Alex' Generic Grid Launch
...
Change-Id: I559afb80e9e39ec0d119bb3bf3b85ef9e448caf6
2017-03-17 14:59:34 -05:00
Aditya Atluri
219343027f
Added default module launch api functionality
...
1. As in hipModuleLaunchKernel(..., kernelParams, nullptr); works with this commit
2. Added headers AMDGPUPTNote.h, AMDGPURuntimeMetadata.h to do code object meta data parsing
3. Changed CMake to look at llvm link libraries
4. HIP developer should set env variable LLVM_HOME to remove link errors
5. HIP depends on installed LLVM (not source, not build)
6. Added sample to test out the feature
7. Right now HCC does not support embedding metadata in code object. Use clang opencl
8. Changed HIPCC to read LLVM_HOME env var
9. New argument to CMake should be given -DLLVM_HOME=<where llvm 5.0 is installed>
Change-Id: Iba38194aa872d97cc2c90a8e5ff746c48055c868
2017-03-17 13:11:34 -05:00
Maneesh Gupta
8d6cb1f5a3
4_shfl and 5_2dshfl samples are unsupported on gfx701
...
Change-Id: I81eb880350f25e89573ba14c62b549c6c43f8c91
2017-03-14 15:56:18 +05:30
Ben Sander
705ab93664
Refactor registered memory calls.
2017-03-11 09:18:27 -06:00
Ben Sander
b25691cb87
Add first step to a "registerd" mode in hipBusBandwidth.
2017-03-11 09:18:27 -06:00
Rahul Garg
5eb39f1c6b
Fix for HCSWAP-128, make 5_2dshfl cookbook sample only for fiji
...
Change-Id: I8869c28151bca1bd47a053a2808e93a801d16d00
2017-03-10 10:29:52 +05:30
Aditya Atluri
a8fb90d9a9
make 4_shfl cookbook sample only for fiji
...
1. __shfl is not supported on hawaii gfx701
Change-Id: Iac09f5d30ee0674b8f58a6e74ec5c49b02be32ad
2017-03-09 08:52:50 -06:00