Commit Graph

123 Commits

Author SHA1 Message Date
Sandeep Kumar e43802035f Add more info for inline asm in hip kernel guide and cookbook readme 2017-09-13 12:57:37 +05:30
Ben Sander e87f7a4d00 Refactor dispatch latency test and fix several bugs. 2017-08-17 08:46:58 -05:00
Aditya Atluri 961440b593 fixed device selection during compilation to use rocm_agent_enumerator
1. Changed hipcc to use rocm_agent_enumerator
2. Changed square sample test to use device variable
2017-07-21 15:50:12 -05:00
Maneesh Gupta db959e648b GPUOpen-ProfessionalCompute-Tools -> ROCm-Developer-Tools
Change-Id: I9f5b29dd1097385acecb0c672770d8adca2fdcf7
2017-07-05 11:44:44 +05:30
Aditya Atluri 3e610ab0fe automate gcnarch detection
Change-Id: Ibbad22db136f7f5e2be84c82e9169298a144cc77
2017-06-29 12:01:40 -05:00
Aditya Atluri 871c2fc8d6 removed rm for /opt/rocm/hip/src in inline asm sample
Change-Id: I0c02bccd4cd35e01a8e889ea1e586ea8baf0ab90
2017-06-20 11:35:52 -05:00
Sandeep Kumar c17ab84882 Add peer2peer bandwidth and latency test
Change-Id: I6d88e4aa9f6e64096af16579eebef4740734203e
2017-06-14 09:44:56 +05:30
Sandeep Kumar 0154c97ddd Add readme for inline asm and unroll cookbook samples
Change-Id: I71b7a5652c3dad181c5df60ab0dd1b81d79f1bfb
2017-05-31 09:25:50 +05:30
Sandeep Kumar e4f0b28bb9 Add unroll and inline asm cookbook samples
Change-Id: Ie5a0fbb01b7fca82959090d89299533d49e092f1
2017-05-31 09:25:35 +05:30
Sandeep Kumar dfcba01db6 Print msg for single gpu
Change-Id: I2d23c73542add8973990ba96592016726994422e
2017-05-31 09:25:17 +05:30
Ben Sander 6d5ac3fbac Fix compilation error with nvcc (c++ nullptr) 2017-04-21 09:01:34 -05:00
Maneesh Gupta 935e3cd649 Fix build issues with bit_extract sample
Change-Id: I628b3c83a16f7adf0ab8ca60aecde8c073c34fd9
2017-04-07 15:24:10 +05:30
Maneesh Gupta 6e565d8469 Fix build issues in hipCommander sample
- Remove -stdlib=libstdc++ from Makefile
- Removed deleted HIP header file fom includes

Change-Id: Ia189396bee19fc52b679259df56c6c6e2bafb6fe
2017-04-07 14:54:03 +05:30
Aditya Atluri 6e1756fe23 added module api sample which uses hipHccModuleLaunchKernel
Change-Id: I7bce60b4480a3b5ff7ed69c3256078ded65a0945
2017-03-31 14:30:40 -05:00
Aditya Atluri 3eed9aba5d added debug support for HIP sample
Change-Id: Ia7265234082039b68114f7421f4dbcb7149d4d2b
2017-03-31 14:13:46 -05:00
Aditya Atluri 7d0a406fba Fixed bit_extract
Change-Id: I92d7b7a302e3fa0db84889fb5dc6b612e6a53c73
2017-03-31 13:35:26 -05:00
Aditya Atluri b9091ba818 added new api hipHccModuleLaunchKernel
1. hipHccModuleLaunchKernel is same as hipModuleLaunchKernel with OpenCL workitem model
2. Added copy right
3. Fixed header naming

Change-Id: I6a7c35a3566e2f8d3f5056613e34193775d4b236
2017-03-31 12:11:34 -05:00
Sun, Peng 329e2182d6 revert workaround for square sample and update doc on GGL
Change-Id: I731c68ca4111e7dc2e45bef51c4cad2c23fc81f8
2017-03-21 10:26:09 -05:00
pensun 33c38de407 Initial integration with Alex' Generic Grid Launch
Change-Id: I559afb80e9e39ec0d119bb3bf3b85ef9e448caf6
2017-03-17 14:59:34 -05:00
Aditya Atluri 99432cc12c Added default module launch api functionality
1. As in hipModuleLaunchKernel(..., kernelParams, nullptr); works with this commit
    2. Added headers AMDGPUPTNote.h, AMDGPURuntimeMetadata.h to do code object meta data parsing
    3. Changed CMake to look at llvm link libraries
    4. HIP developer should set env variable LLVM_HOME to remove link errors
    5. HIP depends on installed LLVM (not source, not build)
    6. Added sample to test out the feature
    7. Right now HCC does not support embedding metadata in code object. Use clang opencl
    8. Changed HIPCC to read LLVM_HOME env var
    9. New argument to CMake should be given -DLLVM_HOME=<where llvm 5.0 is installed>

Change-Id: Iba38194aa872d97cc2c90a8e5ff746c48055c868
2017-03-17 13:11:34 -05:00
Maneesh Gupta 9f5a11a3fb 4_shfl and 5_2dshfl samples are unsupported on gfx701
Change-Id: I81eb880350f25e89573ba14c62b549c6c43f8c91
2017-03-14 15:56:18 +05:30
Ben Sander 524e007db5 Refactor registered memory calls. 2017-03-11 09:18:27 -06:00
Ben Sander f022bd651f Add first step to a "registerd" mode in hipBusBandwidth. 2017-03-11 09:18:27 -06:00
Rahul Garg 6d815f84d1 Fix for HCSWAP-128, make 5_2dshfl cookbook sample only for fiji
Change-Id: I8869c28151bca1bd47a053a2808e93a801d16d00
2017-03-10 10:29:52 +05:30
Aditya Atluri 60f8908ad8 make 4_shfl cookbook sample only for fiji
1. __shfl is not supported on hawaii gfx701

Change-Id: Iac09f5d30ee0674b8f58a6e74ec5c49b02be32ad
2017-03-09 08:52:50 -06:00
Aditya Atluri f86f3b3b33 added new field to hipDeviceProp_t structure gcnArch.
1. It is an integer containing gfx values 701, 801, 802, 803
2. On NV path, it is zero

Change-Id: I2b4c7f48981d0214d8c6b1905d2cc85b16203419
2017-03-07 11:24:32 -06:00
Aditya Atluri 39548fb023 removed hipblas samples as it is not yet supported
Change-Id: I354b710e652ce0d0413d670530ceb8b70f4993d5
2017-02-17 08:51:02 -06:00
Rahul Garg 55df1b6ff2 Command scripts for latency measurements
Change-Id: I8c28765a09fb0358447367939de524b12699a317
2017-02-07 15:03:46 +05:30
Ben Sander 1f5d16afe7 Doc update - describe debug techniques
Also tweak sample to remove unneeded HIP_KERNEL_NAME.
Comment update
2017-01-19 12:40:45 -06:00
Rahul Garg cc0d2a6753 Fixed hipcommander default execution for HCSWAP-106
Change-Id: I9fbd10dfaeeb4928b2ec23ceed131b5200a658f9
2017-01-19 15:04:32 +05:30
Ben Sander 43635f51dc Print limits on CUDA devices 2016-12-16 08:55:11 -06:00
Ben Sander 4080fe209d remove TODO file 2016-12-15 14:42:52 -06:00
Sandeep Kumar d78649b978 Fixes in Makefile of couple of samples
- modified Makefile for hipblas_saxpy to replaced hcblas.so with hipblas.so as part of HCSWAP-100
- Resolved missing separator issue in peer2peer cookbook Makefile

Change-Id: I678fea267eee1481f02da09379339ed78d3f95f2
2016-12-14 16:27:14 +05:30
Ben Sander ce92a53f25 Add more debug info 2016-11-26 08:56:02 -06:00
Sandeep Kumar 53e771fc75 fix_format
Change-Id: I34e265de434263a11654e5deba044c3f21e86578
2016-11-18 14:34:14 +05:30
Sandeep Kumar 09b157ca8c Add p2p for cookbook
Change-Id: Id2e77ab31123ef95885d665efe34bc0d4596733a
(cherry picked from commit 6fbd0352713ca36e399b1ed4f17c486207a53875)
2016-11-14 06:10:36 +05:30
Maneesh Gupta 36024deb3a hcc_dialects/Makefile: use clamp-config
Change-Id: I86df82f75b75125825e22d0545209a19386d9936
2016-11-10 11:31:50 +05:30
Ben Sander c59fe09a81 Update gitignore for some common output files
Change-Id: I9cd60f042af4dba07fe0fdbd2ee442936ff8c7bd
2016-11-06 04:26:15 -06:00
Ben Sander 5832349e5e Print non-peers too
Change-Id: I2a6905edcdf144aa732ae3120c17780477f232ac
2016-11-04 06:34:07 -05:00
Ben Sander bb58f4f6fc Add initial hipProfileStart/Stop
And modify sample to show how to use.
Still needs some work to understand interaction with CXL.

Change-Id: I2579824d2dd7863ea23874d34f0dabb3cb305d3e
2016-10-27 23:09:32 -05:00
Ben Sander 32b086400e show how to use variety of HIP_PROFILE features
Change-Id: I6edd66ac4c068b64e1dc3787d7f1f69ab3238469
2016-10-27 23:09:32 -05:00
Ben Sander f95482c7c5 Add new hipdemangleatp and snapshot sample update for new functionality
Change-Id: Ie19c683b2b0bdfeb0c3fcf89444c2e21b7f606e7
2016-10-27 23:09:31 -05:00
Ben Sander e9056798f6 Rename HIP_ATP_MARKER and profiling vars
HIP_PROFILE_API
HIP_DB_START_API
HIP_DB_STOP_API

Change-Id: I6c4da67212ff8217e6356a2622d4c6278a188c34
2016-10-27 23:09:31 -05:00
Sandeep Kumar d5c59830f3 Add more apps to 2_Cookbook
Change-Id: Iafe462df9726a32f450bd240a2de3eaa73a10057
2016-10-19 12:58:16 -04:00
Maneesh Gupta 501778898f samples: Updated copyright header
Change-Id: I821f514ced5e34d492cb167b65d7273e26ed7b84
2016-10-15 23:05:04 +05:30
Ben Sander 3ae3c39e44 Refactor module API test.
- Add PASSED/FAIL indication.
- Set args using struct rather than void* array.

Change-Id: Ic924f88c49cc46979b12b7fef8650081e3b5f58c
2016-10-14 23:19:25 -05:00
Aditya Atluri 1f28d992d3 indent correction for square.cu
Change-Id: I2ca008e260b920ac3a503ad2a4bb28cd32300c98
2016-10-12 19:26:59 -05:00
Aditya Atluri 068785ee30 added copyright to module sample kernel file
Change-Id: If57e0761df63c902e1677084ff85106ec49df5de
2016-10-12 19:23:48 -05:00
Maneesh Gupta 3b55dcbf6c clang-hipify -> hipify-clang in documentation
Change-Id: I86ebc8112477db0d3e09f240beb3f9222d909ee6
2016-10-05 23:07:14 +05:30
Maneesh Gupta 3a247efb2f samples: Updated to use new hip include path
Change-Id: I53a1385a17f13a997ea21d14315f15a3ad851dab
2016-10-04 22:21:10 +05:30