Ben Sander
9da4352142
Refactor dispatch latency test and fix several bugs.
2017-08-17 08:46:58 -05:00
Aditya Atluri
b10962a539
fixed device selection during compilation to use rocm_agent_enumerator
...
1. Changed hipcc to use rocm_agent_enumerator
2. Changed square sample test to use device variable
2017-07-21 15:50:12 -05:00
Maneesh Gupta
13e3e63fd8
GPUOpen-ProfessionalCompute-Tools -> ROCm-Developer-Tools
...
Change-Id: I9f5b29dd1097385acecb0c672770d8adca2fdcf7
2017-07-05 11:44:44 +05:30
Aditya Atluri
0fe0381608
automate gcnarch detection
...
Change-Id: Ibbad22db136f7f5e2be84c82e9169298a144cc77
2017-06-29 12:01:40 -05:00
Aditya Atluri
fba69b1ce7
removed rm for /opt/rocm/hip/src in inline asm sample
...
Change-Id: I0c02bccd4cd35e01a8e889ea1e586ea8baf0ab90
2017-06-20 11:35:52 -05:00
Sandeep Kumar
01842faa1c
Add peer2peer bandwidth and latency test
...
Change-Id: I6d88e4aa9f6e64096af16579eebef4740734203e
2017-06-14 09:44:56 +05:30
Sandeep Kumar
f6b98854ba
Add readme for inline asm and unroll cookbook samples
...
Change-Id: I71b7a5652c3dad181c5df60ab0dd1b81d79f1bfb
2017-05-31 09:25:50 +05:30
Sandeep Kumar
5696eaf842
Add unroll and inline asm cookbook samples
...
Change-Id: Ie5a0fbb01b7fca82959090d89299533d49e092f1
2017-05-31 09:25:35 +05:30
Sandeep Kumar
e104c2e3bf
Print msg for single gpu
...
Change-Id: I2d23c73542add8973990ba96592016726994422e
2017-05-31 09:25:17 +05:30
Ben Sander
f1f907cb51
Fix compilation error with nvcc (c++ nullptr)
2017-04-21 09:01:34 -05:00
Maneesh Gupta
cbb7f12b1d
Fix build issues with bit_extract sample
...
Change-Id: I628b3c83a16f7adf0ab8ca60aecde8c073c34fd9
2017-04-07 15:24:10 +05:30
Maneesh Gupta
6877554f5e
Fix build issues in hipCommander sample
...
- Remove -stdlib=libstdc++ from Makefile
- Removed deleted HIP header file fom includes
Change-Id: Ia189396bee19fc52b679259df56c6c6e2bafb6fe
2017-04-07 14:54:03 +05:30
Aditya Atluri
0fab80e515
added module api sample which uses hipHccModuleLaunchKernel
...
Change-Id: I7bce60b4480a3b5ff7ed69c3256078ded65a0945
2017-03-31 14:30:40 -05:00
Aditya Atluri
81d3725a16
added debug support for HIP sample
...
Change-Id: Ia7265234082039b68114f7421f4dbcb7149d4d2b
2017-03-31 14:13:46 -05:00
Aditya Atluri
d4f3fb092a
Fixed bit_extract
...
Change-Id: I92d7b7a302e3fa0db84889fb5dc6b612e6a53c73
2017-03-31 13:35:26 -05:00
Aditya Atluri
1cead6a4cd
added new api hipHccModuleLaunchKernel
...
1. hipHccModuleLaunchKernel is same as hipModuleLaunchKernel with OpenCL workitem model
2. Added copy right
3. Fixed header naming
Change-Id: I6a7c35a3566e2f8d3f5056613e34193775d4b236
2017-03-31 12:11:34 -05:00
Sun, Peng
14f8c7cf74
revert workaround for square sample and update doc on GGL
...
Change-Id: I731c68ca4111e7dc2e45bef51c4cad2c23fc81f8
2017-03-21 10:26:09 -05:00
pensun
ad882222b0
Initial integration with Alex' Generic Grid Launch
...
Change-Id: I559afb80e9e39ec0d119bb3bf3b85ef9e448caf6
2017-03-17 14:59:34 -05:00
Aditya Atluri
9645b0e0dc
Added default module launch api functionality
...
1. As in hipModuleLaunchKernel(..., kernelParams, nullptr); works with this commit
2. Added headers AMDGPUPTNote.h, AMDGPURuntimeMetadata.h to do code object meta data parsing
3. Changed CMake to look at llvm link libraries
4. HIP developer should set env variable LLVM_HOME to remove link errors
5. HIP depends on installed LLVM (not source, not build)
6. Added sample to test out the feature
7. Right now HCC does not support embedding metadata in code object. Use clang opencl
8. Changed HIPCC to read LLVM_HOME env var
9. New argument to CMake should be given -DLLVM_HOME=<where llvm 5.0 is installed>
Change-Id: Iba38194aa872d97cc2c90a8e5ff746c48055c868
2017-03-17 13:11:34 -05:00
Maneesh Gupta
f91583b294
4_shfl and 5_2dshfl samples are unsupported on gfx701
...
Change-Id: I81eb880350f25e89573ba14c62b549c6c43f8c91
2017-03-14 15:56:18 +05:30
Ben Sander
b7acb85fa8
Refactor registered memory calls.
2017-03-11 09:18:27 -06:00
Ben Sander
f23b5a1f90
Add first step to a "registerd" mode in hipBusBandwidth.
2017-03-11 09:18:27 -06:00
Rahul Garg
d48943699d
Fix for HCSWAP-128, make 5_2dshfl cookbook sample only for fiji
...
Change-Id: I8869c28151bca1bd47a053a2808e93a801d16d00
2017-03-10 10:29:52 +05:30
Aditya Atluri
7f4b24886f
make 4_shfl cookbook sample only for fiji
...
1. __shfl is not supported on hawaii gfx701
Change-Id: Iac09f5d30ee0674b8f58a6e74ec5c49b02be32ad
2017-03-09 08:52:50 -06:00
Aditya Atluri
9f575721aa
added new field to hipDeviceProp_t structure gcnArch.
...
1. It is an integer containing gfx values 701, 801, 802, 803
2. On NV path, it is zero
Change-Id: I2b4c7f48981d0214d8c6b1905d2cc85b16203419
2017-03-07 11:24:32 -06:00
Aditya Atluri
56744c0730
removed hipblas samples as it is not yet supported
...
Change-Id: I354b710e652ce0d0413d670530ceb8b70f4993d5
2017-02-17 08:51:02 -06:00
Rahul Garg
73b56e690f
Command scripts for latency measurements
...
Change-Id: I8c28765a09fb0358447367939de524b12699a317
2017-02-07 15:03:46 +05:30
Ben Sander
ca1cef4e06
Doc update - describe debug techniques
...
Also tweak sample to remove unneeded HIP_KERNEL_NAME.
Comment update
2017-01-19 12:40:45 -06:00
Rahul Garg
aa3f278475
Fixed hipcommander default execution for HCSWAP-106
...
Change-Id: I9fbd10dfaeeb4928b2ec23ceed131b5200a658f9
2017-01-19 15:04:32 +05:30
Ben Sander
ab07d9b4a7
Print limits on CUDA devices
2016-12-16 08:55:11 -06:00
Ben Sander
a30b36afdc
remove TODO file
2016-12-15 14:42:52 -06:00
Sandeep Kumar
8b0ce6e81b
Fixes in Makefile of couple of samples
...
- modified Makefile for hipblas_saxpy to replaced hcblas.so with hipblas.so as part of HCSWAP-100
- Resolved missing separator issue in peer2peer cookbook Makefile
Change-Id: I678fea267eee1481f02da09379339ed78d3f95f2
2016-12-14 16:27:14 +05:30
Ben Sander
a504df955e
Add more debug info
2016-11-26 08:56:02 -06:00
Sandeep Kumar
1fe6890ba3
fix_format
...
Change-Id: I34e265de434263a11654e5deba044c3f21e86578
2016-11-18 14:34:14 +05:30
Sandeep Kumar
39e1b16d0b
Add p2p for cookbook
...
Change-Id: Id2e77ab31123ef95885d665efe34bc0d4596733a
(cherry picked from commit 6fbd0352713ca36e399b1ed4f17c486207a53875)
2016-11-14 06:10:36 +05:30
Maneesh Gupta
052a580d5b
hcc_dialects/Makefile: use clamp-config
...
Change-Id: I86df82f75b75125825e22d0545209a19386d9936
2016-11-10 11:31:50 +05:30
Ben Sander
0e5cfed3eb
Update gitignore for some common output files
...
Change-Id: I9cd60f042af4dba07fe0fdbd2ee442936ff8c7bd
2016-11-06 04:26:15 -06:00
Ben Sander
44aee4b61c
Print non-peers too
...
Change-Id: I2a6905edcdf144aa732ae3120c17780477f232ac
2016-11-04 06:34:07 -05:00
Ben Sander
18dbafe6e8
Add initial hipProfileStart/Stop
...
And modify sample to show how to use.
Still needs some work to understand interaction with CXL.
Change-Id: I2579824d2dd7863ea23874d34f0dabb3cb305d3e
2016-10-27 23:09:32 -05:00
Ben Sander
08c08d6f9c
show how to use variety of HIP_PROFILE features
...
Change-Id: I6edd66ac4c068b64e1dc3787d7f1f69ab3238469
2016-10-27 23:09:32 -05:00
Ben Sander
5b7da93e61
Add new hipdemangleatp and snapshot sample update for new functionality
...
Change-Id: Ie19c683b2b0bdfeb0c3fcf89444c2e21b7f606e7
2016-10-27 23:09:31 -05:00
Ben Sander
f7c42efa3f
Rename HIP_ATP_MARKER and profiling vars
...
HIP_PROFILE_API
HIP_DB_START_API
HIP_DB_STOP_API
Change-Id: I6c4da67212ff8217e6356a2622d4c6278a188c34
2016-10-27 23:09:31 -05:00
Sandeep Kumar
d025ed980b
Add more apps to 2_Cookbook
...
Change-Id: Iafe462df9726a32f450bd240a2de3eaa73a10057
2016-10-19 12:58:16 -04:00
Maneesh Gupta
945a09c241
samples: Updated copyright header
...
Change-Id: I821f514ced5e34d492cb167b65d7273e26ed7b84
2016-10-15 23:05:04 +05:30
Ben Sander
4668a08783
Refactor module API test.
...
- Add PASSED/FAIL indication.
- Set args using struct rather than void* array.
Change-Id: Ic924f88c49cc46979b12b7fef8650081e3b5f58c
2016-10-14 23:19:25 -05:00
Aditya Atluri
d8c0954e13
indent correction for square.cu
...
Change-Id: I2ca008e260b920ac3a503ad2a4bb28cd32300c98
2016-10-12 19:26:59 -05:00
Aditya Atluri
cb734a5b1b
added copyright to module sample kernel file
...
Change-Id: If57e0761df63c902e1677084ff85106ec49df5de
2016-10-12 19:23:48 -05:00
Maneesh Gupta
eb1fce0d77
clang-hipify -> hipify-clang in documentation
...
Change-Id: I86ebc8112477db0d3e09f240beb3f9222d909ee6
2016-10-05 23:07:14 +05:30
Maneesh Gupta
ead394ec09
samples: Updated to use new hip include path
...
Change-Id: I53a1385a17f13a997ea21d14315f15a3ad851dab
2016-10-04 22:21:10 +05:30
Ben Sander
e413869271
small typo fix
...
Change-Id: I01906b330be8e6ec149bcdfe82def73e15931c89
2016-10-02 05:57:02 -05:00