Rahul Garg
55df1b6ff2
Command scripts for latency measurements
...
Change-Id: I8c28765a09fb0358447367939de524b12699a317
2017-02-07 15:03:46 +05:30
Ben Sander
1f5d16afe7
Doc update - describe debug techniques
...
Also tweak sample to remove unneeded HIP_KERNEL_NAME.
Comment update
2017-01-19 12:40:45 -06:00
Rahul Garg
cc0d2a6753
Fixed hipcommander default execution for HCSWAP-106
...
Change-Id: I9fbd10dfaeeb4928b2ec23ceed131b5200a658f9
2017-01-19 15:04:32 +05:30
Ben Sander
43635f51dc
Print limits on CUDA devices
2016-12-16 08:55:11 -06:00
Ben Sander
4080fe209d
remove TODO file
2016-12-15 14:42:52 -06:00
Sandeep Kumar
d78649b978
Fixes in Makefile of couple of samples
...
- modified Makefile for hipblas_saxpy to replaced hcblas.so with hipblas.so as part of HCSWAP-100
- Resolved missing separator issue in peer2peer cookbook Makefile
Change-Id: I678fea267eee1481f02da09379339ed78d3f95f2
2016-12-14 16:27:14 +05:30
Ben Sander
ce92a53f25
Add more debug info
2016-11-26 08:56:02 -06:00
Sandeep Kumar
53e771fc75
fix_format
...
Change-Id: I34e265de434263a11654e5deba044c3f21e86578
2016-11-18 14:34:14 +05:30
Sandeep Kumar
09b157ca8c
Add p2p for cookbook
...
Change-Id: Id2e77ab31123ef95885d665efe34bc0d4596733a
(cherry picked from commit 6fbd0352713ca36e399b1ed4f17c486207a53875)
2016-11-14 06:10:36 +05:30
Maneesh Gupta
36024deb3a
hcc_dialects/Makefile: use clamp-config
...
Change-Id: I86df82f75b75125825e22d0545209a19386d9936
2016-11-10 11:31:50 +05:30
Ben Sander
c59fe09a81
Update gitignore for some common output files
...
Change-Id: I9cd60f042af4dba07fe0fdbd2ee442936ff8c7bd
2016-11-06 04:26:15 -06:00
Ben Sander
5832349e5e
Print non-peers too
...
Change-Id: I2a6905edcdf144aa732ae3120c17780477f232ac
2016-11-04 06:34:07 -05:00
Ben Sander
bb58f4f6fc
Add initial hipProfileStart/Stop
...
And modify sample to show how to use.
Still needs some work to understand interaction with CXL.
Change-Id: I2579824d2dd7863ea23874d34f0dabb3cb305d3e
2016-10-27 23:09:32 -05:00
Ben Sander
32b086400e
show how to use variety of HIP_PROFILE features
...
Change-Id: I6edd66ac4c068b64e1dc3787d7f1f69ab3238469
2016-10-27 23:09:32 -05:00
Ben Sander
f95482c7c5
Add new hipdemangleatp and snapshot sample update for new functionality
...
Change-Id: Ie19c683b2b0bdfeb0c3fcf89444c2e21b7f606e7
2016-10-27 23:09:31 -05:00
Ben Sander
e9056798f6
Rename HIP_ATP_MARKER and profiling vars
...
HIP_PROFILE_API
HIP_DB_START_API
HIP_DB_STOP_API
Change-Id: I6c4da67212ff8217e6356a2622d4c6278a188c34
2016-10-27 23:09:31 -05:00
Sandeep Kumar
d5c59830f3
Add more apps to 2_Cookbook
...
Change-Id: Iafe462df9726a32f450bd240a2de3eaa73a10057
2016-10-19 12:58:16 -04:00
Maneesh Gupta
501778898f
samples: Updated copyright header
...
Change-Id: I821f514ced5e34d492cb167b65d7273e26ed7b84
2016-10-15 23:05:04 +05:30
Ben Sander
3ae3c39e44
Refactor module API test.
...
- Add PASSED/FAIL indication.
- Set args using struct rather than void* array.
Change-Id: Ic924f88c49cc46979b12b7fef8650081e3b5f58c
2016-10-14 23:19:25 -05:00
Aditya Atluri
1f28d992d3
indent correction for square.cu
...
Change-Id: I2ca008e260b920ac3a503ad2a4bb28cd32300c98
2016-10-12 19:26:59 -05:00
Aditya Atluri
068785ee30
added copyright to module sample kernel file
...
Change-Id: If57e0761df63c902e1677084ff85106ec49df5de
2016-10-12 19:23:48 -05:00
Maneesh Gupta
3b55dcbf6c
clang-hipify -> hipify-clang in documentation
...
Change-Id: I86ebc8112477db0d3e09f240beb3f9222d909ee6
2016-10-05 23:07:14 +05:30
Maneesh Gupta
3a247efb2f
samples: Updated to use new hip include path
...
Change-Id: I53a1385a17f13a997ea21d14315f15a3ad851dab
2016-10-04 22:21:10 +05:30
Ben Sander
848b27ccef
small typo fix
...
Change-Id: I01906b330be8e6ec149bcdfe82def73e15931c89
2016-10-02 05:57:02 -05:00
sandeep kumar
4f6112730b
Add 2_Cookbook
...
Change-Id: I10bbbd4bcb80a5900fe6af466c8f4c94ea5efe9a
2016-09-30 12:52:06 +05:30
Ben Sander
7bc1af2776
Small tool, doc, sample enhancements.
...
- Expand message when HIP version mismatch detected.
- Doc touchup.
- change sorting of hipBusBandwidth so byte results shown at top.
-
Change-Id: Ifb4e44a5fdfb65d59c4994b11e5f13385705f7e0
2016-09-26 16:36:01 -05:00
Ben Sander
1160cefc6d
Sample improvements.
...
- Enable -O3 for hipDispatchLatency.
- Use nearly-null kernel to prevent it from being optimized away.
- Formatting for hipDispatchLatency.
- Formatting for hipInfo.
2016-09-22 13:05:47 -05:00
Aditya Atluri
4911876371
added vimrc for current project
...
1. Added vimrc config file for HIP
2. Corrected square sample indent
Change-Id: I3e1d92403571148fe6825db6ad63ad925ae69519
2016-09-15 11:40:17 -05:00
Maneesh Gupta
9b93847628
module_api/Makefile: Update as per newer hipgenisa.sh
...
Change-Id: I479c74eae00d7521434f2740ce5930e326ea05cf
2016-09-06 17:47:10 +05:30
Maneesh Gupta
0cc51f2a72
module_api sample: Remove unnecessary platform checks
...
Change-Id: I1d531264d51ff952a3a68d554672b6d293e23379
2016-09-04 21:25:14 +05:30
Rahul Garg
7c132821a8
Removed NVCC check for hipCtxXXX functions in module_api/runKernel.cpp
...
Change-Id: I2bdd4fadf41063ec60626f1850e16f8307ebe6b5
2016-09-04 20:37:29 +05:30
Maneesh Gupta
7020584cdf
module_api: HCC path no longer needs mangled kernel name
...
Change-Id: I4c1cb218bfdd05c9fba57276167e3e4205b93614
2016-09-04 16:26:16 +05:30
Maneesh Gupta
ba8a05a3c0
module_api sample: no longer need EXTERN_C workaround
...
Change-Id: Ida087d832df8e1f3620b38f920ec2853aad641c8
2016-09-04 13:49:43 +05:30
Maneesh Gupta
07badd6f4e
module_api: workaround to use vcpy_kernel.cpp for NV path
...
Change-Id: Ib4868bf02c64070e846c19427c39289609909466
2016-09-04 12:35:08 +05:30
pensun
e08f4513c1
For module_api sample, use vcpy_kernel.cu to generate ptx file for NV path.
...
Change-Id: Id0033678834288c4eaa56b12e7d447119be99deb
2016-09-03 21:06:58 -05:00
Aditya Atluri
4f8c45d05a
corrected offline kernel compilation on hipcc path
...
1. hipgenisa.sh now adds int main(){} during kernel compilation. User does not have to put it there
2. Renamed vcpy_isa.cpp to vcpy_kernel.cpp
3. Removed vcpy_isa.cu as the kernel code should be common for both paths
4. Changed Makefile and runkernel.cpp to work with above changes
Change-Id: I9f8c84706b44bb500bc493a68e959762b55a0142
2016-09-02 13:17:17 -05:00
Ben Sander
cdba60a566
Fix double-lock of stream on hipModuleLaunchKernel
...
Change-Id: I4ca164971c25f4eb8fbcca11d6258367bb3d2ab4
2016-09-02 12:47:49 -05:00
Ben Sander
fb7046160f
Fix av::copy in dialects to use capture-by-value
...
Change-Id: Ibce1488a1326f66b92b4d5b351230666b691ed31
2016-09-02 09:46:59 -05:00
Ben Sander
cd3a0a2d61
enable hc_am example in hcc_ddialects example
...
Change-Id: Iec2f9eb05f95cb025c157fee8fd284aab844d1a2
2016-09-02 09:46:59 -05:00
Aditya Atluri
cc8ce4d58c
Fixed offline kernel compilation
...
1. Removed vcpy_isa.ptx as it should be generated during make
2. Made argument padding specific to hcc path
3. Renamed --gencodeobject to --genco
4. Changed Makefile to work on both nvcc and hcc path
Change-Id: Ifd053d541085d9ce4fd37bc21b07674786c7163e
2016-09-01 10:39:14 -05:00
Maneesh Gupta
811df250e6
Fixed module_api/Makefile to set flags based on HIP_PLATFORM
...
Change-Id: I2fa9a556e0c4f25f4963ecef1d25eb922f9af1b9
2016-09-01 15:11:12 +05:30
Maneesh Gupta
13d1721431
module_api/Makefile: Use gencodeobject instead of genisa
...
Change-Id: I7e3523810f5603ad727b1fda7ff2d0dc53ec72d7
2016-09-01 12:10:31 +05:30
Aditya Atluri
d5a6e22c59
added sample for how-to-use pre-compiled kernels1. Corrected the exit output of kernel compilation by hipcc
...
2. Added sample which loads/run kernel binary during runtime?
Change-Id: I26ccaca1f844fee317592e26c9e654ce548b96a8
2016-08-31 13:56:07 -05:00
Maneesh Gupta
4803ff12f1
Rename 2_Advanced to 7_Advanced
...
Change-Id: I51e5fa7f4c1dbf467f2d7182ec69d12d5fe548d0
2016-08-18 12:40:30 +05:30
Maneesh Gupta
d192976b00
Add simple hipblas saxpy sample
...
Change-Id: I67ae83e1e5397d5191a3c644aba068f06ff97830
2016-08-12 13:50:22 +05:30
Maneesh Gupta
506d4086a9
hipDispatchLatency: reduce iterations to 5120
...
Change-Id: I94ae4993ff5058cf15f9487a5a528fc24c1ad5fa
2016-06-13 14:23:51 +05:30
Maneesh Gupta
bd31d333e6
Fix bit_extract sample
...
Change-Id: I933f932bac26d9a9469d5d069973af166e11cbcd
2016-05-20 01:06:08 -04:00
Maneesh Gupta
3f83673b04
Fix square.cu to use cudaError_t instead of hipError_t
...
Change-Id: If3314910d1c03122741d3e0a45e14a4412c473b3
2016-05-12 10:13:07 +05:30
Maneesh Gupta
6181988232
hcc_dialects report PASSED when passed
2016-05-03 14:32:59 +05:30
Maneesh Gupta
cb6a5d9421
bit_extract reports PASSED when passed
2016-05-03 14:19:25 +05:30