sandeep kumar
db7e460626
Add 2_Cookbook
...
Change-Id: I10bbbd4bcb80a5900fe6af466c8f4c94ea5efe9a
2016-09-30 12:52:06 +05:30
Ben Sander
14ad755612
Small tool, doc, sample enhancements.
...
- Expand message when HIP version mismatch detected.
- Doc touchup.
- change sorting of hipBusBandwidth so byte results shown at top.
-
Change-Id: Ifb4e44a5fdfb65d59c4994b11e5f13385705f7e0
2016-09-26 16:36:01 -05:00
Ben Sander
fdb4ed3c5e
Sample improvements.
...
- Enable -O3 for hipDispatchLatency.
- Use nearly-null kernel to prevent it from being optimized away.
- Formatting for hipDispatchLatency.
- Formatting for hipInfo.
2016-09-22 13:05:47 -05:00
Aditya Atluri
be3fd69bd1
added vimrc for current project
...
1. Added vimrc config file for HIP
2. Corrected square sample indent
Change-Id: I3e1d92403571148fe6825db6ad63ad925ae69519
2016-09-15 11:40:17 -05:00
Maneesh Gupta
2145e2ba61
module_api/Makefile: Update as per newer hipgenisa.sh
...
Change-Id: I479c74eae00d7521434f2740ce5930e326ea05cf
2016-09-06 17:47:10 +05:30
Maneesh Gupta
e2469d5c55
module_api sample: Remove unnecessary platform checks
...
Change-Id: I1d531264d51ff952a3a68d554672b6d293e23379
2016-09-04 21:25:14 +05:30
Rahul Garg
71736d2ed2
Removed NVCC check for hipCtxXXX functions in module_api/runKernel.cpp
...
Change-Id: I2bdd4fadf41063ec60626f1850e16f8307ebe6b5
2016-09-04 20:37:29 +05:30
Maneesh Gupta
b250c5a7b3
module_api: HCC path no longer needs mangled kernel name
...
Change-Id: I4c1cb218bfdd05c9fba57276167e3e4205b93614
2016-09-04 16:26:16 +05:30
Maneesh Gupta
c63944fc08
module_api sample: no longer need EXTERN_C workaround
...
Change-Id: Ida087d832df8e1f3620b38f920ec2853aad641c8
2016-09-04 13:49:43 +05:30
Maneesh Gupta
6618c010b5
module_api: workaround to use vcpy_kernel.cpp for NV path
...
Change-Id: Ib4868bf02c64070e846c19427c39289609909466
2016-09-04 12:35:08 +05:30
pensun
49971e8c9e
For module_api sample, use vcpy_kernel.cu to generate ptx file for NV path.
...
Change-Id: Id0033678834288c4eaa56b12e7d447119be99deb
2016-09-03 21:06:58 -05:00
Aditya Atluri
6ca7a87e0e
corrected offline kernel compilation on hipcc path
...
1. hipgenisa.sh now adds int main(){} during kernel compilation. User does not have to put it there
2. Renamed vcpy_isa.cpp to vcpy_kernel.cpp
3. Removed vcpy_isa.cu as the kernel code should be common for both paths
4. Changed Makefile and runkernel.cpp to work with above changes
Change-Id: I9f8c84706b44bb500bc493a68e959762b55a0142
2016-09-02 13:17:17 -05:00
Ben Sander
512ff8ec8e
Fix double-lock of stream on hipModuleLaunchKernel
...
Change-Id: I4ca164971c25f4eb8fbcca11d6258367bb3d2ab4
2016-09-02 12:47:49 -05:00
Ben Sander
cb539b227c
Fix av::copy in dialects to use capture-by-value
...
Change-Id: Ibce1488a1326f66b92b4d5b351230666b691ed31
2016-09-02 09:46:59 -05:00
Ben Sander
2341e48842
enable hc_am example in hcc_ddialects example
...
Change-Id: Iec2f9eb05f95cb025c157fee8fd284aab844d1a2
2016-09-02 09:46:59 -05:00
Aditya Atluri
ebc17b0d6e
Fixed offline kernel compilation
...
1. Removed vcpy_isa.ptx as it should be generated during make
2. Made argument padding specific to hcc path
3. Renamed --gencodeobject to --genco
4. Changed Makefile to work on both nvcc and hcc path
Change-Id: Ifd053d541085d9ce4fd37bc21b07674786c7163e
2016-09-01 10:39:14 -05:00
Maneesh Gupta
176c74af6a
Fixed module_api/Makefile to set flags based on HIP_PLATFORM
...
Change-Id: I2fa9a556e0c4f25f4963ecef1d25eb922f9af1b9
2016-09-01 15:11:12 +05:30
Maneesh Gupta
52e3d0e799
module_api/Makefile: Use gencodeobject instead of genisa
...
Change-Id: I7e3523810f5603ad727b1fda7ff2d0dc53ec72d7
2016-09-01 12:10:31 +05:30
Aditya Atluri
c1b1086c71
added sample for how-to-use pre-compiled kernels1. Corrected the exit output of kernel compilation by hipcc
...
2. Added sample which loads/run kernel binary during runtime?
Change-Id: I26ccaca1f844fee317592e26c9e654ce548b96a8
2016-08-31 13:56:07 -05:00
Maneesh Gupta
6ad8ac5d95
Rename 2_Advanced to 7_Advanced
...
Change-Id: I51e5fa7f4c1dbf467f2d7182ec69d12d5fe548d0
2016-08-18 12:40:30 +05:30
Maneesh Gupta
339590da90
Add simple hipblas saxpy sample
...
Change-Id: I67ae83e1e5397d5191a3c644aba068f06ff97830
2016-08-12 13:50:22 +05:30
Maneesh Gupta
a685f7dc79
hipDispatchLatency: reduce iterations to 5120
...
Change-Id: I94ae4993ff5058cf15f9487a5a528fc24c1ad5fa
2016-06-13 14:23:51 +05:30
Maneesh Gupta
e89fba7fe1
Fix bit_extract sample
...
Change-Id: I933f932bac26d9a9469d5d069973af166e11cbcd
2016-05-20 01:06:08 -04:00
Maneesh Gupta
f6544a376b
Fix square.cu to use cudaError_t instead of hipError_t
...
Change-Id: If3314910d1c03122741d3e0a45e14a4412c473b3
2016-05-12 10:13:07 +05:30
Maneesh Gupta
07026bfdea
hcc_dialects report PASSED when passed
2016-05-03 14:32:59 +05:30
Maneesh Gupta
fb88bb1c17
bit_extract reports PASSED when passed
2016-05-03 14:19:25 +05:30
Maneesh Gupta
307b24b9d5
Fix makefiles in samples
2016-04-18 10:15:35 +05:30
Maneesh Gupta
4092a1efe8
Replace /opt/hcc -> /opt/rocm/hcc and /opt/hsa -> /opt/rocm/hsa
2016-04-15 12:56:31 +05:30
Ben Sander
40e3772d40
Fix HIP_PATH, CHECK macro in samples.
2016-04-13 17:37:39 -05:00
Ben Sander
a296b93281
add hcc dialects sample
2016-04-13 17:32:38 -05:00
Ben Sander
7eba742c66
fix peer query order
2016-04-11 07:58:59 -05:00
Ben Sander
c161c1ba9b
P2p checkpoint.
...
- set USE_PEER_TO_PEER=3 (requires HCC "am_memtracker_update_peers")
- when enabling peer, turn it on for previously allocated memory.
- hipDeviceCanAccessPeer is no longer self-ware (self does not qualify
as a peer)
- device peerlist always includes self, so when we call allow_access
we never remove self access.
- hipDeviceReset() removes old peer mappings.
2016-04-11 07:58:59 -05:00
Ben Sander
565300acb0
Remove stray debug msgs, hipInfo don't print self as peer.
2016-04-11 07:58:58 -05:00
Ben Sander
2fd45f0a6d
Use HIP_PATH if set else use relative ../...
2016-04-11 07:58:58 -05:00
Ben Sander
4fca0a1bdf
Print peers in hipConfig.
...
Also include peer APIs in vim hilighting.
2016-04-11 07:58:58 -05:00
Maneesh Gupta
2433bca2b1
Remove deprecated KERNELBEGIN and KERNELEND from bit_extract sample
2016-04-04 14:47:02 +05:30
streamhsa
155c366e79
change makefile for samples
2016-03-29 16:02:09 +08:00
Aditya Atluri
4992ccfea6
Logging dispatch latency through database util
2016-03-23 11:39:57 -05:00
Ben Sander
c1dd930c92
Only include activity logger if CodeXL installed.
...
Fix hipHostMalloc in hipBusBandwidth.
2016-03-22 09:27:10 -05:00
Ben Sander
3b3bae3772
hipHostRegister and hipHostMalloc refactor.
...
Note hipHostMalloc (not hipHostAlloc or hipMallocHost).
- the hipHost* is used for all HIP APIs dealing with Host memory.
(including hipHostMalloc, hipHostFree, hipHostRegister,
hipHostUnregister, hipHostGetFlags, hipHostGetDevicePointer).
- hipMallocHost is consistent with "hipMalloc" for allocating device
memory. Enumerations hipHostMalloc* also used as optional
flags parm to hipHostMalloc.
2016-03-22 02:30:10 -05:00
Ben Sander
221973404f
Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
...
Conflicts:
src/hip_hcc.cpp
2016-03-19 03:22:09 -05:00
Ben Sander
d011d7cf6e
Add beastperiteration and onesize for testing.
...
onesize allows running tests at one specific size.
2016-03-19 02:43:04 -05:00
Ben Sander
f7e2c254df
Improve formatting - line up cols
2016-03-18 23:43:04 -05:00
Ben Sander
62fb06f54e
Print Pinned or Unpinned in result summary
2016-03-18 21:28:29 -05:00
Ben Sander
97493d2098
Supported --aliged mode. Add results check for H2D and D2H.
2016-03-18 03:09:52 -05:00
Aditya Atluri
9800567a53
corrected first and second kernel dispatch
2016-03-15 14:22:00 -05:00
Aditya Atluri
f10d879285
Added single kernel launch to sample
2016-03-15 21:05:15 -05:00
Aditya Atluri
93c30afe2f
added performance metrics for kernel dispatch
2016-03-15 12:37:24 -05:00
Aditya Atluri
e376b1baec
v2 deprecating hipMallocHost with hipHostAlloc
2016-03-15 13:39:15 -05:00
Ben Sander
6b34ae4797
print device config info
2016-03-14 23:02:49 -05:00