Commit Graph

51 Commits

Author SHA1 Message Date
Maneesh Gupta 506d4086a9 hipDispatchLatency: reduce iterations to 5120
Change-Id: I94ae4993ff5058cf15f9487a5a528fc24c1ad5fa
2016-06-13 14:23:51 +05:30
Maneesh Gupta bd31d333e6 Fix bit_extract sample
Change-Id: I933f932bac26d9a9469d5d069973af166e11cbcd
2016-05-20 01:06:08 -04:00
Maneesh Gupta 3f83673b04 Fix square.cu to use cudaError_t instead of hipError_t
Change-Id: If3314910d1c03122741d3e0a45e14a4412c473b3
2016-05-12 10:13:07 +05:30
Maneesh Gupta 6181988232 hcc_dialects report PASSED when passed 2016-05-03 14:32:59 +05:30
Maneesh Gupta cb6a5d9421 bit_extract reports PASSED when passed 2016-05-03 14:19:25 +05:30
Maneesh Gupta bcaefb81fc Fix makefiles in samples 2016-04-18 10:15:35 +05:30
Maneesh Gupta 5a31bad821 Replace /opt/hcc -> /opt/rocm/hcc and /opt/hsa -> /opt/rocm/hsa 2016-04-15 12:56:31 +05:30
Ben Sander 8bbe32a708 Fix HIP_PATH, CHECK macro in samples. 2016-04-13 17:37:39 -05:00
Ben Sander 624b2f35ff add hcc dialects sample 2016-04-13 17:32:38 -05:00
Ben Sander e4d1863ce8 fix peer query order 2016-04-11 07:58:59 -05:00
Ben Sander 83f0de7806 P2p checkpoint.
- set USE_PEER_TO_PEER=3 (requires HCC "am_memtracker_update_peers")
- when enabling peer, turn it on for previously allocated memory.
- hipDeviceCanAccessPeer is no longer self-ware (self does not qualify
  as a peer)
- device peerlist always includes self, so when we call allow_access
  we never remove self access.
- hipDeviceReset() removes old peer mappings.
2016-04-11 07:58:59 -05:00
Ben Sander d89539d40f Remove stray debug msgs, hipInfo don't print self as peer. 2016-04-11 07:58:58 -05:00
Ben Sander 40e72dcd4a Use HIP_PATH if set else use relative ../... 2016-04-11 07:58:58 -05:00
Ben Sander 0ac41ad143 Print peers in hipConfig.
Also include peer APIs in vim hilighting.
2016-04-11 07:58:58 -05:00
Maneesh Gupta 70f8236ac5 Remove deprecated KERNELBEGIN and KERNELEND from bit_extract sample 2016-04-04 14:47:02 +05:30
streamhsa d0f0bf5c8e change makefile for samples 2016-03-29 16:02:09 +08:00
Aditya Atluri 78407ea40a Logging dispatch latency through database util 2016-03-23 11:39:57 -05:00
Ben Sander 3a5f964c4f Only include activity logger if CodeXL installed.
Fix hipHostMalloc in hipBusBandwidth.
2016-03-22 09:27:10 -05:00
Ben Sander ab910efb96 hipHostRegister and hipHostMalloc refactor.
Note hipHostMalloc (not hipHostAlloc or hipMallocHost).
 -  the hipHost* is used for all HIP APIs dealing with Host memory.
    (including hipHostMalloc, hipHostFree, hipHostRegister,
hipHostUnregister, hipHostGetFlags, hipHostGetDevicePointer).
  - hipMallocHost is consistent with "hipMalloc" for allocating device
    memory.  Enumerations hipHostMalloc* also used as optional
    flags parm to hipHostMalloc.
2016-03-22 02:30:10 -05:00
Ben Sander 1de63bfeea Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
Conflicts:
	src/hip_hcc.cpp
2016-03-19 03:22:09 -05:00
Ben Sander 7ff5b16d2a Add beastperiteration and onesize for testing.
onesize allows running tests at one specific size.
2016-03-19 02:43:04 -05:00
Ben Sander 85fce5f21e Improve formatting - line up cols 2016-03-18 23:43:04 -05:00
Ben Sander c2102847a4 Print Pinned or Unpinned in result summary 2016-03-18 21:28:29 -05:00
Ben Sander 618556eaf9 Supported --aliged mode. Add results check for H2D and D2H. 2016-03-18 03:09:52 -05:00
Aditya Atluri e23bd0a23e corrected first and second kernel dispatch 2016-03-15 14:22:00 -05:00
Aditya Atluri 862817626b Added single kernel launch to sample 2016-03-15 21:05:15 -05:00
Aditya Atluri 31d8f60e56 added performance metrics for kernel dispatch 2016-03-15 12:37:24 -05:00
Aditya Atluri 58fa0524b6 v2 deprecating hipMallocHost with hipHostAlloc 2016-03-15 13:39:15 -05:00
Ben Sander 70c5f5e3f5 print device config info 2016-03-14 23:02:49 -05:00
Ben Sander e1617b9604 Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging
Conflicts:
	src/hip_hcc.cpp
	tests/src/CMakeLists.txt
2016-03-14 15:01:26 -05:00
Ben Sander 5606bee076 Add Bidir copy test and help. 2016-03-14 14:39:23 -05:00
Ben Sander ac6ed35ba0 refactor, add support for speccing xfers in bytes 2016-03-13 09:41:06 -05:00
Aditya Atluri d3ba2b9782 corrected hipDeviceGetProperties to hipGetDeviceProperties - not docs 2016-03-06 08:31:04 -06:00
Ben Sander ff66ef0779 fixes for titan platform 2016-02-26 05:25:30 -06:00
Ben Sander c300ffe458 Merge branch 'privatestaging' of https://github.com/AMDComputeLibraries/HIP-privatestaging into privatestaging 2016-02-26 06:15:09 -06:00
Ben Sander 4adab7b7ef Merge branch 'memtracker' into privatestaging
Conflicts:
	src/hip_hcc.cpp
2016-02-25 19:38:46 -06:00
Evgeny Mankov 57e212606d Attribute hipDeviceAttributeIsMultiGpuBoard for obtaining Device property isMultiGpuBoard is added.
On HIP path property obtaining done through hsa_iterate_agents and counting the devices of HSA_DEVICE_TYPE_GPU type.

P.S.
On multi-boards systems it might be problems with detection what board a GPU plugged into (not tested).
2016-02-25 23:44:39 +03:00
Evgeny Mankov 833c9e52ad Guard #ifdef USE_ROCR_20 is added for ROCR_20 device properties (memoryClockRate, memoryBusWidth)
By default isn't defined.
To add ROCR_20 support HIP have to be compiled as follows: make CXX_DEFINES+=-DUSE_ROCR_20
2016-02-19 13:27:03 +03:00
Evgeny Mankov 1c19dbb807 Device property memoryBusWidth implementation.
+ Device property memoryBusWidth is added to hipDeviceProp_t struct.
+ Device attribute hipDeviceAttributeMemoryBusWidth is added to hipDeviceAttribute_t struct.
+ Tests update.
2016-02-18 18:15:01 +03:00
Evgeny Mankov 5ea8543d2e Device property memoryClockRate implementation.
+ Device property memoryClockRate is added to hipDeviceProp_t struct.
+ Device attribute hipDeviceAttributeMemoryClockRate is added to hipDeviceAttribute_t struct.
+ Tests update.
+ Rename hipDevAttrConcurrentKernels to hipDeviceAttributeConcurrentKernels.
2016-02-18 17:25:28 +03:00
Evgeny Mankov 5b05a9fef1 hipInfo sample update with new Device Properties. 2016-02-18 15:08:55 +03:00
Evgeny Mankov 072d649d8d Formatting, no functional changes. 2016-02-15 13:16:05 +03:00
Ben Sander 928996fec7 Enable -O3, style points on array size 2016-02-13 03:17:42 -06:00
Ben Sander c3720c19a8 Result formatting 2016-02-13 01:14:01 -06:00
Ben Sander bcb5953d6e Add D2H test 2016-02-12 22:47:26 -06:00
Ben Sander 559db057d5 Add D2H test 2016-02-12 22:46:34 -06:00
Ben Sander f3fd6476eb Add Bus Bandwidth test, leveraged from SHOC. 2016-02-12 21:30:43 -06:00
Ben Sander 317566c1b6 Update links in docs to GPUOpen and to Doxygen 2016-01-27 00:23:47 -06:00
Aditya Avinash Atluri 1d74e7c05f Update README.md 2016-01-26 10:43:41 -05:00
Aditya Avinash Atluri 2d57a3dd0b Corrected compilation error 2016-01-26 10:40:06 -05:00