コミットグラフ

6428 コミット

作成者 SHA1 メッセージ 日付
pensun 24c621db5b Add documentation on threadfence_system workaround guidelines.
Change-Id: I9636a3808798f3dabe992285ce5652187cee6eb8
2016-11-09 21:36:30 -06:00
pensun f7e9f12bf1 Add option to alloc fingrained system memory
Change-Id: Ia13c8e058cb988b5857e75a590a4d67411362ae1
2016-11-09 21:36:30 -06:00
Maneesh Gupta 72c722c3d6 Merge branch 'rocm-rel-1.3' into amd-develop
Conflicts:
	include/hip/nvcc_detail/hip_runtime_api.h

Change-Id: I990a7d008da9e8dcc68250cebbc8ee6e723c7e01
2016-11-10 08:56:38 +05:30
pensun 57cd3c8244 fix hipProfiler* apis on NV path
Change-Id: I6adca6151fef3a9b35348163eb6bd13f5c414172
2016-11-09 15:44:01 -06:00
pensun 76c3c20da6 fix for hipcallback function on NV path
Change-Id: If80c0cfe60b1f3b1a71627b5f3f79503cba4d491
2016-11-09 11:33:23 -06:00
Maneesh Gupta 7c68768a0d Update release notes for 1.0 release
Change-Id: I74fa2b41afc334a76c309b125c27aa141cd59554
2016-11-08 16:31:56 +05:30
Ben Sander 7106dc6796 Fix tests to read warpSize from device props.
Change-Id: I9583577793afad49f9eb1ee9069bd4c6963a6023
2016-11-06 04:26:28 -06:00
Ben Sander c59fe09a81 Update gitignore for some common output files
Change-Id: I9cd60f042af4dba07fe0fdbd2ee442936ff8c7bd
2016-11-06 04:26:15 -06:00
Ben Sander d728819d17 Improve Peer support and testing.
Change-Id: Icadc65988aaf145a265587ab0357c5bf4d26f3eb
2016-11-06 03:22:36 -06:00
Ben Sander 092b3dacda Set forceHostCopyEngine for other copy dirs. Support HIP_FORCE_P2P_HOST
Also: more debug for copy and P2p.

Change-Id: I87030c525410e041b2a00baaf6c68e6c0977ff42
2016-11-04 19:53:23 -05:00
Ben Sander ff3298f40e Expand hipP2PSimple testing.
Cover cases where P2P is used for H2D copies, where host is pinned
but not accessible to the copy agent.

Change-Id: I9464b787228b40f93473708c3fde9726e1986365
2016-11-04 16:13:32 -05:00
Ben Sander 5d79384832 Refactor resolve-mem step1
Change-Id: I7b8b2bbb56d7b31a97b48ebd42002641cd07a460
2016-11-04 09:37:56 -05:00
Ben Sander 3f0a2b8dc1 Add debug for Peer APIs. Enable PeerMemcpy APIs by default.
Change-Id: I46e39a9e7b07686a78484c1f3b5495b08e052fbb
2016-11-04 08:51:16 -05:00
Ben Sander 5832349e5e Print non-peers too
Change-Id: I2a6905edcdf144aa732ae3120c17780477f232ac
2016-11-04 06:34:07 -05:00
Ben Sander f0bb817c11 Pre-pend HIP_PATH/lib to linker, so we find developer object code
Previously might pick up libs from /opt/rocm/lib.

Change-Id: Ia7adb345defe433d5952aa61706fe03fd7cbcd35
2016-11-04 06:06:04 -05:00
pensun 774de273d0 Update document for workaround suggestion on threadfence_system()
Change-Id: Icccab8270604a0e578a8614b9afb3f95372f4966
2016-11-02 16:08:27 -05:00
pensun 4817131cdc Update hipStreamNonBlocking to use cuda define on NV path
Change-Id: I74ea09db99d602ba1c5f192b36ff7f2781176e6a
2016-11-01 20:30:56 -05:00
Aditya Atluri f48c53534e added inter thread data movement intrinsics
Change-Id: I2a8a8ed49429cb7f96439bd28c4b83b5142737df
2016-11-01 16:37:33 -05:00
Rahul Garg 2d15d0741c Added hipDeviceGetByPCIBusId in hip/hcc path
Change-Id: I3cca0dc533d0281689d8a407c7da16ca1ba6a3a8
2016-11-01 10:57:48 +05:30
Evgeny Mankov 1b64c167f6 [HIPIFY] wrap kernel name with HIP_KERNEL_NAME macros...
only in case of commas in it.
2016-10-28 20:05:51 +03:00
Evgeny Mankov b0f37d0d2e * [HIPIFY] Initial Profiler support.
CUDA Driver API porting to HIP:

+ cuProfilerStart, cuProfilerStop.
- cuProfilerInitialize & cudaProfilerInitialize - unsupported yet by HIP.
2016-10-28 18:32:13 +03:00
Ben Sander 87a2e8f12b add hip_profile.h
Change-Id: Id43a4336db53567020584cb7842baf5c1649fd8e
2016-10-28 07:08:46 -05:00
Maneesh Gupta c39ddad215 hipdemangleatp: Try handling HC kernels as well
Change-Id: Ie438ddd28e5bc6067fcd682df849d3183046b40a
2016-10-28 15:46:59 +05:30
Maneesh Gupta c5a2ad4964 CMakeLists.txt: Update include paths needed for Fedora support
Change-Id: Ib84f9dba30d2c64f344d6f8e85ddbe15f30af1a0
2016-10-28 14:12:53 +05:30
Maneesh Gupta 6fa9bc3b61 hipcc: Update flags for Fedora support
Change-Id: I90be7768410e491b4f11c3b0f08470246d781d80
2016-10-28 14:12:13 +05:30
Ben Sander 024d9ab090 Print short hipLaunchKernel correctly.
Change-Id: I6ca03d7c707cd03d6982199830213953d5855f17
2016-10-27 23:09:32 -05:00
Ben Sander bb58f4f6fc Add initial hipProfileStart/Stop
And modify sample to show how to use.
Still needs some work to understand interaction with CXL.

Change-Id: I2579824d2dd7863ea23874d34f0dabb3cb305d3e
2016-10-27 23:09:32 -05:00
Ben Sander 32b086400e show how to use variety of HIP_PROFILE features
Change-Id: I6edd66ac4c068b64e1dc3787d7f1f69ab3238469
2016-10-27 23:09:32 -05:00
Ben Sander ef8eac9b66 Add two levels of HIP_PROFILE_API (1=short,2=long)
Change-Id: I7ef98589f8731fb879db109fd573c62b489f2b61
2016-10-27 23:09:31 -05:00
Ben Sander e5ef8a2fd7 HIPCC adds paths to profiler include and libs by default.
Users who desire otherwise can set HIP_ATP_MARKER=0.
Also remove old unused hipcc_explicit_lib option.

Change-Id: I2bf07ba880329e7a3b1365dd33a3b2be6794370f
2016-10-27 23:09:31 -05:00
Ben Sander f95482c7c5 Add new hipdemangleatp and snapshot sample update for new functionality
Change-Id: Ie19c683b2b0bdfeb0c3fcf89444c2e21b7f606e7
2016-10-27 23:09:31 -05:00
Ben Sander ab1836544a Fix scoped marker so begin/end ATP timestamps correct
Change-Id: Ic944d3fc00d7bc31b756c0e6c327b99eb489537e
2016-10-27 23:09:31 -05:00
Ben Sander e9056798f6 Rename HIP_ATP_MARKER and profiling vars
HIP_PROFILE_API
HIP_DB_START_API
HIP_DB_STOP_API

Change-Id: I6c4da67212ff8217e6356a2622d4c6278a188c34
2016-10-27 23:09:31 -05:00
Ben Sander f5e8090f2f Allow HIP_DB to be number or string flags (ie HIP_DB=api+mem+sync)
Add callbacks for processing env vars.

Change-Id: I4ddf50e2da56b1dae43f50657bc693b07b23c03d
2016-10-27 23:09:31 -05:00
Ben Sander 710be682ca Add HIP_PROFILE_START_API, HIP_PROFILE_STOP_API
Refactor HIP_INIT_API to call recordApiTrace.

Change-Id: Ieff4b5018236f59e49e1b9841474440a34f821df
2016-10-27 23:09:31 -05:00
Ben Sander 739bc37503 Add per-thread API seqnum to debug
Change-Id: Ib13733a3e84cd56bae13a32bae40f936c20b7543
2016-10-27 23:09:31 -05:00
Ben Sander 354091f357 Don't call allow-access if allocating device's only peer is self.
Change-Id: Iac58e6c3e460675833f10b1e8b2e393de223654d
2016-10-27 23:09:31 -05:00
Ben Sander 346c519ace Improve HIP TID printing in debug mode.
Map long thread-id to a short one that is printed with each message.
Remove clunky stirng creation code for tid_tr.
Print TID on every message.

Change-Id: I780a91d8ce789cb4957789036b478bf5cde8c4e4
2016-10-27 23:09:31 -05:00
Ben Sander bff88d0f63 Update docs and improve hipcc error message.
Change-Id: I25636d06959d02cc46c8c476d3948e91ff83ea47
2016-10-27 23:09:31 -05:00
pensun 334e9c6f8e Add missing hipStream typedef for NV path
Change-Id: I915cd14a9ff32b55b0121062d7804a7fbbdc3341
2016-10-27 13:34:14 -05:00
pensun 2abf300797 Remove extra semicolons and extra spaces in header on NV path
Change-Id: Ib33aec2451a4e0b298d537dbb1b9df000405871b
2016-10-26 10:23:10 +05:30
pensun 8a7dcfce0b Remove extra semicolons and extra spaces in header on NV path
Change-Id: Ib33aec2451a4e0b298d537dbb1b9df000405871b
2016-10-25 15:29:52 -05:00
Aditya Atluri e1c1b4c009 reverted change for cache size query
Change-Id: I44a1f43818cd287a2a3b6265f43d183f9bd5b71c
2016-10-25 11:03:35 -05:00
Aditya Atluri 820a914b98 correct cachesize to output correct value
Change-Id: I5db031591eb718b0c12e78a35e4b19349de9526d
2016-10-25 09:33:45 -05:00
Maneesh Gupta db7a46adf6 hip_hcc package: install hip_ir.ll to lib folder
Change-Id: Ieaa4ff83597c211f221b4c1b0b93e08ad7c92ea6
2016-10-25 15:43:56 +05:30
Maneesh Gupta c2b6eee152 hip_hcc package: install hip_ir.ll to lib folder
Change-Id: Ieaa4ff83597c211f221b4c1b0b93e08ad7c92ea6
2016-10-25 15:42:24 +05:30
Maneesh Gupta 528b257004 hipcc: Turn back linking hip_ir.ll by default
Change-Id: I9cec5a1a5a4791eddf6ef3e3332143ffc3ceadb6
2016-10-25 12:27:24 +05:30
Maneesh Gupta f1a4303b04 hipcc: Turn back linking hip_ir.ll by default
Change-Id: I9cec5a1a5a4791eddf6ef3e3332143ffc3ceadb6
2016-10-25 12:26:54 +05:30
Maneesh Gupta b935a876f0 Merge branch 'amd-develop' into amd-master
Change-Id: I4d12b7ad56d743ea52e97a100f93818ee0fd916c
2016-10-24 21:40:25 +05:30
pensun 28aad46674 Modify hipStreamAddCallback test case to consider both NULL stream and stream
Change-Id: I8468c988f0f01c7d3f3176a01469d69bf7a68237
2016-10-23 12:36:02 -05:00