Граф коммитов

1179 Коммитов

Автор SHA1 Сообщение Дата
pensun daf19a2dbb resolve conflicts for git pull
Change-Id: Ie353b831e2241bc28042069b6cc7405257e871e1


[ROCm/clr commit: bbb619c732]
2016-11-09 21:38:43 -06:00
pensun ce1b4bdc06 Add documentation on threadfence_system workaround guidelines.
Change-Id: I9636a3808798f3dabe992285ce5652187cee6eb8


[ROCm/clr commit: 94dfff9db2]
2016-11-09 21:36:30 -06:00
pensun 61635b585f Add option to alloc fingrained system memory
Change-Id: Ia13c8e058cb988b5857e75a590a4d67411362ae1


[ROCm/clr commit: 23de0e1b50]
2016-11-09 21:36:30 -06:00
Maneesh Gupta 68b4d20b26 Merge branch 'rocm-rel-1.3' into amd-develop
Conflicts:
	include/hip/nvcc_detail/hip_runtime_api.h

Change-Id: I990a7d008da9e8dcc68250cebbc8ee6e723c7e01


[ROCm/clr commit: e3b5eef7c9]
2016-11-10 08:56:38 +05:30
pensun 85fa855e18 fix hipProfiler* apis on NV path
Change-Id: I6adca6151fef3a9b35348163eb6bd13f5c414172


[ROCm/clr commit: 4a8a6a4697]
2016-11-09 15:44:01 -06:00
pensun 12a5923a2b fix for hipcallback function on NV path
Change-Id: If80c0cfe60b1f3b1a71627b5f3f79503cba4d491


[ROCm/clr commit: e5277ab4b6]
2016-11-09 11:33:23 -06:00
Maneesh Gupta ded929878c Update release notes for 1.0 release
Change-Id: I74fa2b41afc334a76c309b125c27aa141cd59554


[ROCm/clr commit: 01a38c82a6]
2016-11-08 16:31:56 +05:30
Ben Sander 3d4a76d560 Fix tests to read warpSize from device props.
Change-Id: I9583577793afad49f9eb1ee9069bd4c6963a6023


[ROCm/clr commit: a13ec441bf]
2016-11-06 04:26:28 -06:00
Ben Sander 445f888d97 Update gitignore for some common output files
Change-Id: I9cd60f042af4dba07fe0fdbd2ee442936ff8c7bd


[ROCm/clr commit: 0e5cfed3eb]
2016-11-06 04:26:15 -06:00
Ben Sander 03fcf556e9 Improve Peer support and testing.
Change-Id: Icadc65988aaf145a265587ab0357c5bf4d26f3eb


[ROCm/clr commit: f3d38c2615]
2016-11-06 03:22:36 -06:00
Ben Sander 0af2722827 Set forceHostCopyEngine for other copy dirs. Support HIP_FORCE_P2P_HOST
Also: more debug for copy and P2p.

Change-Id: I87030c525410e041b2a00baaf6c68e6c0977ff42


[ROCm/clr commit: 06ecfa3975]
2016-11-04 19:53:23 -05:00
Ben Sander 2bf51afaa1 Expand hipP2PSimple testing.
Cover cases where P2P is used for H2D copies, where host is pinned
but not accessible to the copy agent.

Change-Id: I9464b787228b40f93473708c3fde9726e1986365


[ROCm/clr commit: 60a8a5405d]
2016-11-04 16:13:32 -05:00
Ben Sander 06b9391974 Refactor resolve-mem step1
Change-Id: I7b8b2bbb56d7b31a97b48ebd42002641cd07a460


[ROCm/clr commit: 926e63c655]
2016-11-04 09:37:56 -05:00
Ben Sander 74c9c6e591 Add debug for Peer APIs. Enable PeerMemcpy APIs by default.
Change-Id: I46e39a9e7b07686a78484c1f3b5495b08e052fbb


[ROCm/clr commit: 00276d141e]
2016-11-04 08:51:16 -05:00
Ben Sander 43723d77cc Print non-peers too
Change-Id: I2a6905edcdf144aa732ae3120c17780477f232ac


[ROCm/clr commit: 44aee4b61c]
2016-11-04 06:34:07 -05:00
Ben Sander 97d9a5722e Pre-pend HIP_PATH/lib to linker, so we find developer object code
Previously might pick up libs from /opt/rocm/lib.

Change-Id: Ia7adb345defe433d5952aa61706fe03fd7cbcd35


[ROCm/clr commit: d1db786910]
2016-11-04 06:06:04 -05:00
pensun 8911a02b17 Update document for workaround suggestion on threadfence_system()
Change-Id: Icccab8270604a0e578a8614b9afb3f95372f4966


[ROCm/clr commit: 212fa7033c]
2016-11-02 16:08:27 -05:00
pensun 00ce529177 Update hipStreamNonBlocking to use cuda define on NV path
Change-Id: I74ea09db99d602ba1c5f192b36ff7f2781176e6a


[ROCm/clr commit: 9f86e47800]
2016-11-01 20:30:56 -05:00
Aditya Atluri 2d299543bf added inter thread data movement intrinsics
Change-Id: I2a8a8ed49429cb7f96439bd28c4b83b5142737df


[ROCm/clr commit: f097b6ef81]
2016-11-01 16:37:33 -05:00
Rahul Garg 5040e8bcc3 Added hipDeviceGetByPCIBusId in hip/hcc path
Change-Id: I3cca0dc533d0281689d8a407c7da16ca1ba6a3a8


[ROCm/clr commit: 81c91f5b0b]
2016-11-01 10:57:48 +05:30
Evgeny Mankov 6b06d071b9 [HIPIFY] wrap kernel name with HIP_KERNEL_NAME macros...
only in case of commas in it.


[ROCm/clr commit: e1812a1319]
2016-10-28 20:05:51 +03:00
Evgeny Mankov 85b20ca376 * [HIPIFY] Initial Profiler support.
CUDA Driver API porting to HIP:

+ cuProfilerStart, cuProfilerStop.
- cuProfilerInitialize & cudaProfilerInitialize - unsupported yet by HIP.


[ROCm/clr commit: 3101c26d14]
2016-10-28 18:32:13 +03:00
Ben Sander f31e602346 add hip_profile.h
Change-Id: Id43a4336db53567020584cb7842baf5c1649fd8e


[ROCm/clr commit: 9edaf0e3f7]
2016-10-28 07:08:46 -05:00
Maneesh Gupta 63ffd01391 hipdemangleatp: Try handling HC kernels as well
Change-Id: Ie438ddd28e5bc6067fcd682df849d3183046b40a


[ROCm/clr commit: c26f5d7d5a]
2016-10-28 15:46:59 +05:30
Maneesh Gupta 04eb05f1a0 CMakeLists.txt: Update include paths needed for Fedora support
Change-Id: Ib84f9dba30d2c64f344d6f8e85ddbe15f30af1a0


[ROCm/clr commit: 1f08f2adaf]
2016-10-28 14:12:53 +05:30
Maneesh Gupta 6872cb2ceb hipcc: Update flags for Fedora support
Change-Id: I90be7768410e491b4f11c3b0f08470246d781d80


[ROCm/clr commit: 0d8aa10473]
2016-10-28 14:12:13 +05:30
Ben Sander c8aad6ee8e Print short hipLaunchKernel correctly.
Change-Id: I6ca03d7c707cd03d6982199830213953d5855f17


[ROCm/clr commit: 3d0fa30183]
2016-10-27 23:09:32 -05:00
Ben Sander 4378a14789 Add initial hipProfileStart/Stop
And modify sample to show how to use.
Still needs some work to understand interaction with CXL.

Change-Id: I2579824d2dd7863ea23874d34f0dabb3cb305d3e


[ROCm/clr commit: 18dbafe6e8]
2016-10-27 23:09:32 -05:00
Ben Sander 04b77f0688 show how to use variety of HIP_PROFILE features
Change-Id: I6edd66ac4c068b64e1dc3787d7f1f69ab3238469


[ROCm/clr commit: 08c08d6f9c]
2016-10-27 23:09:32 -05:00
Ben Sander 6a06295e31 Add two levels of HIP_PROFILE_API (1=short,2=long)
Change-Id: I7ef98589f8731fb879db109fd573c62b489f2b61


[ROCm/clr commit: 1d5034f583]
2016-10-27 23:09:31 -05:00
Ben Sander 61c99f912d HIPCC adds paths to profiler include and libs by default.
Users who desire otherwise can set HIP_ATP_MARKER=0.
Also remove old unused hipcc_explicit_lib option.

Change-Id: I2bf07ba880329e7a3b1365dd33a3b2be6794370f


[ROCm/clr commit: e57baf8d95]
2016-10-27 23:09:31 -05:00
Ben Sander 9b2a6652c4 Add new hipdemangleatp and snapshot sample update for new functionality
Change-Id: Ie19c683b2b0bdfeb0c3fcf89444c2e21b7f606e7


[ROCm/clr commit: 5b7da93e61]
2016-10-27 23:09:31 -05:00
Ben Sander 44127ad800 Fix scoped marker so begin/end ATP timestamps correct
Change-Id: Ic944d3fc00d7bc31b756c0e6c327b99eb489537e


[ROCm/clr commit: 90adf98a7e]
2016-10-27 23:09:31 -05:00
Ben Sander e0a3b2472c Rename HIP_ATP_MARKER and profiling vars
HIP_PROFILE_API
HIP_DB_START_API
HIP_DB_STOP_API

Change-Id: I6c4da67212ff8217e6356a2622d4c6278a188c34


[ROCm/clr commit: f7c42efa3f]
2016-10-27 23:09:31 -05:00
Ben Sander fd2e0db1f8 Allow HIP_DB to be number or string flags (ie HIP_DB=api+mem+sync)
Add callbacks for processing env vars.

Change-Id: I4ddf50e2da56b1dae43f50657bc693b07b23c03d


[ROCm/clr commit: caf446fe53]
2016-10-27 23:09:31 -05:00
Ben Sander 22ae54b4cc Add HIP_PROFILE_START_API, HIP_PROFILE_STOP_API
Refactor HIP_INIT_API to call recordApiTrace.

Change-Id: Ieff4b5018236f59e49e1b9841474440a34f821df


[ROCm/clr commit: 3132c36471]
2016-10-27 23:09:31 -05:00
Ben Sander 147cd45ebf Add per-thread API seqnum to debug
Change-Id: Ib13733a3e84cd56bae13a32bae40f936c20b7543


[ROCm/clr commit: 45e2dd4d69]
2016-10-27 23:09:31 -05:00
Ben Sander 5e20bb6370 Don't call allow-access if allocating device's only peer is self.
Change-Id: Iac58e6c3e460675833f10b1e8b2e393de223654d


[ROCm/clr commit: f8b356dce8]
2016-10-27 23:09:31 -05:00
Ben Sander 4029bc31e6 Improve HIP TID printing in debug mode.
Map long thread-id to a short one that is printed with each message.
Remove clunky stirng creation code for tid_tr.
Print TID on every message.

Change-Id: I780a91d8ce789cb4957789036b478bf5cde8c4e4


[ROCm/clr commit: 7d69e858a5]
2016-10-27 23:09:31 -05:00
Ben Sander 3b4fc56e30 Update docs and improve hipcc error message.
Change-Id: I25636d06959d02cc46c8c476d3948e91ff83ea47


[ROCm/clr commit: 03caa1de8a]
2016-10-27 23:09:31 -05:00
pensun 33b6db2322 Add missing hipStream typedef for NV path
Change-Id: I915cd14a9ff32b55b0121062d7804a7fbbdc3341


[ROCm/clr commit: dcdb7f6c68]
2016-10-27 13:34:14 -05:00
pensun 6818eaf657 Remove extra semicolons and extra spaces in header on NV path
Change-Id: Ib33aec2451a4e0b298d537dbb1b9df000405871b


[ROCm/clr commit: 0dbc6a1e93]
2016-10-26 10:23:10 +05:30
pensun 6f4ae9b9c4 Remove extra semicolons and extra spaces in header on NV path
Change-Id: Ib33aec2451a4e0b298d537dbb1b9df000405871b


[ROCm/clr commit: ee126928b1]
2016-10-25 15:29:52 -05:00
Aditya Atluri c58f5af811 reverted change for cache size query
Change-Id: I44a1f43818cd287a2a3b6265f43d183f9bd5b71c


[ROCm/clr commit: 8145e1b129]
2016-10-25 11:03:35 -05:00
Aditya Atluri 71ac51b965 correct cachesize to output correct value
Change-Id: I5db031591eb718b0c12e78a35e4b19349de9526d


[ROCm/clr commit: e28f65604b]
2016-10-25 09:33:45 -05:00
Maneesh Gupta 16b2e9abc5 hip_hcc package: install hip_ir.ll to lib folder
Change-Id: Ieaa4ff83597c211f221b4c1b0b93e08ad7c92ea6


[ROCm/clr commit: 0f0a77ae9f]
2016-10-25 15:43:56 +05:30
Maneesh Gupta 715afe3249 hip_hcc package: install hip_ir.ll to lib folder
Change-Id: Ieaa4ff83597c211f221b4c1b0b93e08ad7c92ea6


[ROCm/clr commit: 5b4127abda]
2016-10-25 15:42:24 +05:30
Maneesh Gupta 5a48591fc5 hipcc: Turn back linking hip_ir.ll by default
Change-Id: I9cec5a1a5a4791eddf6ef3e3332143ffc3ceadb6


[ROCm/clr commit: 72a5deb91e]
2016-10-25 12:27:24 +05:30
Maneesh Gupta 7baad0e7c1 hipcc: Turn back linking hip_ir.ll by default
Change-Id: I9cec5a1a5a4791eddf6ef3e3332143ffc3ceadb6


[ROCm/clr commit: 2d9ff6acd6]
2016-10-25 12:26:54 +05:30
Maneesh Gupta ac6b6474ea Merge branch 'amd-develop' into amd-master
Change-Id: I4d12b7ad56d743ea52e97a100f93818ee0fd916c


[ROCm/clr commit: a0ceaef5d4]
2016-10-24 21:40:25 +05:30