Commit Graph

1155 Commitit

Tekijä SHA1 Viesti Päivämäärä
pensun bbd526b7e5 resolve conflicts for git pull
Change-Id: Ie353b831e2241bc28042069b6cc7405257e871e1


[ROCm/hip commit: 4d7ac1e091]
2016-11-09 21:38:43 -06:00
pensun e1d9d27254 Add documentation on threadfence_system workaround guidelines.
Change-Id: I9636a3808798f3dabe992285ce5652187cee6eb8


[ROCm/hip commit: 24c621db5b]
2016-11-09 21:36:30 -06:00
pensun 74b0840cc6 Add option to alloc fingrained system memory
Change-Id: Ia13c8e058cb988b5857e75a590a4d67411362ae1


[ROCm/hip commit: f7e9f12bf1]
2016-11-09 21:36:30 -06:00
Maneesh Gupta bbc3811662 Merge branch 'rocm-rel-1.3' into amd-develop
Conflicts:
	include/hip/nvcc_detail/hip_runtime_api.h

Change-Id: I990a7d008da9e8dcc68250cebbc8ee6e723c7e01


[ROCm/hip commit: 72c722c3d6]
2016-11-10 08:56:38 +05:30
pensun 007513296a fix hipProfiler* apis on NV path
Change-Id: I6adca6151fef3a9b35348163eb6bd13f5c414172


[ROCm/hip commit: 57cd3c8244]
2016-11-09 15:44:01 -06:00
pensun ca0c0b4061 fix for hipcallback function on NV path
Change-Id: If80c0cfe60b1f3b1a71627b5f3f79503cba4d491


[ROCm/hip commit: 76c3c20da6]
2016-11-09 11:33:23 -06:00
Maneesh Gupta 56a686a0b9 Update release notes for 1.0 release
Change-Id: I74fa2b41afc334a76c309b125c27aa141cd59554


[ROCm/hip commit: 7c68768a0d]
2016-11-08 16:31:56 +05:30
Ben Sander 58bd8617e8 Fix tests to read warpSize from device props.
Change-Id: I9583577793afad49f9eb1ee9069bd4c6963a6023


[ROCm/hip commit: 7106dc6796]
2016-11-06 04:26:28 -06:00
Ben Sander b478d8a0b8 Update gitignore for some common output files
Change-Id: I9cd60f042af4dba07fe0fdbd2ee442936ff8c7bd


[ROCm/hip commit: c59fe09a81]
2016-11-06 04:26:15 -06:00
Ben Sander 56cd9ce82f Improve Peer support and testing.
Change-Id: Icadc65988aaf145a265587ab0357c5bf4d26f3eb


[ROCm/hip commit: d728819d17]
2016-11-06 03:22:36 -06:00
Ben Sander 2a9aa89491 Set forceHostCopyEngine for other copy dirs. Support HIP_FORCE_P2P_HOST
Also: more debug for copy and P2p.

Change-Id: I87030c525410e041b2a00baaf6c68e6c0977ff42


[ROCm/hip commit: 092b3dacda]
2016-11-04 19:53:23 -05:00
Ben Sander 5741930acb Expand hipP2PSimple testing.
Cover cases where P2P is used for H2D copies, where host is pinned
but not accessible to the copy agent.

Change-Id: I9464b787228b40f93473708c3fde9726e1986365


[ROCm/hip commit: ff3298f40e]
2016-11-04 16:13:32 -05:00
Ben Sander fe1d895f07 Refactor resolve-mem step1
Change-Id: I7b8b2bbb56d7b31a97b48ebd42002641cd07a460


[ROCm/hip commit: 5d79384832]
2016-11-04 09:37:56 -05:00
Ben Sander 1e5540e07f Add debug for Peer APIs. Enable PeerMemcpy APIs by default.
Change-Id: I46e39a9e7b07686a78484c1f3b5495b08e052fbb


[ROCm/hip commit: 3f0a2b8dc1]
2016-11-04 08:51:16 -05:00
Ben Sander 411cf6c80a Print non-peers too
Change-Id: I2a6905edcdf144aa732ae3120c17780477f232ac


[ROCm/hip commit: 5832349e5e]
2016-11-04 06:34:07 -05:00
Ben Sander 6444dddaa7 Pre-pend HIP_PATH/lib to linker, so we find developer object code
Previously might pick up libs from /opt/rocm/lib.

Change-Id: Ia7adb345defe433d5952aa61706fe03fd7cbcd35


[ROCm/hip commit: f0bb817c11]
2016-11-04 06:06:04 -05:00
pensun 9251040786 Update document for workaround suggestion on threadfence_system()
Change-Id: Icccab8270604a0e578a8614b9afb3f95372f4966


[ROCm/hip commit: 774de273d0]
2016-11-02 16:08:27 -05:00
pensun 959880e318 Update hipStreamNonBlocking to use cuda define on NV path
Change-Id: I74ea09db99d602ba1c5f192b36ff7f2781176e6a


[ROCm/hip commit: 4817131cdc]
2016-11-01 20:30:56 -05:00
Aditya Atluri 42130918bb added inter thread data movement intrinsics
Change-Id: I2a8a8ed49429cb7f96439bd28c4b83b5142737df


[ROCm/hip commit: f48c53534e]
2016-11-01 16:37:33 -05:00
Rahul Garg 0aa4468ac3 Added hipDeviceGetByPCIBusId in hip/hcc path
Change-Id: I3cca0dc533d0281689d8a407c7da16ca1ba6a3a8


[ROCm/hip commit: 2d15d0741c]
2016-11-01 10:57:48 +05:30
Evgeny Mankov 0fd5ec74c0 [HIPIFY] wrap kernel name with HIP_KERNEL_NAME macros...
only in case of commas in it.


[ROCm/hip commit: 1b64c167f6]
2016-10-28 20:05:51 +03:00
Evgeny Mankov 4b907cce25 * [HIPIFY] Initial Profiler support.
CUDA Driver API porting to HIP:

+ cuProfilerStart, cuProfilerStop.
- cuProfilerInitialize & cudaProfilerInitialize - unsupported yet by HIP.


[ROCm/hip commit: b0f37d0d2e]
2016-10-28 18:32:13 +03:00
Ben Sander ae8d55ccc1 add hip_profile.h
Change-Id: Id43a4336db53567020584cb7842baf5c1649fd8e


[ROCm/hip commit: 87a2e8f12b]
2016-10-28 07:08:46 -05:00
Maneesh Gupta d1a803e5f2 hipdemangleatp: Try handling HC kernels as well
Change-Id: Ie438ddd28e5bc6067fcd682df849d3183046b40a


[ROCm/hip commit: c39ddad215]
2016-10-28 15:46:59 +05:30
Maneesh Gupta 4009e17202 CMakeLists.txt: Update include paths needed for Fedora support
Change-Id: Ib84f9dba30d2c64f344d6f8e85ddbe15f30af1a0


[ROCm/hip commit: c5a2ad4964]
2016-10-28 14:12:53 +05:30
Maneesh Gupta 6aa40b6f21 hipcc: Update flags for Fedora support
Change-Id: I90be7768410e491b4f11c3b0f08470246d781d80


[ROCm/hip commit: 6fa9bc3b61]
2016-10-28 14:12:13 +05:30
Ben Sander 22b87992cf Print short hipLaunchKernel correctly.
Change-Id: I6ca03d7c707cd03d6982199830213953d5855f17


[ROCm/hip commit: 024d9ab090]
2016-10-27 23:09:32 -05:00
Ben Sander 2cfd770f1b Add initial hipProfileStart/Stop
And modify sample to show how to use.
Still needs some work to understand interaction with CXL.

Change-Id: I2579824d2dd7863ea23874d34f0dabb3cb305d3e


[ROCm/hip commit: bb58f4f6fc]
2016-10-27 23:09:32 -05:00
Ben Sander 73d0a04f44 show how to use variety of HIP_PROFILE features
Change-Id: I6edd66ac4c068b64e1dc3787d7f1f69ab3238469


[ROCm/hip commit: 32b086400e]
2016-10-27 23:09:32 -05:00
Ben Sander 9022f04992 Add two levels of HIP_PROFILE_API (1=short,2=long)
Change-Id: I7ef98589f8731fb879db109fd573c62b489f2b61


[ROCm/hip commit: ef8eac9b66]
2016-10-27 23:09:31 -05:00
Ben Sander cd3db22eb6 HIPCC adds paths to profiler include and libs by default.
Users who desire otherwise can set HIP_ATP_MARKER=0.
Also remove old unused hipcc_explicit_lib option.

Change-Id: I2bf07ba880329e7a3b1365dd33a3b2be6794370f


[ROCm/hip commit: e5ef8a2fd7]
2016-10-27 23:09:31 -05:00
Ben Sander ed022118bb Add new hipdemangleatp and snapshot sample update for new functionality
Change-Id: Ie19c683b2b0bdfeb0c3fcf89444c2e21b7f606e7


[ROCm/hip commit: f95482c7c5]
2016-10-27 23:09:31 -05:00
Ben Sander 591f91f2da Fix scoped marker so begin/end ATP timestamps correct
Change-Id: Ic944d3fc00d7bc31b756c0e6c327b99eb489537e


[ROCm/hip commit: ab1836544a]
2016-10-27 23:09:31 -05:00
Ben Sander 0845025a21 Rename HIP_ATP_MARKER and profiling vars
HIP_PROFILE_API
HIP_DB_START_API
HIP_DB_STOP_API

Change-Id: I6c4da67212ff8217e6356a2622d4c6278a188c34


[ROCm/hip commit: e9056798f6]
2016-10-27 23:09:31 -05:00
Ben Sander 6972e6d87a Allow HIP_DB to be number or string flags (ie HIP_DB=api+mem+sync)
Add callbacks for processing env vars.

Change-Id: I4ddf50e2da56b1dae43f50657bc693b07b23c03d


[ROCm/hip commit: f5e8090f2f]
2016-10-27 23:09:31 -05:00
Ben Sander 770ae2611f Add HIP_PROFILE_START_API, HIP_PROFILE_STOP_API
Refactor HIP_INIT_API to call recordApiTrace.

Change-Id: Ieff4b5018236f59e49e1b9841474440a34f821df


[ROCm/hip commit: 710be682ca]
2016-10-27 23:09:31 -05:00
Ben Sander 494239ad61 Add per-thread API seqnum to debug
Change-Id: Ib13733a3e84cd56bae13a32bae40f936c20b7543


[ROCm/hip commit: 739bc37503]
2016-10-27 23:09:31 -05:00
Ben Sander 04691cdff8 Don't call allow-access if allocating device's only peer is self.
Change-Id: Iac58e6c3e460675833f10b1e8b2e393de223654d


[ROCm/hip commit: 354091f357]
2016-10-27 23:09:31 -05:00
Ben Sander a27124c1f6 Improve HIP TID printing in debug mode.
Map long thread-id to a short one that is printed with each message.
Remove clunky stirng creation code for tid_tr.
Print TID on every message.

Change-Id: I780a91d8ce789cb4957789036b478bf5cde8c4e4


[ROCm/hip commit: 346c519ace]
2016-10-27 23:09:31 -05:00
Ben Sander a357a07982 Update docs and improve hipcc error message.
Change-Id: I25636d06959d02cc46c8c476d3948e91ff83ea47


[ROCm/hip commit: bff88d0f63]
2016-10-27 23:09:31 -05:00
pensun d449ce078c Add missing hipStream typedef for NV path
Change-Id: I915cd14a9ff32b55b0121062d7804a7fbbdc3341


[ROCm/hip commit: 334e9c6f8e]
2016-10-27 13:34:14 -05:00
pensun 35e0bdbc22 Remove extra semicolons and extra spaces in header on NV path
Change-Id: Ib33aec2451a4e0b298d537dbb1b9df000405871b


[ROCm/hip commit: 2abf300797]
2016-10-26 10:23:10 +05:30
pensun 75ccef39c9 Remove extra semicolons and extra spaces in header on NV path
Change-Id: Ib33aec2451a4e0b298d537dbb1b9df000405871b


[ROCm/hip commit: 8a7dcfce0b]
2016-10-25 15:29:52 -05:00
Aditya Atluri 84d64ef382 reverted change for cache size query
Change-Id: I44a1f43818cd287a2a3b6265f43d183f9bd5b71c


[ROCm/hip commit: e1c1b4c009]
2016-10-25 11:03:35 -05:00
Aditya Atluri 8580bba30c correct cachesize to output correct value
Change-Id: I5db031591eb718b0c12e78a35e4b19349de9526d


[ROCm/hip commit: 820a914b98]
2016-10-25 09:33:45 -05:00
Maneesh Gupta f34dc4d90c hip_hcc package: install hip_ir.ll to lib folder
Change-Id: Ieaa4ff83597c211f221b4c1b0b93e08ad7c92ea6


[ROCm/hip commit: db7a46adf6]
2016-10-25 15:43:56 +05:30
Maneesh Gupta 5be53b00c6 hip_hcc package: install hip_ir.ll to lib folder
Change-Id: Ieaa4ff83597c211f221b4c1b0b93e08ad7c92ea6


[ROCm/hip commit: c2b6eee152]
2016-10-25 15:42:24 +05:30
Maneesh Gupta b602b3d168 hipcc: Turn back linking hip_ir.ll by default
Change-Id: I9cec5a1a5a4791eddf6ef3e3332143ffc3ceadb6


[ROCm/hip commit: 528b257004]
2016-10-25 12:27:24 +05:30
Maneesh Gupta 0e36aae349 hipcc: Turn back linking hip_ir.ll by default
Change-Id: I9cec5a1a5a4791eddf6ef3e3332143ffc3ceadb6


[ROCm/hip commit: f1a4303b04]
2016-10-25 12:26:54 +05:30
pensun 5b80757b00 Modify hipStreamAddCallback test case to consider both NULL stream and stream
Change-Id: I8468c988f0f01c7d3f3176a01469d69bf7a68237


[ROCm/hip commit: 28aad46674]
2016-10-23 12:36:02 -05:00