pensun
bbd526b7e5
resolve conflicts for git pull
...
Change-Id: Ie353b831e2241bc28042069b6cc7405257e871e1
[ROCm/hip commit: 4d7ac1e091 ]
2016-11-09 21:38:43 -06:00
pensun
e1d9d27254
Add documentation on threadfence_system workaround guidelines.
...
Change-Id: I9636a3808798f3dabe992285ce5652187cee6eb8
[ROCm/hip commit: 24c621db5b ]
2016-11-09 21:36:30 -06:00
pensun
74b0840cc6
Add option to alloc fingrained system memory
...
Change-Id: Ia13c8e058cb988b5857e75a590a4d67411362ae1
[ROCm/hip commit: f7e9f12bf1 ]
2016-11-09 21:36:30 -06:00
Maneesh Gupta
bbc3811662
Merge branch 'rocm-rel-1.3' into amd-develop
...
Conflicts:
include/hip/nvcc_detail/hip_runtime_api.h
Change-Id: I990a7d008da9e8dcc68250cebbc8ee6e723c7e01
[ROCm/hip commit: 72c722c3d6 ]
2016-11-10 08:56:38 +05:30
pensun
007513296a
fix hipProfiler* apis on NV path
...
Change-Id: I6adca6151fef3a9b35348163eb6bd13f5c414172
[ROCm/hip commit: 57cd3c8244 ]
2016-11-09 15:44:01 -06:00
pensun
ca0c0b4061
fix for hipcallback function on NV path
...
Change-Id: If80c0cfe60b1f3b1a71627b5f3f79503cba4d491
[ROCm/hip commit: 76c3c20da6 ]
2016-11-09 11:33:23 -06:00
Maneesh Gupta
56a686a0b9
Update release notes for 1.0 release
...
Change-Id: I74fa2b41afc334a76c309b125c27aa141cd59554
[ROCm/hip commit: 7c68768a0d ]
2016-11-08 16:31:56 +05:30
Ben Sander
58bd8617e8
Fix tests to read warpSize from device props.
...
Change-Id: I9583577793afad49f9eb1ee9069bd4c6963a6023
[ROCm/hip commit: 7106dc6796 ]
2016-11-06 04:26:28 -06:00
Ben Sander
b478d8a0b8
Update gitignore for some common output files
...
Change-Id: I9cd60f042af4dba07fe0fdbd2ee442936ff8c7bd
[ROCm/hip commit: c59fe09a81 ]
2016-11-06 04:26:15 -06:00
Ben Sander
56cd9ce82f
Improve Peer support and testing.
...
Change-Id: Icadc65988aaf145a265587ab0357c5bf4d26f3eb
[ROCm/hip commit: d728819d17 ]
2016-11-06 03:22:36 -06:00
Ben Sander
2a9aa89491
Set forceHostCopyEngine for other copy dirs. Support HIP_FORCE_P2P_HOST
...
Also: more debug for copy and P2p.
Change-Id: I87030c525410e041b2a00baaf6c68e6c0977ff42
[ROCm/hip commit: 092b3dacda ]
2016-11-04 19:53:23 -05:00
Ben Sander
5741930acb
Expand hipP2PSimple testing.
...
Cover cases where P2P is used for H2D copies, where host is pinned
but not accessible to the copy agent.
Change-Id: I9464b787228b40f93473708c3fde9726e1986365
[ROCm/hip commit: ff3298f40e ]
2016-11-04 16:13:32 -05:00
Ben Sander
fe1d895f07
Refactor resolve-mem step1
...
Change-Id: I7b8b2bbb56d7b31a97b48ebd42002641cd07a460
[ROCm/hip commit: 5d79384832 ]
2016-11-04 09:37:56 -05:00
Ben Sander
1e5540e07f
Add debug for Peer APIs. Enable PeerMemcpy APIs by default.
...
Change-Id: I46e39a9e7b07686a78484c1f3b5495b08e052fbb
[ROCm/hip commit: 3f0a2b8dc1 ]
2016-11-04 08:51:16 -05:00
Ben Sander
411cf6c80a
Print non-peers too
...
Change-Id: I2a6905edcdf144aa732ae3120c17780477f232ac
[ROCm/hip commit: 5832349e5e ]
2016-11-04 06:34:07 -05:00
Ben Sander
6444dddaa7
Pre-pend HIP_PATH/lib to linker, so we find developer object code
...
Previously might pick up libs from /opt/rocm/lib.
Change-Id: Ia7adb345defe433d5952aa61706fe03fd7cbcd35
[ROCm/hip commit: f0bb817c11 ]
2016-11-04 06:06:04 -05:00
pensun
9251040786
Update document for workaround suggestion on threadfence_system()
...
Change-Id: Icccab8270604a0e578a8614b9afb3f95372f4966
[ROCm/hip commit: 774de273d0 ]
2016-11-02 16:08:27 -05:00
pensun
959880e318
Update hipStreamNonBlocking to use cuda define on NV path
...
Change-Id: I74ea09db99d602ba1c5f192b36ff7f2781176e6a
[ROCm/hip commit: 4817131cdc ]
2016-11-01 20:30:56 -05:00
Aditya Atluri
42130918bb
added inter thread data movement intrinsics
...
Change-Id: I2a8a8ed49429cb7f96439bd28c4b83b5142737df
[ROCm/hip commit: f48c53534e ]
2016-11-01 16:37:33 -05:00
Rahul Garg
0aa4468ac3
Added hipDeviceGetByPCIBusId in hip/hcc path
...
Change-Id: I3cca0dc533d0281689d8a407c7da16ca1ba6a3a8
[ROCm/hip commit: 2d15d0741c ]
2016-11-01 10:57:48 +05:30
Evgeny Mankov
0fd5ec74c0
[HIPIFY] wrap kernel name with HIP_KERNEL_NAME macros...
...
only in case of commas in it.
[ROCm/hip commit: 1b64c167f6 ]
2016-10-28 20:05:51 +03:00
Evgeny Mankov
4b907cce25
* [HIPIFY] Initial Profiler support.
...
CUDA Driver API porting to HIP:
+ cuProfilerStart, cuProfilerStop.
- cuProfilerInitialize & cudaProfilerInitialize - unsupported yet by HIP.
[ROCm/hip commit: b0f37d0d2e ]
2016-10-28 18:32:13 +03:00
Ben Sander
ae8d55ccc1
add hip_profile.h
...
Change-Id: Id43a4336db53567020584cb7842baf5c1649fd8e
[ROCm/hip commit: 87a2e8f12b ]
2016-10-28 07:08:46 -05:00
Maneesh Gupta
d1a803e5f2
hipdemangleatp: Try handling HC kernels as well
...
Change-Id: Ie438ddd28e5bc6067fcd682df849d3183046b40a
[ROCm/hip commit: c39ddad215 ]
2016-10-28 15:46:59 +05:30
Maneesh Gupta
4009e17202
CMakeLists.txt: Update include paths needed for Fedora support
...
Change-Id: Ib84f9dba30d2c64f344d6f8e85ddbe15f30af1a0
[ROCm/hip commit: c5a2ad4964 ]
2016-10-28 14:12:53 +05:30
Maneesh Gupta
6aa40b6f21
hipcc: Update flags for Fedora support
...
Change-Id: I90be7768410e491b4f11c3b0f08470246d781d80
[ROCm/hip commit: 6fa9bc3b61 ]
2016-10-28 14:12:13 +05:30
Ben Sander
22b87992cf
Print short hipLaunchKernel correctly.
...
Change-Id: I6ca03d7c707cd03d6982199830213953d5855f17
[ROCm/hip commit: 024d9ab090 ]
2016-10-27 23:09:32 -05:00
Ben Sander
2cfd770f1b
Add initial hipProfileStart/Stop
...
And modify sample to show how to use.
Still needs some work to understand interaction with CXL.
Change-Id: I2579824d2dd7863ea23874d34f0dabb3cb305d3e
[ROCm/hip commit: bb58f4f6fc ]
2016-10-27 23:09:32 -05:00
Ben Sander
73d0a04f44
show how to use variety of HIP_PROFILE features
...
Change-Id: I6edd66ac4c068b64e1dc3787d7f1f69ab3238469
[ROCm/hip commit: 32b086400e ]
2016-10-27 23:09:32 -05:00
Ben Sander
9022f04992
Add two levels of HIP_PROFILE_API (1=short,2=long)
...
Change-Id: I7ef98589f8731fb879db109fd573c62b489f2b61
[ROCm/hip commit: ef8eac9b66 ]
2016-10-27 23:09:31 -05:00
Ben Sander
cd3db22eb6
HIPCC adds paths to profiler include and libs by default.
...
Users who desire otherwise can set HIP_ATP_MARKER=0.
Also remove old unused hipcc_explicit_lib option.
Change-Id: I2bf07ba880329e7a3b1365dd33a3b2be6794370f
[ROCm/hip commit: e5ef8a2fd7 ]
2016-10-27 23:09:31 -05:00
Ben Sander
ed022118bb
Add new hipdemangleatp and snapshot sample update for new functionality
...
Change-Id: Ie19c683b2b0bdfeb0c3fcf89444c2e21b7f606e7
[ROCm/hip commit: f95482c7c5 ]
2016-10-27 23:09:31 -05:00
Ben Sander
591f91f2da
Fix scoped marker so begin/end ATP timestamps correct
...
Change-Id: Ic944d3fc00d7bc31b756c0e6c327b99eb489537e
[ROCm/hip commit: ab1836544a ]
2016-10-27 23:09:31 -05:00
Ben Sander
0845025a21
Rename HIP_ATP_MARKER and profiling vars
...
HIP_PROFILE_API
HIP_DB_START_API
HIP_DB_STOP_API
Change-Id: I6c4da67212ff8217e6356a2622d4c6278a188c34
[ROCm/hip commit: e9056798f6 ]
2016-10-27 23:09:31 -05:00
Ben Sander
6972e6d87a
Allow HIP_DB to be number or string flags (ie HIP_DB=api+mem+sync)
...
Add callbacks for processing env vars.
Change-Id: I4ddf50e2da56b1dae43f50657bc693b07b23c03d
[ROCm/hip commit: f5e8090f2f ]
2016-10-27 23:09:31 -05:00
Ben Sander
770ae2611f
Add HIP_PROFILE_START_API, HIP_PROFILE_STOP_API
...
Refactor HIP_INIT_API to call recordApiTrace.
Change-Id: Ieff4b5018236f59e49e1b9841474440a34f821df
[ROCm/hip commit: 710be682ca ]
2016-10-27 23:09:31 -05:00
Ben Sander
494239ad61
Add per-thread API seqnum to debug
...
Change-Id: Ib13733a3e84cd56bae13a32bae40f936c20b7543
[ROCm/hip commit: 739bc37503 ]
2016-10-27 23:09:31 -05:00
Ben Sander
04691cdff8
Don't call allow-access if allocating device's only peer is self.
...
Change-Id: Iac58e6c3e460675833f10b1e8b2e393de223654d
[ROCm/hip commit: 354091f357 ]
2016-10-27 23:09:31 -05:00
Ben Sander
a27124c1f6
Improve HIP TID printing in debug mode.
...
Map long thread-id to a short one that is printed with each message.
Remove clunky stirng creation code for tid_tr.
Print TID on every message.
Change-Id: I780a91d8ce789cb4957789036b478bf5cde8c4e4
[ROCm/hip commit: 346c519ace ]
2016-10-27 23:09:31 -05:00
Ben Sander
a357a07982
Update docs and improve hipcc error message.
...
Change-Id: I25636d06959d02cc46c8c476d3948e91ff83ea47
[ROCm/hip commit: bff88d0f63 ]
2016-10-27 23:09:31 -05:00
pensun
d449ce078c
Add missing hipStream typedef for NV path
...
Change-Id: I915cd14a9ff32b55b0121062d7804a7fbbdc3341
[ROCm/hip commit: 334e9c6f8e ]
2016-10-27 13:34:14 -05:00
pensun
35e0bdbc22
Remove extra semicolons and extra spaces in header on NV path
...
Change-Id: Ib33aec2451a4e0b298d537dbb1b9df000405871b
[ROCm/hip commit: 2abf300797 ]
2016-10-26 10:23:10 +05:30
pensun
75ccef39c9
Remove extra semicolons and extra spaces in header on NV path
...
Change-Id: Ib33aec2451a4e0b298d537dbb1b9df000405871b
[ROCm/hip commit: 8a7dcfce0b ]
2016-10-25 15:29:52 -05:00
Aditya Atluri
84d64ef382
reverted change for cache size query
...
Change-Id: I44a1f43818cd287a2a3b6265f43d183f9bd5b71c
[ROCm/hip commit: e1c1b4c009 ]
2016-10-25 11:03:35 -05:00
Aditya Atluri
8580bba30c
correct cachesize to output correct value
...
Change-Id: I5db031591eb718b0c12e78a35e4b19349de9526d
[ROCm/hip commit: 820a914b98 ]
2016-10-25 09:33:45 -05:00
Maneesh Gupta
f34dc4d90c
hip_hcc package: install hip_ir.ll to lib folder
...
Change-Id: Ieaa4ff83597c211f221b4c1b0b93e08ad7c92ea6
[ROCm/hip commit: db7a46adf6 ]
2016-10-25 15:43:56 +05:30
Maneesh Gupta
5be53b00c6
hip_hcc package: install hip_ir.ll to lib folder
...
Change-Id: Ieaa4ff83597c211f221b4c1b0b93e08ad7c92ea6
[ROCm/hip commit: c2b6eee152 ]
2016-10-25 15:42:24 +05:30
Maneesh Gupta
b602b3d168
hipcc: Turn back linking hip_ir.ll by default
...
Change-Id: I9cec5a1a5a4791eddf6ef3e3332143ffc3ceadb6
[ROCm/hip commit: 528b257004 ]
2016-10-25 12:27:24 +05:30
Maneesh Gupta
0e36aae349
hipcc: Turn back linking hip_ir.ll by default
...
Change-Id: I9cec5a1a5a4791eddf6ef3e3332143ffc3ceadb6
[ROCm/hip commit: f1a4303b04 ]
2016-10-25 12:26:54 +05:30
pensun
5b80757b00
Modify hipStreamAddCallback test case to consider both NULL stream and stream
...
Change-Id: I8468c988f0f01c7d3f3176a01469d69bf7a68237
[ROCm/hip commit: 28aad46674 ]
2016-10-23 12:36:02 -05:00