Maneesh Gupta
0dd56073db
Fix build issues due to refactoring changes
...
Change-Id: I0a709ff4864244ba1b49e1a25327e3901ed6e17f
[ROCm/clr commit: fcafb975a6 ]
2016-08-09 22:49:32 +05:30
Rahul Garg
dbf9fd8312
Changed StagingBuffer class to UnpinnedCopyEngine
...
Change-Id: I1e212bfc8030dcf225ecf78fd7b23fda9b1de92f
[ROCm/clr commit: d08b1239ee ]
2016-08-09 21:29:42 +05:30
Rahul Garg
cb771c60d3
Moved sync copy decision logic to staging buffer class
...
Change-Id: I5c398772375fcc1f174a7597eea1215ce7bf80b4
[ROCm/clr commit: 4044cb412c ]
2016-08-09 09:28:18 +05:30
Ben Sander
0c8ca4b37d
Add initial context implementation.
...
APIs: hipInit, hipCtxCreate.
Track TLS default ctx. Set deviceID now changes the ctx.
Add first context test.
Change-Id: If1cb9989b5a04a36147e25e84904336c7b6f3d88
[ROCm/clr commit: f19f2248bf ]
2016-08-08 17:49:02 -05:00
Ben Sander
c85f34bf8a
Code cleanup, use camelCase where appropriate.
...
Change-Id: I5a7ec50df8bbb3e7a3b313c0b12e2dd55ae4a09c
[ROCm/clr commit: ac173c3ae8 ]
2016-08-08 14:54:38 -05:00
Ben Sander
4ef3cb1796
Coding guidelines update
...
Change-Id: Ib8d8da4c3897d157aeb26eb2e99718d66fd260b1
[ROCm/clr commit: 7c5a611df4 ]
2016-08-08 13:12:22 -05:00
Ben Sander
e6dd8281d0
Move copy kernel templates into hip_memory.cpp
...
Change-Id: I862529f3fa8232372c6bacaa5d36f035bbdd32a1
[ROCm/clr commit: 6dbe554894 ]
2016-08-08 12:07:12 -05:00
Ben Sander
3ee6e7f8ab
Split ihipCtx_t into ihipCtx_t and ihipDevice_t .
...
Major change to existing code base.
Ctx holds streams, enables peers, and flags.
Device holds accelerator, hsa-agent, device props.
Add hipCtx_t.
Add peer APIs that accept hipCtx_t (in addition to deviceId)
Compiles and passes directed tests.
Change-Id: Iddab1eb9edbf90caad2ef5959c6b811d658197f1
[ROCm/clr commit: d09b19bb6c ]
2016-08-08 11:55:57 -05:00
Ben Sander
4c7160a134
Add initial/partial coding guidelines
...
Change-Id: Ifd8cb3ad74b15d3ab2f38c3daa038a2808af6fa9
[ROCm/clr commit: 694c0bad62 ]
2016-08-08 11:55:41 -05:00
Ben Sander
e5300e2b40
Change Device->Ctx
...
Change ihipDevice_t -> ihipCtx_t (new)
Change ihipGetTlsDefaultDevice->ihipGetTlsDefaultCtx
Some other changes from device->ctx where appropriate.
Change-Id: I5c4ae93b2fd42c6303aa23d748eb166b7431925d
[ROCm/clr commit: 0d16565061 ]
2016-08-07 21:47:12 -05:00
Ben Sander
aa43c9e006
Remove ihipStream_r::_device_index
...
Replace with direct pointer to device. Cleaner, and prep
for transition to contexts.
Change-Id: I0e550f34412923d46c541c0a14bb7d29c3fd4b11
[ROCm/clr commit: 3c604b6430 ]
2016-08-07 20:47:06 -05:00
Evgeny Mankov
67629d0c65
clang-hipify: Transformation of declarations with external linkage and shared attribute for IncompleteArrayType (aka C array[]) only.
...
Example:
extern __shared__ uint sRadix1[]; => HIP_DYNAMIC_SHARED(unsigned int, sRadix1);
[ROCm/clr commit: b7ac63e202 ]
2016-08-05 21:35:58 +03:00
Rahul Garg
7b7f9dab90
Region based apis to pool based api changes
...
Change-Id: If53019eebafe051ab4e811863995f78315297080
[ROCm/clr commit: 52712e2134 ]
2016-08-05 15:05:57 +05:30
Maneesh Gupta
f468d7b0b8
.hipconfig now lives in lib
...
Also moved reading .hipconfig to begining of hipcc
Change-Id: Ibb9d32bef64a79f189aa037c36814759c8cc8052
[ROCm/clr commit: 6a2e92f7c8 ]
2016-08-03 12:01:48 +05:30
scchan
b9a6362bf1
compile all hip objects with -fPIC so that they work in a shared library
...
Change-Id: I1f8355d4a81dbd5f408062c317c16a5efc668876
[ROCm/clr commit: 992195b395 ]
2016-08-03 11:48:46 +05:30
Maneesh Gupta
9739e7b9c6
hipcc reads .hipconfig
...
hipcc dies if HCC_VERSION is not same as the one used to build HIP
Change-Id: I180c0108812fe5cb6c5304477557c524a4a61f75
[ROCm/clr commit: 1381681335 ]
2016-08-03 11:32:08 +05:30
Maneesh Gupta
0cb75139c7
Only create .hipconfig on install
...
Change-Id: I9c20440023401f5794b33a0165e0918372783a68
[ROCm/clr commit: b732fd74b4 ]
2016-08-03 11:25:59 +05:30
Maneesh Gupta
4adc12f379
Merge branch 'release_0.92.00' into amd-develop
...
[ROCm/clr commit: 25c65c5fc4 ]
2016-08-03 09:10:44 +05:30
Maneesh Gupta
1d1d071b2d
Fix hipcc to work correctly when HIP_ATP_MARKER is not defined
...
Change-Id: I7cc525daccf896704e6ccf6d04ed395fda622031
[ROCm/clr commit: 58097bb717 ]
2016-08-03 09:08:40 +05:30
Maneesh Gupta
0d8126a9eb
hip_hcc rpm package does not depend on rocm-profiler
...
Change-Id: I291c046371e704ce395781d5de3f6430c0dcba7b
[ROCm/clr commit: 8b272278f5 ]
2016-08-03 09:07:58 +05:30
Aditya Atluri
e1a140f43c
Added configuration file to be used by hipcc
...
1. CMake will create .hip-config file in bin directory
Future Work: Need to make changes to hipcc to read the file
Change-Id: Ia7dc48d43787921d5af4ab07d7a5befbcf904465
[ROCm/clr commit: 5971c64a7b ]
2016-08-02 15:45:46 -05:00
Ben Sander
ffd5a709f6
Remove faulty assert for kernelCnt==0
...
Change-Id: I8a925c95f48e857c0a31f44561499e90dc6df552
[ROCm/clr commit: bb005d1755 ]
2016-08-01 13:38:47 -05:00
Maneesh Gupta
22fefac98f
Merge branch 'release_0.92.00' into amd-develop
...
Conflicts:
RELEASE.md
docs/markdown/hip_faq.md
Change-Id: Ifae1b64b6255a7872dfdc885bb8fb52f622464b7
[ROCm/clr commit: 2d6eb727e9 ]
2016-08-01 10:47:25 +05:30
Aditya Atluri
33f459c541
Signal Fix: The signals in a stream are re-used
...
1. Before, the signal pool is increased depending on the usage
2. After, a static number of signals are allocated to the pool
Only these are used by hip in a stream
3. If the signals required are more than the pool size, the
stream has to wait to make sure all the signals are available
4. Once they are available, the stream can use them
5. Removed HIP_NUM_SIGNALS_PER_STREAM because of redundancy with HIP_STREAM_SIGNALS
6. Increased signal count from 2 to 32.
Future Work: Dynamically increase the pool size depending on the number of
streams allocated by the application. And, null stream should have more signals
Change-Id: I6be36e084f26bb04766fabf776c7210aee0f9e91
[ROCm/clr commit: 9c7ee12822 ]
2016-07-28 23:01:35 -05:00
Ben Sander
381f67e067
Remove dead enqueueBarrier function.
...
Change-Id: Ib18fe6bd96ce24dbeb342961ddb5721f7d03f2b2
[ROCm/clr commit: f7ab82cb39 ]
2016-07-28 22:48:22 -05:00
Ben Sander
a853d10446
Cleanup sync code.
...
Remove dead depFutures, enqueueBarrier call.
Rename some parms to reflect usage.
Add comments to better explain tricky parts of sync code.
Change-Id: I763296421d9c2b3b58fc8cef5f010b12ab49553c
[ROCm/clr commit: ef61aae878 ]
2016-07-27 18:31:11 -05:00
Ben Sander
058e927c9b
Fix API string message for hipDeviceGetAttribute
...
Change-Id: I30f54627630c8ee835506be8c9921742bb68a43a
[ROCm/clr commit: f5118ce3cd ]
2016-07-27 16:18:14 -05:00
Aditya Atluri
96fe085832
Signal Fix: Added signal limit to allocSignal
...
1. Did not change the logic in allocSignal
2. Added guard to wait on signal limit
Change-Id: I78f29097e6a584b3c3d78319dac19869067bd1fe
[ROCm/clr commit: 1b2a24d0b8 ]
2016-07-27 13:48:49 -05:00
Maneesh Gupta
5fa1029861
Update release notes for 0.92.00 release
...
Change-Id: I9ca588cd0d5d752dc6521e76ba943500eb55525f
[ROCm/clr commit: de7c9769a4 ]
2016-07-27 20:30:04 +05:30
Aditya Atluri
b99410bd07
Signal Fix: Moved kernel count to critical stream
...
1. Added environment variable HIP_NUM_KERNELS_INFLIGHT
2. Moved kernelcount variable inside stream critical section
Change-Id: I51d24d0a2a109467209170de117a6d02ba4e308e
[ROCm/clr commit: 7be196de48 ]
2016-07-26 17:09:27 -05:00
Aditya Atluri
d8ea207f61
Signal Fix: Changed global signal count to per stream signal count
...
1. The number of kernels that can use signals are increased to 128
2. The kernel count is now specific to the stream
Change-Id: Ie6d1aa3f437aad8f08c3333fe48bd3f46e551e60
[ROCm/clr commit: 2e754d27dc ]
2016-07-26 14:03:51 -05:00
Aditya Atluri
bb363f9d63
removed redundant signal destroy
...
Change-Id: Icf0cd76b2620d34c87cfb6c7a83049087c0a0bc4
[ROCm/clr commit: 524127b4a4 ]
2016-07-26 13:35:35 -05:00
Ben Sander
0ed0ad30da
Doc update for FAQ and future RELEASE notes
...
Change-Id: I7e7c32d9a19fdaea0a0e41f1d4fa4652e53640f2
[ROCm/clr commit: 77be00e8d3 ]
2016-07-26 12:14:15 -05:00
Ben Sander
542948ad75
Doc update for README.md - add more intro text, example
...
Change-Id: I99b8eaacd6460dfdbdbc8ddba3fe589647d877e7
[ROCm/clr commit: 75d1f8c297 ]
2016-07-26 12:14:02 -05:00
Ben Sander
4094170fd7
Make HCC ignore register keyword
...
- (previously would emit a warning)
- Also tweak documentation.
Change-Id: I0f4f00f82f8cc53d420112570f2d7675535e6aea
[ROCm/clr commit: b9e48d6066 ]
2016-07-26 12:13:48 -05:00
Aditya Atluri
e207805f81
Added re-fix for memcpy kernel sync
...
1. The patch uses HIP signal pools to sync between copy and kernel commands
2. The hsa_signal_create is removed
3. Left the redundant enqueueBarrier method just in case
Change-Id: I3dff3e8ee57fff3cd49bec802ff735ed128e5ca1
[ROCm/clr commit: 0232e6bbb4 ]
2016-07-26 09:22:59 -05:00
Maneesh Gupta
cc9a901498
Documentation updates
...
Change-Id: Ia624d86915c4c96da0ac0242f767135f30ff73c6
[ROCm/clr commit: b29ed98f9a ]
2016-07-25 14:53:15 +05:30
Rahul Garg
7a70966cb1
D2H and H2D unpinned memory transfer support
...
Change-Id: If6d6c970f435e5d917d5cc6cddc2ee2918cd1c37
Conflicts:
src/hip_hcc.cpp
[ROCm/clr commit: d11d65d401 ]
2016-07-25 14:36:07 +05:30
Aditya Atluri
776ea7a1f0
Partial fix async after kernel launch signal issue
...
Change-Id: Ib48d6564379160035bded9493b93663fba361710
[ROCm/clr commit: 1704006bed ]
2016-07-23 14:54:20 -05:00
pensun
2c945dba23
Add empty stubs for threadfence family routines, changes include:
...
- stubs and documentation in include/hcc_details/hip_runtime.h
- stubs with "no-op" in src/hip_memory.cpp
- document update in hip_kernel_language.md, add suggestions to
disable L1 and L2 caches when using the threadfence routines.
Change-Id: Ic0753170f802003055bca9d7476d7f48817b98b7
[ROCm/clr commit: 6db08e5135 ]
2016-07-22 10:40:58 -05:00
Maneesh Gupta
64d0ffb404
Replace calls to ihipInit with use of HIP_INIT_API macro
...
Change-Id: Iabf7df79f0238a8ddffea4607fe945df36642850
[ROCm/clr commit: b485470819 ]
2016-07-22 15:46:55 +05:30
Maneesh Gupta
21ae09859d
hip_hcc package now depends on rocm-profiler
...
Change-Id: I80d1c6048cc18c47c2024efb90368b17139a09ad
[ROCm/clr commit: 5c18187741 ]
2016-07-22 15:45:32 +05:30
Maneesh Gupta
68b1a199c4
Fix using ATP markers
...
Change-Id: If2d04f80b580237426c569737551e2001a8cd35a
[ROCm/clr commit: dffed956fb ]
2016-07-21 16:02:51 +05:30
Maneesh Gupta
997cb27be8
Merge branch 'hiparray' into amd-develop
...
Change-Id: I63ca7b1db7b593ac5cfb3fd7cd5d08d6e4075a4c
[ROCm/clr commit: 7d5cffdc17 ]
2016-07-21 12:29:56 +05:30
Maneesh Gupta
0b97390948
Bump HIP version to 0.92.00
...
Change-Id: I4d653213dcf3ddf7d09d36433afe31e21d17a7ee
[ROCm/clr commit: 8c4cd59903 ]
2016-07-21 09:52:08 +05:30
Aditya Atluri
794d6bfda8
added kernel launch stress test
...
Change-Id: Ib50d47e55079839ba31aabb524b70b704d7a46fe
[ROCm/clr commit: 300ec2af1e ]
2016-07-19 13:57:56 -05:00
Aditya Atluri
2098afc04d
added fix for signal overflow in kernels
...
Change-Id: Ie0b1f97f69b7d7b34e445f6f120472819be03a0e
[ROCm/clr commit: 77d7134619 ]
2016-07-19 13:51:44 -05:00
Maneesh Gupta
1d3164ac80
Add markdown documentation to hip_docs package
...
Change-Id: I4075e6baaf287356ac8b485cccc231bfb729f078
[ROCm/clr commit: 538eeb7cc0 ]
2016-07-19 14:30:03 +05:30
Maneesh Gupta
bb2296d48b
Set cmake policy CMP0037 to old behavior for newer cmake versions
...
Change-Id: Ib2a7da53a238a489e73d6c006c50f12f07f866a1
[ROCm/clr commit: 89cbfadf86 ]
2016-07-19 06:52:38 +05:30
Ben Sander
5586cc02b4
Update documentation.
...
- Add more detailed feature comparison HIP vs specific CUDA SDK revs.
- Remove reference to old codenames.
- Fix TOC in some md files.
Change-Id: If0f6a05dc61237c4710c4d4bf2d933d556dcaeae
[ROCm/clr commit: 4b3102b578 ]
2016-07-18 13:56:22 -05:00