Ben Sander
77bbf9c832
Fix delete[]
...
[ROCm/hip commit: a15d236de3 ]
2017-01-09 21:03:11 -06:00
Ben Sander
1908d9504b
Add HIP_MAX_QUEUES feature.
...
Includes some tricky manipulation of the locks for contexts and streams.
issue is that stealing a stream requires we lock the context to
walk the streams to find a victim. To avoid deadlock, we can't
have a stream locked when we lock the context. This implementation
releases the stream lock, then acquires the context and selects the
victim.
A more stable implemenation might be to copy the stream list
from a context so that a lock is not required to walk all streams.
Smart shared_ptr could be used to prevent the streams from being
deallocated during the walk.
[ROCm/hip commit: a3e0012567 ]
2017-01-09 21:02:56 -06:00
Ben Sander
bbb1485a83
First pass at virtualized queue support.
...
Also updated stream debug messages to consistently use trace_helper.
[ROCm/hip commit: 93fbc9cf7b ]
2017-01-09 21:02:53 -06:00
Ben Sander
653248458f
Add more notes on debugging HIP apps.
...
[ROCm/hip commit: fd209f37d9 ]
2017-01-09 21:02:50 -06:00
Ben Sander
72f6afab45
tolerate spaces in hip args
...
[ROCm/hip commit: 3a42a7642a ]
2017-01-09 20:57:13 -06:00
Rahul Garg
2e9defbef8
Added state for hipDevice.
...
Change-Id: Idbc3c04cd054a01b634856a1e0a23ff172e991aa
[ROCm/hip commit: 5fb09879c7 ]
2017-01-09 23:54:01 +05:30
scchan
eb9be747f2
[cmake] add library dependencies to hip_hcc libraries
...
[ROCm/hip commit: 4fd48084a6 ]
2017-01-05 18:26:54 -05:00
Maneesh Gupta
977cd8fcb6
hipcc: Link to shared HIP runtime by default
...
Change-Id: I5030e3245e4afb6863b401656ca5d1ad9ae84310
[ROCm/hip commit: a42da10c44 ]
2017-01-04 12:39:09 +05:30
Evgeny Mankov
688d9f459e
[HIPIFY] Elapsed time is added to statistics.
...
[ROCm/hip commit: 14e9cf7e62 ]
2016-12-28 20:44:05 +03:00
Evgeny Mankov
9d41561ab4
[HIPIFY] Added the rest of cuBlas API.
...
CUBLAS API 7.5 now is supported by hipify;
API calls, which are not yet supported by hcblas/hipblas, are listed as HIP_UNSUPPORTED.
[ROCm/hip commit: 6ceb85a03a ]
2016-12-28 18:08:10 +03:00
Evgeny Mankov
2250218e10
[HIPIFY] Formatting, no functional changes.
...
[ROCm/hip commit: d7d3fcc77d ]
2016-12-27 19:48:59 +03:00
Evgeny Mankov
12bf2e3f73
[HIPIFY] [Fix] An argument of a function used as macro argument is not hipified.
...
https://github.com/GPUOpen-ProfessionalCompute-Tools/HIP/issues/35
[ROCm/hip commit: 5ec0488ce8 ]
2016-12-27 18:54:02 +03:00
Evgeny Mankov
154576f772
[HIPIFY] Pointer to typedef declaration is not hipified
...
Fix for https://github.com/GPUOpen-ProfessionalCompute-Tools/HIP/issues/60
[ROCm/hip commit: 24703944de ]
2016-12-26 19:03:50 +03:00
Evgeny Mankov
db5b0439de
[HIPIFY] Add hipconvertinplace2.sh and hipexamine2.sh scripts for hipify-clang.
...
The differences from the similar scripts for hipify.pl:
1. CSV file with extended statistics is produced.
2. scripts' arguments are changed a bit:
DIRNAME [hipify options] [--] [clang options]
where -- is a delimiter; all the arguments are optional, except DIRNAME.
Usage example:
./hipexamine2.sh ./tmp -o-stats ./tmp/stats.csv -- -I/usr/local/cuda-7.5/include -I/usr/local/hipify-clang/hipblas/include 2>&1 | tee log
[ROCm/hip commit: bcbbc32fa6 ]
2016-12-23 22:06:20 +03:00
Evgeny Mankov
d2cf5ba147
[HIPIFY] Fix line endings.
...
[ROCm/hip commit: ab00e2a627 ]
2016-12-23 18:01:26 +03:00
Evgeny Mankov
d152a89e79
[HIPIFY] Stats: Calculation of changed code amount, based on actually replaced bytes.
...
+ REPLACED bytes, TOTAL bytes & CODE CHANGED are added to statistics.
+ -o-stats option for specifying the file with statistic.
[ROCm/hip commit: 6882057fd2 ]
2016-12-23 17:40:06 +03:00
Maneesh Gupta
5d44da9e46
hipcc: link to hip runtime using absolute path
...
Change-Id: I714b3e9da0bc1d49665b079d9c4cec1c1a2efa80
[ROCm/hip commit: f6e9f6f0bf ]
2016-12-23 11:49:00 +05:30
Ben Sander
ec4f4a643d
Support size_t in memset kernel.
...
Add disable for HSA_AMD_AGENT_INFO_MAX_WAVES_PER_CU
Remove one copy of completion_future in memset.
[ROCm/hip commit: c325c988b1 ]
2016-12-22 12:25:09 -06:00
Maneesh Gupta
63ab898d17
hip_hcc package changes
...
- updated hip_hcc package creation dependencies
- support build hip_hcc package for HCC-1.0
Change-Id: Idf23e415eff8cb352a8906191c79bd822c7618e7
[ROCm/hip commit: 9b6d1588ba ]
2016-12-22 15:30:38 +05:30
Ben Sander
c1ac04322a
Increment API sequence number.
...
Change name to tls_tidInfo
[ROCm/hip commit: 37d8cafb12 ]
2016-12-21 15:30:36 -06:00
Evgeny Mankov
0215c67ba6
[HIPIFY] Statistics in CSV file.
...
+ Stats by CUDA ref name.
+ Conversion %.
TODO: Calculation of changed code amount, based on actually replaced bytes.
[ROCm/hip commit: 4bb8bf8dab ]
2016-12-21 23:08:01 +03:00
Rahul Garg
191e8fc08e
Removed redundant GetPCIBusID int version function
...
Change-Id: I37f2ff87d09fcfb1e3b104c44c51f606fcb83c01
[ROCm/hip commit: 4704547bab ]
2016-12-20 23:25:16 +05:30
Evgeny Mankov
121b909f55
[HIPIFY] Reflect unsupported CUDA API refs in statistics
...
https://github.com/GPUOpen-ProfessionalCompute-Tools/HIP/issues/53
+ Unsupported refs (by HIP) are now might be listed along with the supported ones.
+ Warnings are added for the unhandled (by HIPIFY) refs, for instance:
"warning: the following reference is not handled: 'cublasContext' [param decl ptr]."
+ Reflect unsupported CUDA API refs in statistics.
+ Occupancy API [HIP_UNSUPPORTED].
+ A few CUBLAS refs are listed as HIP_UNSUPPORTED.
TODO: Statistics in CSV file.
[ROCm/hip commit: 3dd32e969d ]
2016-12-19 14:38:19 +03:00
Rahul Garg
a93a5a41f3
Fix for HCSWAP-67
...
Change-Id: I0b2ce5ab933237947fb41d89769db3da16e5be6a
Conflicts:
src/hip_hcc.cpp
[ROCm/hip commit: fbf7ed63a8 ]
2016-12-19 16:19:51 +05:30
Ben Sander
a5f13421b5
Add name for function
...
[ROCm/hip commit: 90c69e14bb ]
2016-12-17 08:54:09 -06:00
Ben Sander
a212162612
Remove HSA dependency from hipFunction_t
...
Place _groupSegmentSize and _privateSegmentSize inside Function,
remove hsa_executable_symbol_t.
[ROCm/hip commit: 8bf4bd2f7d ]
2016-12-17 07:22:56 -06:00
Ben Sander
f97cffdc48
Remove USE_DISPATCH_HSA_KERNEL=0 path.
...
[ROCm/hip commit: 6ed7e1c1c1 ]
2016-12-17 07:22:56 -06:00
Ben Sander
7402308098
Refactor Module and Function APIs.
...
- hipFunction_t is now returned by value. This eliminates dynamic
allocation / memory management complexity in the module. Removed
the kernel
name so the structure is just 16 bytes now.
- Moved the hsa_executable_load_module and hsa_executable_freeze
calls to the hipModuleLoad and hipModuleLoadData calls.
- Apply sharedMemBytes in hipModuleLaunchKernel to group segment
size (not private).
[ROCm/hip commit: 4d29885be3 ]
2016-12-17 07:22:33 -06:00
Rahul Garg
ae83f93ba4
Mapped hipDevice_t to int
...
Change-Id: I6cfa56c42b7cd04aa0e0bce510c0d72d34ea211a
[ROCm/hip commit: 263a9614ff ]
2016-12-17 16:53:03 +05:30
Aditya Atluri
d93219fc00
disabled half native support as inline asm is not working
...
Change-Id: I3073d8ae39eed321987f0f2f0e689eec4cdbb48c
[ROCm/hip commit: 2665ad2762 ]
2016-12-16 09:24:59 -06:00
Ben Sander
fcdaa05a4d
Print limits on CUDA devices
...
[ROCm/hip commit: 43635f51dc ]
2016-12-16 08:55:11 -06:00
Ben Sander
70f125f2a8
Fix typo
...
[ROCm/hip commit: bd19bb4074 ]
2016-12-15 14:42:52 -06:00
Ben Sander
5a3c11dd9a
fix copyright
...
[ROCm/hip commit: 8ed38bae69 ]
2016-12-15 14:42:52 -06:00
Ben Sander
89523da1c2
remove TODO file
...
[ROCm/hip commit: 4080fe209d ]
2016-12-15 14:42:52 -06:00
Evgeny Mankov
e29cc18289
[HIPIFY] nested macro is not hipified, when it isAnyIdentifier
...
Fix for https://github.com/GPUOpen-ProfessionalCompute-Tools/HIP/issues/55
[ROCm/hip commit: 2383d9bc1a ]
2016-12-15 21:00:34 +03:00
Brecht Carlier
fd99cec62d
Update hip_faq.md
...
Fixed navigation and list.
[ROCm/hip commit: 05b3e2928d ]
2016-12-15 14:08:34 +05:30
Martin Schleiss
92dae66b21
Fix another typo
...
[ROCm/hip commit: b1eace4348 ]
2016-12-15 14:07:20 +05:30
Martin Schleiss
d3ca48fc45
Fix various typos
...
Conflicts:
README.md
Change-Id: Ie296d503d16121a62fed1a208352ec2b81c97fd9
[ROCm/hip commit: 263dcfed83 ]
2016-12-15 14:06:38 +05:30
Peter Steinberger
beb82e5109
Fixes a typo: perforamnce -> performance
...
Change-Id: I85e3b3d22c98c16556227283bfb33530e1bce2cf
[ROCm/hip commit: 961b7890c1 ]
2016-12-15 14:01:08 +05:30
Aditya Atluri
4ebb6e569f
fixed compilation issues
...
Change-Id: I96692538736e2e4f2da9dba9c8c29a164aec4c0d
[ROCm/hip commit: 68c57c38ff ]
2016-12-14 16:50:16 -06:00
Aditya Atluri
727aab2304
added half2 support
...
Change-Id: I0f3b9b7037fed97e80ec99f5369c75a63f001aae
[ROCm/hip commit: d2daf6ad75 ]
2016-12-14 14:18:48 -06:00
Sandeep Kumar
1c6c618247
Fixes in Makefile of couple of samples
...
- modified Makefile for hipblas_saxpy to replaced hcblas.so with hipblas.so as part of HCSWAP-100
- Resolved missing separator issue in peer2peer cookbook Makefile
Change-Id: I678fea267eee1481f02da09379339ed78d3f95f2
[ROCm/hip commit: d78649b978 ]
2016-12-14 16:27:14 +05:30
Aditya Atluri
d4b7fe8385
added simple half math ops
...
Change-Id: I10b1d1023a9e5f2ba63f28c4a2bbe60ee49a8aee
[ROCm/hip commit: 7c7d948fc6 ]
2016-12-13 20:20:58 -06:00
Aditya Atluri
0b445d68de
disabled compiler flag hcc 4.0 for half support
...
Change-Id: I32175113f4c05d43310b3a05c2a14e12f6d48b09
[ROCm/hip commit: ed39a7f43b ]
2016-12-13 20:06:56 -06:00
Ben Sander
1a7ecbcd04
Add USE_IPC to disable use of IPC APIs. Set to 0.
...
[ROCm/hip commit: b30e4b4781 ]
2016-12-13 15:07:04 -06:00
Aditya Atluri
ada705544f
added few type reinterpret cast device functions
...
1. __int_as_float
2. __hiloint2double
Change-Id: Id247c196887b24a12090f0521bf91e13afeec733
[ROCm/hip commit: 04ab5f7f56 ]
2016-12-13 14:41:36 -06:00
Aditya Atluri
8ad8f7ce26
added half math addition ISA support
...
Change-Id: I293b771f695b499b795d7e53f600c9e4fe2a2071
[ROCm/hip commit: a6fe6222c4 ]
2016-12-13 09:18:34 -06:00
Evgeny Mankov
4fd6a82aaf
[HIPIFY] Code refactoring and total stat collecting
...
+ Single base class for Preprocessor and MatchFinder classes.
+ Total Stats for multiple files is added.
[ROCm/hip commit: ad3ec15d85 ]
2016-12-13 18:01:08 +03:00
Evgeny Mankov
b2377f20ba
[HIPIFY] Rename -n to -examine
...
[ROCm/hip commit: 95ae514511 ]
2016-12-12 20:03:01 +03:00
Evgeny Mankov
a5ded34092
[HIPIFY] Multiple source files support.
...
[ROCm/hip commit: 02468d31fc ]
2016-12-12 19:57:19 +03:00