Graphe des révisions

1284 Révisions

Auteur SHA1 Message Date
Ben Sander 77bbf9c832 Fix delete[]
[ROCm/hip commit: a15d236de3]
2017-01-09 21:03:11 -06:00
Ben Sander 1908d9504b Add HIP_MAX_QUEUES feature.
Includes some tricky manipulation of the locks for contexts and streams.
issue is that stealing a stream requires we lock the context to
walk the streams to find a victim.  To avoid deadlock, we can't
have a stream locked when we lock the context.  This implementation
releases the stream lock, then acquires the context and selects the
victim.
A more stable implemenation might be to copy the stream list
from a context so that a lock is not required to walk all streams.
Smart shared_ptr could be used to prevent the streams from being
deallocated during the walk.


[ROCm/hip commit: a3e0012567]
2017-01-09 21:02:56 -06:00
Ben Sander bbb1485a83 First pass at virtualized queue support.
Also updated stream debug messages to consistently use trace_helper.


[ROCm/hip commit: 93fbc9cf7b]
2017-01-09 21:02:53 -06:00
Ben Sander 653248458f Add more notes on debugging HIP apps.
[ROCm/hip commit: fd209f37d9]
2017-01-09 21:02:50 -06:00
Ben Sander 72f6afab45 tolerate spaces in hip args
[ROCm/hip commit: 3a42a7642a]
2017-01-09 20:57:13 -06:00
Rahul Garg 2e9defbef8 Added state for hipDevice.
Change-Id: Idbc3c04cd054a01b634856a1e0a23ff172e991aa


[ROCm/hip commit: 5fb09879c7]
2017-01-09 23:54:01 +05:30
scchan eb9be747f2 [cmake] add library dependencies to hip_hcc libraries
[ROCm/hip commit: 4fd48084a6]
2017-01-05 18:26:54 -05:00
Maneesh Gupta 977cd8fcb6 hipcc: Link to shared HIP runtime by default
Change-Id: I5030e3245e4afb6863b401656ca5d1ad9ae84310


[ROCm/hip commit: a42da10c44]
2017-01-04 12:39:09 +05:30
Evgeny Mankov 688d9f459e [HIPIFY] Elapsed time is added to statistics.
[ROCm/hip commit: 14e9cf7e62]
2016-12-28 20:44:05 +03:00
Evgeny Mankov 9d41561ab4 [HIPIFY] Added the rest of cuBlas API.
CUBLAS API 7.5 now is supported by hipify;
API calls, which are not yet supported by hcblas/hipblas, are listed as HIP_UNSUPPORTED.


[ROCm/hip commit: 6ceb85a03a]
2016-12-28 18:08:10 +03:00
Evgeny Mankov 2250218e10 [HIPIFY] Formatting, no functional changes.
[ROCm/hip commit: d7d3fcc77d]
2016-12-27 19:48:59 +03:00
Evgeny Mankov 12bf2e3f73 [HIPIFY] [Fix] An argument of a function used as macro argument is not hipified.
https://github.com/GPUOpen-ProfessionalCompute-Tools/HIP/issues/35


[ROCm/hip commit: 5ec0488ce8]
2016-12-27 18:54:02 +03:00
Evgeny Mankov 154576f772 [HIPIFY] Pointer to typedef declaration is not hipified
Fix for https://github.com/GPUOpen-ProfessionalCompute-Tools/HIP/issues/60


[ROCm/hip commit: 24703944de]
2016-12-26 19:03:50 +03:00
Evgeny Mankov db5b0439de [HIPIFY] Add hipconvertinplace2.sh and hipexamine2.sh scripts for hipify-clang.
The differences from the similar scripts for hipify.pl:
1. CSV file with extended statistics is produced.
2. scripts' arguments are changed a bit:
DIRNAME [hipify options] [--] [clang options]

where -- is a delimiter; all the arguments are optional, except DIRNAME.

Usage example:
./hipexamine2.sh ./tmp -o-stats ./tmp/stats.csv -- -I/usr/local/cuda-7.5/include -I/usr/local/hipify-clang/hipblas/include 2>&1 | tee log


[ROCm/hip commit: bcbbc32fa6]
2016-12-23 22:06:20 +03:00
Evgeny Mankov d2cf5ba147 [HIPIFY] Fix line endings.
[ROCm/hip commit: ab00e2a627]
2016-12-23 18:01:26 +03:00
Evgeny Mankov d152a89e79 [HIPIFY] Stats: Calculation of changed code amount, based on actually replaced bytes.
+ REPLACED bytes, TOTAL bytes & CODE CHANGED are added to statistics.
+ -o-stats option for specifying the file with statistic.


[ROCm/hip commit: 6882057fd2]
2016-12-23 17:40:06 +03:00
Maneesh Gupta 5d44da9e46 hipcc: link to hip runtime using absolute path
Change-Id: I714b3e9da0bc1d49665b079d9c4cec1c1a2efa80


[ROCm/hip commit: f6e9f6f0bf]
2016-12-23 11:49:00 +05:30
Ben Sander ec4f4a643d Support size_t in memset kernel.
Add disable for HSA_AMD_AGENT_INFO_MAX_WAVES_PER_CU
Remove one copy of completion_future in memset.


[ROCm/hip commit: c325c988b1]
2016-12-22 12:25:09 -06:00
Maneesh Gupta 63ab898d17 hip_hcc package changes
- updated hip_hcc package creation dependencies
- support build hip_hcc package for HCC-1.0

Change-Id: Idf23e415eff8cb352a8906191c79bd822c7618e7


[ROCm/hip commit: 9b6d1588ba]
2016-12-22 15:30:38 +05:30
Ben Sander c1ac04322a Increment API sequence number.
Change name to tls_tidInfo


[ROCm/hip commit: 37d8cafb12]
2016-12-21 15:30:36 -06:00
Evgeny Mankov 0215c67ba6 [HIPIFY] Statistics in CSV file.
+ Stats by CUDA ref name.
+ Conversion %.

TODO: Calculation of changed code amount, based on actually replaced bytes.


[ROCm/hip commit: 4bb8bf8dab]
2016-12-21 23:08:01 +03:00
Rahul Garg 191e8fc08e Removed redundant GetPCIBusID int version function
Change-Id: I37f2ff87d09fcfb1e3b104c44c51f606fcb83c01


[ROCm/hip commit: 4704547bab]
2016-12-20 23:25:16 +05:30
Evgeny Mankov 121b909f55 [HIPIFY] Reflect unsupported CUDA API refs in statistics
https://github.com/GPUOpen-ProfessionalCompute-Tools/HIP/issues/53

+ Unsupported refs (by HIP) are now might be listed along with the supported ones.
+ Warnings are added for the unhandled (by HIPIFY) refs, for instance:
  "warning: the following reference is not handled: 'cublasContext' [param decl ptr]."
+ Reflect unsupported CUDA API refs in statistics.
+ Occupancy API [HIP_UNSUPPORTED].
+ A few CUBLAS refs are listed as HIP_UNSUPPORTED.

TODO: Statistics in CSV file.


[ROCm/hip commit: 3dd32e969d]
2016-12-19 14:38:19 +03:00
Rahul Garg a93a5a41f3 Fix for HCSWAP-67
Change-Id: I0b2ce5ab933237947fb41d89769db3da16e5be6a

Conflicts:
	src/hip_hcc.cpp


[ROCm/hip commit: fbf7ed63a8]
2016-12-19 16:19:51 +05:30
Ben Sander a5f13421b5 Add name for function
[ROCm/hip commit: 90c69e14bb]
2016-12-17 08:54:09 -06:00
Ben Sander a212162612 Remove HSA dependency from hipFunction_t
Place _groupSegmentSize and _privateSegmentSize inside Function,
remove hsa_executable_symbol_t.


[ROCm/hip commit: 8bf4bd2f7d]
2016-12-17 07:22:56 -06:00
Ben Sander f97cffdc48 Remove USE_DISPATCH_HSA_KERNEL=0 path.
[ROCm/hip commit: 6ed7e1c1c1]
2016-12-17 07:22:56 -06:00
Ben Sander 7402308098 Refactor Module and Function APIs.
- hipFunction_t is now returned by value.  This eliminates dynamic
      allocation / memory management complexity in the module.  Removed
the kernel
      name so the structure is just 16 bytes now.

    - Moved the hsa_executable_load_module and hsa_executable_freeze
      calls to the hipModuleLoad and hipModuleLoadData calls.

    - Apply sharedMemBytes in hipModuleLaunchKernel to group segment
      size (not private).


[ROCm/hip commit: 4d29885be3]
2016-12-17 07:22:33 -06:00
Rahul Garg ae83f93ba4 Mapped hipDevice_t to int
Change-Id: I6cfa56c42b7cd04aa0e0bce510c0d72d34ea211a


[ROCm/hip commit: 263a9614ff]
2016-12-17 16:53:03 +05:30
Aditya Atluri d93219fc00 disabled half native support as inline asm is not working
Change-Id: I3073d8ae39eed321987f0f2f0e689eec4cdbb48c


[ROCm/hip commit: 2665ad2762]
2016-12-16 09:24:59 -06:00
Ben Sander fcdaa05a4d Print limits on CUDA devices
[ROCm/hip commit: 43635f51dc]
2016-12-16 08:55:11 -06:00
Ben Sander 70f125f2a8 Fix typo
[ROCm/hip commit: bd19bb4074]
2016-12-15 14:42:52 -06:00
Ben Sander 5a3c11dd9a fix copyright
[ROCm/hip commit: 8ed38bae69]
2016-12-15 14:42:52 -06:00
Ben Sander 89523da1c2 remove TODO file
[ROCm/hip commit: 4080fe209d]
2016-12-15 14:42:52 -06:00
Evgeny Mankov e29cc18289 [HIPIFY] nested macro is not hipified, when it isAnyIdentifier
Fix for https://github.com/GPUOpen-ProfessionalCompute-Tools/HIP/issues/55


[ROCm/hip commit: 2383d9bc1a]
2016-12-15 21:00:34 +03:00
Brecht Carlier fd99cec62d Update hip_faq.md
Fixed navigation and list.


[ROCm/hip commit: 05b3e2928d]
2016-12-15 14:08:34 +05:30
Martin Schleiss 92dae66b21 Fix another typo
[ROCm/hip commit: b1eace4348]
2016-12-15 14:07:20 +05:30
Martin Schleiss d3ca48fc45 Fix various typos
Conflicts:
	README.md

Change-Id: Ie296d503d16121a62fed1a208352ec2b81c97fd9


[ROCm/hip commit: 263dcfed83]
2016-12-15 14:06:38 +05:30
Peter Steinberger beb82e5109 Fixes a typo: perforamnce -> performance
Change-Id: I85e3b3d22c98c16556227283bfb33530e1bce2cf


[ROCm/hip commit: 961b7890c1]
2016-12-15 14:01:08 +05:30
Aditya Atluri 4ebb6e569f fixed compilation issues
Change-Id: I96692538736e2e4f2da9dba9c8c29a164aec4c0d


[ROCm/hip commit: 68c57c38ff]
2016-12-14 16:50:16 -06:00
Aditya Atluri 727aab2304 added half2 support
Change-Id: I0f3b9b7037fed97e80ec99f5369c75a63f001aae


[ROCm/hip commit: d2daf6ad75]
2016-12-14 14:18:48 -06:00
Sandeep Kumar 1c6c618247 Fixes in Makefile of couple of samples
- modified Makefile for hipblas_saxpy to replaced hcblas.so with hipblas.so as part of HCSWAP-100
- Resolved missing separator issue in peer2peer cookbook Makefile

Change-Id: I678fea267eee1481f02da09379339ed78d3f95f2


[ROCm/hip commit: d78649b978]
2016-12-14 16:27:14 +05:30
Aditya Atluri d4b7fe8385 added simple half math ops
Change-Id: I10b1d1023a9e5f2ba63f28c4a2bbe60ee49a8aee


[ROCm/hip commit: 7c7d948fc6]
2016-12-13 20:20:58 -06:00
Aditya Atluri 0b445d68de disabled compiler flag hcc 4.0 for half support
Change-Id: I32175113f4c05d43310b3a05c2a14e12f6d48b09


[ROCm/hip commit: ed39a7f43b]
2016-12-13 20:06:56 -06:00
Ben Sander 1a7ecbcd04 Add USE_IPC to disable use of IPC APIs. Set to 0.
[ROCm/hip commit: b30e4b4781]
2016-12-13 15:07:04 -06:00
Aditya Atluri ada705544f added few type reinterpret cast device functions
1. __int_as_float
2. __hiloint2double

Change-Id: Id247c196887b24a12090f0521bf91e13afeec733


[ROCm/hip commit: 04ab5f7f56]
2016-12-13 14:41:36 -06:00
Aditya Atluri 8ad8f7ce26 added half math addition ISA support
Change-Id: I293b771f695b499b795d7e53f600c9e4fe2a2071


[ROCm/hip commit: a6fe6222c4]
2016-12-13 09:18:34 -06:00
Evgeny Mankov 4fd6a82aaf [HIPIFY] Code refactoring and total stat collecting
+ Single base class for Preprocessor and MatchFinder classes.
+ Total Stats for multiple files is added.


[ROCm/hip commit: ad3ec15d85]
2016-12-13 18:01:08 +03:00
Evgeny Mankov b2377f20ba [HIPIFY] Rename -n to -examine
[ROCm/hip commit: 95ae514511]
2016-12-12 20:03:01 +03:00
Evgeny Mankov a5ded34092 [HIPIFY] Multiple source files support.
[ROCm/hip commit: 02468d31fc]
2016-12-12 19:57:19 +03:00