Evgeny Mankov
3cd1adcb24
[HIPIFY] Pointer to typedef declaration is not hipified
...
Fix for https://github.com/GPUOpen-ProfessionalCompute-Tools/HIP/issues/60
2016-12-26 19:03:50 +03:00
Evgeny Mankov
f6aa3ddb0e
[HIPIFY] Add hipconvertinplace2.sh and hipexamine2.sh scripts for hipify-clang.
...
The differences from the similar scripts for hipify.pl:
1. CSV file with extended statistics is produced.
2. scripts' arguments are changed a bit:
DIRNAME [hipify options] [--] [clang options]
where -- is a delimiter; all the arguments are optional, except DIRNAME.
Usage example:
./hipexamine2.sh ./tmp -o-stats ./tmp/stats.csv -- -I/usr/local/cuda-7.5/include -I/usr/local/hipify-clang/hipblas/include 2>&1 | tee log
2016-12-23 22:06:20 +03:00
Evgeny Mankov
e8c5906062
[HIPIFY] Fix line endings.
2016-12-23 18:01:26 +03:00
Evgeny Mankov
52b3fb9f79
[HIPIFY] Stats: Calculation of changed code amount, based on actually replaced bytes.
...
+ REPLACED bytes, TOTAL bytes & CODE CHANGED are added to statistics.
+ -o-stats option for specifying the file with statistic.
2016-12-23 17:40:06 +03:00
Maneesh Gupta
16b705d912
hipcc: link to hip runtime using absolute path
...
Change-Id: I714b3e9da0bc1d49665b079d9c4cec1c1a2efa80
2016-12-23 11:49:00 +05:30
Ben Sander
fd5b0c68b1
Support size_t in memset kernel.
...
Add disable for HSA_AMD_AGENT_INFO_MAX_WAVES_PER_CU
Remove one copy of completion_future in memset.
2016-12-22 12:25:09 -06:00
Maneesh Gupta
1b50462ce0
hip_hcc package changes
...
- updated hip_hcc package creation dependencies
- support build hip_hcc package for HCC-1.0
Change-Id: Idf23e415eff8cb352a8906191c79bd822c7618e7
2016-12-22 15:30:38 +05:30
Ben Sander
cf338d716b
Increment API sequence number.
...
Change name to tls_tidInfo
2016-12-21 15:30:36 -06:00
Evgeny Mankov
ac8166d051
[HIPIFY] Statistics in CSV file.
...
+ Stats by CUDA ref name.
+ Conversion %.
TODO: Calculation of changed code amount, based on actually replaced bytes.
2016-12-21 23:08:01 +03:00
Rahul Garg
0578febb99
Removed redundant GetPCIBusID int version function
...
Change-Id: I37f2ff87d09fcfb1e3b104c44c51f606fcb83c01
2016-12-20 23:25:16 +05:30
Evgeny Mankov
ea8e886077
[HIPIFY] Reflect unsupported CUDA API refs in statistics
...
https://github.com/GPUOpen-ProfessionalCompute-Tools/HIP/issues/53
+ Unsupported refs (by HIP) are now might be listed along with the supported ones.
+ Warnings are added for the unhandled (by HIPIFY) refs, for instance:
"warning: the following reference is not handled: 'cublasContext' [param decl ptr]."
+ Reflect unsupported CUDA API refs in statistics.
+ Occupancy API [HIP_UNSUPPORTED].
+ A few CUBLAS refs are listed as HIP_UNSUPPORTED.
TODO: Statistics in CSV file.
2016-12-19 14:38:19 +03:00
Rahul Garg
4988975b59
Fix for HCSWAP-67
...
Change-Id: I0b2ce5ab933237947fb41d89769db3da16e5be6a
Conflicts:
src/hip_hcc.cpp
2016-12-19 16:19:51 +05:30
Ben Sander
5d815937de
Add name for function
2016-12-17 08:54:09 -06:00
Ben Sander
2bd70ff345
Remove HSA dependency from hipFunction_t
...
Place _groupSegmentSize and _privateSegmentSize inside Function,
remove hsa_executable_symbol_t.
2016-12-17 07:22:56 -06:00
Ben Sander
06d382bc6d
Remove USE_DISPATCH_HSA_KERNEL=0 path.
2016-12-17 07:22:56 -06:00
Ben Sander
3f9404d0e1
Refactor Module and Function APIs.
...
- hipFunction_t is now returned by value. This eliminates dynamic
allocation / memory management complexity in the module. Removed
the kernel
name so the structure is just 16 bytes now.
- Moved the hsa_executable_load_module and hsa_executable_freeze
calls to the hipModuleLoad and hipModuleLoadData calls.
- Apply sharedMemBytes in hipModuleLaunchKernel to group segment
size (not private).
2016-12-17 07:22:33 -06:00
Rahul Garg
bddaa0e81c
Mapped hipDevice_t to int
...
Change-Id: I6cfa56c42b7cd04aa0e0bce510c0d72d34ea211a
2016-12-17 16:53:03 +05:30
Aditya Atluri
c673aec971
disabled half native support as inline asm is not working
...
Change-Id: I3073d8ae39eed321987f0f2f0e689eec4cdbb48c
2016-12-16 09:24:59 -06:00
Ben Sander
ab07d9b4a7
Print limits on CUDA devices
2016-12-16 08:55:11 -06:00
Ben Sander
d39ef58daf
Fix typo
2016-12-15 14:42:52 -06:00
Ben Sander
b53ba931e8
fix copyright
2016-12-15 14:42:52 -06:00
Ben Sander
a30b36afdc
remove TODO file
2016-12-15 14:42:52 -06:00
Evgeny Mankov
a9488603b4
[HIPIFY] nested macro is not hipified, when it isAnyIdentifier
...
Fix for https://github.com/GPUOpen-ProfessionalCompute-Tools/HIP/issues/55
2016-12-15 21:00:34 +03:00
Brecht Carlier
cc73dd4ed5
Update hip_faq.md
...
Fixed navigation and list.
2016-12-15 14:08:34 +05:30
Martin Schleiss
8a4fe76de2
Fix another typo
2016-12-15 14:07:20 +05:30
Martin Schleiss
dc8db38f04
Fix various typos
...
Conflicts:
README.md
Change-Id: Ie296d503d16121a62fed1a208352ec2b81c97fd9
2016-12-15 14:06:38 +05:30
Peter Steinberger
e2dc1cc27f
Fixes a typo: perforamnce -> performance
...
Change-Id: I85e3b3d22c98c16556227283bfb33530e1bce2cf
2016-12-15 14:01:08 +05:30
Aditya Atluri
a1d1fcfdac
fixed compilation issues
...
Change-Id: I96692538736e2e4f2da9dba9c8c29a164aec4c0d
2016-12-14 16:50:16 -06:00
Aditya Atluri
c20a86d866
added half2 support
...
Change-Id: I0f3b9b7037fed97e80ec99f5369c75a63f001aae
2016-12-14 14:18:48 -06:00
Sandeep Kumar
8b0ce6e81b
Fixes in Makefile of couple of samples
...
- modified Makefile for hipblas_saxpy to replaced hcblas.so with hipblas.so as part of HCSWAP-100
- Resolved missing separator issue in peer2peer cookbook Makefile
Change-Id: I678fea267eee1481f02da09379339ed78d3f95f2
2016-12-14 16:27:14 +05:30
Aditya Atluri
01ed8e91e9
added simple half math ops
...
Change-Id: I10b1d1023a9e5f2ba63f28c4a2bbe60ee49a8aee
2016-12-13 20:20:58 -06:00
Aditya Atluri
26934a920c
disabled compiler flag hcc 4.0 for half support
...
Change-Id: I32175113f4c05d43310b3a05c2a14e12f6d48b09
2016-12-13 20:06:56 -06:00
Ben Sander
5eed20be12
Add USE_IPC to disable use of IPC APIs. Set to 0.
2016-12-13 15:07:04 -06:00
Aditya Atluri
7a712aa76b
added few type reinterpret cast device functions
...
1. __int_as_float
2. __hiloint2double
Change-Id: Id247c196887b24a12090f0521bf91e13afeec733
2016-12-13 14:41:36 -06:00
Aditya Atluri
02eab122c5
added half math addition ISA support
...
Change-Id: I293b771f695b499b795d7e53f600c9e4fe2a2071
2016-12-13 09:18:34 -06:00
Evgeny Mankov
6bcacf83a4
[HIPIFY] Code refactoring and total stat collecting
...
+ Single base class for Preprocessor and MatchFinder classes.
+ Total Stats for multiple files is added.
2016-12-13 18:01:08 +03:00
Evgeny Mankov
b73ffa4e7c
[HIPIFY] Rename -n to -examine
2016-12-12 20:03:01 +03:00
Evgeny Mankov
d5eb7f494f
[HIPIFY] Multiple source files support.
2016-12-12 19:57:19 +03:00
Aditya Atluri
812cf130b6
Changed threadfences to match target parsing by hcc
...
Change-Id: I28fcabbaacd13495b707f263fd09afaead0665fa
2016-12-12 10:16:58 -06:00
Evgeny Mankov
c5f5e662ef
[HIPIFY] -no-backup option is added.
...
Is useful for release hipifying in place.
2016-12-08 23:14:19 +03:00
Evgeny Mankov
f5deff4ba3
[HIPIFY] -no-output support.
...
Actually output file is created by clang itself, but isn't updated and is deleted after processing.
In cooperation with -print-stat -no-output (or single -n) is used for examine the source CUDA code.
Conflicting options:
-inplce
-o
2016-12-08 22:45:10 +03:00
Evgeny Mankov
a81dfc5a0f
Fix the limitation of supported input files.
...
Actually .cu and .cuda was supported.
+ All the file names are allowed, including file names without extension.
[IMPORTANT]
To hipify CUDA input file, which name is not *.cu, please add option "-x cuda" after tool's options ending marker "--", for instance:
./hipify-clang NeuralNet -- -x cuda
This option will go to clang itself, not the hipify tool.
2016-12-08 20:28:43 +03:00
Rahul Garg
a6b2f9c3a0
Fixed build error due to GetPCIBusId overloaded function
...
Change-Id: I626446f2c72c8143f08c95367bc1c528abeaf69d
2016-12-08 14:35:58 +05:30
Maneesh Gupta
c677041b37
hcc_detail/hip_runtime_api.h: Fix IPC API signature
...
Change-Id: I0be0f09c62f231620341141bd66183c3338be56a
2016-12-08 12:50:25 +05:30
pensun
7ac5f2e8c3
HIP IPC implementation on ROCr IPC APIs
...
Change-Id: I1ca9d520f5d0b1b56694211471b81eb7c6c23d16
2016-12-07 15:38:36 -06:00
Rahul Garg
d8fdd6c6fc
hipDeviceGetPCIBusId int version changes for CUDA runtime API
...
Change-Id: I4d3b995f1d1ac83415ca84808a074e5c8cd72f3c
2016-12-07 12:12:40 +05:30
pensun
8f2a1752c1
Fix issue of ihipModule_t double defined
...
Change-Id: I508677e595776fd573a2f224691116d01288dc78
2016-12-06 19:10:17 -06:00
pensun
8fcae3acc0
change hipgetPCIID to take int as third parameter
...
Change-Id: I4429b36756a6d868a769abd783bf28a55147c0d0
2016-12-06 14:24:09 -06:00
pensun
092924d660
IPC prototyps and part of the implementation included
...
Change-Id: Id88c7f155d23ec63f57a6ef05098fba43f8af336
2016-12-06 14:24:09 -06:00
pensun
808e555247
local changes for hipnccl
...
Change-Id: I05a1f0381ce2914a800f573342cc954eb5ff82d9
2016-12-06 14:22:02 -06:00