Ben Sander
8288cbdae4
fix copyright
...
[ROCm/clr commit: b53ba931e8 ]
2016-12-15 14:42:52 -06:00
Ben Sander
8353b8ed72
remove TODO file
...
[ROCm/clr commit: a30b36afdc ]
2016-12-15 14:42:52 -06:00
Evgeny Mankov
be8dd18914
[HIPIFY] nested macro is not hipified, when it isAnyIdentifier
...
Fix for https://github.com/GPUOpen-ProfessionalCompute-Tools/HIP/issues/55
[ROCm/clr commit: a9488603b4 ]
2016-12-15 21:00:34 +03:00
Brecht Carlier
bceb8949c7
Update hip_faq.md
...
Fixed navigation and list.
[ROCm/clr commit: cc73dd4ed5 ]
2016-12-15 14:08:34 +05:30
Martin Schleiss
23ddb5aec4
Fix another typo
...
[ROCm/clr commit: 8a4fe76de2 ]
2016-12-15 14:07:20 +05:30
Martin Schleiss
716fac9a99
Fix various typos
...
Conflicts:
README.md
Change-Id: Ie296d503d16121a62fed1a208352ec2b81c97fd9
[ROCm/clr commit: dc8db38f04 ]
2016-12-15 14:06:38 +05:30
Peter Steinberger
432f1c2b7a
Fixes a typo: perforamnce -> performance
...
Change-Id: I85e3b3d22c98c16556227283bfb33530e1bce2cf
[ROCm/clr commit: e2dc1cc27f ]
2016-12-15 14:01:08 +05:30
Aditya Atluri
1844a95003
fixed compilation issues
...
Change-Id: I96692538736e2e4f2da9dba9c8c29a164aec4c0d
[ROCm/clr commit: a1d1fcfdac ]
2016-12-14 16:50:16 -06:00
Aditya Atluri
657739b19f
added half2 support
...
Change-Id: I0f3b9b7037fed97e80ec99f5369c75a63f001aae
[ROCm/clr commit: c20a86d866 ]
2016-12-14 14:18:48 -06:00
Sandeep Kumar
4d78e20eed
Fixes in Makefile of couple of samples
...
- modified Makefile for hipblas_saxpy to replaced hcblas.so with hipblas.so as part of HCSWAP-100
- Resolved missing separator issue in peer2peer cookbook Makefile
Change-Id: I678fea267eee1481f02da09379339ed78d3f95f2
[ROCm/clr commit: 8b0ce6e81b ]
2016-12-14 16:27:14 +05:30
Aditya Atluri
a86ef4e577
added simple half math ops
...
Change-Id: I10b1d1023a9e5f2ba63f28c4a2bbe60ee49a8aee
[ROCm/clr commit: 01ed8e91e9 ]
2016-12-13 20:20:58 -06:00
Aditya Atluri
b26857a631
disabled compiler flag hcc 4.0 for half support
...
Change-Id: I32175113f4c05d43310b3a05c2a14e12f6d48b09
[ROCm/clr commit: 26934a920c ]
2016-12-13 20:06:56 -06:00
Ben Sander
1f638c1da6
Add USE_IPC to disable use of IPC APIs. Set to 0.
...
[ROCm/clr commit: 5eed20be12 ]
2016-12-13 15:07:04 -06:00
Aditya Atluri
9691299b0d
added few type reinterpret cast device functions
...
1. __int_as_float
2. __hiloint2double
Change-Id: Id247c196887b24a12090f0521bf91e13afeec733
[ROCm/clr commit: 7a712aa76b ]
2016-12-13 14:41:36 -06:00
Aditya Atluri
c865d93792
added half math addition ISA support
...
Change-Id: I293b771f695b499b795d7e53f600c9e4fe2a2071
[ROCm/clr commit: 02eab122c5 ]
2016-12-13 09:18:34 -06:00
Evgeny Mankov
e22ed56bd7
[HIPIFY] Code refactoring and total stat collecting
...
+ Single base class for Preprocessor and MatchFinder classes.
+ Total Stats for multiple files is added.
[ROCm/clr commit: 6bcacf83a4 ]
2016-12-13 18:01:08 +03:00
Evgeny Mankov
1da43c7216
[HIPIFY] Rename -n to -examine
...
[ROCm/clr commit: b73ffa4e7c ]
2016-12-12 20:03:01 +03:00
Evgeny Mankov
c9aa0841d8
[HIPIFY] Multiple source files support.
...
[ROCm/clr commit: d5eb7f494f ]
2016-12-12 19:57:19 +03:00
Aditya Atluri
4c61746162
Changed threadfences to match target parsing by hcc
...
Change-Id: I28fcabbaacd13495b707f263fd09afaead0665fa
[ROCm/clr commit: 812cf130b6 ]
2016-12-12 10:16:58 -06:00
Evgeny Mankov
131f601cf7
[HIPIFY] -no-backup option is added.
...
Is useful for release hipifying in place.
[ROCm/clr commit: c5f5e662ef ]
2016-12-08 23:14:19 +03:00
Evgeny Mankov
845f7a00db
[HIPIFY] -no-output support.
...
Actually output file is created by clang itself, but isn't updated and is deleted after processing.
In cooperation with -print-stat -no-output (or single -n) is used for examine the source CUDA code.
Conflicting options:
-inplce
-o
[ROCm/clr commit: f5deff4ba3 ]
2016-12-08 22:45:10 +03:00
Evgeny Mankov
3dcd27d0b1
Fix the limitation of supported input files.
...
Actually .cu and .cuda was supported.
+ All the file names are allowed, including file names without extension.
[IMPORTANT]
To hipify CUDA input file, which name is not *.cu, please add option "-x cuda" after tool's options ending marker "--", for instance:
./hipify-clang NeuralNet -- -x cuda
This option will go to clang itself, not the hipify tool.
[ROCm/clr commit: a81dfc5a0f ]
2016-12-08 20:28:43 +03:00
Rahul Garg
f45c6facf0
Fixed build error due to GetPCIBusId overloaded function
...
Change-Id: I626446f2c72c8143f08c95367bc1c528abeaf69d
[ROCm/clr commit: a6b2f9c3a0 ]
2016-12-08 14:35:58 +05:30
Maneesh Gupta
6fbb70b974
hcc_detail/hip_runtime_api.h: Fix IPC API signature
...
Change-Id: I0be0f09c62f231620341141bd66183c3338be56a
[ROCm/clr commit: c677041b37 ]
2016-12-08 12:50:25 +05:30
pensun
0d8957fb9a
HIP IPC implementation on ROCr IPC APIs
...
Change-Id: I1ca9d520f5d0b1b56694211471b81eb7c6c23d16
[ROCm/clr commit: 7ac5f2e8c3 ]
2016-12-07 15:38:36 -06:00
Rahul Garg
89490a8065
hipDeviceGetPCIBusId int version changes for CUDA runtime API
...
Change-Id: I4d3b995f1d1ac83415ca84808a074e5c8cd72f3c
[ROCm/clr commit: d8fdd6c6fc ]
2016-12-07 12:12:40 +05:30
pensun
87f69ab605
Fix issue of ihipModule_t double defined
...
Change-Id: I508677e595776fd573a2f224691116d01288dc78
[ROCm/clr commit: 8f2a1752c1 ]
2016-12-06 19:10:17 -06:00
pensun
e885147485
change hipgetPCIID to take int as third parameter
...
Change-Id: I4429b36756a6d868a769abd783bf28a55147c0d0
[ROCm/clr commit: 8fcae3acc0 ]
2016-12-06 14:24:09 -06:00
pensun
707358d0b4
IPC prototyps and part of the implementation included
...
Change-Id: Id88c7f155d23ec63f57a6ef05098fba43f8af336
[ROCm/clr commit: 092924d660 ]
2016-12-06 14:24:09 -06:00
pensun
aaecc3fe04
local changes for hipnccl
...
Change-Id: I05a1f0381ce2914a800f573342cc954eb5ff82d9
[ROCm/clr commit: 808e555247 ]
2016-12-06 14:22:02 -06:00
Ben Sander
448c6b3119
Enabled USE_DISPATCH_HSA_KERNEL, with serialization in hipModuleUnload.
...
[ROCm/clr commit: 3f810a203b ]
2016-12-06 10:14:01 -06:00
Rahul Garg
74803e3ddf
Simple test case for hipDeviceGetPCIBusId
...
Change-Id: I0fb6d1bef4739a5b6280928f7b349d95c1656431
[ROCm/clr commit: afc81c955e ]
2016-12-06 17:31:54 +05:30
Rahul Garg
8a210f721d
Build Error correction in hipDeviceGetPCIBusId
...
Change-Id: I50ff4d95b7a732924c7a991cba60400b1c93c0de
[ROCm/clr commit: a019f1ce39 ]
2016-12-06 17:09:21 +05:30
Rahul Garg
48e9aaf696
Changed hipDeviceGetPCIBusId to return Bus ID as string
...
Change-Id: I6d5aa7362084109d34bc015d948f8723b2a38ee9
[ROCm/clr commit: 0017419521 ]
2016-12-06 16:55:17 +05:30
Maneesh Gupta
b9bde4a4be
Remove redundant variable g_malloc_heap_size
...
Change-Id: Idaf47be70488f0deb3eab05a86d9c5a413d3fff7
[ROCm/clr commit: b8413a69c8 ]
2016-12-06 10:19:03 +05:30
pensun
70f0a2f43b
HIP resource leaks fix from Jack
...
Change-Id: I93f3ad7cb94ff1cba1577bd8acc90e826693d12e
[ROCm/clr commit: 4cb1579d4a ]
2016-12-05 20:21:33 -06:00
Maneesh Gupta
136d220c10
Revert "Enable USE_DISPATCH_HSA_KERNEL."
...
This reverts commit bbd9d98af7 .
[ROCm/clr commit: ac93376c26 ]
2016-12-05 16:55:26 +05:30
Maneesh Gupta
b01694f524
Don't share g_malloc_heap_size between libraries
...
Change-Id: Ic70bf83d4f865bc5c453941fdbc1814c77f0ad9d
[ROCm/clr commit: ac368cc60f ]
2016-12-05 11:03:45 +05:30
Ben Sander
bbd9d98af7
Enable USE_DISPATCH_HSA_KERNEL.
...
Optimize hipLaunchModule dispatch latency.
[ROCm/clr commit: f8bcbe8680 ]
2016-12-04 00:13:19 -06:00
Ben Sander
bccb897f84
Add additional controls for forcing serialization and blocking.
...
Move HIP_COHERENT_HOST_ALLOC so it is read once at init time.
Add HIP_LAUNCH_BLOCKING_KERNELS, HIP_API_BLOCKING.
Update docs on debug and chicken bits.
Conflicts:
src/hip_hcc.cpp
[ROCm/clr commit: 783ac156ce ]
2016-12-02 18:03:59 -06:00
Maneesh Gupta
5e38114363
Improve shared and static library support
...
hipcc accepts new parameter -use-staticlib and -use-sharedlib to
control linking behavior. Default is still static library.
Change-Id: I28fb9a939f8177c75abefd8b77d8118a6666d1f4
[ROCm/clr commit: fd8b1c7d41 ]
2016-12-01 15:33:12 +05:30
Maneesh Gupta
ce325da03f
hipcc: HCC workaround no longer needs env var
...
Change-Id: I81f4eafddbda5e9e2f1082932dd502ab451cfc24
[ROCm/clr commit: f14198956f ]
2016-12-01 12:51:58 +05:30
Maneesh Gupta
97d8017e80
Fix some broken directed tests
...
Change-Id: I40f2661a74951f8d91824df8fd9ef0cc2312d183
[ROCm/clr commit: 9cab8d4260 ]
2016-12-01 12:47:37 +05:30
Aditya Atluri
2c51f72e41
added half add and fma intrinsic
...
Change-Id: Ifa60c1a7065f524f069291bb00d987b11c836cc4
[ROCm/clr commit: e6df8cb28c ]
2016-11-29 19:46:01 -06:00
pensun
946e9b4163
Change to use produce device name by default
...
Change-Id: Ie2cee2a2e94a08b5874a2f5abee5d1ab6c9fdf47
[ROCm/clr commit: 504fcaf786 ]
2016-11-29 11:34:06 -06:00
Rahul Garg
4421ff4fb6
Added support for hipMemGetAddressRange
...
Change-Id: I99a796a4eb765152cf15a12d6a86b58684d34f50
[ROCm/clr commit: bda0704213 ]
2016-11-29 22:04:09 +05:30
pensun
b4b8de3cd7
Change the parameter type of hipDeviceGetPCIBusID to char*
...
Change-Id: Ia72f403126e95f65da53208fc246f45d1417381f
[ROCm/clr commit: 8e2980c7ef ]
2016-11-28 10:47:18 -06:00
Ben Sander
68f1a627e0
Add more debug info
...
[ROCm/clr commit: a504df955e ]
2016-11-26 08:56:02 -06:00
Ben Sander
e29f253256
Tweak profiler doc
...
Change-Id: I7be7c44467510e38ae850e1e0a14209a3b4380f1
[ROCm/clr commit: a990806b6c ]
2016-11-25 14:06:59 -06:00
Aditya Atluri
62e0623d18
added support for rcp for float and double
...
Change-Id: Ibeba3a9f64494fc0a176bcb4a854fb2f56567b55
[ROCm/clr commit: de89b25d52 ]
2016-11-23 20:01:18 -06:00