Commit Graph

1266 Commits

Author SHA1 Message Date
Aditya Atluri a86ef4e577 added simple half math ops
Change-Id: I10b1d1023a9e5f2ba63f28c4a2bbe60ee49a8aee


[ROCm/clr commit: 01ed8e91e9]
2016-12-13 20:20:58 -06:00
Aditya Atluri b26857a631 disabled compiler flag hcc 4.0 for half support
Change-Id: I32175113f4c05d43310b3a05c2a14e12f6d48b09


[ROCm/clr commit: 26934a920c]
2016-12-13 20:06:56 -06:00
Ben Sander 1f638c1da6 Add USE_IPC to disable use of IPC APIs. Set to 0.
[ROCm/clr commit: 5eed20be12]
2016-12-13 15:07:04 -06:00
Aditya Atluri 9691299b0d added few type reinterpret cast device functions
1. __int_as_float
2. __hiloint2double

Change-Id: Id247c196887b24a12090f0521bf91e13afeec733


[ROCm/clr commit: 7a712aa76b]
2016-12-13 14:41:36 -06:00
Aditya Atluri c865d93792 added half math addition ISA support
Change-Id: I293b771f695b499b795d7e53f600c9e4fe2a2071


[ROCm/clr commit: 02eab122c5]
2016-12-13 09:18:34 -06:00
Evgeny Mankov e22ed56bd7 [HIPIFY] Code refactoring and total stat collecting
+ Single base class for Preprocessor and MatchFinder classes.
+ Total Stats for multiple files is added.


[ROCm/clr commit: 6bcacf83a4]
2016-12-13 18:01:08 +03:00
Evgeny Mankov 1da43c7216 [HIPIFY] Rename -n to -examine
[ROCm/clr commit: b73ffa4e7c]
2016-12-12 20:03:01 +03:00
Evgeny Mankov c9aa0841d8 [HIPIFY] Multiple source files support.
[ROCm/clr commit: d5eb7f494f]
2016-12-12 19:57:19 +03:00
Aditya Atluri 4c61746162 Changed threadfences to match target parsing by hcc
Change-Id: I28fcabbaacd13495b707f263fd09afaead0665fa


[ROCm/clr commit: 812cf130b6]
2016-12-12 10:16:58 -06:00
Evgeny Mankov 131f601cf7 [HIPIFY] -no-backup option is added.
Is useful for release hipifying in place.


[ROCm/clr commit: c5f5e662ef]
2016-12-08 23:14:19 +03:00
Evgeny Mankov 845f7a00db [HIPIFY] -no-output support.
Actually output file is created by clang itself, but isn't updated and is deleted after processing.
In cooperation with -print-stat -no-output (or single -n) is used for examine the source CUDA code.

Conflicting options:
-inplce
-o


[ROCm/clr commit: f5deff4ba3]
2016-12-08 22:45:10 +03:00
Evgeny Mankov 3dcd27d0b1 Fix the limitation of supported input files.
Actually .cu and .cuda was supported.

+ All the file names are allowed, including file names without extension.

[IMPORTANT]
To hipify CUDA input file, which name is not *.cu, please add option "-x cuda" after tool's options ending marker "--", for instance:

./hipify-clang NeuralNet -- -x cuda

This option will go to clang itself, not the hipify tool.


[ROCm/clr commit: a81dfc5a0f]
2016-12-08 20:28:43 +03:00
Rahul Garg f45c6facf0 Fixed build error due to GetPCIBusId overloaded function
Change-Id: I626446f2c72c8143f08c95367bc1c528abeaf69d


[ROCm/clr commit: a6b2f9c3a0]
2016-12-08 14:35:58 +05:30
Maneesh Gupta 6fbb70b974 hcc_detail/hip_runtime_api.h: Fix IPC API signature
Change-Id: I0be0f09c62f231620341141bd66183c3338be56a


[ROCm/clr commit: c677041b37]
2016-12-08 12:50:25 +05:30
pensun 0d8957fb9a HIP IPC implementation on ROCr IPC APIs
Change-Id: I1ca9d520f5d0b1b56694211471b81eb7c6c23d16


[ROCm/clr commit: 7ac5f2e8c3]
2016-12-07 15:38:36 -06:00
Rahul Garg 89490a8065 hipDeviceGetPCIBusId int version changes for CUDA runtime API
Change-Id: I4d3b995f1d1ac83415ca84808a074e5c8cd72f3c


[ROCm/clr commit: d8fdd6c6fc]
2016-12-07 12:12:40 +05:30
pensun 87f69ab605 Fix issue of ihipModule_t double defined
Change-Id: I508677e595776fd573a2f224691116d01288dc78


[ROCm/clr commit: 8f2a1752c1]
2016-12-06 19:10:17 -06:00
pensun e885147485 change hipgetPCIID to take int as third parameter
Change-Id: I4429b36756a6d868a769abd783bf28a55147c0d0


[ROCm/clr commit: 8fcae3acc0]
2016-12-06 14:24:09 -06:00
pensun 707358d0b4 IPC prototyps and part of the implementation included
Change-Id: Id88c7f155d23ec63f57a6ef05098fba43f8af336


[ROCm/clr commit: 092924d660]
2016-12-06 14:24:09 -06:00
pensun aaecc3fe04 local changes for hipnccl
Change-Id: I05a1f0381ce2914a800f573342cc954eb5ff82d9


[ROCm/clr commit: 808e555247]
2016-12-06 14:22:02 -06:00
Ben Sander 448c6b3119 Enabled USE_DISPATCH_HSA_KERNEL, with serialization in hipModuleUnload.
[ROCm/clr commit: 3f810a203b]
2016-12-06 10:14:01 -06:00
Rahul Garg 74803e3ddf Simple test case for hipDeviceGetPCIBusId
Change-Id: I0fb6d1bef4739a5b6280928f7b349d95c1656431


[ROCm/clr commit: afc81c955e]
2016-12-06 17:31:54 +05:30
Rahul Garg 8a210f721d Build Error correction in hipDeviceGetPCIBusId
Change-Id: I50ff4d95b7a732924c7a991cba60400b1c93c0de


[ROCm/clr commit: a019f1ce39]
2016-12-06 17:09:21 +05:30
Rahul Garg 48e9aaf696 Changed hipDeviceGetPCIBusId to return Bus ID as string
Change-Id: I6d5aa7362084109d34bc015d948f8723b2a38ee9


[ROCm/clr commit: 0017419521]
2016-12-06 16:55:17 +05:30
Maneesh Gupta b9bde4a4be Remove redundant variable g_malloc_heap_size
Change-Id: Idaf47be70488f0deb3eab05a86d9c5a413d3fff7


[ROCm/clr commit: b8413a69c8]
2016-12-06 10:19:03 +05:30
pensun 70f0a2f43b HIP resource leaks fix from Jack
Change-Id: I93f3ad7cb94ff1cba1577bd8acc90e826693d12e


[ROCm/clr commit: 4cb1579d4a]
2016-12-05 20:21:33 -06:00
Maneesh Gupta 136d220c10 Revert "Enable USE_DISPATCH_HSA_KERNEL."
This reverts commit bbd9d98af7.


[ROCm/clr commit: ac93376c26]
2016-12-05 16:55:26 +05:30
Maneesh Gupta b01694f524 Don't share g_malloc_heap_size between libraries
Change-Id: Ic70bf83d4f865bc5c453941fdbc1814c77f0ad9d


[ROCm/clr commit: ac368cc60f]
2016-12-05 11:03:45 +05:30
Ben Sander bbd9d98af7 Enable USE_DISPATCH_HSA_KERNEL.
Optimize hipLaunchModule dispatch latency.


[ROCm/clr commit: f8bcbe8680]
2016-12-04 00:13:19 -06:00
Ben Sander bccb897f84 Add additional controls for forcing serialization and blocking.
Move HIP_COHERENT_HOST_ALLOC so it is read once at init time.
Add HIP_LAUNCH_BLOCKING_KERNELS, HIP_API_BLOCKING.
Update docs on debug and chicken bits.

Conflicts:
	src/hip_hcc.cpp


[ROCm/clr commit: 783ac156ce]
2016-12-02 18:03:59 -06:00
Maneesh Gupta 5e38114363 Improve shared and static library support
hipcc accepts new parameter -use-staticlib and -use-sharedlib to
control linking behavior. Default is still static library.

Change-Id: I28fb9a939f8177c75abefd8b77d8118a6666d1f4


[ROCm/clr commit: fd8b1c7d41]
2016-12-01 15:33:12 +05:30
Maneesh Gupta ce325da03f hipcc: HCC workaround no longer needs env var
Change-Id: I81f4eafddbda5e9e2f1082932dd502ab451cfc24


[ROCm/clr commit: f14198956f]
2016-12-01 12:51:58 +05:30
Maneesh Gupta 97d8017e80 Fix some broken directed tests
Change-Id: I40f2661a74951f8d91824df8fd9ef0cc2312d183


[ROCm/clr commit: 9cab8d4260]
2016-12-01 12:47:37 +05:30
Aditya Atluri 2c51f72e41 added half add and fma intrinsic
Change-Id: Ifa60c1a7065f524f069291bb00d987b11c836cc4


[ROCm/clr commit: e6df8cb28c]
2016-11-29 19:46:01 -06:00
pensun 946e9b4163 Change to use produce device name by default
Change-Id: Ie2cee2a2e94a08b5874a2f5abee5d1ab6c9fdf47


[ROCm/clr commit: 504fcaf786]
2016-11-29 11:34:06 -06:00
Rahul Garg 4421ff4fb6 Added support for hipMemGetAddressRange
Change-Id: I99a796a4eb765152cf15a12d6a86b58684d34f50


[ROCm/clr commit: bda0704213]
2016-11-29 22:04:09 +05:30
pensun b4b8de3cd7 Change the parameter type of hipDeviceGetPCIBusID to char*
Change-Id: Ia72f403126e95f65da53208fc246f45d1417381f


[ROCm/clr commit: 8e2980c7ef]
2016-11-28 10:47:18 -06:00
Ben Sander 68f1a627e0 Add more debug info
[ROCm/clr commit: a504df955e]
2016-11-26 08:56:02 -06:00
Ben Sander e29f253256 Tweak profiler doc
Change-Id: I7be7c44467510e38ae850e1e0a14209a3b4380f1


[ROCm/clr commit: a990806b6c]
2016-11-25 14:06:59 -06:00
Aditya Atluri 62e0623d18 added support for rcp for float and double
Change-Id: Ibeba3a9f64494fc0a176bcb4a854fb2f56567b55


[ROCm/clr commit: de89b25d52]
2016-11-23 20:01:18 -06:00
Aditya Atluri 11018fa0f6 added fma for double and float
1. Added fma intrinsic support for double and float
2. Added test for fma

Change-Id: I909fdbec34a3d12c03ba6eff3a39376a7128ee43


[ROCm/clr commit: cc1f8a1011]
2016-11-23 18:22:05 -06:00
pensun 29a69adda7 Add some missing APIs on nv path and hipify
Change-Id: Ic0f4740ab06bf70b1de61b39fedc7a6e7605cb61


[ROCm/clr commit: 69b43ec17c]
2016-11-23 14:36:30 -06:00
pensun bf34811dbe Add several missing APIs in hipify
Change-Id: I58912871cb0b10128f221ef26a11b0d69fb7873c


[ROCm/clr commit: a4b43a6610]
2016-11-23 14:06:18 -06:00
Aditya Atluri a2ecbc2d6e Added fast math flag
1. Use -DHIP_FAST_MATH to make precise math functions compiled to fast math
2. Added double fast math functions for sqrt
3. Changed hipcc to parse -use_fast_math (not working)
4. Added passed tag to hipFloatMath test

Change-Id: I72884b2436b4efe61e9a9297346c1358fee38a2d


[ROCm/clr commit: c2f6ecf264]
2016-11-23 11:19:15 -06:00
Ben Sander 79c92fdb23 Add toc to hip_profiling.md
Change-Id: I3ae100f12686d0398a0403b78ca571382acce135


[ROCm/clr commit: ddb1845ab5]
2016-11-23 08:36:08 -06:00
Ben Sander e2f024fc55 Improve docs in some places
Change-Id: If31e84fbf0c8595ca72edb842dce7ce47783579b


[ROCm/clr commit: 9db93a1b96]
2016-11-23 08:16:18 -06:00
Ben Sander 2e63b780f4 Improve debug capabilities.
Print TID mapping at init when HIP_TRACE_API=1.
Print base host/dev info from tracker during copy.

Change-Id: I84e26d7b801567e5a91baad36126fb590920ec87


[ROCm/clr commit: 111b57ddd0]
2016-11-23 08:16:18 -06:00
Ben Sander 4a60fbc922 Improve profiler and debug documentation
[ROCm/clr commit: 4bcb0fac22]
2016-11-23 08:15:40 -06:00
Rahul Garg cc61f4a720 Removed nested HIP calls from hip_device functions
Change-Id: I18785b0ee27e32fb8950982fa5c3a64d1ae6a9b8


[ROCm/clr commit: 8a2685e6cd]
2016-11-23 18:37:06 +05:30
Aditya Atluri 06969472b2 added fast math intrinsics to HIP
1. Added fast math intrinsics for single precision data types
2. Added test to check the intrinsics
3. Added HIP_PRECISE_MATH macro to enable precise math on fast math

Change-Id: Iadacbb6182c31252c5e3252854372d1b80dfd27b


[ROCm/clr commit: d9a3527769]
2016-11-22 15:26:00 -06:00