Wykres commitów

1210 Commity

Autor SHA1 Wiadomość Data
Maneesh Gupta 97d8017e80 Fix some broken directed tests
Change-Id: I40f2661a74951f8d91824df8fd9ef0cc2312d183


[ROCm/clr commit: 9cab8d4260]
2016-12-01 12:47:37 +05:30
Aditya Atluri 2c51f72e41 added half add and fma intrinsic
Change-Id: Ifa60c1a7065f524f069291bb00d987b11c836cc4


[ROCm/clr commit: e6df8cb28c]
2016-11-29 19:46:01 -06:00
pensun 946e9b4163 Change to use produce device name by default
Change-Id: Ie2cee2a2e94a08b5874a2f5abee5d1ab6c9fdf47


[ROCm/clr commit: 504fcaf786]
2016-11-29 11:34:06 -06:00
Rahul Garg 4421ff4fb6 Added support for hipMemGetAddressRange
Change-Id: I99a796a4eb765152cf15a12d6a86b58684d34f50


[ROCm/clr commit: bda0704213]
2016-11-29 22:04:09 +05:30
pensun b4b8de3cd7 Change the parameter type of hipDeviceGetPCIBusID to char*
Change-Id: Ia72f403126e95f65da53208fc246f45d1417381f


[ROCm/clr commit: 8e2980c7ef]
2016-11-28 10:47:18 -06:00
Ben Sander 68f1a627e0 Add more debug info
[ROCm/clr commit: a504df955e]
2016-11-26 08:56:02 -06:00
Ben Sander e29f253256 Tweak profiler doc
Change-Id: I7be7c44467510e38ae850e1e0a14209a3b4380f1


[ROCm/clr commit: a990806b6c]
2016-11-25 14:06:59 -06:00
Aditya Atluri 62e0623d18 added support for rcp for float and double
Change-Id: Ibeba3a9f64494fc0a176bcb4a854fb2f56567b55


[ROCm/clr commit: de89b25d52]
2016-11-23 20:01:18 -06:00
Aditya Atluri 11018fa0f6 added fma for double and float
1. Added fma intrinsic support for double and float
2. Added test for fma

Change-Id: I909fdbec34a3d12c03ba6eff3a39376a7128ee43


[ROCm/clr commit: cc1f8a1011]
2016-11-23 18:22:05 -06:00
pensun 29a69adda7 Add some missing APIs on nv path and hipify
Change-Id: Ic0f4740ab06bf70b1de61b39fedc7a6e7605cb61


[ROCm/clr commit: 69b43ec17c]
2016-11-23 14:36:30 -06:00
pensun bf34811dbe Add several missing APIs in hipify
Change-Id: I58912871cb0b10128f221ef26a11b0d69fb7873c


[ROCm/clr commit: a4b43a6610]
2016-11-23 14:06:18 -06:00
Aditya Atluri a2ecbc2d6e Added fast math flag
1. Use -DHIP_FAST_MATH to make precise math functions compiled to fast math
2. Added double fast math functions for sqrt
3. Changed hipcc to parse -use_fast_math (not working)
4. Added passed tag to hipFloatMath test

Change-Id: I72884b2436b4efe61e9a9297346c1358fee38a2d


[ROCm/clr commit: c2f6ecf264]
2016-11-23 11:19:15 -06:00
Ben Sander 79c92fdb23 Add toc to hip_profiling.md
Change-Id: I3ae100f12686d0398a0403b78ca571382acce135


[ROCm/clr commit: ddb1845ab5]
2016-11-23 08:36:08 -06:00
Ben Sander e2f024fc55 Improve docs in some places
Change-Id: If31e84fbf0c8595ca72edb842dce7ce47783579b


[ROCm/clr commit: 9db93a1b96]
2016-11-23 08:16:18 -06:00
Ben Sander 2e63b780f4 Improve debug capabilities.
Print TID mapping at init when HIP_TRACE_API=1.
Print base host/dev info from tracker during copy.

Change-Id: I84e26d7b801567e5a91baad36126fb590920ec87


[ROCm/clr commit: 111b57ddd0]
2016-11-23 08:16:18 -06:00
Ben Sander 4a60fbc922 Improve profiler and debug documentation
[ROCm/clr commit: 4bcb0fac22]
2016-11-23 08:15:40 -06:00
Rahul Garg cc61f4a720 Removed nested HIP calls from hip_device functions
Change-Id: I18785b0ee27e32fb8950982fa5c3a64d1ae6a9b8


[ROCm/clr commit: 8a2685e6cd]
2016-11-23 18:37:06 +05:30
Aditya Atluri 06969472b2 added fast math intrinsics to HIP
1. Added fast math intrinsics for single precision data types
2. Added test to check the intrinsics
3. Added HIP_PRECISE_MATH macro to enable precise math on fast math

Change-Id: Iadacbb6182c31252c5e3252854372d1b80dfd27b


[ROCm/clr commit: d9a3527769]
2016-11-22 15:26:00 -06:00
Aditya Atluri b8b1459a64 added fast math APIs
1. Added fast math apis for sin, cos, tan, sincos
2. Added test for trig math functions
3. Added logarithm fast math
4. Changed how hipGetDevice, hipDeviceGetCacheConfig emit errors

Change-Id: Ie6ab594ddd5853cbe85e39a2f6d3479a807fa323


[ROCm/clr commit: 1a85762f53]
2016-11-22 10:20:09 -06:00
Rahul Garg 5f24847ec0 Removed hsaKmtReleaseSystemProperties call
Change-Id: I7cb992cccf587c333f0ca0cb518409f3944bdb06


[ROCm/clr commit: afbd278804]
2016-11-22 06:15:35 +05:30
Aditya Atluri 66bd811c0d fixed error output for hipDeviceGetAttribute
Change-Id: I1e343a4e4e20e1a550d419f701cc1e60e9d03af4


[ROCm/clr commit: 912426716b]
2016-11-21 18:07:01 -06:00
Aditya Atluri a7bb6239ab fixed texture header on nvcc
Change-Id: Ibe19f94be5edf972b6b51dea263e1088b6c60c1d


[ROCm/clr commit: 2ded0ce302]
2016-11-21 13:53:28 -06:00
Aditya Atluri 72ff879c0a removed warnings in macros
Change-Id: I992b11f6aee2bab09f46885a2d12234aa6814cc5


[ROCm/clr commit: fef766df88]
2016-11-21 09:04:36 -06:00
Aditya Atluri 59c423b36a fixed compilation bugs
1. Texture functions are now compiling fine
2. Fixed hipFuncCache to hipFuncCache_t

Change-Id: I8f815887e4de43ee115bbaff249905b236541c39


[ROCm/clr commit: 2611de2477]
2016-11-21 08:56:30 -06:00
Aditya Atluri df27127960 Fixed hipDeviceGetCacheConfig on nvcc path
1. Changed test macro to emit line numbers
2. Added getcacheconfig api test for nvcc path
3. Fixed hipFuncCache_t data type

TODO: With this commit, right now there are 2 func cache datatypes
a. hipFuncCache_t for runtime API
b. hipFuncCache for driver API

Map these to a single data type

Change-Id: Ia47c9f5d7c2633638051bf17b1103048a1ede973


[ROCm/clr commit: b3c16ea7b5]
2016-11-20 12:18:08 -06:00
Aditya Atluri 979e54d54e added new test for getting attribute
1. Added copyright to all new tests
2. Added test for hipDeviceGetAttribute

Change-Id: I7a070c5b8316ef6575b3f4c49bda2769aea2a7c4


[ROCm/clr commit: e0aba8647f]
2016-11-20 11:53:16 -06:00
Aditya Atluri 76fbc233af added copy right to new header
Change-Id: I16e1d02194551e4b20019bcb6850a3f84882ef18


[ROCm/clr commit: cc829f04c5]
2016-11-19 23:02:56 -06:00
Aditya Atluri 21470101c7 added tests to check nvcc runtime api output
Change-Id: Ifdd39b5d0a6a58d20a8e9745e59dd82d50a90e2f


[ROCm/clr commit: 6692ee09d7]
2016-11-19 21:36:28 -06:00
Sandeep Kumar b8eda419a8 fix_format
Change-Id: I34e265de434263a11654e5deba044c3f21e86578


[ROCm/clr commit: 1fe6890ba3]
2016-11-18 14:34:14 +05:30
Maneesh Gupta b13556367d Refactor for building HIP as dynamic library
Change-Id: I65a3d9d589c4fdbbdcf1611e5427224253be8260


[ROCm/clr commit: 2195e3c37d]
2016-11-18 14:33:20 +05:30
scchan 782c7fca8b Add extra linker flags to the shared library build
Change-Id: I19e569d566fb5e25e343e364a3053a3f12659361


[ROCm/clr commit: 04049feaf4]
2016-11-18 14:18:29 +05:30
Maneesh Gupta 333e994a57 Fix broken tests due to d3559bf
Change-Id: I847c80f8462e1c955bdef957e6de2841a3a6ab29


[ROCm/clr commit: 1a67766dc5]
2016-11-18 12:20:47 +05:30
Aditya Atluri cc573d140d moved runtime macros to runtime_api.h
Change-Id: Ib47e449328e8e6ec55d1b6ee19899de4b591ea8e


[ROCm/clr commit: 3b1f0e903c]
2016-11-17 14:19:18 -06:00
Aditya Atluri e2dd339cfd added texture header to memory api source
Change-Id: I1af6d60aca5a9a9ef1cadf8c304bea892acbe061


[ROCm/clr commit: 84d0d10fad]
2016-11-17 11:57:53 -06:00
Aditya Atluri d3559bffb4 make texture as seperate header as of now
Change-Id: I3c65aa75f2f729eedd8c3292fa3cbc37709c1cfe


[ROCm/clr commit: 94984470d4]
2016-11-17 11:55:29 -06:00
Aditya Atluri a997f0f074 Added i8 packed math intrinsics
1. Added add, sub, mul packed math i8 intrinsics
2. Removed c++ packed data structures included from HCC

Change-Id: I1d109c5ce10c48b7cd3ea059478b88fc1de78499
TODO: Add better packed data structures support


[ROCm/clr commit: 603bb321ec]
2016-11-17 01:09:12 -06:00
Maneesh Gupta 65782bbaeb Enable USE_COPY_EXT_V2 by default
Change-Id: I2c0dc80f85a0ccb5744715b5418a604e38b249ed


[ROCm/clr commit: 0696d4417f]
2016-11-15 10:42:27 +05:30
Ben Sander ab0362087f tweak hcc demangler
[ROCm/clr commit: 2ea3f8f68e]
2016-11-14 15:26:27 -07:00
Sandeep Kumar dc599cf2b8 Add p2p for cookbook
Change-Id: Id2e77ab31123ef95885d665efe34bc0d4596733a
(cherry picked from commit 6fbd0352713ca36e399b1ed4f17c486207a53875)


[ROCm/clr commit: 39e1b16d0b]
2016-11-14 06:10:36 +05:30
Maneesh Gupta d8564db4a5 Revert "hipcc: Turn back linking hip_ir.ll by default"
This reverts commit 5a48591fc5.


[ROCm/clr commit: f9d598d66c]
2016-11-14 06:05:31 +05:30
Ben Sander b8fb23009b Add draft doc on profiling with hip.
Change-Id: I79727dd2500333b3f16acb381dd5852a15ed408a


[ROCm/clr commit: 09d88d3b97]
2016-11-13 10:01:05 -06:00
Ben Sander 603c3a3a38 Add &nbsp to demangler
Change-Id: I89586c7c17f5152b7a6850d0d6c2aa1d3ebc8190


[ROCm/clr commit: d3dbf66ab1]
2016-11-11 16:50:56 -06:00
pensun dd1061b874 Add direct test case for threadfence_system workaround
Change-Id: I5b21b590e957c901044741ac94e816cd8b1426f9


[ROCm/clr commit: 992f94b3a1]
2016-11-11 15:09:43 -06:00
Aditya Atluri a3286737aa fixed multi-dim module kernel launch
Change-Id: Id1d81f2375d058979ab526433f905cf0ea3d23d6


[ROCm/clr commit: 6dcdf08e0d]
2016-11-11 12:25:23 -06:00
Ben Sander 5e354dcd77 Add option to deny peer access.
Also fix test.

Change-Id: I1b247f6c4271442b008e560669bca4daf8eb94c7


[ROCm/clr commit: d666fbaafe]
2016-11-10 23:12:48 -06:00
Ben Sander 40f8947cc3 Use forceUnpinnedCopy to resolve P2p corner cases.
Change-Id: I2aebb419881246cebb696bec87798635bc71acc2


[ROCm/clr commit: 6e54a600b6]
2016-11-10 23:12:48 -06:00
Ben Sander f634f73fef Enable async copy again.
Also add HIP_FORCE_SYNC_COPY chicken bit.

Change-Id: I76a385410494b99bf27305d3c08f55dd81987565


[ROCm/clr commit: 0eeaa3bcd5]
2016-11-10 23:12:48 -06:00
Ben Sander 0c66772f37 Doc change only - add comments to test.
Change-Id: Ie42087cf3c78e49337b18bb71f3f0e1e7950ee1b


[ROCm/clr commit: 85e65b55ff]
2016-11-10 23:12:48 -06:00
Ben Sander ee41609b48 Refactor copy and P2P logic.
Prefer use of source-engine for DMA copies, even if user submits copy
in a stream attached to a different device.
The stream is now used only for synchronization, and HIP
makes the most optimal decision for which engine to perform the
copy - typically the source copy engine.

HIP now makes decision on which engine should perform the copy
and passes this to HCC using new apis.
HIP has additional information about peer
visibility and will make a decision which agent should perform
the copy .

Change-Id: I0cf4cfebeae256e6ca795f08a7ed7130f4857d1f


[ROCm/clr commit: e767e0032e]
2016-11-10 23:12:48 -06:00
Ben Sander ae2992bcb9 Improve memory debug
Change-Id: I0f033139aa4e4b47039eb016e404009127bd0a44


[ROCm/clr commit: e9835617f1]
2016-11-10 23:12:48 -06:00