Commit Graph

1221 Commits

Author SHA1 Message Date
Ben Sander e2f024fc55 Improve docs in some places
Change-Id: If31e84fbf0c8595ca72edb842dce7ce47783579b


[ROCm/clr commit: 9db93a1b96]
2016-11-23 08:16:18 -06:00
Ben Sander 2e63b780f4 Improve debug capabilities.
Print TID mapping at init when HIP_TRACE_API=1.
Print base host/dev info from tracker during copy.

Change-Id: I84e26d7b801567e5a91baad36126fb590920ec87


[ROCm/clr commit: 111b57ddd0]
2016-11-23 08:16:18 -06:00
Ben Sander 4a60fbc922 Improve profiler and debug documentation
[ROCm/clr commit: 4bcb0fac22]
2016-11-23 08:15:40 -06:00
Rahul Garg cc61f4a720 Removed nested HIP calls from hip_device functions
Change-Id: I18785b0ee27e32fb8950982fa5c3a64d1ae6a9b8


[ROCm/clr commit: 8a2685e6cd]
2016-11-23 18:37:06 +05:30
Aditya Atluri 06969472b2 added fast math intrinsics to HIP
1. Added fast math intrinsics for single precision data types
2. Added test to check the intrinsics
3. Added HIP_PRECISE_MATH macro to enable precise math on fast math

Change-Id: Iadacbb6182c31252c5e3252854372d1b80dfd27b


[ROCm/clr commit: d9a3527769]
2016-11-22 15:26:00 -06:00
Aditya Atluri b8b1459a64 added fast math APIs
1. Added fast math apis for sin, cos, tan, sincos
2. Added test for trig math functions
3. Added logarithm fast math
4. Changed how hipGetDevice, hipDeviceGetCacheConfig emit errors

Change-Id: Ie6ab594ddd5853cbe85e39a2f6d3479a807fa323


[ROCm/clr commit: 1a85762f53]
2016-11-22 10:20:09 -06:00
Rahul Garg 5f24847ec0 Removed hsaKmtReleaseSystemProperties call
Change-Id: I7cb992cccf587c333f0ca0cb518409f3944bdb06


[ROCm/clr commit: afbd278804]
2016-11-22 06:15:35 +05:30
Aditya Atluri 66bd811c0d fixed error output for hipDeviceGetAttribute
Change-Id: I1e343a4e4e20e1a550d419f701cc1e60e9d03af4


[ROCm/clr commit: 912426716b]
2016-11-21 18:07:01 -06:00
Aditya Atluri a7bb6239ab fixed texture header on nvcc
Change-Id: Ibe19f94be5edf972b6b51dea263e1088b6c60c1d


[ROCm/clr commit: 2ded0ce302]
2016-11-21 13:53:28 -06:00
Aditya Atluri 72ff879c0a removed warnings in macros
Change-Id: I992b11f6aee2bab09f46885a2d12234aa6814cc5


[ROCm/clr commit: fef766df88]
2016-11-21 09:04:36 -06:00
Aditya Atluri 59c423b36a fixed compilation bugs
1. Texture functions are now compiling fine
2. Fixed hipFuncCache to hipFuncCache_t

Change-Id: I8f815887e4de43ee115bbaff249905b236541c39


[ROCm/clr commit: 2611de2477]
2016-11-21 08:56:30 -06:00
Aditya Atluri df27127960 Fixed hipDeviceGetCacheConfig on nvcc path
1. Changed test macro to emit line numbers
2. Added getcacheconfig api test for nvcc path
3. Fixed hipFuncCache_t data type

TODO: With this commit, right now there are 2 func cache datatypes
a. hipFuncCache_t for runtime API
b. hipFuncCache for driver API

Map these to a single data type

Change-Id: Ia47c9f5d7c2633638051bf17b1103048a1ede973


[ROCm/clr commit: b3c16ea7b5]
2016-11-20 12:18:08 -06:00
Aditya Atluri 979e54d54e added new test for getting attribute
1. Added copyright to all new tests
2. Added test for hipDeviceGetAttribute

Change-Id: I7a070c5b8316ef6575b3f4c49bda2769aea2a7c4


[ROCm/clr commit: e0aba8647f]
2016-11-20 11:53:16 -06:00
Aditya Atluri 76fbc233af added copy right to new header
Change-Id: I16e1d02194551e4b20019bcb6850a3f84882ef18


[ROCm/clr commit: cc829f04c5]
2016-11-19 23:02:56 -06:00
Aditya Atluri 21470101c7 added tests to check nvcc runtime api output
Change-Id: Ifdd39b5d0a6a58d20a8e9745e59dd82d50a90e2f


[ROCm/clr commit: 6692ee09d7]
2016-11-19 21:36:28 -06:00
Sandeep Kumar b8eda419a8 fix_format
Change-Id: I34e265de434263a11654e5deba044c3f21e86578


[ROCm/clr commit: 1fe6890ba3]
2016-11-18 14:34:14 +05:30
Maneesh Gupta b13556367d Refactor for building HIP as dynamic library
Change-Id: I65a3d9d589c4fdbbdcf1611e5427224253be8260


[ROCm/clr commit: 2195e3c37d]
2016-11-18 14:33:20 +05:30
scchan 782c7fca8b Add extra linker flags to the shared library build
Change-Id: I19e569d566fb5e25e343e364a3053a3f12659361


[ROCm/clr commit: 04049feaf4]
2016-11-18 14:18:29 +05:30
Maneesh Gupta 333e994a57 Fix broken tests due to d3559bf
Change-Id: I847c80f8462e1c955bdef957e6de2841a3a6ab29


[ROCm/clr commit: 1a67766dc5]
2016-11-18 12:20:47 +05:30
Aditya Atluri cc573d140d moved runtime macros to runtime_api.h
Change-Id: Ib47e449328e8e6ec55d1b6ee19899de4b591ea8e


[ROCm/clr commit: 3b1f0e903c]
2016-11-17 14:19:18 -06:00
Aditya Atluri e2dd339cfd added texture header to memory api source
Change-Id: I1af6d60aca5a9a9ef1cadf8c304bea892acbe061


[ROCm/clr commit: 84d0d10fad]
2016-11-17 11:57:53 -06:00
Aditya Atluri d3559bffb4 make texture as seperate header as of now
Change-Id: I3c65aa75f2f729eedd8c3292fa3cbc37709c1cfe


[ROCm/clr commit: 94984470d4]
2016-11-17 11:55:29 -06:00
Aditya Atluri a997f0f074 Added i8 packed math intrinsics
1. Added add, sub, mul packed math i8 intrinsics
2. Removed c++ packed data structures included from HCC

Change-Id: I1d109c5ce10c48b7cd3ea059478b88fc1de78499
TODO: Add better packed data structures support


[ROCm/clr commit: 603bb321ec]
2016-11-17 01:09:12 -06:00
Maneesh Gupta 65782bbaeb Enable USE_COPY_EXT_V2 by default
Change-Id: I2c0dc80f85a0ccb5744715b5418a604e38b249ed


[ROCm/clr commit: 0696d4417f]
2016-11-15 10:42:27 +05:30
Ben Sander ab0362087f tweak hcc demangler
[ROCm/clr commit: 2ea3f8f68e]
2016-11-14 15:26:27 -07:00
Sandeep Kumar dc599cf2b8 Add p2p for cookbook
Change-Id: Id2e77ab31123ef95885d665efe34bc0d4596733a
(cherry picked from commit 6fbd0352713ca36e399b1ed4f17c486207a53875)


[ROCm/clr commit: 39e1b16d0b]
2016-11-14 06:10:36 +05:30
Maneesh Gupta d8564db4a5 Revert "hipcc: Turn back linking hip_ir.ll by default"
This reverts commit 5a48591fc5.


[ROCm/clr commit: f9d598d66c]
2016-11-14 06:05:31 +05:30
Ben Sander b8fb23009b Add draft doc on profiling with hip.
Change-Id: I79727dd2500333b3f16acb381dd5852a15ed408a


[ROCm/clr commit: 09d88d3b97]
2016-11-13 10:01:05 -06:00
Ben Sander 603c3a3a38 Add &nbsp to demangler
Change-Id: I89586c7c17f5152b7a6850d0d6c2aa1d3ebc8190


[ROCm/clr commit: d3dbf66ab1]
2016-11-11 16:50:56 -06:00
pensun dd1061b874 Add direct test case for threadfence_system workaround
Change-Id: I5b21b590e957c901044741ac94e816cd8b1426f9


[ROCm/clr commit: 992f94b3a1]
2016-11-11 15:09:43 -06:00
Aditya Atluri a3286737aa fixed multi-dim module kernel launch
Change-Id: Id1d81f2375d058979ab526433f905cf0ea3d23d6


[ROCm/clr commit: 6dcdf08e0d]
2016-11-11 12:25:23 -06:00
Ben Sander 5e354dcd77 Add option to deny peer access.
Also fix test.

Change-Id: I1b247f6c4271442b008e560669bca4daf8eb94c7


[ROCm/clr commit: d666fbaafe]
2016-11-10 23:12:48 -06:00
Ben Sander 40f8947cc3 Use forceUnpinnedCopy to resolve P2p corner cases.
Change-Id: I2aebb419881246cebb696bec87798635bc71acc2


[ROCm/clr commit: 6e54a600b6]
2016-11-10 23:12:48 -06:00
Ben Sander f634f73fef Enable async copy again.
Also add HIP_FORCE_SYNC_COPY chicken bit.

Change-Id: I76a385410494b99bf27305d3c08f55dd81987565


[ROCm/clr commit: 0eeaa3bcd5]
2016-11-10 23:12:48 -06:00
Ben Sander 0c66772f37 Doc change only - add comments to test.
Change-Id: Ie42087cf3c78e49337b18bb71f3f0e1e7950ee1b


[ROCm/clr commit: 85e65b55ff]
2016-11-10 23:12:48 -06:00
Ben Sander ee41609b48 Refactor copy and P2P logic.
Prefer use of source-engine for DMA copies, even if user submits copy
in a stream attached to a different device.
The stream is now used only for synchronization, and HIP
makes the most optimal decision for which engine to perform the
copy - typically the source copy engine.

HIP now makes decision on which engine should perform the copy
and passes this to HCC using new apis.
HIP has additional information about peer
visibility and will make a decision which agent should perform
the copy .

Change-Id: I0cf4cfebeae256e6ca795f08a7ed7130f4857d1f


[ROCm/clr commit: e767e0032e]
2016-11-10 23:12:48 -06:00
Ben Sander ae2992bcb9 Improve memory debug
Change-Id: I0f033139aa4e4b47039eb016e404009127bd0a44


[ROCm/clr commit: e9835617f1]
2016-11-10 23:12:48 -06:00
pensun 8ea566f2b2 Update depreciated information for threadfence_system()
Change-Id: Id13d2f81edb51eb42b896a5c06913d59ec907c55


[ROCm/clr commit: 9aa2269d5c]
2016-11-10 11:55:12 -06:00
Maneesh Gupta df096e871e CMakeLists.txt: Cascade CMAKE_BUILD_TYPE to tests
Change-Id: I53a3ea951c1fd57e43a02381a457c1dedc1a34f7


[ROCm/clr commit: cdcf04d744]
2016-11-10 21:26:34 +05:30
Rahul Garg cd7ad3d620 hipDeviceGetByPCIBusId support for HIP/NVCC
Change-Id: I8f82890e88d2a15f592bff192179e7d5c5362722


[ROCm/clr commit: f86c7b5b3c]
2016-11-10 11:40:59 +05:30
Maneesh Gupta d55b32b765 hipcc: Default to HIP_LIB_TYPE=1
Change-Id: I83b05accd76f7bc94bd724c66ae060fa0095bc8d


[ROCm/clr commit: 462ffb8117]
2016-11-10 11:34:00 +05:30
Maneesh Gupta 99678b0000 hcc_dialects/Makefile: use clamp-config
Change-Id: I86df82f75b75125825e22d0545209a19386d9936


[ROCm/clr commit: 052a580d5b]
2016-11-10 11:31:50 +05:30
pensun daf19a2dbb resolve conflicts for git pull
Change-Id: Ie353b831e2241bc28042069b6cc7405257e871e1


[ROCm/clr commit: bbb619c732]
2016-11-09 21:38:43 -06:00
pensun ce1b4bdc06 Add documentation on threadfence_system workaround guidelines.
Change-Id: I9636a3808798f3dabe992285ce5652187cee6eb8


[ROCm/clr commit: 94dfff9db2]
2016-11-09 21:36:30 -06:00
pensun 61635b585f Add option to alloc fingrained system memory
Change-Id: Ia13c8e058cb988b5857e75a590a4d67411362ae1


[ROCm/clr commit: 23de0e1b50]
2016-11-09 21:36:30 -06:00
Maneesh Gupta 68b4d20b26 Merge branch 'rocm-rel-1.3' into amd-develop
Conflicts:
	include/hip/nvcc_detail/hip_runtime_api.h

Change-Id: I990a7d008da9e8dcc68250cebbc8ee6e723c7e01


[ROCm/clr commit: e3b5eef7c9]
2016-11-10 08:56:38 +05:30
pensun 85fa855e18 fix hipProfiler* apis on NV path
Change-Id: I6adca6151fef3a9b35348163eb6bd13f5c414172


[ROCm/clr commit: 4a8a6a4697]
2016-11-09 15:44:01 -06:00
pensun 12a5923a2b fix for hipcallback function on NV path
Change-Id: If80c0cfe60b1f3b1a71627b5f3f79503cba4d491


[ROCm/clr commit: e5277ab4b6]
2016-11-09 11:33:23 -06:00
Maneesh Gupta ded929878c Update release notes for 1.0 release
Change-Id: I74fa2b41afc334a76c309b125c27aa141cd59554


[ROCm/clr commit: 01a38c82a6]
2016-11-08 16:31:56 +05:30
Ben Sander 3d4a76d560 Fix tests to read warpSize from device props.
Change-Id: I9583577793afad49f9eb1ee9069bd4c6963a6023


[ROCm/clr commit: a13ec441bf]
2016-11-06 04:26:28 -06:00