pensun
2b557e949e
local changes for hipnccl
...
Change-Id: I05a1f0381ce2914a800f573342cc954eb5ff82d9
[ROCm/hip commit: eba2595611 ]
2016-12-06 14:22:02 -06:00
Ben Sander
1c91897cb8
Enabled USE_DISPATCH_HSA_KERNEL, with serialization in hipModuleUnload.
...
[ROCm/hip commit: 6209565ec0 ]
2016-12-06 10:14:01 -06:00
Rahul Garg
bb1b5c3a25
Simple test case for hipDeviceGetPCIBusId
...
Change-Id: I0fb6d1bef4739a5b6280928f7b349d95c1656431
[ROCm/hip commit: 9aebbe00de ]
2016-12-06 17:31:54 +05:30
Rahul Garg
094bca8b5d
Build Error correction in hipDeviceGetPCIBusId
...
Change-Id: I50ff4d95b7a732924c7a991cba60400b1c93c0de
[ROCm/hip commit: ca06747e1f ]
2016-12-06 17:09:21 +05:30
Rahul Garg
61dffc7e18
Changed hipDeviceGetPCIBusId to return Bus ID as string
...
Change-Id: I6d5aa7362084109d34bc015d948f8723b2a38ee9
[ROCm/hip commit: 27072b8972 ]
2016-12-06 16:55:17 +05:30
Maneesh Gupta
55aa1d85d9
Remove redundant variable g_malloc_heap_size
...
Change-Id: Idaf47be70488f0deb3eab05a86d9c5a413d3fff7
[ROCm/hip commit: 6d5145eba4 ]
2016-12-06 10:19:03 +05:30
pensun
569dfe1526
HIP resource leaks fix from Jack
...
Change-Id: I93f3ad7cb94ff1cba1577bd8acc90e826693d12e
[ROCm/hip commit: 778c6626fd ]
2016-12-05 20:21:33 -06:00
Maneesh Gupta
73e0f36148
Revert "Enable USE_DISPATCH_HSA_KERNEL."
...
This reverts commit 891f956313 .
[ROCm/hip commit: 46ffc69557 ]
2016-12-05 16:55:26 +05:30
Maneesh Gupta
7a48767acc
Don't share g_malloc_heap_size between libraries
...
Change-Id: Ic70bf83d4f865bc5c453941fdbc1814c77f0ad9d
[ROCm/hip commit: 1cf9332c3f ]
2016-12-05 11:03:45 +05:30
Ben Sander
891f956313
Enable USE_DISPATCH_HSA_KERNEL.
...
Optimize hipLaunchModule dispatch latency.
[ROCm/hip commit: 097e4eb9d8 ]
2016-12-04 00:13:19 -06:00
Ben Sander
ce01d26a87
Add additional controls for forcing serialization and blocking.
...
Move HIP_COHERENT_HOST_ALLOC so it is read once at init time.
Add HIP_LAUNCH_BLOCKING_KERNELS, HIP_API_BLOCKING.
Update docs on debug and chicken bits.
Conflicts:
src/hip_hcc.cpp
[ROCm/hip commit: ff2f54c1bf ]
2016-12-02 18:03:59 -06:00
Maneesh Gupta
eaf16d02f2
Improve shared and static library support
...
hipcc accepts new parameter -use-staticlib and -use-sharedlib to
control linking behavior. Default is still static library.
Change-Id: I28fb9a939f8177c75abefd8b77d8118a6666d1f4
[ROCm/hip commit: ef046c7098 ]
2016-12-01 15:33:12 +05:30
Maneesh Gupta
dead181892
hipcc: HCC workaround no longer needs env var
...
Change-Id: I81f4eafddbda5e9e2f1082932dd502ab451cfc24
[ROCm/hip commit: df9faffe93 ]
2016-12-01 12:51:58 +05:30
Maneesh Gupta
50830c58e7
Fix some broken directed tests
...
Change-Id: I40f2661a74951f8d91824df8fd9ef0cc2312d183
[ROCm/hip commit: 1e9dc5b521 ]
2016-12-01 12:47:37 +05:30
Aditya Atluri
cdbc90d490
added half add and fma intrinsic
...
Change-Id: Ifa60c1a7065f524f069291bb00d987b11c836cc4
[ROCm/hip commit: 7b0650773c ]
2016-11-29 19:46:01 -06:00
pensun
cd92d2a145
Change to use produce device name by default
...
Change-Id: Ie2cee2a2e94a08b5874a2f5abee5d1ab6c9fdf47
[ROCm/hip commit: 0dfcd3e664 ]
2016-11-29 11:34:06 -06:00
Rahul Garg
a523bc5632
Added support for hipMemGetAddressRange
...
Change-Id: I99a796a4eb765152cf15a12d6a86b58684d34f50
[ROCm/hip commit: fe6ba656c9 ]
2016-11-29 22:04:09 +05:30
pensun
ab5cb4bce6
Change the parameter type of hipDeviceGetPCIBusID to char*
...
Change-Id: Ia72f403126e95f65da53208fc246f45d1417381f
[ROCm/hip commit: 2fbbf2b136 ]
2016-11-28 10:47:18 -06:00
Ben Sander
e7edba491b
Add more debug info
...
[ROCm/hip commit: ce92a53f25 ]
2016-11-26 08:56:02 -06:00
Ben Sander
a7d73614ad
Tweak profiler doc
...
Change-Id: I7be7c44467510e38ae850e1e0a14209a3b4380f1
[ROCm/hip commit: 501c7d6317 ]
2016-11-25 14:06:59 -06:00
Aditya Atluri
3203341551
added support for rcp for float and double
...
Change-Id: Ibeba3a9f64494fc0a176bcb4a854fb2f56567b55
[ROCm/hip commit: 7131d0b961 ]
2016-11-23 20:01:18 -06:00
Aditya Atluri
a04dad833b
added fma for double and float
...
1. Added fma intrinsic support for double and float
2. Added test for fma
Change-Id: I909fdbec34a3d12c03ba6eff3a39376a7128ee43
[ROCm/hip commit: 30674382a4 ]
2016-11-23 18:22:05 -06:00
pensun
d4d76b7018
Add some missing APIs on nv path and hipify
...
Change-Id: Ic0f4740ab06bf70b1de61b39fedc7a6e7605cb61
[ROCm/hip commit: 8a8c7a6b4d ]
2016-11-23 14:36:30 -06:00
pensun
16ff329c51
Add several missing APIs in hipify
...
Change-Id: I58912871cb0b10128f221ef26a11b0d69fb7873c
[ROCm/hip commit: a836395350 ]
2016-11-23 14:06:18 -06:00
Aditya Atluri
3b1150f0b7
Added fast math flag
...
1. Use -DHIP_FAST_MATH to make precise math functions compiled to fast math
2. Added double fast math functions for sqrt
3. Changed hipcc to parse -use_fast_math (not working)
4. Added passed tag to hipFloatMath test
Change-Id: I72884b2436b4efe61e9a9297346c1358fee38a2d
[ROCm/hip commit: 043da795f6 ]
2016-11-23 11:19:15 -06:00
Ben Sander
0272f36892
Add toc to hip_profiling.md
...
Change-Id: I3ae100f12686d0398a0403b78ca571382acce135
[ROCm/hip commit: 75c540fe3c ]
2016-11-23 08:36:08 -06:00
Ben Sander
1cbb76ba24
Improve docs in some places
...
Change-Id: If31e84fbf0c8595ca72edb842dce7ce47783579b
[ROCm/hip commit: dec59d9909 ]
2016-11-23 08:16:18 -06:00
Ben Sander
2693958431
Improve debug capabilities.
...
Print TID mapping at init when HIP_TRACE_API=1.
Print base host/dev info from tracker during copy.
Change-Id: I84e26d7b801567e5a91baad36126fb590920ec87
[ROCm/hip commit: b6ae6b08fb ]
2016-11-23 08:16:18 -06:00
Ben Sander
53a672ae9e
Improve profiler and debug documentation
...
[ROCm/hip commit: e4e14211b3 ]
2016-11-23 08:15:40 -06:00
Rahul Garg
a4123e291e
Removed nested HIP calls from hip_device functions
...
Change-Id: I18785b0ee27e32fb8950982fa5c3a64d1ae6a9b8
[ROCm/hip commit: 6a4f44bce0 ]
2016-11-23 18:37:06 +05:30
Aditya Atluri
467d8d1aea
added fast math intrinsics to HIP
...
1. Added fast math intrinsics for single precision data types
2. Added test to check the intrinsics
3. Added HIP_PRECISE_MATH macro to enable precise math on fast math
Change-Id: Iadacbb6182c31252c5e3252854372d1b80dfd27b
[ROCm/hip commit: f843928ddd ]
2016-11-22 15:26:00 -06:00
Aditya Atluri
c8071e3d84
added fast math APIs
...
1. Added fast math apis for sin, cos, tan, sincos
2. Added test for trig math functions
3. Added logarithm fast math
4. Changed how hipGetDevice, hipDeviceGetCacheConfig emit errors
Change-Id: Ie6ab594ddd5853cbe85e39a2f6d3479a807fa323
[ROCm/hip commit: 94d2115d6d ]
2016-11-22 10:20:09 -06:00
Rahul Garg
c49562d59c
Removed hsaKmtReleaseSystemProperties call
...
Change-Id: I7cb992cccf587c333f0ca0cb518409f3944bdb06
[ROCm/hip commit: 2dcf20ac6f ]
2016-11-22 06:15:35 +05:30
Aditya Atluri
a9737bd2d0
fixed error output for hipDeviceGetAttribute
...
Change-Id: I1e343a4e4e20e1a550d419f701cc1e60e9d03af4
[ROCm/hip commit: 7145ea6a4a ]
2016-11-21 18:07:01 -06:00
Aditya Atluri
d1d588f763
fixed texture header on nvcc
...
Change-Id: Ibe19f94be5edf972b6b51dea263e1088b6c60c1d
[ROCm/hip commit: d6ad91ffa4 ]
2016-11-21 13:53:28 -06:00
Aditya Atluri
5239788014
removed warnings in macros
...
Change-Id: I992b11f6aee2bab09f46885a2d12234aa6814cc5
[ROCm/hip commit: 6052eaa761 ]
2016-11-21 09:04:36 -06:00
Aditya Atluri
75fbf16ca9
fixed compilation bugs
...
1. Texture functions are now compiling fine
2. Fixed hipFuncCache to hipFuncCache_t
Change-Id: I8f815887e4de43ee115bbaff249905b236541c39
[ROCm/hip commit: 2412c9a061 ]
2016-11-21 08:56:30 -06:00
Aditya Atluri
a5bb0c6960
Fixed hipDeviceGetCacheConfig on nvcc path
...
1. Changed test macro to emit line numbers
2. Added getcacheconfig api test for nvcc path
3. Fixed hipFuncCache_t data type
TODO: With this commit, right now there are 2 func cache datatypes
a. hipFuncCache_t for runtime API
b. hipFuncCache for driver API
Map these to a single data type
Change-Id: Ia47c9f5d7c2633638051bf17b1103048a1ede973
[ROCm/hip commit: afaa5fcf96 ]
2016-11-20 12:18:08 -06:00
Aditya Atluri
dd68cbb486
added new test for getting attribute
...
1. Added copyright to all new tests
2. Added test for hipDeviceGetAttribute
Change-Id: I7a070c5b8316ef6575b3f4c49bda2769aea2a7c4
[ROCm/hip commit: 0edc082ff6 ]
2016-11-20 11:53:16 -06:00
Aditya Atluri
1b4c7d082c
added copy right to new header
...
Change-Id: I16e1d02194551e4b20019bcb6850a3f84882ef18
[ROCm/hip commit: a6c4304725 ]
2016-11-19 23:02:56 -06:00
Aditya Atluri
bab6543ea9
added tests to check nvcc runtime api output
...
Change-Id: Ifdd39b5d0a6a58d20a8e9745e59dd82d50a90e2f
[ROCm/hip commit: 428041cfc2 ]
2016-11-19 21:36:28 -06:00
Sandeep Kumar
7d583dde51
fix_format
...
Change-Id: I34e265de434263a11654e5deba044c3f21e86578
[ROCm/hip commit: 53e771fc75 ]
2016-11-18 14:34:14 +05:30
Maneesh Gupta
2234b57513
Refactor for building HIP as dynamic library
...
Change-Id: I65a3d9d589c4fdbbdcf1611e5427224253be8260
[ROCm/hip commit: c0419cc749 ]
2016-11-18 14:33:20 +05:30
scchan
f167855b74
Add extra linker flags to the shared library build
...
Change-Id: I19e569d566fb5e25e343e364a3053a3f12659361
[ROCm/hip commit: 3d6bf5e799 ]
2016-11-18 14:18:29 +05:30
Maneesh Gupta
ca142c34b7
Fix broken tests due to ffb2f23
...
Change-Id: I847c80f8462e1c955bdef957e6de2841a3a6ab29
[ROCm/hip commit: 4fc082ff09 ]
2016-11-18 12:20:47 +05:30
Aditya Atluri
68c2b11ed8
moved runtime macros to runtime_api.h
...
Change-Id: Ib47e449328e8e6ec55d1b6ee19899de4b591ea8e
[ROCm/hip commit: 1618cb3f85 ]
2016-11-17 14:19:18 -06:00
Aditya Atluri
7192577e38
added texture header to memory api source
...
Change-Id: I1af6d60aca5a9a9ef1cadf8c304bea892acbe061
[ROCm/hip commit: c20c524400 ]
2016-11-17 11:57:53 -06:00
Aditya Atluri
ffb2f233b9
make texture as seperate header as of now
...
Change-Id: I3c65aa75f2f729eedd8c3292fa3cbc37709c1cfe
[ROCm/hip commit: dc64a732d8 ]
2016-11-17 11:55:29 -06:00
Aditya Atluri
da0a8cd032
Added i8 packed math intrinsics
...
1. Added add, sub, mul packed math i8 intrinsics
2. Removed c++ packed data structures included from HCC
Change-Id: I1d109c5ce10c48b7cd3ea059478b88fc1de78499
TODO: Add better packed data structures support
[ROCm/hip commit: 12dd9df88f ]
2016-11-17 01:09:12 -06:00
Maneesh Gupta
c7f6ddd612
Enable USE_COPY_EXT_V2 by default
...
Change-Id: I2c0dc80f85a0ccb5744715b5418a604e38b249ed
[ROCm/hip commit: 888a3528d2 ]
2016-11-15 10:42:27 +05:30