提交線圖

1223 次程式碼提交

作者 SHA1 備註 日期
pensun 2b557e949e local changes for hipnccl
Change-Id: I05a1f0381ce2914a800f573342cc954eb5ff82d9


[ROCm/hip commit: eba2595611]
2016-12-06 14:22:02 -06:00
Ben Sander 1c91897cb8 Enabled USE_DISPATCH_HSA_KERNEL, with serialization in hipModuleUnload.
[ROCm/hip commit: 6209565ec0]
2016-12-06 10:14:01 -06:00
Rahul Garg bb1b5c3a25 Simple test case for hipDeviceGetPCIBusId
Change-Id: I0fb6d1bef4739a5b6280928f7b349d95c1656431


[ROCm/hip commit: 9aebbe00de]
2016-12-06 17:31:54 +05:30
Rahul Garg 094bca8b5d Build Error correction in hipDeviceGetPCIBusId
Change-Id: I50ff4d95b7a732924c7a991cba60400b1c93c0de


[ROCm/hip commit: ca06747e1f]
2016-12-06 17:09:21 +05:30
Rahul Garg 61dffc7e18 Changed hipDeviceGetPCIBusId to return Bus ID as string
Change-Id: I6d5aa7362084109d34bc015d948f8723b2a38ee9


[ROCm/hip commit: 27072b8972]
2016-12-06 16:55:17 +05:30
Maneesh Gupta 55aa1d85d9 Remove redundant variable g_malloc_heap_size
Change-Id: Idaf47be70488f0deb3eab05a86d9c5a413d3fff7


[ROCm/hip commit: 6d5145eba4]
2016-12-06 10:19:03 +05:30
pensun 569dfe1526 HIP resource leaks fix from Jack
Change-Id: I93f3ad7cb94ff1cba1577bd8acc90e826693d12e


[ROCm/hip commit: 778c6626fd]
2016-12-05 20:21:33 -06:00
Maneesh Gupta 73e0f36148 Revert "Enable USE_DISPATCH_HSA_KERNEL."
This reverts commit 891f956313.


[ROCm/hip commit: 46ffc69557]
2016-12-05 16:55:26 +05:30
Maneesh Gupta 7a48767acc Don't share g_malloc_heap_size between libraries
Change-Id: Ic70bf83d4f865bc5c453941fdbc1814c77f0ad9d


[ROCm/hip commit: 1cf9332c3f]
2016-12-05 11:03:45 +05:30
Ben Sander 891f956313 Enable USE_DISPATCH_HSA_KERNEL.
Optimize hipLaunchModule dispatch latency.


[ROCm/hip commit: 097e4eb9d8]
2016-12-04 00:13:19 -06:00
Ben Sander ce01d26a87 Add additional controls for forcing serialization and blocking.
Move HIP_COHERENT_HOST_ALLOC so it is read once at init time.
Add HIP_LAUNCH_BLOCKING_KERNELS, HIP_API_BLOCKING.
Update docs on debug and chicken bits.

Conflicts:
	src/hip_hcc.cpp


[ROCm/hip commit: ff2f54c1bf]
2016-12-02 18:03:59 -06:00
Maneesh Gupta eaf16d02f2 Improve shared and static library support
hipcc accepts new parameter -use-staticlib and -use-sharedlib to
control linking behavior. Default is still static library.

Change-Id: I28fb9a939f8177c75abefd8b77d8118a6666d1f4


[ROCm/hip commit: ef046c7098]
2016-12-01 15:33:12 +05:30
Maneesh Gupta dead181892 hipcc: HCC workaround no longer needs env var
Change-Id: I81f4eafddbda5e9e2f1082932dd502ab451cfc24


[ROCm/hip commit: df9faffe93]
2016-12-01 12:51:58 +05:30
Maneesh Gupta 50830c58e7 Fix some broken directed tests
Change-Id: I40f2661a74951f8d91824df8fd9ef0cc2312d183


[ROCm/hip commit: 1e9dc5b521]
2016-12-01 12:47:37 +05:30
Aditya Atluri cdbc90d490 added half add and fma intrinsic
Change-Id: Ifa60c1a7065f524f069291bb00d987b11c836cc4


[ROCm/hip commit: 7b0650773c]
2016-11-29 19:46:01 -06:00
pensun cd92d2a145 Change to use produce device name by default
Change-Id: Ie2cee2a2e94a08b5874a2f5abee5d1ab6c9fdf47


[ROCm/hip commit: 0dfcd3e664]
2016-11-29 11:34:06 -06:00
Rahul Garg a523bc5632 Added support for hipMemGetAddressRange
Change-Id: I99a796a4eb765152cf15a12d6a86b58684d34f50


[ROCm/hip commit: fe6ba656c9]
2016-11-29 22:04:09 +05:30
pensun ab5cb4bce6 Change the parameter type of hipDeviceGetPCIBusID to char*
Change-Id: Ia72f403126e95f65da53208fc246f45d1417381f


[ROCm/hip commit: 2fbbf2b136]
2016-11-28 10:47:18 -06:00
Ben Sander e7edba491b Add more debug info
[ROCm/hip commit: ce92a53f25]
2016-11-26 08:56:02 -06:00
Ben Sander a7d73614ad Tweak profiler doc
Change-Id: I7be7c44467510e38ae850e1e0a14209a3b4380f1


[ROCm/hip commit: 501c7d6317]
2016-11-25 14:06:59 -06:00
Aditya Atluri 3203341551 added support for rcp for float and double
Change-Id: Ibeba3a9f64494fc0a176bcb4a854fb2f56567b55


[ROCm/hip commit: 7131d0b961]
2016-11-23 20:01:18 -06:00
Aditya Atluri a04dad833b added fma for double and float
1. Added fma intrinsic support for double and float
2. Added test for fma

Change-Id: I909fdbec34a3d12c03ba6eff3a39376a7128ee43


[ROCm/hip commit: 30674382a4]
2016-11-23 18:22:05 -06:00
pensun d4d76b7018 Add some missing APIs on nv path and hipify
Change-Id: Ic0f4740ab06bf70b1de61b39fedc7a6e7605cb61


[ROCm/hip commit: 8a8c7a6b4d]
2016-11-23 14:36:30 -06:00
pensun 16ff329c51 Add several missing APIs in hipify
Change-Id: I58912871cb0b10128f221ef26a11b0d69fb7873c


[ROCm/hip commit: a836395350]
2016-11-23 14:06:18 -06:00
Aditya Atluri 3b1150f0b7 Added fast math flag
1. Use -DHIP_FAST_MATH to make precise math functions compiled to fast math
2. Added double fast math functions for sqrt
3. Changed hipcc to parse -use_fast_math (not working)
4. Added passed tag to hipFloatMath test

Change-Id: I72884b2436b4efe61e9a9297346c1358fee38a2d


[ROCm/hip commit: 043da795f6]
2016-11-23 11:19:15 -06:00
Ben Sander 0272f36892 Add toc to hip_profiling.md
Change-Id: I3ae100f12686d0398a0403b78ca571382acce135


[ROCm/hip commit: 75c540fe3c]
2016-11-23 08:36:08 -06:00
Ben Sander 1cbb76ba24 Improve docs in some places
Change-Id: If31e84fbf0c8595ca72edb842dce7ce47783579b


[ROCm/hip commit: dec59d9909]
2016-11-23 08:16:18 -06:00
Ben Sander 2693958431 Improve debug capabilities.
Print TID mapping at init when HIP_TRACE_API=1.
Print base host/dev info from tracker during copy.

Change-Id: I84e26d7b801567e5a91baad36126fb590920ec87


[ROCm/hip commit: b6ae6b08fb]
2016-11-23 08:16:18 -06:00
Ben Sander 53a672ae9e Improve profiler and debug documentation
[ROCm/hip commit: e4e14211b3]
2016-11-23 08:15:40 -06:00
Rahul Garg a4123e291e Removed nested HIP calls from hip_device functions
Change-Id: I18785b0ee27e32fb8950982fa5c3a64d1ae6a9b8


[ROCm/hip commit: 6a4f44bce0]
2016-11-23 18:37:06 +05:30
Aditya Atluri 467d8d1aea added fast math intrinsics to HIP
1. Added fast math intrinsics for single precision data types
2. Added test to check the intrinsics
3. Added HIP_PRECISE_MATH macro to enable precise math on fast math

Change-Id: Iadacbb6182c31252c5e3252854372d1b80dfd27b


[ROCm/hip commit: f843928ddd]
2016-11-22 15:26:00 -06:00
Aditya Atluri c8071e3d84 added fast math APIs
1. Added fast math apis for sin, cos, tan, sincos
2. Added test for trig math functions
3. Added logarithm fast math
4. Changed how hipGetDevice, hipDeviceGetCacheConfig emit errors

Change-Id: Ie6ab594ddd5853cbe85e39a2f6d3479a807fa323


[ROCm/hip commit: 94d2115d6d]
2016-11-22 10:20:09 -06:00
Rahul Garg c49562d59c Removed hsaKmtReleaseSystemProperties call
Change-Id: I7cb992cccf587c333f0ca0cb518409f3944bdb06


[ROCm/hip commit: 2dcf20ac6f]
2016-11-22 06:15:35 +05:30
Aditya Atluri a9737bd2d0 fixed error output for hipDeviceGetAttribute
Change-Id: I1e343a4e4e20e1a550d419f701cc1e60e9d03af4


[ROCm/hip commit: 7145ea6a4a]
2016-11-21 18:07:01 -06:00
Aditya Atluri d1d588f763 fixed texture header on nvcc
Change-Id: Ibe19f94be5edf972b6b51dea263e1088b6c60c1d


[ROCm/hip commit: d6ad91ffa4]
2016-11-21 13:53:28 -06:00
Aditya Atluri 5239788014 removed warnings in macros
Change-Id: I992b11f6aee2bab09f46885a2d12234aa6814cc5


[ROCm/hip commit: 6052eaa761]
2016-11-21 09:04:36 -06:00
Aditya Atluri 75fbf16ca9 fixed compilation bugs
1. Texture functions are now compiling fine
2. Fixed hipFuncCache to hipFuncCache_t

Change-Id: I8f815887e4de43ee115bbaff249905b236541c39


[ROCm/hip commit: 2412c9a061]
2016-11-21 08:56:30 -06:00
Aditya Atluri a5bb0c6960 Fixed hipDeviceGetCacheConfig on nvcc path
1. Changed test macro to emit line numbers
2. Added getcacheconfig api test for nvcc path
3. Fixed hipFuncCache_t data type

TODO: With this commit, right now there are 2 func cache datatypes
a. hipFuncCache_t for runtime API
b. hipFuncCache for driver API

Map these to a single data type

Change-Id: Ia47c9f5d7c2633638051bf17b1103048a1ede973


[ROCm/hip commit: afaa5fcf96]
2016-11-20 12:18:08 -06:00
Aditya Atluri dd68cbb486 added new test for getting attribute
1. Added copyright to all new tests
2. Added test for hipDeviceGetAttribute

Change-Id: I7a070c5b8316ef6575b3f4c49bda2769aea2a7c4


[ROCm/hip commit: 0edc082ff6]
2016-11-20 11:53:16 -06:00
Aditya Atluri 1b4c7d082c added copy right to new header
Change-Id: I16e1d02194551e4b20019bcb6850a3f84882ef18


[ROCm/hip commit: a6c4304725]
2016-11-19 23:02:56 -06:00
Aditya Atluri bab6543ea9 added tests to check nvcc runtime api output
Change-Id: Ifdd39b5d0a6a58d20a8e9745e59dd82d50a90e2f


[ROCm/hip commit: 428041cfc2]
2016-11-19 21:36:28 -06:00
Sandeep Kumar 7d583dde51 fix_format
Change-Id: I34e265de434263a11654e5deba044c3f21e86578


[ROCm/hip commit: 53e771fc75]
2016-11-18 14:34:14 +05:30
Maneesh Gupta 2234b57513 Refactor for building HIP as dynamic library
Change-Id: I65a3d9d589c4fdbbdcf1611e5427224253be8260


[ROCm/hip commit: c0419cc749]
2016-11-18 14:33:20 +05:30
scchan f167855b74 Add extra linker flags to the shared library build
Change-Id: I19e569d566fb5e25e343e364a3053a3f12659361


[ROCm/hip commit: 3d6bf5e799]
2016-11-18 14:18:29 +05:30
Maneesh Gupta ca142c34b7 Fix broken tests due to ffb2f23
Change-Id: I847c80f8462e1c955bdef957e6de2841a3a6ab29


[ROCm/hip commit: 4fc082ff09]
2016-11-18 12:20:47 +05:30
Aditya Atluri 68c2b11ed8 moved runtime macros to runtime_api.h
Change-Id: Ib47e449328e8e6ec55d1b6ee19899de4b591ea8e


[ROCm/hip commit: 1618cb3f85]
2016-11-17 14:19:18 -06:00
Aditya Atluri 7192577e38 added texture header to memory api source
Change-Id: I1af6d60aca5a9a9ef1cadf8c304bea892acbe061


[ROCm/hip commit: c20c524400]
2016-11-17 11:57:53 -06:00
Aditya Atluri ffb2f233b9 make texture as seperate header as of now
Change-Id: I3c65aa75f2f729eedd8c3292fa3cbc37709c1cfe


[ROCm/hip commit: dc64a732d8]
2016-11-17 11:55:29 -06:00
Aditya Atluri da0a8cd032 Added i8 packed math intrinsics
1. Added add, sub, mul packed math i8 intrinsics
2. Removed c++ packed data structures included from HCC

Change-Id: I1d109c5ce10c48b7cd3ea059478b88fc1de78499
TODO: Add better packed data structures support


[ROCm/hip commit: 12dd9df88f]
2016-11-17 01:09:12 -06:00
Maneesh Gupta c7f6ddd612 Enable USE_COPY_EXT_V2 by default
Change-Id: I2c0dc80f85a0ccb5744715b5418a604e38b249ed


[ROCm/hip commit: 888a3528d2]
2016-11-15 10:42:27 +05:30