rocm-systems

Автор	SHA1	Сообщение	Дата
Aaron Enye Shi	58dfeff27a	Add *_rn functions back into HIP intrinsics Add back the round-to-nearest-even intrinsics back to HIP math intrinsics as it caused regression.	2018-12-18 19:31:54 +00:00
Aaron Enye Shi	0cfaa52d15	Guard rcp rounded implementation as well Since rcp implementations of non-default rounded versions are not correct or supported in OCML, guard them using the same macro OCML_BASIC_ROUNDED_OPERATIONS. Also update the docs and tests.	2018-11-06 19:53:28 +00:00
Maneesh Gupta	52e320f396	Replace hipLaunchKernel -> hipLaunchKernelGGL Change-Id: I4d99009e1199811d417becf1e1b934ec4d4e30be	2018-10-17 14:32:25 +05:30
Aaron Enye Shi	c11220f224	Disable non-default-rounded functions Device library has removed the non-default-rounded functions, so hipFloatMath will fail to build. These include the removal of __ocml_sqrt_rte, __ocml_sqrt_rtn, __ocml_sqrt_rtp, and __ocml_sqrt_rtz. As seen here: https://github.com/RadeonOpenCompute/ROCm-Device-Libs/commit/2fc04e10e1354edee331ce700f98a60f8255effb . Disable these function tests for now, until they are re-enabled, or deleted completely.	2018-09-20 16:33:32 -04:00
Maneesh Gupta	1ba06f63c4	Apply .clangformat to all repo source files Change-Id: I7e79c6058f0303f9a98911e3b7dd2e8596079344	2018-03-12 11:29:03 +05:30
Phaneendr-kumar Lanka	eea7d495c7	[nvccWarnings] Fix warnings seen with dtests on nvcc path	2017-12-14 14:10:37 +05:30
Alex Voicu	cffd0e14eb	This implements the trivial change needed to move back from the hip{Something}_{x, y, z} macros to the natural CUDA syntax of Something.{x, y, z}. This is contained in lines 384-404 in hip_runtime.h. All of the other changes have to do with changing unit tests to use this syntax. The macros are retained for backwards compatibility.	2017-11-19 01:54:12 +00:00
Aditya Atluri	b723169ee9	Moved device code to mimic cuda header behavior 1. All fp32, fp64 math device/host functions should be in math_functions.h/.cpp 2. All fp32, fp64 fast math intrinsics for device/host functions should be in device_functions.h/.cpp 3. All the device code implementations should be in device_util.h/.cpp 4. Hence, made changes appropriately by moving code and creating new header files 5. Added math_functions.cpp/.h 6. Changed #ifndef signature to make sure no conflicts between headers with same names in hip/hip_runtime.h and hip/hcc_detail/hip_runtime.h 7. Changed tests to fit the code changes, making them to include appropriate headers 8. Added math_functions.cpp to CMakeLists.txt 9. Some of the tests are still broken, mostly host math functions will fix them in next commit 10. TODO: FIX compilation issues for host math functions Change-Id: I7a17637d7e294a7d224ffba932c1a08668febd26	2017-01-17 14:57:51 -06:00
Aditya Atluri	043da795f6	Added fast math flag 1. Use -DHIP_FAST_MATH to make precise math functions compiled to fast math 2. Added double fast math functions for sqrt 3. Changed hipcc to parse -use_fast_math (not working) 4. Added passed tag to hipFloatMath test Change-Id: I72884b2436b4efe61e9a9297346c1358fee38a2d	2016-11-23 11:19:15 -06:00
Aditya Atluri	f843928ddd	added fast math intrinsics to HIP 1. Added fast math intrinsics for single precision data types 2. Added test to check the intrinsics 3. Added HIP_PRECISE_MATH macro to enable precise math on fast math Change-Id: Iadacbb6182c31252c5e3252854372d1b80dfd27b	2016-11-22 15:26:00 -06:00

10 Коммитов