커밋 그래프

13 커밋

작성자 SHA1 메시지 날짜
Aaron Enye Shi 8e7b49d1cf Add *_rn functions back into HIP intrinsics
Add back the round-to-nearest-even intrinsics back to HIP math
intrinsics as it caused regression.


[ROCm/hip commit: 58dfeff27a]
2018-12-18 19:31:54 +00:00
Aaron Enye Shi 8bf2429663 Guard rcp rounded implementation as well
Since rcp implementations of non-default rounded versions are not correct or supported in OCML, guard them using the same macro OCML_BASIC_ROUNDED_OPERATIONS. Also update the docs and tests.


[ROCm/hip commit: 0cfaa52d15]
2018-11-06 19:53:28 +00:00
Aaron Enye Shi eb4053e3fe Guard the OCML rounded operations instead
Instead of commenting all these functions out, guard the functions with a macro OCML_BASIC_ROUNDED_OPERATIONS.


[ROCm/hip commit: e59c33250a]
2018-11-06 16:32:14 +00:00
Aaron Enye Shi 1c6f47ef55 Remove non-working non-default-rounded math apis
In ROCm-Device-Libs, they have dropped the non-default-rounded versions of add, sub, mul, div, sqrt and fma. Therefore, ocml has removed the rte, rtp, rtn, and rtz counterparts. This will remove the same math APIs in HIP for _ru, _rd, _rn, and _rz.


[ROCm/hip commit: 2b108a2cfd]
2018-11-05 22:34:16 +00:00
Alex Voicu de15c6988d Re-sync with upstream.
[ROCm/hip commit: fe959f7bd7]
2018-10-18 12:27:03 +01:00
Maneesh Gupta c4a93adb74 Replace hipLaunchKernel -> hipLaunchKernelGGL
Change-Id: I4d99009e1199811d417becf1e1b934ec4d4e30be


[ROCm/hip commit: 52e320f396]
2018-10-17 14:32:25 +05:30
Alex Voicu 69b0baf37f Minimal should mean minimal.
[ROCm/hip commit: 5312336ce2]
2018-10-11 00:21:41 +01:00
Alex Voicu 312d29ebaf Align with HC Next.
[ROCm/hip commit: c6720e882b]
2018-09-17 11:50:29 +03:00
Maneesh Gupta 4f42ee762d Apply .clangformat to all repo source files
Change-Id: I7e79c6058f0303f9a98911e3b7dd2e8596079344


[ROCm/hip commit: 1ba06f63c4]
2018-03-12 11:29:03 +05:30
Aditya Atluri 9d445cbf35 changed device code tests to work not to work as one
Change-Id: I0eec1eab19dda3b703bc3a0d778a6bbb2802a412


[ROCm/hip commit: 66dc2d42db]
2017-01-27 09:20:14 -06:00
Aditya Atluri 77401c9b64 Moved device code to mimic cuda header behavior
1. All fp32, fp64 math device/host functions should be in math_functions.h/.cpp
2. All fp32, fp64 fast math intrinsics for device/host functions should be in device_functions.h/.cpp
3. All the device code implementations should be in device_util.h/.cpp
4. Hence, made changes appropriately by moving code and creating new header files
5. Added math_functions.cpp/.h
6. Changed #ifndef signature to make sure no conflicts between headers with same names in hip/hip_runtime.h and hip/hcc_detail/hip_runtime.h
7. Changed tests to fit the code changes, making them to include appropriate headers
8. Added math_functions.cpp to CMakeLists.txt
9. Some of the tests are still broken, mostly host math functions will fix them in next commit
10. TODO: FIX compilation issues for host math functions

Change-Id: I7a17637d7e294a7d224ffba932c1a08668febd26


[ROCm/hip commit: b723169ee9]
2017-01-17 14:57:51 -06:00
Maneesh Gupta 724a024bb6 directed tests: Updated to use new hip include path
Change-Id: Iab1aee638b7158eb9674a19625dedf6471947e51


[ROCm/hip commit: 7e04884d0f]
2016-10-04 22:20:50 +05:30
Ben Sander 89a14a6d73 Refactor directed test infrastructue.
- Add hierarchy.  Tests now live in directories, each with its own
  CMakeFiles.txt.  Reduces merge conflicts.
- Change make_hip_executable -> build_hip_executable.
- Refresh docs.
- Enable some tests that were previously built but not run.

Change-Id: I8c5de3c954400bf233904282b8b42861a2b7c536


[ROCm/hip commit: 1b69b2e9f0]
2016-06-17 15:18:57 -05:00