Alex Voicu
90e7799bf4
Add missing interop with volatile. Fix unit tests.
...
[ROCm/hip commit: 0108819e2a ]
2018-05-31 15:27:12 +01:00
Alex Voicu
8b039f989c
Move converting constructor from _Float16 under macro guard. Refactor.
...
[ROCm/hip commit: 0eec115bcd ]
2018-05-25 19:46:41 +01:00
Alex Voicu
328e946c59
Update hipTestHalf to actually test behaviour. Add missing hipHostfree.
...
[ROCm/hip commit: 9948b5961e ]
2018-05-24 13:55:30 +01:00
Alex Voicu
4ceb9cbc09
Missing commit.
...
[ROCm/hip commit: 6f819f226b ]
2018-05-23 17:57:47 +01:00
Maneesh Gupta
bb8918faf7
Disable incomplete unit tests that don't work on nvcc path
...
Change-Id: If5823ec96a3b2497a08c46ab802c5a0158271053
[ROCm/hip commit: 305592d622 ]
2018-05-21 11:35:03 +05:30
Siu Chi Chan
c05e2e2261
initial gfx906 support
...
[ROCm/hip commit: b898049412 ]
2018-05-10 19:28:00 +00:00
Deven Desai
2b3332ac72
Checkin to add unit tests for fixes in my previous commit
...
This change adds unit tests for
1. __hisinf
2. __hisnan
2. abs(long long)
3. lgamma(double)
[ROCm/hip commit: 4be4cf644f ]
2018-04-25 17:53:21 +00:00
Maneesh Gupta
4f42ee762d
Apply .clangformat to all repo source files
...
Change-Id: I7e79c6058f0303f9a98911e3b7dd2e8596079344
[ROCm/hip commit: 1ba06f63c4 ]
2018-03-12 11:29:03 +05:30
Maneesh Gupta
d807f528e6
Merge pull request #266 from gargrahul/fix_half2_gfx900
...
Fixed half2 issue on gfx900
[ROCm/hip commit: db378fbc9e ]
2017-11-20 07:28:41 +05:30
Alex Voicu
962bf7bfda
This implements the trivial change needed to move back from the hip{Something}_{x, y, z} macros to the natural CUDA syntax of Something.{x, y, z}. This is contained in lines 384-404 in hip_runtime.h. All of the other changes have to do with changing unit tests to use this syntax. The macros are retained for backwards compatibility.
...
[ROCm/hip commit: cffd0e14eb ]
2017-11-19 01:54:12 +00:00
Rahul Garg
5de28551c1
Fixed test case for GFX900
...
[ROCm/hip commit: 9af0f9cbc1 ]
2017-11-16 09:34:52 +05:30
Rahul Garg
d0ee6e00fe
Fixed half2 issue on gfx900
...
[ROCm/hip commit: fef496d4f1 ]
2017-11-15 18:52:59 +05:30
Aditya Atluri
d48f7e93cc
added architecture specific macros
...
1. Added __HIP_ARCH_GFXNUM__
2. Usage, -D__HIP_ARCH_GFX803__=1
Change-Id: I68b3a85d62cfab3a45d2b7a70cb3518ab2565236
[ROCm/hip commit: 9a1989193a ]
2017-02-08 19:45:32 -06:00
Aditya Atluri
8794954411
fixed hipcc for new compiler flags
...
Change-Id: I49ec059be20ff26b7482c84d91ab7a43826c6a8d
[ROCm/hip commit: 55d92833fb ]
2017-02-08 14:06:01 -06:00
Aditya Atluri
c692cd5d4a
include arch specific ir on fallback path
...
Change-Id: Ib04996aae2c21eb73ef2a9f6305915e0caccd704
[ROCm/hip commit: 27d2fc99ca ]
2017-02-08 12:19:06 -06:00
Aditya Atluri
6d5992d445
changed __global__ attribute
...
1. Moved around tests and added them to HIT
Change-Id: I5d75280c42a5af852670ebabc7305ee56721ec7b
[ROCm/hip commit: 5e3d63c0a3 ]
2017-02-03 10:53:36 -06:00
Aditya Atluri
8e411f0beb
Added type conversion intrinsics
...
1. Added all type conversion intrinsics
2. NO TESTS have been added. (Will add in next commit)
3. Sanatized code in hip_runtime.h
4. Added passed() to hipTestHalf to make it pass on HIT
Change-Id: I0987963c802fc7ff4d7e07d7b88d86da35da53c9
[ROCm/hip commit: 6f2cfddc67 ]
2017-01-16 12:10:05 -06:00
Aditya Atluri
fa376d6d71
added half2 log, log10, exp, exp10 math functions
...
1. Enabled tests for log, log10, exp, exp10 half2
2. h2rint is still disabled.
Change-Id: I01f6002f6992259919893c524c526db5ee09473a
[ROCm/hip commit: bf45105c7c ]
2017-01-13 13:26:10 -06:00
Aditya Atluri
1f5fe6714b
added half2 math operations
...
1. They use SDWA + LLVM IR
2. Added these functions to test
3. Need to do exp, exp10, log, log10, rint
Change-Id: I06176acc6cb8bb054495310531777406a41b54e4
[ROCm/hip commit: 0e576295b4 ]
2017-01-13 12:27:11 -06:00
Aditya Atluri
0d5b2539d3
added math functions for half
...
1. Added math functions for half precision
2. HRCP is not available due to device code linking errors, will be enabled once it is fixed
3. Added math functions to half test file
Change-Id: Ie317ce70ef518a4fc3f27142143d01e0327f5df3
[ROCm/hip commit: 8c978c210c ]
2017-01-13 12:05:29 -06:00
Aditya Atluri
b00361b981
added half2 cmp and conv, data movement device functions
...
1. Added half2 comparision functions
2. Added conversion and data movement half apis
Change-Id: Ia33c0e957d9deb1f2b7a8fde8e22168f4d41b88b
[ROCm/hip commit: 3f52f76194 ]
2017-01-13 10:56:07 -06:00
Aditya Atluri
d30f6a1d1b
added comparision device functions for fp16
...
1. Added comparision device functions
2. Added test to check correct isa getting generated
Change-Id: I16732f5a1438bdce145f7bfcecd28198e3cc4b79
[ROCm/hip commit: 2dcd7600dc ]
2017-01-12 14:52:14 -06:00
Aditya Atluri
0180125a29
added packed math fp16 native device functions
...
1. Added SDWA implementation inside IR file
2. Added device functions to header + used them in test
Change-Id: Ib4e059a58eee201cc82438689e3e9bc5f9d26653
[ROCm/hip commit: 5ef8ef3bd7 ]
2017-01-12 14:10:51 -06:00
Aditya Atluri
7f00c120a7
Started adding native half math library support
...
1. Removed HIP_EXPERIMENTAL env variable so that device code will be accessed from LLVM IR
2. Removed soft support from headers and moved to hip_fp16.cpp
3. Added LLVM IR + inline asm to hip_ir.ll
4. Added test for fp16
5. Added barriers for hcc 3.5 and hcc 4.0 for half support
a. Which means, hcc 4.0 can parse __fp16 but hcc 3.5 cant
b. HCC 4.0 code is implemented now, hcc 3.5 will be added later
Change-Id: Ic37859b2688ebb02e168bab643d1882bf4727952
[ROCm/hip commit: d180fdaae0 ]
2017-01-12 11:30:20 -06:00