Alex Voicu
5325b6535e
Update hip_module.cpp
2018-05-14 17:15:36 +01:00
Evgeny Mankov
a69b4c3a06
Merge pull request #449 from emankov/cuDNN
...
[HIPIFY][DNN] support of cuDNN 7.1.3 - continuation 2
2018-05-14 16:30:34 +03:00
Evgeny Mankov
b0fd0c310d
[HIPIFY][DNN] support of cuDNN 7.1.3 - continuation 2
...
- not finished yet.
- based on https://github.com/ROCmSoftwarePlatform/hipDNN .
- testing on https://github.com/baidu-research/DeepBench - almost pass, except cusparse (not supported yet).
- started testing of examples from libcudnn7-dev_7.1.3.16-1+cuda8.0_amd64 package.
2018-05-14 16:23:59 +03:00
949f45ac
79480d7cbd
Provide correct __mul64hi and __umul64hi builtins, using code from ROCm-Device-Libs
2018-05-14 08:34:56 +02:00
Alex Voicu
b8c98d2357
Switch over to using native vector types, for better codegen. Remove noise.
2018-05-12 00:14:17 +01:00
foreman
fbd86acc9f
P4 to Git Change 1553709 by cpaquot@cpaquot-ocl-lc-lnx on 2018/05/11 11:24:30
...
SWDEV-145570 - [HIP] Make streamSet global and protect it
By default from the spec, streamSet should be global and not per thread.
There is a flag to make it per thread but we don't handle this yet. We
would just add another variable that will be thread local and use it instead.
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/hip/hip_context.cpp#12 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_device_runtime.cpp#10 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_internal.hpp#11 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_stream.cpp#8 edit
2018-05-11 11:34:01 -04:00
Aaron Enye Shi
848a24b524
Fix hipMathFunction for gfx906
2018-05-11 10:53:07 -04:00
Alex Voicu
1ba8a35dba
Don't use magic constants, they're evil.
...
Also clarify that the register count cannot be queried at the moment.
2018-05-11 11:31:46 +01:00
Alex Voicu
13274ce559
Add support for the hipFuncGetAttributes interface.
2018-05-11 03:35:10 +01:00
Siu Chi Chan
b898049412
initial gfx906 support
2018-05-10 19:28:00 +00:00
Evgeny Mankov
ace018501d
Merge pull request #443 from emankov/cuDNN
...
[HIPIFY][DNN] support of cuDNN 7.1.3 - continuation
2018-05-10 19:42:05 +03:00
Evgeny Mankov
dffe1802be
[HIPIFY][DNN] support of cuDNN 7.1.3 - continuation
...
- not finished yet.
- based on https://github.com/ROCmSoftwarePlatform/hipDNN .
- testing on https://github.com/baidu-research/DeepBench .
2018-05-10 17:36:51 +03:00
foreman
ac011af36b
P4 to Git Change 1552807 by skudchad@skudchad_test2_win_opencl on 2018/05/09 18:08:07
...
SWDEV-145570 - [HIP] - Release a stream first before taking it off from the set.
- Queue::create() needs to be called before returning a valid queue.
ReviewBoardURL = http://ocltc.amd.com/reviews/r/14830/diff/
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/hip/hip_stream.cpp#7 edit
2018-05-09 18:15:41 -04:00
Jenkins
5fab78cf09
Merge 'master' into 'amd-master'
...
Change-Id: Ic79e86ea3cd8cd7f85d7f9f2a24e6acfad02d942
2018-05-09 04:09:39 -05:00
foreman
50810d28b8
P4 to Git Change 1552209 by cpaquot@cpaquot-ocl-lc-lnx on 2018/05/08 20:24:32
...
SWDEV-145570 - [HIP] Use the as_amd()->asHostQueue with streamSet
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/hip/hip_stream.cpp#6 edit
2018-05-08 20:30:17 -04:00
foreman
7176a5fe81
P4 to Git Change 1552198 by skudchad@skudchad_rocm on 2018/05/08 18:57:32
...
SWDEV-145570 - [HIP] - Synchronize the legacy default stream with other blocking streams as per the spec.
ReviewBoardURL = http://ocltc.amd.com/reviews/r/14796/diff/
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/hip/hip_context.cpp#11 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_internal.hpp#10 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_stream.cpp#5 edit
2018-05-08 19:04:35 -04:00
Yaxun (Sam) Liu
19f3ed6f62
Fix warning about inlined function is not defined
2018-05-08 16:38:50 -04:00
foreman
2df122be02
P4 to Git Change 1552011 by skudchad@skudchad_test2_win_opencl on 2018/05/08 14:48:45
...
SWDEV-145570 - [HIP] Fix offset calculation when getting a memory object. Also include case when destination VA may just be a CPU host VA and not nessarily device alloced.
- Fix hipMemset* to write each byte and now a dword as per the spec
ReviewBoardURL = http://ocltc.amd.com/reviews/r/14787/diff/
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#24 edit
2018-05-08 15:43:13 -04:00
Yaxun (Sam) Liu
7672b44c79
Add __assert_fail, __device_trap and hipErrorAssert for clang
2018-05-08 15:42:27 -04:00
Evgeny Mankov
fea366cc89
Merge pull request #438 from emankov/hipBLAS
...
[HIPIFY][Blas] Sync with CUDA 9.1
2018-05-08 20:50:39 +03:00
Siu Chi Chan
b285145966
add intrinsics mbcnt_lo, mbcnt_hi, lane_id
2018-05-08 13:43:53 -04:00
Evgeny Mankov
e5ba9668fc
[HIPIFY][Blas] Sync with CUDA 9.1
2018-05-08 20:42:30 +03:00
Evgeny Mankov
6559e57b24
Merge pull request #435 from emankov/hipBLAS
...
[HIPIFY] Sync with hipBLAS
2018-05-08 19:22:32 +03:00
Evgeny Mankov
7681775662
[HIPIFY] Sync with hipBLAS
2018-05-08 19:20:47 +03:00
Jenkins
1d0d352cf2
Merge 'master' into 'amd-master'
...
Change-Id: I387d49269f0314b38db5e77eacc1be636280620d
2018-05-08 04:09:43 -05:00
Maneesh Gupta
3095f67281
Merge pull request #432 from moosichu/patch-1
...
Add space between `###` and `Notes` in hip_terms
2018-05-08 12:30:27 +05:30
Rahul Garg
da302c3e93
Added hipMemset3D
2018-05-07 10:24:30 +05:30
Tom Maenan Read Cutting
9d76f5839e
Add space between ### and Notes in hip_terms
...
Makes `Notes` an H3 heading.
2018-05-05 13:30:11 +01:00
foreman
2c5606ea44
P4 to Git Change 1550648 by skudchad@skudchad_test2_win_opencl on 2018/05/04 20:56:00
...
SWDEV-145570 - [HIP]
- Implement hipMemcpyHtoD/DtoH/DtoD/ and their Async APIs
- Combine logic for hipMemset/Memcpy/Memset2D/Memcpy2D that can be shared across multiple APIs
ReviewBoardURL = http://ocltc.amd.com/reviews/r/14782/diff/
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#23 edit
2018-05-05 00:34:05 -04:00
Evgeny Mankov
991f817441
Merge pull request #431 from emankov/master
...
[HIPIFY][test] Undo commit "Apply .clangformat to all repo source files"
2018-05-04 22:23:19 +03:00
emankov
21b79cd467
[HIPIFY][test] Undo commit "Apply .clangformat to all repo source files"
...
Commit broke tests due to code and comments formatting changes, thus FileCheck fails on checks, which are in comments.
2018-05-04 22:23:16 +03:00
foreman
65b557ecce
P4 to Git Change 1550526 by cpaquot@cpaquot-ocl-lc-lnx on 2018/05/04 15:00:44
...
SWDEV-145570 - [HIP] Missing HIP_INIT_API macro in hipDeviceSynchronize.
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/hip/hip_device_runtime.cpp#9 edit
2018-05-04 15:12:48 -04:00
foreman
9129aa17f6
P4 to Git Change 1550486 by cpaquot@cpaquot-ocl-lc-lnx on 2018/05/04 14:00:33
...
SWDEV-145570 - [HIP] Fix multithread init
Make the g_ihipInitialized variable per thread
And make sure to assign a default g_context
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/hip/hip_context.cpp#10 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_internal.hpp#9 edit
2018-05-04 14:13:13 -04:00
emankov
01f146e1bc
Merge branch 'master' of https://github.com/ROCm-Developer-Tools/HIP
2018-05-04 21:04:34 +03:00
Evgeny Mankov
dbe1583f28
Merge pull request #429 from emankov/master
...
[HIPIFY][test] add cuDNN test
2018-05-04 12:40:48 +03:00
Evgeny Mankov
a56b480c5e
[HIPIFY][test] add cuDNN test
2018-05-04 12:37:15 +03:00
Evgeny Mankov
9073bee7a7
Merge pull request #428 from emankov/docs
...
[HIPIFY][doc] Readme.md update
2018-05-04 11:06:59 +03:00
Evgeny Mankov
054c3f71f0
[HIPIFY][doc] Readme.md update
...
+ supported CUDA version to LLVM version correspondence table is added.
+ Test section is rewritten.
+ Windows support is added.
2018-05-04 10:50:18 +03:00
Evgeny Mankov
49024a5a55
Merge pull request #423 from emankov/cuDNN
...
[HIPIFY] Initial cuDNN support
2018-05-04 10:01:47 +03:00
foreman
0508a4ed37
P4 to Git Change 1550010 by cpaquot@cpaquot-ocl-lc-lnx on 2018/05/03 21:16:06
...
SWDEV-145570 - [HIP] Fixed region initialization
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#22 edit
2018-05-03 21:26:33 -04:00
foreman
99d3444f00
P4 to Git Change 1549971 by skudchad@skudchad_rocm on 2018/05/03 18:39:37
...
SWDEV-145570 - [HIP] - Fix build
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#21 edit
... //depot/stg/opencl/drivers/opencl/runtime/device/rocm/rockernel.hpp#21 edit
2018-05-03 18:56:01 -04:00
foreman
3ad5f2a01b
P4 to Git Change 1549965 by skudchad@skudchad_test2_win_opencl on 2018/05/03 18:15:18
...
SWDEV-145570 - [HIP] - Minor fixes. Use coarse grain for hipMallocPitch
ReviewBoardURL = http://ocltc.amd.com/reviews/r/14768/diff/
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#20 edit
2018-05-03 18:26:45 -04:00
Jenkins
d7bd818004
Merge 'master' into 'amd-master'
...
Change-Id: I1d6d23c5381138aaf2d4555229183a6810612da2
2018-05-03 04:09:39 -05:00
Evgeny Mankov
fe421c89b2
[HIPIFY] Initial cuDNN support
...
- based on https://github.com/ROCmSoftwarePlatform/hipDNN .
- lit testing was supplemented with CUDA_DNN_ROOT_DIR for cuDNN testing.
- single cuDNN test was added.
2018-05-03 11:33:40 +03:00
Evgeny Mankov
e1d1835798
Merge pull request #417 from emankov/master
...
[HIPIFY] Sync with HIP (Execution Control, Surfaces, Memory)
2018-05-03 10:45:51 +03:00
Maneesh Gupta
67cb81c1d1
Merge pull request #422 from luckynikki/NULL-FIXES
...
Null checks added for hipmallocpitch and hipmemcpy apis
2018-05-03 10:14:40 +05:30
Lakhan Singh
6411ca1f6d
Null checks added for hipmallocpitch and hipmemcpy apis
2018-05-03 09:27:50 +05:30
foreman
a10512c510
P4 to Git Change 1549067 by skudchad@skudchad_rocm on 2018/05/02 17:40:37
...
SWDEV-145570 - [HIP] - Fix build
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#19 edit
2018-05-02 22:25:53 -04:00
foreman
445d300d63
P4 to Git Change 1549060 by skudchad@skudchad_test2_win_opencl on 2018/05/02 17:36:53
...
SWDEV-145570 - [HIP] - Add hipMemcpyDeviceToDevice/hipMemcpyHostToHost/hipMemcpyDefault cases to hipMemcpy/hipMemcpyAsync, hipMemset2DAsync
ReviewBoardURL = http://ocltc.amd.com/reviews/r/14746/diff/
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#18 edit
2018-05-02 21:08:53 -04:00
foreman
0ddf288904
P4 to Git Change 1549012 by lmoriche@lmoriche_opencl_dev2 on 2018/05/02 16:56:02
...
SWDEV-145570 - [HIP] - Add hipMemset2DAsync, fix the HCC triple for device binaries (make sure to update to the latest HCC).
Affected files ...
... //depot/stg/opencl/drivers/opencl/api/hip/hip_hcc.def.in#4 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_hcc.map.in#4 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_memory.cpp#17 edit
... //depot/stg/opencl/drivers/opencl/api/hip/hip_platform.cpp#10 edit
2018-05-02 19:14:52 -04:00