Grafik Komit

1267 Melakukan

Penulis SHA1 Pesan Tanggal
Ben Sander ec4f4a643d Support size_t in memset kernel.
Add disable for HSA_AMD_AGENT_INFO_MAX_WAVES_PER_CU
Remove one copy of completion_future in memset.


[ROCm/hip commit: c325c988b1]
2016-12-22 12:25:09 -06:00
Maneesh Gupta 63ab898d17 hip_hcc package changes
- updated hip_hcc package creation dependencies
- support build hip_hcc package for HCC-1.0

Change-Id: Idf23e415eff8cb352a8906191c79bd822c7618e7


[ROCm/hip commit: 9b6d1588ba]
2016-12-22 15:30:38 +05:30
Ben Sander c1ac04322a Increment API sequence number.
Change name to tls_tidInfo


[ROCm/hip commit: 37d8cafb12]
2016-12-21 15:30:36 -06:00
Evgeny Mankov 0215c67ba6 [HIPIFY] Statistics in CSV file.
+ Stats by CUDA ref name.
+ Conversion %.

TODO: Calculation of changed code amount, based on actually replaced bytes.


[ROCm/hip commit: 4bb8bf8dab]
2016-12-21 23:08:01 +03:00
Rahul Garg 191e8fc08e Removed redundant GetPCIBusID int version function
Change-Id: I37f2ff87d09fcfb1e3b104c44c51f606fcb83c01


[ROCm/hip commit: 4704547bab]
2016-12-20 23:25:16 +05:30
Evgeny Mankov 121b909f55 [HIPIFY] Reflect unsupported CUDA API refs in statistics
https://github.com/GPUOpen-ProfessionalCompute-Tools/HIP/issues/53

+ Unsupported refs (by HIP) are now might be listed along with the supported ones.
+ Warnings are added for the unhandled (by HIPIFY) refs, for instance:
  "warning: the following reference is not handled: 'cublasContext' [param decl ptr]."
+ Reflect unsupported CUDA API refs in statistics.
+ Occupancy API [HIP_UNSUPPORTED].
+ A few CUBLAS refs are listed as HIP_UNSUPPORTED.

TODO: Statistics in CSV file.


[ROCm/hip commit: 3dd32e969d]
2016-12-19 14:38:19 +03:00
Rahul Garg a93a5a41f3 Fix for HCSWAP-67
Change-Id: I0b2ce5ab933237947fb41d89769db3da16e5be6a

Conflicts:
	src/hip_hcc.cpp


[ROCm/hip commit: fbf7ed63a8]
2016-12-19 16:19:51 +05:30
Ben Sander a5f13421b5 Add name for function
[ROCm/hip commit: 90c69e14bb]
2016-12-17 08:54:09 -06:00
Ben Sander a212162612 Remove HSA dependency from hipFunction_t
Place _groupSegmentSize and _privateSegmentSize inside Function,
remove hsa_executable_symbol_t.


[ROCm/hip commit: 8bf4bd2f7d]
2016-12-17 07:22:56 -06:00
Ben Sander f97cffdc48 Remove USE_DISPATCH_HSA_KERNEL=0 path.
[ROCm/hip commit: 6ed7e1c1c1]
2016-12-17 07:22:56 -06:00
Ben Sander 7402308098 Refactor Module and Function APIs.
- hipFunction_t is now returned by value.  This eliminates dynamic
      allocation / memory management complexity in the module.  Removed
the kernel
      name so the structure is just 16 bytes now.

    - Moved the hsa_executable_load_module and hsa_executable_freeze
      calls to the hipModuleLoad and hipModuleLoadData calls.

    - Apply sharedMemBytes in hipModuleLaunchKernel to group segment
      size (not private).


[ROCm/hip commit: 4d29885be3]
2016-12-17 07:22:33 -06:00
Rahul Garg ae83f93ba4 Mapped hipDevice_t to int
Change-Id: I6cfa56c42b7cd04aa0e0bce510c0d72d34ea211a


[ROCm/hip commit: 263a9614ff]
2016-12-17 16:53:03 +05:30
Aditya Atluri d93219fc00 disabled half native support as inline asm is not working
Change-Id: I3073d8ae39eed321987f0f2f0e689eec4cdbb48c


[ROCm/hip commit: 2665ad2762]
2016-12-16 09:24:59 -06:00
Ben Sander fcdaa05a4d Print limits on CUDA devices
[ROCm/hip commit: 43635f51dc]
2016-12-16 08:55:11 -06:00
Ben Sander 70f125f2a8 Fix typo
[ROCm/hip commit: bd19bb4074]
2016-12-15 14:42:52 -06:00
Ben Sander 5a3c11dd9a fix copyright
[ROCm/hip commit: 8ed38bae69]
2016-12-15 14:42:52 -06:00
Ben Sander 89523da1c2 remove TODO file
[ROCm/hip commit: 4080fe209d]
2016-12-15 14:42:52 -06:00
Evgeny Mankov e29cc18289 [HIPIFY] nested macro is not hipified, when it isAnyIdentifier
Fix for https://github.com/GPUOpen-ProfessionalCompute-Tools/HIP/issues/55


[ROCm/hip commit: 2383d9bc1a]
2016-12-15 21:00:34 +03:00
Brecht Carlier fd99cec62d Update hip_faq.md
Fixed navigation and list.


[ROCm/hip commit: 05b3e2928d]
2016-12-15 14:08:34 +05:30
Martin Schleiss 92dae66b21 Fix another typo
[ROCm/hip commit: b1eace4348]
2016-12-15 14:07:20 +05:30
Martin Schleiss d3ca48fc45 Fix various typos
Conflicts:
	README.md

Change-Id: Ie296d503d16121a62fed1a208352ec2b81c97fd9


[ROCm/hip commit: 263dcfed83]
2016-12-15 14:06:38 +05:30
Peter Steinberger beb82e5109 Fixes a typo: perforamnce -> performance
Change-Id: I85e3b3d22c98c16556227283bfb33530e1bce2cf


[ROCm/hip commit: 961b7890c1]
2016-12-15 14:01:08 +05:30
Aditya Atluri 4ebb6e569f fixed compilation issues
Change-Id: I96692538736e2e4f2da9dba9c8c29a164aec4c0d


[ROCm/hip commit: 68c57c38ff]
2016-12-14 16:50:16 -06:00
Aditya Atluri 727aab2304 added half2 support
Change-Id: I0f3b9b7037fed97e80ec99f5369c75a63f001aae


[ROCm/hip commit: d2daf6ad75]
2016-12-14 14:18:48 -06:00
Sandeep Kumar 1c6c618247 Fixes in Makefile of couple of samples
- modified Makefile for hipblas_saxpy to replaced hcblas.so with hipblas.so as part of HCSWAP-100
- Resolved missing separator issue in peer2peer cookbook Makefile

Change-Id: I678fea267eee1481f02da09379339ed78d3f95f2


[ROCm/hip commit: d78649b978]
2016-12-14 16:27:14 +05:30
Aditya Atluri d4b7fe8385 added simple half math ops
Change-Id: I10b1d1023a9e5f2ba63f28c4a2bbe60ee49a8aee


[ROCm/hip commit: 7c7d948fc6]
2016-12-13 20:20:58 -06:00
Aditya Atluri 0b445d68de disabled compiler flag hcc 4.0 for half support
Change-Id: I32175113f4c05d43310b3a05c2a14e12f6d48b09


[ROCm/hip commit: ed39a7f43b]
2016-12-13 20:06:56 -06:00
Ben Sander 1a7ecbcd04 Add USE_IPC to disable use of IPC APIs. Set to 0.
[ROCm/hip commit: b30e4b4781]
2016-12-13 15:07:04 -06:00
Aditya Atluri ada705544f added few type reinterpret cast device functions
1. __int_as_float
2. __hiloint2double

Change-Id: Id247c196887b24a12090f0521bf91e13afeec733


[ROCm/hip commit: 04ab5f7f56]
2016-12-13 14:41:36 -06:00
Aditya Atluri 8ad8f7ce26 added half math addition ISA support
Change-Id: I293b771f695b499b795d7e53f600c9e4fe2a2071


[ROCm/hip commit: a6fe6222c4]
2016-12-13 09:18:34 -06:00
Evgeny Mankov 4fd6a82aaf [HIPIFY] Code refactoring and total stat collecting
+ Single base class for Preprocessor and MatchFinder classes.
+ Total Stats for multiple files is added.


[ROCm/hip commit: ad3ec15d85]
2016-12-13 18:01:08 +03:00
Evgeny Mankov b2377f20ba [HIPIFY] Rename -n to -examine
[ROCm/hip commit: 95ae514511]
2016-12-12 20:03:01 +03:00
Evgeny Mankov a5ded34092 [HIPIFY] Multiple source files support.
[ROCm/hip commit: 02468d31fc]
2016-12-12 19:57:19 +03:00
Aditya Atluri 90938b804f Changed threadfences to match target parsing by hcc
Change-Id: I28fcabbaacd13495b707f263fd09afaead0665fa


[ROCm/hip commit: 765947aaf5]
2016-12-12 10:16:58 -06:00
Evgeny Mankov 0ec50d179c [HIPIFY] -no-backup option is added.
Is useful for release hipifying in place.


[ROCm/hip commit: 428a1bc79f]
2016-12-08 23:14:19 +03:00
Evgeny Mankov d773facac2 [HIPIFY] -no-output support.
Actually output file is created by clang itself, but isn't updated and is deleted after processing.
In cooperation with -print-stat -no-output (or single -n) is used for examine the source CUDA code.

Conflicting options:
-inplce
-o


[ROCm/hip commit: 2374153c10]
2016-12-08 22:45:10 +03:00
Evgeny Mankov 70a6fee7ef Fix the limitation of supported input files.
Actually .cu and .cuda was supported.

+ All the file names are allowed, including file names without extension.

[IMPORTANT]
To hipify CUDA input file, which name is not *.cu, please add option "-x cuda" after tool's options ending marker "--", for instance:

./hipify-clang NeuralNet -- -x cuda

This option will go to clang itself, not the hipify tool.


[ROCm/hip commit: 23bbe6e467]
2016-12-08 20:28:43 +03:00
Rahul Garg 7d1d1b2a9c Fixed build error due to GetPCIBusId overloaded function
Change-Id: I626446f2c72c8143f08c95367bc1c528abeaf69d


[ROCm/hip commit: d35c8128a8]
2016-12-08 14:35:58 +05:30
Maneesh Gupta 2b9117b353 hcc_detail/hip_runtime_api.h: Fix IPC API signature
Change-Id: I0be0f09c62f231620341141bd66183c3338be56a


[ROCm/hip commit: 6e6b518098]
2016-12-08 12:50:25 +05:30
pensun f6d6861ad4 HIP IPC implementation on ROCr IPC APIs
Change-Id: I1ca9d520f5d0b1b56694211471b81eb7c6c23d16


[ROCm/hip commit: a53d35fd6c]
2016-12-07 15:38:36 -06:00
Rahul Garg 8420edd5e4 hipDeviceGetPCIBusId int version changes for CUDA runtime API
Change-Id: I4d3b995f1d1ac83415ca84808a074e5c8cd72f3c


[ROCm/hip commit: 266b27ac83]
2016-12-07 12:12:40 +05:30
pensun 7b8766bfd2 Fix issue of ihipModule_t double defined
Change-Id: I508677e595776fd573a2f224691116d01288dc78


[ROCm/hip commit: 6fcfab2552]
2016-12-06 19:10:17 -06:00
pensun 4606e364e0 change hipgetPCIID to take int as third parameter
Change-Id: I4429b36756a6d868a769abd783bf28a55147c0d0


[ROCm/hip commit: 01f688587e]
2016-12-06 14:24:09 -06:00
pensun 6d2bf52f9f IPC prototyps and part of the implementation included
Change-Id: Id88c7f155d23ec63f57a6ef05098fba43f8af336


[ROCm/hip commit: 17b98d59b8]
2016-12-06 14:24:09 -06:00
pensun 2b557e949e local changes for hipnccl
Change-Id: I05a1f0381ce2914a800f573342cc954eb5ff82d9


[ROCm/hip commit: eba2595611]
2016-12-06 14:22:02 -06:00
Ben Sander 1c91897cb8 Enabled USE_DISPATCH_HSA_KERNEL, with serialization in hipModuleUnload.
[ROCm/hip commit: 6209565ec0]
2016-12-06 10:14:01 -06:00
Rahul Garg bb1b5c3a25 Simple test case for hipDeviceGetPCIBusId
Change-Id: I0fb6d1bef4739a5b6280928f7b349d95c1656431


[ROCm/hip commit: 9aebbe00de]
2016-12-06 17:31:54 +05:30
Rahul Garg 094bca8b5d Build Error correction in hipDeviceGetPCIBusId
Change-Id: I50ff4d95b7a732924c7a991cba60400b1c93c0de


[ROCm/hip commit: ca06747e1f]
2016-12-06 17:09:21 +05:30
Rahul Garg 61dffc7e18 Changed hipDeviceGetPCIBusId to return Bus ID as string
Change-Id: I6d5aa7362084109d34bc015d948f8723b2a38ee9


[ROCm/hip commit: 27072b8972]
2016-12-06 16:55:17 +05:30
Maneesh Gupta 55aa1d85d9 Remove redundant variable g_malloc_heap_size
Change-Id: Idaf47be70488f0deb3eab05a86d9c5a413d3fff7


[ROCm/hip commit: 6d5145eba4]
2016-12-06 10:19:03 +05:30