rocm-systems

Συγγραφέας	SHA1	Μήνυμα	Ημερομηνία
David Yat Sin	8d3fee5095	Use HybridMutex for signal mutexes Implement HybridMutex to improve latencies compared to KernelMutex when there is contention between several threads calling hsa_signal_create and hsa_amd_signal_async_handler. Change-Id: If53377033e749b0050727964c9303f09b02527cc	2024-01-16 21:29:39 +00:00
Sean Keely	df55cb0450	Rework memory locks to allow device parallelism in alloc/free. Prior solution used a single global lock to protect the memory tracking structures. This change protects the memory tracking structure with a shared mutex (rw lock) in shared (r) mode for memory allocations and frees so that long duration processes, calling to kfd, can be done in parallel. Operations which must modify the memory map take the mutex in exclusive mode (w) and must not call to the thunk while holding the mutex. The fragment allocator now requires separate protection and is protected with a mutex at the device level. Protecting at the device level, rather than pool, allows retention of the current recursive design and allows calling Trim from withing Allocate. This could be made finer (pool level locks) but would require backing out of Allocate entirely to call Trim. Trim and any retried Allocation must be done in isolation (per device) or we may report OOM when memory is actually available in some pool's fragment cache. So some device level serialization is required in at least some paths. Change-Id: I7c1e94d6965ffcc602b12fefdd3a6e97b84b5e00	2021-11-24 19:22:05 -06:00
Ramesh Errabolu	fa13208698	Add rocr namespace to core header and impl files Change-Id: I1e1b33f9bba1078d049bc19797889988c3e43360	2020-06-19 22:34:21 -04:00
Sean Keely	ce19721c88	Update copyright date. Change-Id: If4bf4c20cf051878bfe759080bb7345d884dd53d	2020-06-19 22:34:01 -04:00
Sean Keely	9dfdce5b3c	Improve branch elimination in ScopedAcquire. GCC can't reasonably be told that the lock ptr isn't null. Adding a private bool allows the branch to be eliminated, along with the bool. Change-Id: I0605d69474d6a6e6951be93c0af1d8caf3f77124	2017-09-19 06:08:36 -04:00
Sean Keely	117be0b55a	Add suballocator for ordinary VRAM allocations smaller than 2MB. Track pointer info for sub 2MB fragment allocations in allocation_map_. Add fragment support to IPC. Change-Id: I00cfc2e2fa289aac90a4718c392f9bb056a61a87	2017-09-19 06:08:36 -04:00
James Edwards (xN/A) TX	7d2bc9d113	Separate open source core runtime code from DK makefiles. [git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1250152]	2016-03-22 18:10:13 -05:00
James Edwards (xN/A) TX	7d1e6c3a57	Remove opensrc test files. [git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1249961]	2016-03-22 13:39:51 -05:00
James Edwards (xN/A) TX	c9ffe0004e	Check open source core runtime code into perforce. This includes license and README files. [git-p4: depot-paths = "//depot/stg/hsa/drivers/hsa/runtime/": change = 1249136]	2016-03-20 15:39:40 -05:00

9 Υποβολές