Commit Graph

3919 Commits

Author SHA1 Message Date
Mateusz Jablonski 9337911742 fix: add self-assign check in operator=
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-03 08:09:16 +02:00
Mateusz Jablonski 16d94a0b8c refactor: remove dead code
Related-To: NEO-9038
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-10-03 07:34:31 +02:00
Dunajski, Bartosz 891cc2d09b feature: enable waiting for in-order events from regular CmdList
This is prework. Functional only on immediate CmdLists

Related-To: NEO-8145

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-10-02 16:42:10 +02:00
Spruit, Neil R 5a22477b83 fix: Update Relaxed Memory Size to check physical before global mem size
Related-To: NEO-9012

- Allows for the memory size requested by the user to be within the
physical memory size if that is set, otherwise the limit is the global
memory size.

Signed-off-by: Spruit, Neil R <neil.r.spruit@intel.com>
2023-10-02 16:02:35 +02:00
John Falkowski 56f05303c9 feature: add support for zeMemGet/SetAtomicAccessAttributeExp
Resolves: NEO-8219

Signed-off-by: John Falkowski <john.falkowski@intel.com>
2023-10-02 15:59:17 +02:00
Compute-Runtime-Validation 1ac37d4a49 Revert "refactor: Enable CSR heap sharing on Older Generation platforms"
This reverts commit 58ff9c6d94.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-10-02 15:13:23 +02:00
Mateusz Jablonski 4a80380aaf refactor: remove redundant comparing array address to zero
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-29 15:36:22 +02:00
Dunajski, Bartosz 5402c9a13f feature: pass in-order exec info from Regular CmdList to Event
Related-To: NEO-8145

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-29 15:29:19 +02:00
Hoppe, Mateusz 5c565efe28 feature: bindless global heap with debugger
- program debugSurface's SurfaceState at the beginning of Bindless Surface
State Heap - SPECIAL_SSH
- ensure SPECIAL_SSH is resident

Related-To: NEO-7063

Signed-off-by: Hoppe, Mateusz <mateusz.hoppe@intel.com>
2023-09-29 13:13:46 +02:00
Brandon Yates 253e1df43b feature(debugger): Filter page fault threads by start IP
When a thread is stopped due to potential page fault
we must check AIP against start IP to ensure it is not
a newly started thread accidentally caught by PF algorithm

Related-to: NEO-8617

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2023-09-29 12:45:32 +02:00
Mateusz Jablonski 81b535d3f1 fix: correct handling ze group count in appendLaunchMultipleKernelsIndirect
Related-To: HSD-18033883770
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-29 12:16:32 +02:00
Jitendra Sharma 58ff9c6d94 refactor: Enable CSR heap sharing on Older Generation platforms
Related-To: LOCI-4312
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2023-09-29 11:54:51 +02:00
Dunajski, Bartosz c5f5f60f8d feature: cross CmdList and Event in-order data sharing [1/n]
Related-To: NEO-7966

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-29 11:14:18 +02:00
Mateusz Jablonski b8c3dea8dd refactor: simplify KernelImmutableData dtor
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-28 08:36:01 +02:00
Brandon Yates 9c56a11c2e refactor(debugger): Move PF exception check into bitmask
Related-to: NEO-8617
Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2023-09-28 08:28:00 +02:00
Mateusz Jablonski 3a21b3b228 refactor: remove not needed code
Related-To: NEO-7527

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-28 07:52:39 +02:00
Mateusz Jablonski cb730d11f4 refactor: pass ze group count as a reference, not a pointer
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-28 07:41:04 +02:00
Dunajski, Bartosz 2aaf5a1f03 feature: enable RelaxedOrdering for in-order Events chaining
Related-To: NEO-7966

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-27 18:36:31 +02:00
Dunajski, Bartosz 4e8600d8d0 feature: initial support for RelaxedOrdering of in-order Events chaining
Disabled by default.

Related-To: NEO-7966

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-27 16:45:20 +02:00
Mateusz Jablonski 09044dfbaa refactor: remove not needed code
Related-To: NEO-7527

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-27 14:35:49 +02:00
Katarzyna Cencelewska e71db368db fix: use proper pointer inside appendWriteGlobalTimestamp
Resolves: NEO-8105
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2023-09-27 14:00:37 +02:00
Dunajski, Bartosz 96a7f63371 feature: allow waiting for immediate in-order event from Regular CmdList
Related-To: NEO-7966

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-27 13:03:44 +02:00
Mateusz Jablonski 64fd270d56 refactor: pass reference to pointer as arg to findAllocationDataForRange
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-27 12:20:23 +02:00
Kacper Nowak 160303924d refactor: Correct logic for SIMD1
- For calculating number of threads per workgroup, for SIMD 1, return
local work size (each software thread should be mapped into a whole hardware
thread).
- Correct logic of calculating space for per thread data for SIMD 1.
- Minor: unit tests refactor.
- Corrected naming.
Related-To: NEO-8261
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2023-09-26 15:28:37 +02:00
Bari, Pratik 3c2ef4fbbf feature(sysman): Boilerplate for Sysman Product Helper
- Added the product specific files and some functions
- Have added APIs to call the interface to the Sysman product helper
- Have added ULTs for the new APIs

Related-To: NEO-8584

Signed-off-by: Bari, Pratik <pratik.bari@intel.com>
2023-09-26 15:22:13 +02:00
Brandon Yates b3464a2621 feature(debugger): Expose DBG regset
Related-To: NEO-8383

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2023-09-26 11:50:29 +02:00
Dunajski, Bartosz 97b8cac1d1 feature: enable in-order regular cmd lists patching
Related-To: NEO-7966

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-26 09:19:05 +02:00
Dominik Dabek eebf2bbd26 performance(ocl): timestamp packet count per gfx
Add support for different timestamp packet counts per gfx family.
Change all packet counts to 1 except for xe-hpc.

Related-To: NEO-8154

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-09-25 20:34:58 +02:00
Dunajski, Bartosz 529aa60563 performance: optimize RelaxedOrdering in-order Barrier programming
Related-To: NEO-7966

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-25 15:45:36 +02:00
Dunajski, Bartosz c115eeb108 feature: dont program pipe_control for in-order barrier profiling
Related-To: NEO-7966

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-25 15:32:18 +02:00
Dunajski, Bartosz 202a33ffac feature: track InOrder allocation ownership
CmdList can be released before Event. In this case, GfxAllocation
destruction must be deferred.

Related-To: NEO-7966

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-25 15:03:25 +02:00
Dunajski, Bartosz 80d0c74605 fix: track registered CSR clients 2
Related-To: NEO-8884

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-25 09:35:55 +02:00
Compute-Runtime-Validation ade538ce54 Revert "fix: track registered CSR clients"
This reverts commit 53f635e392.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-09-24 10:07:26 +02:00
Dunajski, Bartosz d515ba965c feature: return not ready state for not signaled in-order event
Related-To: NEO-7966

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-22 15:56:50 +02:00
Dunajski, Bartosz 480c058cb2 feature: in-order patching for ComputeWalker
Related-To: NEO-7966

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-22 15:00:44 +02:00
Zbigniew Zdanowicz d725d454ea performance: emit fence on immediate copy engine only for blocking calls
- this change handles level zero immediate command lists on copy engine
- monitor fence will be dispatched for blocking calls
- asynchronous mode will dispatch monitor fence only on host synchronization

Related-To: NEO-8395

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-09-22 13:14:37 +02:00
Compute-Runtime-Validation f19898d178 Revert "fix: enable cache env variables for level-zero"
This reverts commit 74689fa58a.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-09-22 13:10:02 +02:00
Dunajski, Bartosz 53f635e392 fix: track registered CSR clients
In L0 its not possible to track objects relations. For example CmdList
may be removed before Event.
In such case, Event needs to safely skip unregister call, without
accessing CmdList/CmdQueue object.

Related-To: NEO-8884

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-22 09:42:55 +02:00
Kacper Kasper 74689fa58a fix: enable cache env variables for level-zero
Related-To: NEO-4262

Signed-off-by: Kacper Kasper <kacper.k.kasper@intel.com>
2023-09-21 17:03:33 +02:00
Maciej Plewka 8658fdb04e fix: Use stack vec for api specific prefix
Related-To: NEO-8388, GSD-6296

Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2023-09-21 16:10:54 +02:00
Dunajski, Bartosz 42496ac96d feature: initial support for patching regular in-order CmdList
Related-To: NEO-7966

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-21 14:20:50 +02:00
Maciej Bielski 97e7cda912 feature: Optimize intra-module kernel ISA allocations
So far, there is a separate page allocated for each kernel's ISA within
`KernelImmutableData::initialize()`. Apparently the ISA blocks are often
much smaller than a 64k page, which leads to poor memory utilization and
was even observed to cause the device OOM error if a single module has
several keys.

Improve the situation by reusing the parent allocation (owned by the
module instance) for modules, which kernel ISAs can fit together within
a single 64k page. This improves the memory utilization on a single
module level.

Related-To: NEO-7788
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2023-09-21 13:55:45 +02:00
Dunajski, Bartosz b94f58abaa feature: debug flag to enable in-order events
Related-To: NEO-7966

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-21 11:22:48 +02:00
Kulkarni, Ashwin Kumar 3a5b038c45 fix(sysman): bugfix for return of correct uuid with zesInit flow
Related-To: LOCI-4907

Signed-off-by: Kulkarni, Ashwin Kumar <ashwin.kumar.kulkarni@intel.com>
2023-09-21 09:41:49 +02:00
Latif, Raiyan 9bb559d468 feature: Ray Tracing Acceleration Structure (RTAS) Support
Related-To: LOCI-3419

Signed-off-by: Latif, Raiyan <raiyan.latif@intel.com>
2023-09-20 19:57:56 +02:00
Dunajski, Bartosz 020822fe29 feature: re-enable in-order semaphores for regular CmdLists
Related-To: NEO-7966

Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com>
2023-09-20 15:34:05 +02:00
Mateusz Jablonski b1808f7830 fix: correct suggested number of work groups for concurrent kernels on PVC
value depends on CCS count:
- single CCS mode (default) - 50% available
- two CCS mode - 25% available
- four CCS mode - 12.5% available

Related-To: NEO-8377
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-09-20 13:40:22 +02:00
Mateusz Hoppe 69f5ca6345 feature: bindless addressing - flush state cache after reusing SS slot
- when Surface State is reused for new resource, State Cache needs to be
invalidated

Related-To: NEO-7063

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-09-20 12:53:32 +02:00
Zbigniew Zdanowicz 09ab0ca1d2 performance: remove not needed validation loop
- validation of arguments must be moved to validation layer

Related-To: NEO-7828

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-09-20 12:35:18 +02:00
Zbigniew Zdanowicz 40e169f8e2 refactor: add batch buffer attribute for explicit monitor fence dispatch
Related-To: NEO-8395

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-09-20 12:18:32 +02:00