Vinod Tipparaju
248e955bad
Fix to optimize PC dispatched during appendMemory*() calls.
...
Includes fix to optimize PC post walker in case of timestamp events.
Related-To: LOCI-1995
Signed-off-by: Vinod Tipparaju <vinod.tipparaju@intel.com >
2021-02-05 13:05:46 +01:00
Igor Venevtsev
3df6110a17
Add extra parameters to setArgStateful()
...
Signed-off-by: Igor Venevtsev <igor.venevtsev@intel.com >
2021-02-05 12:24:27 +01:00
Jaime Arteaga
2fa27ebe73
Move ISA at kernel creation time (2)
...
For internal kernels, move ISA only at creation time. For user
kernels, do it a module creation time.
Related-To: LOCI-2009
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com >
2021-02-04 21:03:41 +01:00
Jaime Arteaga
5a7f844e46
Revert "Defer builtin initialization until first use"
...
This reverts commit df84232ef2 .
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com >
2021-02-04 18:21:12 +01:00
Zbigniew Zdanowicz
eac124f7b2
Add cmake file
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2021-02-04 16:49:53 +01:00
Mateusz Hoppe
c7767de7e9
Revert "Fix version-script for ze_intel_gpu"
...
This reverts commit 7e27ebe70b .
2021-02-04 15:44:01 +01:00
Zbigniew Zdanowicz
e4eb6d4c64
Reorganize files
...
Related-To: NEO-5244
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2021-02-04 10:22:08 +01:00
Mateusz Hoppe
f7613a5827
Program surface state when debugging is enabled
...
- surface state for debugSurface should be set when
kernel is compiled for debug
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2021-02-02 17:53:12 +01:00
Bartosz Dunajski
580fdd757c
Improve buffer surface state programming
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2021-02-02 14:42:18 +01:00
Mateusz Hoppe
0ca1cdc565
Add State Sip end WA
...
Related-To: NEO-5479
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2021-02-01 19:15:57 +01:00
Mateusz Hoppe
7e27ebe70b
Fix version-script for ze_intel_gpu
...
- export only ze* functions
Related-To: NEO-5509
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2021-02-01 18:00:27 +01:00
Jaime Arteaga
afffedebb2
Move ISA at kernel creation time
...
Instead of moving the ISAs for all kernel in a module when the module
is created, move the ISA when the kernel is created, to avoid
unnecessary memory transfers.
Related-To: LOCI-2009
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com >
2021-02-01 13:28:38 +01:00
Jaime Arteaga
df84232ef2
Defer builtin initialization until first use
...
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com >
2021-02-01 10:29:20 +01:00
Daria Hinz
c55886e9ff
Disable adding an event for each split kernel
...
Signed-off-by: Daria Hinz <daria.hinz@intel.com >
2021-01-29 15:14:54 +01:00
Bartosz Dunajski
c2e333fe38
Update compression encoding interface + test traits
...
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com >
2021-01-29 13:57:15 +01:00
Lukasz Jobczyk
8ed14d0c9f
Use ZE_AFFINITY_MASK for neo devices
...
Resolves: NEO-5438
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2021-01-29 11:13:29 +01:00
Vinod Tipparaju
7e9461ffa0
Fix to optimize PC dispatched during appendMemoryCopy() call.
...
Eliminate reduandant PC in case event signal scope is either device or host.
The optimization is applicable to timestamp events as well.
Related-To: LOCI-1995
Signed-off-by: Vinod Tipparaju <vinod.tipparaju@intel.com >
2021-01-29 01:26:40 +01:00
Aravind Gopalakrishnan
8f36ca5736
Flip order of instructions in appendWaitOnEvents
...
It might be beneficial to flush the caches in parallel with
the wait instruction
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com >
2021-01-29 01:26:24 +01:00
Mateusz Hoppe
aa79af46ac
Fix state sip programming
...
- STATE_SIP should be added once for MidThread preemption
or when debugger is used for non internal cmdQs
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2021-01-28 16:59:35 +01:00
Igor Venevtsev
bb72beac6b
Add extra parameters to programStateBaseAddress()
...
Signed-off-by: Igor Venevtsev <igor.venevtsev@intel.com >
2021-01-28 16:03:17 +01:00
Aravind Gopalakrishnan
72e002a3ca
Optimize appendBarrier to remove any redundant Pipe Controls
...
If we know a signal event is provided, we can remove need to use
a separate pipe control for a barrier action.
Related-To: LOCI-1954
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com >
2021-01-27 07:27:41 +01:00
Young Jin Yoon
4168ee8120
Print output from Kernel dtor to avoid segfault
...
Related-to: LOCI-1912
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com >
2021-01-21 23:00:15 +01:00
Vinod Tipparaju
494a59c7df
Fix to dispatch MEDIA_VFE_STATE with updated scratchSpacePointer & Size
...
Minor fix to reset scratch space size and indirect params during cmdList reset
Signed-off-by: Vinod Tipparaju <vinod.tipparaju@intel.com >
2021-01-21 14:29:34 +01:00
Mateusz Hoppe
5d2ea72db6
Handle relocations from zeBin
...
Related-To: NEO-5323
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2021-01-21 13:16:38 +01:00
Lukasz Jobczyk
18bfc3be8b
Revert "Extend ZE_AFFINITY_MASK to OCL"
...
This reverts commit 452b53fb4d02ebf4205018b9fd982422e2d85f30.
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2021-01-21 10:31:40 +01:00
Daria Hinz
5a05d7643c
Correct event reset
...
Signed-off-by: Daria Hinz <daria.hinz@intel.com >
2021-01-20 19:22:07 +01:00
Lukasz Jobczyk
d9b6280511
Extend ZE_AFFINITY_MASK to OCL
...
Resolves: NEO-5438
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2021-01-19 14:10:44 +01:00
Maciej Plewka
24a88d22aa
Fix scratch in bindless mode
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2021-01-18 15:46:18 +01:00
Jaime Arteaga
05b5ad37ea
Initialize kernel private surface when kernel is created
...
This instead of when the associated module is created, to avoid
allocating memory for kernels that are never created nor used.
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com >
2021-01-13 17:22:40 +01:00
Jaime Arteaga
5c9d43e618
Revert "Remove unnecessary flush in event profiling"
...
This reverts commit a0db607083 .
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com >
2021-01-10 22:59:41 +01:00
Jaime Arteaga
08655a315c
Revert "Initialize kernel private surface when kernel is created"
...
This reverts commit be2a87fe98 .
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com >
2021-01-10 22:56:32 +01:00
Jaime Arteaga
26b036ab97
Improve zeCommandListAppendMemoryFill Performance (2)
...
Add missing kernel for remainder kernel when pattern size is 1.
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com >
2021-01-09 08:41:17 +01:00
Jaime Arteaga
a0db607083
Remove unnecessary flush in event profiling
...
Remove flush before appending appendWriteKernelTimestamp, since
the associated PC is only to ensure kernel execution has completed
in the device, so flush to the host is not required.
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com >
2021-01-09 02:40:45 +01:00
Jaime Arteaga
be2a87fe98
Initialize kernel private surface when kernel is created
...
This instead of when the associated module is created, to avoid
allocating memory for kernels that are never created nor used.
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com >
2021-01-08 19:22:17 +01:00
Mateusz Hoppe
6f69bd3560
Initialize SIP kernel in Device::create()
...
Related-To: NEO-4878
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2021-01-08 15:46:23 +01:00
Slawomir Milczarek
55f3c8f134
Add resolve capability for compressed USM device allocations
...
Related-To: NEO-5107
Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com >
2021-01-08 12:16:26 +01:00
Jaime Arteaga
479d01c118
Improve zeCommandListAppendMemoryFill Performance
...
Improve L0 fill operations by copying the pattern using
two kernels: one that copies four bytes at a time, and one
that takes care of the remainder. Additionally, a new
allocation is created to fill up at least a cacheline.
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com >
2021-01-08 07:24:55 +01:00
Daria Hinz
4591101541
Implementation of profiling for copy/fill API's
...
Signed-off-by: Daria Hinz <daria.hinz@intel.com >
2021-01-07 09:43:00 +01:00
vinod krishna
7d29a0342a
maxHardwareContexts now reports 64K
...
Related-To: NEO-1582
Signed-off-by: vinod krishna <vinod.krishna@intel.com >
2021-01-06 07:04:55 +01:00
Maciej Plewka
3ca77a6cbe
Program sba for global bindless heaps
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2021-01-04 14:23:47 +01:00
Jaime Arteaga
d7ea713c5f
Revert "Initialize kernel immutable data when kernel is created"
...
This reverts commit a6ac10088c .
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com >
2021-01-04 11:11:39 +01:00
Young Jin Yoon
e09ac446c4
Mask bit 0 of timestamp for event profiling
...
Related-to: LOCI-1161
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com >
2020-12-31 23:51:12 +01:00
Kamil Diedrich
e0da0e1868
Fix for block load and store
...
Signed-off-by: Kamil Diedrich <kamil.diedrich@intel.com >
2020-12-31 12:28:26 +01:00
Mateusz Jablonski
1b7d7afc07
Refactor USM properties
...
store reference to std of root device indices and device bitfields
store NEO::Device in USM properties
Related-To: NEO-3691
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2020-12-30 11:36:56 +01:00
Young Jin Yoon
06dad67c5a
Add a return statement for getCsrForLowPriority
...
Related-to: LOCI-1124
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com >
2020-12-29 21:34:05 +01:00
Jaime Arteaga
a6ac10088c
Initialize kernel immutable data when kernel is created
...
This instead of when the associated module is created, to avoid
allocating memory for kernels that are never created nor used.
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com >
2020-12-29 19:29:10 +01:00
Vinod Tipparaju
bbb0e9d095
Fix to Append signal/ wait events for appendMemoryFill & appendMemoryCopyRegion
...
Signed-off-by: Vinod Tipparaju <vinod.tipparaju@intel.com >
2020-12-29 07:28:50 +01:00
Young Jin Yoon
571007cfe9
Support low priority command queues
...
Related-to: LOCI-1124
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com >
2020-12-24 05:00:34 +01:00
Vinod Tipparaju
5440c83539
Fix to acquire lock during migrating shared allocation via makeResident() call
...
This fix is required to guarentee thread safety
Signed-off-by: Vinod Tipparaju <vinod.tipparaju@intel.com >
2020-12-24 04:42:08 +01:00
Jitendra Sharma
7c70a14bc4
Implement ZE_ENABLE_PCI_ID_DEVICE_ORDER environment variable
...
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com >
2020-12-23 12:08:43 +01:00