Commit Graph

12272 Commits

Author SHA1 Message Date
Kacper Nowak d2a2656caa fix(zebin): Enforce fallback to CTNI on TGL/ICL for nGEN dummy kernel
For TGL and ICL platforms - if on clCreateProgramWithSource()
call we detect a nGen dummy kernel usage - enforce fallback to the
patchtokens format (only for this kernel).
- corrected naming
- minor ULTs refactor (less dependencies).
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com>
2023-01-13 14:36:56 +01:00
Dominik Dabek 0c3cde2141 fix(ocl): adjust pool buffer allocator
Increase chunk alignment from 256 to 512.
Restores performance in some workloads with pool enabled but lowers maximum
possible number of buffers in pool from 256 to 128.

MemObj size will keep the value passed to clCreateBuffer ie. will not be
aligned up by chunk alignment.
CL_MEM_SIZE will now return same value as with pool disabled.

Related-To: NEO-7332

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2023-01-13 14:20:29 +01:00
Zbigniew Zdanowicz ee99df18aa Fix event signaling in command list extension function
Related-To: NEO-7490

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-01-13 14:13:00 +01:00
Mateusz Jablonski d17b1e9019 test: add missing test for sampler WA conditions for DG2 platform
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-01-13 13:51:30 +01:00
Mateusz Jablonski a8c125ceb9 test: add missing test for compression selector
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-01-13 13:42:58 +01:00
Mateusz Jablonski 91a9b925f7 fix: Append device id to ambigous device names
Related-To: NEO-7537
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-01-13 13:31:18 +01:00
Zbigniew Zdanowicz e8b0024b5c Enable signal all packets
Related-To: NEO-7490

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-01-13 13:15:59 +01:00
Mateusz Jablonski c4759884d8 fix: defer initialization of cross root device tag allocations
additional tag allocations are not needed before creating OCL contexts
with multiple root devices

Related-To: NEO-7634

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-01-13 13:13:05 +01:00
Lukasz Jobczyk dff2c2d8e3 Align external host ptr allocation gpu va to 2MB
Related-To: NEO-7116

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-01-13 12:37:10 +01:00
Cencelewska, Katarzyna f94528097a fix: add set stateCacheInvalidationEnable to flush cache
when blit operation and dcflush needed

resolves problem with corruptions visible when switch ccs with bcs
on platforms without engines coherency

Resolves: NEO-7577
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com>
2023-01-13 10:35:54 +01:00
HeFan2017 2ea734491a Deferred deletion of allocations in main thread
Add a clearQueueTillFirstFailure interface to DeferredDeleter, which
iterates the queue from the front and delete the allocations in the
queue till a failure. It is called by defer deletion of allocations
occupied by mutliple contexts to unlock the execution in main thread

Related-To: NEO-7532

Signed-off-by: HeFan2017 <fan.f.he@intel.com>
2023-01-13 09:20:35 +01:00
Warchulski, Jaroslaw 1ad4b81b28 Cleanup includes 30
Cleaned up files:
opencl/test/unit_test/fixtures/cl_device_fixture.h

Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-13 08:22:14 +01:00
Maciej Plewka 16bc84e27d feature(ocl) use tags to synchronize multi root device events
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2023-01-13 08:09:32 +01:00
Warchulski, Jaroslaw fecb52ac49 Cleanup includes 29
Cleaned up files:
opencl/source/helpers/cl_memory_properties_helpers.h
shared/source/memory_manager/surface.h

Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-13 07:53:03 +01:00
Bari, Pratik 21a0a4af52 Removal of Template Specializations and NiceMocks
Template Specializations and NiceMocks have been removed in
Linux Common Code, ECC, Firmware, Frequency, Global Operations,
Power, PCI, RAS and Standby ULTs.

Related-To: LOCI-3391

Signed-off-by: Bari, Pratik <pratik.bari@intel.com>
2023-01-13 05:33:53 +01:00
Joshua Santosh Ranjan 2e1d20883b Fix the bpp to 1 for platforms not using blit for region copy
For platforms which do not using blit commands for copying,
the bpp could be set as 1, which leaves the transfer dimensions
unmodified.

Related-TO: LOCI-3723

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2023-01-13 05:31:16 +01:00
Bari, Pratik a5ac8197e5 Removal of Template Specializations and NiceMocks
Template Specializations and NiceMocks have been removed in
Memory, Performance, Scheduler and Temperature ULT.

Related-To: LOCI-3391

Signed-off-by: Bari, Pratik <pratik.bari@intel.com>
2023-01-13 05:21:56 +01:00
Bari, Pratik 90ea027c7d Removal of Template Specializations and NiceMocks
Template Specializations and NiceMocks have been removed in
Diagnostics, Engine, Events and Fabric Port ULT.

Related-To: LOCI-3391

Signed-off-by: Bari, Pratik <pratik.bari@intel.com>
2023-01-13 05:17:50 +01:00
Joshua Santosh Ranjan 3fdb9ae0dd Fix zero elapsed time for AppendMemoryCopy
1. If cpu based copy is used, it is possible that copy time is
less than device timestamp resolution. In this scenario, this patch
returns 1 instead of 0.

2. This patch also fixes usage of CPU time instead of CPU timestamp
for end timestamp calculation.

Related-To: LOCI-3754

Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com>
2023-01-13 04:40:50 +01:00
Spruit, Neil R d1ee840ba5 Enforce 64KB alignment when mmaping GPU_TIMESTAMP_DEVICE_BUFFER
Related-To: LOCI-3866

Signed-off-by: Spruit, Neil R <neil.r.spruit@intel.com>
2023-01-12 22:52:25 +01:00
John Falkowski 01017a5df3 L0: add support for deferred memory free as per ContextImp::freeMemExt
Signed-off-by: John Falkowski <john.falkowski@intel.com>
2023-01-12 21:05:02 +01:00
Aravind Gopalakrishnan 889c2fe4e9 feature: Enable additional engine queries
Adds flexibility to query for additional engines

Related-To: LOCI-3346

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2023-01-12 19:43:02 +01:00
Andrzej Koska be9775891c Narrowing the usDeviceID range for WA
This patch narrows down the scope covered by WA
to G10 machines only

Related-To: NEO-7475
Signed-off-by: Andrzej Koska andrzej.koska@intel.com
2023-01-12 16:40:18 +01:00
Mateusz Jablonski bc5bce6847 fix: remove device id value from device name string
Resolves: NEO-7537
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-01-12 16:34:22 +01:00
Mateusz Jablonski 0a75560d7d Reduce scope of WAs for DG2 platforms
Related-To: NEO-7607, HSD-14010744585, HSD-14010847105
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-01-12 14:13:54 +01:00
Zbigniew Zdanowicz 5a82b84219 refactor event interface for better use of functions
Related-To: NEO-7490

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-01-12 11:50:35 +01:00
Zbigniew Zdanowicz 5339d5c0af refactor level zero event for better access of completion field
Related-To: NEO-7490

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-01-12 10:57:57 +01:00
Lukasz Jobczyk 6eb66ee133 [PVC] Enable tlb flush
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-01-12 09:37:14 +01:00
Compute-Runtime-Validation ddb7eb07b0 Revert "Align external host ptr allocation gpu va to 2MB"
This reverts commit d24f403cc4.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-01-12 09:33:16 +01:00
Krystian Chmielewski 7c408c23b2 refactor(zeinfo): improve code readibility
Signed-off-by: Krystian Chmielewski <krystian.chmielewski@intel.com>
2023-01-12 09:33:00 +01:00
Lukasz Jobczyk 3f0e8f52bc [PVC] Remove tlb flush
Related-To: NEO-7116

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-01-12 07:46:29 +01:00
Compute-Runtime-Validation e53eae6e5f Revert "[PVC] Remove tlb flush"
This reverts commit 181d2021ed.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-01-11 19:27:55 +01:00
Warchulski, Jaroslaw 191d4de033 Cleanup includes 28
Cleaned up files:
opencl/source/command_queue/command_queue.h
opencl/source/command_queue/gpgpu_walker.h
shared/source/helpers/memory_properties_helpers.h

Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-11 17:47:33 +01:00
Zbigniew Zdanowicz 0586ceb613 add missing include
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-01-11 17:26:36 +01:00
Kamil Diedrich ad579be87c Use common definition of UUID for Intel GPUs
This patch will eventaully allow matching UUIDs queried
from NEO with UUIDs queried from other GPU APIs implementations
for Intel GPUs

Signed-off-by: Kamil Diedrich <kamil.diedrich@intel.com>
2023-01-11 16:31:05 +01:00
Kamil Kopryk 200734892b refactor: don't use global gfxCoreHelper getter 7/7
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-01-11 16:19:32 +01:00
Warchulski, Jaroslaw 3d59dce80c Cleanup includes 27
Cleaned up files:
opencl/source/command_queue/command_queue.h
shared/source/built_ins/registry/built_ins_registry.h
shared/source/kernel/kernel_descriptor.h

Related-To: NEO-5548
Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-11 16:10:28 +01:00
Zbigniew Zdanowicz d19e8ffbf9 Fix issues in signal all event packets 8/n
- fix: signal correct remainder packets for single kernels
- fix: signal correct reminder packets for fill and copy memory operations
- fix: all level of event compaction properly estimate reminder events
- refactor: unify code for event signal for kernels

Related-To: NEO-7490

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2023-01-11 15:41:12 +01:00
Lukasz Jobczyk 181d2021ed [PVC] Remove tlb flush
Related-To: NEO-7116

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-01-11 15:11:35 +01:00
Lukasz Jobczyk d24f403cc4 Align external host ptr allocation gpu va to 2MB
Related-To: NEO-7116

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-01-11 13:45:06 +01:00
Mateusz Jablonski 70bd006223 feature: brand string update for data center platforms
Resolves: NEO-7045
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-01-11 13:32:05 +01:00
Fabian Zwolinski 9dfed7cd54 Use cached group sizes in zeKernelSetGroupSize
Optimize zeKernelSetGroupSize by early returning success if group size
values have not changed since last function call.

Moved ImplicitArgs construction above setGroupSize call
in kernel initialization to prevent pImplicitArgs being nullptr
in calls in which we use cached group sizes and early return.

Related-To: NEO-7394
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com>
2023-01-11 12:50:51 +01:00
Mateusz Jablonski 3a5a418488 Reduce scope of compression WA for DG2 platforms
Related-To: NEO-7607
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2023-01-11 12:46:53 +01:00
Mateusz Hoppe 5e4604f66d fix: Do not wait infinitely for fence when context is hung
- when driver is unloaded cleanup paths wait for not completed
fences. In case a context has hung - set finite wait timeout

Resolves: NEO-7613

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-01-11 11:29:44 +01:00
Kamil Kopryk af40c80162 refactor: don't use global gfxCoreHelper getter 6/n
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-01-11 11:07:28 +01:00
Kamil Kopryk 100f7e3745 refactor: don't use global gfxCoreHelper getter 5/n
Related-To: NEO-6853
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2023-01-11 10:47:39 +01:00
Lukasz Jobczyk 2d21d42edb Adjust bcs split to flush task submission
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2023-01-11 09:16:44 +01:00
Warchulski, Jaroslaw 4794648978 Cleanup includes 26
Cleaned up files:
opencl/source/command_queue/csr_selection_args.h
opencl/source/event/event.h
shared/source/helpers/engine_control.h
shared/source/sku_info/definitions/sku_info.h

Related-To: NEO-5548

Signed-off-by: Warchulski, Jaroslaw <jaroslaw.warchulski@intel.com>
2023-01-11 09:10:45 +01:00
Mateusz Hoppe d623ef391b feature: print printf contents right after gpu hang detection
- printf used in kernel is printed on synchronize() call, if
hang is detected - printf buffer was not printed immediately but
only when Kernel was destroyed
- this change adds copying printf buffer with internal engine
(whenever available) right after hang detection on
CommandQueue::synchronize() call

Related-To: NEO-6427

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2023-01-11 08:14:00 +01:00
Compute-Runtime-Validation bd1da1b095 Revert "feature: brand string update for data center platforms"
This reverts commit 06f44a4b94.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2023-01-11 05:19:43 +01:00