Commit Graph

499 Commits

Author SHA1 Message Date
Kamil Kopryk
8757ecf2f7 refactor: reuse tag allocation for host function data
Related-To: NEO-14577
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-11-05 14:51:20 +01:00
Kamil Kopryk
f84a5fbee9 feature: add host functions workers
* add common host function worker interface
* add worker as a single thread per csr with 3 modes
* add logic for waiting on internal tag, check gpu hang
* if tag is in pending state, read callback data, run callback
and signal completion
* threads will exit the work loop once stop request
is called in finish
* add multi thread unit tests

Related-To: NEO-14577
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2025-11-03 12:11:17 +01:00
Compute-Runtime-Validation
681eac2f13 Revert "performance: increase cl buffer pool size"
This reverts commit 6e67271454.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-10-20 13:55:23 +02:00
Szymon Morek
6e67271454 performance: increase cl buffer pool size
Related-To: NEO-16452

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-10-17 12:09:23 +02:00
Szymon Morek
c78c1515de performance: reuse cmd buffer without dc flush
Related-To: NEO-16348

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-10-16 16:26:54 +02:00
Aleksander Czerwionka
cfbbe04e32 refactor: decrease UsmMemAllocPoolsManager pool sizes in unit tests
Related-To: NEO-16082

Might fix NEO-16307

Signed-off-by: Aleksander Czerwionka <aleksander.czerwionka@intel.com>
2025-10-16 11:48:04 +02:00
Compute-Runtime-Validation
2eb8928ec5 Revert "performance: increase heap size to 4MB"
This reverts commit f41bb3517a.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-10-10 22:23:23 +02:00
Mateusz Jablonski
1460ae5202 test: improve capturing logs in tests timeout scenario
perform OnTestIterationEnd callback of xml generator prior to abort

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-10-10 19:13:31 +02:00
Brandon Yates
7b27c3668e fix: calculate subSlicesPerSlice correctly on XeKmd
Populate SliceInfo during topology creation.
Set L0 device props subslicesPerSlice to max
enabled subslice on any slice.

Related-to: NEO-16283

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2025-10-10 10:44:23 +02:00
Szymon Morek
f41bb3517a performance: increase heap size to 4MB
Related-To: NEO-16348

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-10-09 13:03:53 +02:00
Radoslaw Jablonski
54de14a9dc refactor: apply clang-format's InsertBraces rule
Formats code before InsertBraces rule is enabled.

Signed-off-by: Radoslaw Jablonski <radoslaw.jablonski@intel.com>
2025-10-06 15:32:46 +02:00
Bellekallu Rajkiran
9cccbcabe1 fix: Reset kernelWithAssertAppended flag
On new append calls, reset flag if previous submissions
are completed.

Related-To: NEO-16184

Signed-off-by: Bellekallu Rajkiran <bellekallu.rajkiran@intel.com>
2025-09-24 18:30:51 +02:00
Jaroslaw Warchulski
195bf66a49 refactor: fix typos
Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>
2025-09-22 12:46:41 +02:00
Szymon Morek
6b37243bfa performance: increase USM pool sizes
Related-To: NEO-16045

Also, unify pool config parameters and
reduce pool size for ULTs.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-09-16 12:10:25 +02:00
Lukasz Jobczyk
aa74555682 refactor: remove unused variable
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-09-11 12:01:32 +02:00
Filip Hazubski
40ad3cf6f1 feature: Query L3 bank group info from device blob
Related-To: HSD-14025718322, HSD-18041858866

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-08-19 14:46:19 +02:00
Jack Myers
05d8322a97 test: add helper method to ult csr
Related-To: NEO-13007
Signed-off-by: Jack Myers <jack.myers@intel.com>
2025-08-13 00:59:33 +02:00
Radoslaw Jablonski
c2d64cae6a refactor: queryDeviceCaps ioctl
Signed-off-by: Radoslaw Jablonski <radoslaw.jablonski@intel.com>
2025-08-08 14:43:23 +02:00
Dominik Dabek
77470acf7a fix: l0, tag update on mem copy ext host ptr
Require tag update on mem copy with external host ptr.
Without this, temporary allocation might not be cleaned before next copy
operation.
If a second copy operation is passed same ptr that has been reallocated,
there will be a pagefault.

Related-To: NEO-15663

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2025-08-08 12:42:53 +02:00
Compute-Runtime-Validation
b7472fa76f Revert "refactor: queryDeviceCaps ioctl"
This reverts commit 3117d2d2cb.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2025-08-08 07:05:24 +02:00
Radoslaw Jablonski
3117d2d2cb refactor: queryDeviceCaps ioctl
Signed-off-by: Radoslaw Jablonski <radoslaw.jablonski@intel.com>
2025-08-06 12:22:03 +02:00
Mateusz Jablonski
abb00a5ce3 fix: ensure thread-safety in zeDeviceSynchronize
get task count and flush stamp within single critical section

Related-To: NEO-14560
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-07-31 15:59:57 +02:00
John Falkowski
5aed2bcdce refactor: Use min GPU/CPU address range for system allocator vmBind
Related-To:  NEO-15680

Signed-off-by: John Falkowski <john.falkowski@intel.com>
2025-07-29 17:08:00 +02:00
Kindracki, Jakub Tomasz
a73d5724e3 feature: Return insufficient permissions when driver device cannot be loaded
Return ZE_RESULT_ERROR_INSUFFICIENT_PERMISSIONS when user doesn't have
permissions to access the GPU driver device file.

Related-To: NEO-8682
Signed-off-by: Kindracki, Jakub Tomasz <jakub.tomasz.kindracki@intel.com>
2025-07-28 16:31:07 +02:00
Mateusz Jablonski
05cf05e10e refactor: remove redundant tracking of media sampler programming
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-07-28 15:17:46 +02:00
Grochowski, Stanislaw
e652679b95 fix: correct pollForCompletion in aub on program termination
Related-To: NEO-14867

Signed-off-by: Grochowski, Stanislaw <stanislaw.grochowski@intel.com>
2025-07-17 12:59:00 +02:00
Tomasz Biernacik
2c5cbec033 feature: control post sync completion check
Related-To: NEO-14844

Signed-off-by: Tomasz Biernacik <tomasz.biernacik@intel.com>
2025-07-15 13:51:19 +02:00
Filip Hazubski
f8dc80c7fc test: Adjust direct submission tests
Disable direct submission at the end of the test for cases where
direct submission is enabled in test body.

Adjust mock csr to restore os context in cases where
context has been replaced in test body.

Related-To: NEO-15017

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-07-10 11:01:33 +02:00
Kamil Kopryk
bb52c358b9 refactor: flush Task method
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
Related-To: NEO-15011

Also correct ults
2025-06-11 16:36:56 +02:00
Chodor, Jaroslaw
46c60290b9 refactor: introducing igfxfmid_wrapper
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-05-30 19:26:04 +02:00
Szymon Morek
a2c41e4749 fix: add mock for staging buffer size
No need to allocate 2MB in unit tests.
This commit changes this size to 4KB.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2025-05-22 08:36:01 +02:00
Lukasz Jobczyk
1d1414febc refactor: remove unused dc flush mitigation
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2025-05-09 08:26:26 +02:00
Mateusz Jablonski
adc40934ae test: ensure thread safety when capturing input params for wait function
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-05-08 10:22:14 +02:00
Mateusz Jablonski
1b42ebf7fb feature: add L0 API to wait for completion of all submissions to given device
Related-To: NEO-14560
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-05-07 17:26:22 +02:00
Chodor, Jaroslaw
9e14eaaed1 feature: Adding support for scoped debug variables
This allows for limitting debug variables to specific
components.

Related-To: NEO-14473
Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2025-04-11 17:52:17 +02:00
Marcel Skierkowski
b75fbe8e2c refactor: mock filesystem in ocloc ult pt.1
Mocked IO operations in ./ocloc_tests application

Mocked gtest stdout capture in ocloc tests

Related-To: NEO-14084
Signed-off-by: Marcel Skierkowski <marcel.skierkowski@intel.com>
2025-04-08 16:08:42 +02:00
Brandon Yates
a48d66ad75 feature: Add programExceptions stub to CSR
Related-to: NEO-12967

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2025-04-01 18:33:40 +02:00
Maciej Plewka
36fa6d66ae fix: lock csr in stopDirectSubmission if needed
Related-To: NEO-13875, NEO-14143, HSD-16026538384, HSD-16026780358
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2025-03-18 09:30:38 +01:00
Maciej Bielski
a7c46b8213 feature: support L2 cache reservation
Related-To: NEO-12837
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2025-03-17 19:41:55 +01:00
Zbigniew Zdanowicz
2b370f6a6f feature: handle passing separate epilogue immediate command buffer
Related-To: NEO-10356

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2025-03-06 14:52:34 +01:00
Chuansheng Liu
1876a43024 performance: add low latency hint for i915
i915 has provided the low latency hint interface, it helps
improve the performance for light workload a lot.

Related-To: NEO-14250

Signed-off-by: Chuansheng Liu <chuansheng.liu@intel.com>
2025-03-04 11:14:04 +01:00
Wojciech Konior
7f81d0e3e8 fix: vmbind of i915 canonization check
Related-To: NEO-7313

Signed-off-by: Wojciech Konior <wojciech.konior@intel.com>
2025-02-25 19:00:31 +01:00
Filip Hazubski
b60c02d597 fix: Add asserts to ensure NonCopyable and NonMovable n/n
Related-To: NEO-14068

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-02-19 11:36:24 +01:00
Mateusz Jablonski
87f4cd8e83 feature: expose separate L0 driver handle per product family in Core/Tools path
Related-To: NEO-14062
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-02-17 18:54:07 +01:00
Filip Hazubski
4c7900008f refactor: Change wording from NonCopyableOrMovable to NonCopyableAndNonMovable
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2025-02-17 14:19:10 +01:00
Mateusz Jablonski
28be16771d test: add test for creating separate platform per product family in OCL path
Related-To: NEO-14062
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2025-02-17 13:59:07 +01:00
Maciej Bielski
6924a48ca6 refactor: prepare CLOS logic for extension
Prepare cache setup and reservation logic to be extended w.r.t other
cache-levels.

Conceptually this change is like adding a switch-statement, in several
places, in which existing code makes a single (and only) case. This is
caused by splitting larger development to ease the review. Further cases
will be added in following steps. Such approach sometimes creates code
which may seem redundant but it is meant to simplify plugging following
extensions in an easy way.

Related-To: NEO-12837
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2025-02-17 10:43:08 +01:00
Mateusz Hoppe
6e35d055f2 feature: make contextGroupSize dependent on number of processes
Related-To: NEO-12952

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2025-02-03 18:13:33 +01:00
Michał Pryba
75bc74089b refactor: adjust file names after pre-gen12 removal 2/3
Related-To: NEO-12681
Signed-off-by: Michał Pryba <michal.pryba@intel.com>
2025-02-03 15:31:51 +01:00
Brandon Yates
635f69e54a fix: Configure scratch pages for debugger
DG2 requires scratch pages on for debugger. Other platforms do not.

Related-to: NEO-13883

Signed-off-by: Brandon Yates <brandon.yates@intel.com>
2025-01-31 06:49:49 +01:00