Commit Graph

1392 Commits

Author SHA1 Message Date
Filip Hazubski 72cf384c7d refactor: Fix typo
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-10-01 09:31:02 +02:00
Compute-Runtime-Validation ef1b569a85 Revert "performance: Do not create global fence allocation on integrated"
This reverts commit 6bf5183eff.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-10-01 08:14:39 +02:00
Lukasz Jobczyk 6bf5183eff performance: Do not create global fence allocation on integrated
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-09-30 13:13:27 +02:00
Compute-Runtime-Validation 6cb0e45330 Revert "performance: Do not create global fence allocation on integrated"
This reverts commit 50eb6af9ac.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-09-27 11:48:01 +02:00
Lukasz Jobczyk 50eb6af9ac performance: Do not create global fence allocation on integrated
Resolves: NEO-12324

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-09-27 09:32:42 +02:00
Lukasz Jobczyk c93998bcb9 performance: Do not program additional synchronization on integrated
Related-To: NEO-12324

Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>
2024-09-26 10:54:31 +02:00
Mateusz Hoppe 8000133b2a refactor: add method to adjust regular context count
Related-To: NEO-12258

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-09-25 13:46:46 +02:00
Maciej Plewka 80f75ceace fix: submit dummy exec to pin memory during zeContextMakeMemoryResident call
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-09-23 14:43:59 +02:00
Bartosz Dunajski d7ce841081 fix: InOrderExecInfo upload to tbx
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-09-20 16:03:09 +02:00
Mateusz Hoppe 4a068c8eab fix: correclty program StateBaseAddress in global bindless mode
- prepare bindful ssh when kernel requires ssh heap and
SurfaceStateBaseAddress
- remove lastAppendedKernelBindlesMode - local ssh heap may be needed
for bindless kernels with scratch or misaligned buffer args
- use ssh heap gpu address to program SurfaceStateBaseAddress, global base is
used for BindlessSurfaceState and DynamicState

Related-To: NEO-7063

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-09-20 11:57:05 +02:00
Filip Hazubski ebc19b4a70 feature: Add logic to disable bindless addressing via AIL
Add mockable Device functions to get ReleaseHelper and AILConfiguration.

Resolves: NEO-12699

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-09-18 13:49:51 +02:00
Jitendra Sharma 25f2504bf5 fix: set eudebug extension property when debugging enabled
In order to debug on XE, XE_EXEC_QUEUE_SET_PROPERTY_EUDEBUG
needs to be set up.

Related-To: NEO-12691
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2024-09-18 08:32:03 +02:00
Zbigniew Zdanowicz 6e0aa1781c refactor: modify kernel helper method to pass kernel data directly
Related-To: NEO-12639

Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-09-16 16:25:13 +02:00
Mateusz Jablonski 78604bd475 refactor: remove not needed code
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-09-16 12:12:43 +02:00
Dominik Dabek 3bd2befe74 refactor: indirect detection helpers, VC
Allow for different required version for VC compiled kernels.
Define constant for detection disabled.

Related-To: NEO-12491

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-09-13 19:42:57 +02:00
Dominik Dabek 571d703135 refactor: indirect detection helpers
Check indirect detection version from igc header for JIT.
Move required version to its own method.

This allows for different required versions per platform.

Related-To: NEO-12491

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-09-11 14:49:54 +02:00
Zbigniew Zdanowicz 7ce4a8adc2 performance: replace virtual calls with native class methods
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>
2024-09-11 11:10:40 +02:00
Fabian Zwoliński ea5b586c37 fix: move pathExists out of sys calls and do not mock it
Moved pathExists from SysCalls to path.h.
In ULTs, use unchanged pathExists and mock stat, getFileAttributesA instead.
Add Windows and Linux ULTs for pathExists.

Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-09-10 19:24:45 +02:00
Mateusz Hoppe a6bf424417 refactor: determine hp copy engine once after gfx core helper is created
- do not determine engine on every call to getDefaultHpCopyEngine()

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-09-10 10:00:48 +02:00
Wenbin Lu 7ac991fa3f fix: add workaround for incorrect DRM system info
Related-To: NEO-9489

Signed-off-by: Wenbin Lu <wenbin.lu@intel.com>
2024-09-10 08:48:01 +02:00
Filip Hazubski 7d16521c7b performance: Correct alignment checks
Only use checks in debug builds.

Resolves: HSD-18039597713

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-09-10 08:43:36 +02:00
Mateusz Jablonski 54bda0e986 fix: In Linux CL/GL sharing
- always issue flush request before export

Apparently it's expected to flush the object (which might convert them
from one format to another for export, or remove aux buffer uses or
anything not supported by export).

- use modifier to select tiling mode

Previously we just assumed that whatever tiling mode was picked by mesa
will match the one picked by GMMLIB but that's not always the case
and in particular on Arc and Xe it doesn't work ... Mesa picks Tile4
and GMMLIB picks Tile64 ...

Fixes: #761
Fixes: #736

Signed-off-by: Sylvain Munaut <tnt@246tNt.com>
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-09-06 21:40:18 +02:00
Fabian Zwoliński 38e1614f4a fix: create and use new allocation type for syncBuffer
Related-To: NEO-11533
Related-To: HSD-18039788811
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-09-06 14:11:33 +02:00
Kamil Kopryk 95b035a071 fix: global stateless heap creation in heapless path
Related-To: NEO-10681
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-09-05 16:56:00 +02:00
Andrzej Koska b0e7a11e9a refactor: Improving information transfer about the copy engine
Related-To: NEO-11934

Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2024-09-05 16:11:52 +02:00
Compute-Runtime-Validation d842f65cf1 Revert "fix: submit dummy exec to pin memory during zeContextMakeMemoryReside...
This reverts commit f9b87d53e6.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-09-05 03:28:03 +02:00
Maciej Plewka f9b87d53e6 fix: submit dummy exec to pin memory during zeContextMakeMemoryResident call
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>

Related-To: NEO-11879
2024-09-04 14:07:29 +02:00
Mateusz Hoppe 482e077252 feature: add regular and hp contexts in group without dedicated hp engine
- if no hp copy engine available, create group with regular and hp
contexts

Related-To: NEO-11983

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-08-28 14:01:10 +02:00
Mateusz Hoppe 3f6fc1d4d2 feature: switch to new aubstream createHardwareContext2 interface
Related-To: NEO-12257

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-08-26 14:10:53 +02:00
Damian Tomczak df9fa343d1 feature: limit max workgroup size to 1k items
Resolves: NEO-11881

Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>
2024-08-26 09:20:51 +02:00
Mateusz Jablonski f617093a6a fix: add missing nullptr check before accessing ail helper
Fixes: #755
Fixes: #754
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-08-20 16:58:53 +02:00
Compute-Runtime-Validation 9b652f4a34 Revert "feature: Improving information transfer about the copy engine"
This reverts commit 17ffdff4f1.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-08-15 22:06:31 +02:00
Andrzej Koska 17ffdff4f1 feature: Improving information transfer about the copy engine
Related-To: NEO-11934

Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2024-08-14 11:28:29 +02:00
Ben Ashbaugh 32f80e1131 feature: add support for the cl_khr_extended_bit_ops extension
Related-To: NEO-10337

Signed-off-by: Ben Ashbaugh <ben.ashbaugh@intel.com>
2024-08-12 17:44:31 +02:00
Filip Hazubski 54b5adb767 fix: Add checks to align functions
Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-08-12 10:08:15 +02:00
Maciej Bielski a4060013de refactor: move CLOS-related steps from core- to product-helper
Future HW will not support cache reservation uniquely for the whole
platform. Implementation of some functions may vary between products.

Related-To: NEO-10158
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com>
2024-08-12 09:27:04 +02:00
Szymon Morek d4c1631ac7 performance: don't wait for paging fence on user thread
Related-To: NEO-12197

Currently for new resources user thread must wait before submitting
actual workload. With this commit, instead of waiting on user thread,
request is sent to background ULLS controller thread and additional
semaphore is programmed. ULLS controller will perform actual wait
and signal semaphore when paging fence reaches required value.

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-08-07 08:30:51 +02:00
Bartosz Dunajski 24cfd203ab fix: dont download tbx allocations on heapless first device submission
Related-To: HSD-18039476929

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-06 14:03:42 +02:00
Michal Mrozek d52ca080bd Revert "performance: improve pool handling"
This reverts commit a3c3b6533a.

Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-08-06 13:04:02 +02:00
Fabian Zwoliński 674c4a15ad fix: use correct gpu address when bindless heaps helper is enabled
Related-To: NEO-7063
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-08-05 15:09:57 +02:00
Compute-Runtime-Validation e27efd701f Revert "fix: correct calculating max subslice space"
This reverts commit 67f2500c03.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-08-02 12:28:13 +02:00
Mateusz Jablonski 67f2500c03 fix: correct calculating max subslice space
computeMaxNeededSubSliceSpace is no longer needed as getHighestEnabledSubSlice
already determines maximum index from all enabled subslices

Related-To: NEO-12073
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-08-01 16:38:24 +02:00
Bartosz Dunajski 5ccfd6f2be fix: add missing AlignLocalMemoryVaTo2MB flag support
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-08-01 13:54:51 +02:00
Compute-Runtime-Validation 2d1b263e9a Revert "refactor: remove redundant function computeMaxNeededSubSliceSpace"
This reverts commit c0b96dcd6e.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-08-01 03:54:56 +02:00
Mateusz Jablonski c0b96dcd6e refactor: remove redundant function computeMaxNeededSubSliceSpace
use GfxCoreHelper::getHighestEnabledDualSubSlice instead

Related-To: NEO-12073
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-31 14:50:13 +02:00
Mateusz Jablonski aa32f7420b fix: correct getting max enabled dual subslice
when no dss info provided, assume max dss per slice are enabled

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-31 11:15:23 +02:00
Mateusz Jablonski 838894e074 fix: correct getting max enabled subslice
when no subslice info provided, assume max subslices per slice are enabled

Related-To: NEO-12073

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-31 10:42:57 +02:00
Mateusz Jablonski 1fa7e14d0b fix: query engine info prior to query topology
don't setup l3 bank count in default gt sys info

Related-To: NEO-12073
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-25 16:17:44 +02:00
Maciej Plewka afee8814ef refactor: get ioh alignment from static function
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-07-24 14:43:31 +02:00
Bartosz Dunajski c3312f21f7 feature: dont initialize in-order TS nodes
Related-To: NEO-11925

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-07-23 13:31:23 +02:00
Bartosz Dunajski 3758e99cbf feature: initial support for standalone CB Events Timestamps allocator
Related-To: NEO-11925

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-07-22 10:13:55 +02:00
Mateusz Hoppe b6299b8a21 feature: add support for HP copy engine context
- add support for contect group with HP copy engine
- choose HP copy engine when available

Related-To: NEO-11983

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-07-19 12:23:03 +02:00
Compute-Runtime-Validation 7ad15639fc Revert "feature: add support for HP copy engine context"
This reverts commit 3fbcbcaef2.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-07-18 21:02:14 +02:00
Mateusz Hoppe 3fbcbcaef2 feature: add support for HP copy engine context
- add support for contect group with HP copy engine
- choose HP copy engine when available

Related-To: NEO-11983

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-07-18 16:07:07 +02:00
Filip Hazubski 46f9133bf2 fix: Correct logic to select internal BCS engine
When BCS3 is not available, use last available copy engine as internal.

Related-To: HSD-18039263936

Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>
2024-07-18 12:02:50 +02:00
Mateusz Jablonski 778645c11e feature: add initial support for Xe2 platforms
Related-To: NEO-8188, NEO-10774
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-09 10:30:27 +02:00
Mateusz Jablonski f171a22c0a refactor: include reg_configs_common using full path
add missing register offset value

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-07-03 13:30:46 +02:00
Szymon Morek 3dd051c3ee performance: adjust compression handling
Related-To: NEO-11882

Signed-off-by: Szymon Morek <szymon.morek@intel.com>
2024-07-03 09:37:11 +02:00
Morek, Szymon aed96cede4 fix: flush barrier when profiling enabled
Related-To: NEO-10615

Signed-off-by: Morek, Szymon <szymon.morek@intel.com>
2024-07-02 13:25:07 +02:00
Dominik Dabek 76e8be5c39 fix(ocl): handle gl sharing displayable textures
Displayable textures always need dc flush.

Related-To: NEO-11694

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-06-27 13:43:20 +02:00
Mateusz Jablonski 85289f6658 refactor: extract common code for populating ftr and wa table
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-06-27 13:00:15 +02:00
Mateusz Jablonski d51997be4f fix: setup L3 cache config when available from release helper
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-06-26 16:55:51 +02:00
Mateusz Jablonski 66d4d141e1 fix: extract common logic for filling default gt system info
gt system info should be queried from KMD

Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-06-25 20:21:57 +02:00
Michal Mrozek a3c3b6533a performance: improve pool handling
Related-To: NEO-11731
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-06-25 17:04:17 +02:00
Compute-Runtime-Validation 7136dfbd38 Revert "performance: improve pool handling"
This reverts commit 5f0b9efd2b.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-06-24 12:47:19 +02:00
Kamil Kopryk d97dccc409 refactor: rename MaximumNumberOfThreads debug key
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-19 13:06:42 +02:00
Ratajewski, Andrzej f7888fac0d feature: Enable support for cl_intel_subgroup_matrix_multiply_accumulate_tf32
This change enables support for
`cl_intel_subgroup_matrix_multiply_accumulate_tf32` extension for PVC B0
and later.

Related-To: GSD-7825
Signed-off-by: Ratajewski, Andrzej <andrzej.ratajewski@intel.com>
2024-06-19 11:43:03 +02:00
Kamil Kopryk f05ff9b156 refactor: rename OverDispatchControl debug key
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-18 18:59:43 +02:00
Kamil Kopryk c40f0f2421 refactor: rename ComputeOverdispatchDisable debug key
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com>
2024-06-17 13:18:29 +02:00
Pawel Szymichowski 521add6ca7 feature: Enable support for cl_intel_subgroup_buffer_prefetch
This change enables support for cl_intel_subgroup_buffer_prefetch extension for
PVC and later.

Related-To: GSD-7825
Signed-off-by: Ratajewski, Andrzej <andrzej.ratajewski@intel.com>
2024-06-14 20:44:52 +02:00
Mrozek, Michal 5f0b9efd2b performance: improve pool handling
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com>
Resolves: NEO-11731
2024-06-14 12:02:34 +02:00
Bartosz Dunajski da9c009b88 feature: assign unique interrupt to queue
Related-To: NEO-8179

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-06-07 10:06:31 +02:00
Mateusz Jablonski 01c36a38c6 feature: add support for generic id acronyms in ocloc
Related-To: NEO-11568
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-06-06 10:06:44 +02:00
Mateusz Jablonski d7c7cb203c fix: add missing support for acronyms without dashes
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-05-29 17:07:06 +02:00
Aleksander Czerwionka 683eb97d9e refactor: [ocloc] remove dead code in gen12lp backward compatibility
Related-To: NEO-10148

Signed-off-by: Aleksander Czerwionka <aleksander.czerwionka@intel.com>
2024-05-29 16:26:31 +02:00
Chodor, Jaroslaw ad155da67a feature: Enabling platforms in ocloc by default
This feature enables supported platforms in ocloc even
if not enabled for driver.
Allows sharing single ocloc instance for multiple driver-platform
configurations

Related-To: NEO-10531

Signed-off-by: Chodor, Jaroslaw <jaroslaw.chodor@intel.com>
2024-05-28 12:09:47 +02:00
Compute-Runtime-Validation 9d312995e2 Revert "feature: split cache_policy_dg2_and_later.inl file"
This reverts commit 5df982a85d.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-05-28 07:03:47 +02:00
Aleksander Czerwionka 251463d77a feature: [Ocloc] provide backward compatibility for gen12lp family merge
Related-To: NEO-10148

Signed-off-by: Aleksander Czerwionka <aleksander.czerwionka@intel.com>
2024-05-27 17:35:31 +02:00
Jitendra Sharma 5df982a85d feature: split cache_policy_dg2_and_later.inl file
Move getCachingPolicyOptions method present in existing
cache_policy_dg2_and_later.inl in new
get_caching_policy_options.inl file.
This would help in reusing getCachingPolicyOptions
method in any newly created cache_policy_* file.

Related-To: NEO-8306

Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com>
2024-05-27 12:34:14 +02:00
Aravind Gopalakrishnan b39e99a17a feature: Prepare hw info to report CXL type
Related-To: NEO-11362

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-05-24 07:09:42 +02:00
Fabian Zwoliński 78bd3da078 fix: ocloc - add bindless mode options if bindless is enabled
If release helper is not present,
do not add bindless mode options.

Related-To: NEO-7063
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-05-23 14:10:28 +02:00
Mateusz Jablonski 2577f1123e refactor: remove not needed code
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>
2024-05-22 14:01:21 +02:00
Fabian Zwoliński 59829aeb11 fix(ocl): Force rebuild if binary contains bindless kernel
Related-To: NEO-7063
Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>
2024-05-20 14:15:17 +02:00
Maciej Plewka 90df4b298b fix: cache flush dependency for queue blocked
Related-to: NEO-9872, HSD-18038461954
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-05-20 11:55:24 +02:00
Young Jin Yoon e204d27190 fix: print to stdout for disable scratch page
Modified to print out error messages to stdout when disable scratch page
is used.

Related-To: GSD-7611
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com>
2024-05-16 15:05:07 +02:00
Aravind Gopalakrishnan 12ab4e5446 feature: Add 3 channel 8/16/32 bit formats
Related-To: NEO-11292

Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>
2024-05-16 13:28:04 +02:00
Dominik Dabek b4d839fe29 performance(usm): l0, add usm host memory pooling
Disabled by default.

Related-To: NEO-11356

Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>
2024-05-15 15:20:51 +02:00
Ratajewski, Andrzej 0b0f7bd4cd feature: Enable support for cl_intel_subgroup_2d_block_io
This change enables support for cl_intel_subgroup_2d_block_io extension for
PVC and later.

Related-To: GSD-7825
Signed-off-by: Ratajewski, Andrzej <andrzej.ratajewski@intel.com>
2024-05-14 15:16:49 +02:00
Maciej Plewka e39893485c fix: add cache flush as dependency for bcs ccs synchronization
Related-to: NEO-9872
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-05-09 13:43:39 +02:00
Mateusz Hoppe 76af952fcb feature: Bindless sampled image support
Related-To: NEO-10352

Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>
2024-05-08 17:50:37 +02:00
Aleksander Czerwionka 4667f55ea2 fix: add missing checks to return proper OpenCL error from API
changes affect clEnqueueNDCountKernelINTEL and
clGetKernelMaxConcurrentWorkGroupCountINTEL

Related-To: NEO-9688

Signed-off-by: Aleksander Czerwionka <aleksander.czerwionka@intel.com>
2024-05-07 20:32:36 +02:00
Bartosz Dunajski e5882e0d31 feature: pass GraphicsAllocation to fence wait
Related-To: NEO-8179

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-05-07 17:59:28 +02:00
Andrzej Koska e44c7f2627 fix: BCS MMIO programming
When remap is enabled, we must set different base offset for copy engines.
Copy engines must use BCS0 base.

Related-To: NEO-10678

Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>
2024-05-07 14:08:28 +02:00
Katarzyna Cencelewska e9f7df6ae6 refactor: create helper for maxPtssIndex
Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>
2024-05-07 12:47:52 +02:00
Bartosz Dunajski ce36812f8d feature: obtain GraphicsAllocation from external CB event
Related-To: NEO-8179

Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-05-06 22:52:36 +02:00
Compute-Runtime-Validation 8342c0ae2f Revert "fix: add cache flush as dependency for bcs ccs synchronization"
This reverts commit 5e57bb2a32.

Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>
2024-05-01 03:05:47 +02:00
Maciej Plewka 5e57bb2a32 fix: add cache flush as dependency for bcs ccs synchronization
Related-to: NEO-9872
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>
2024-04-30 11:57:49 +02:00
Bartosz Dunajski 2a2596c13b refactor: pass additional data to ioctl helper
Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>
2024-04-26 14:53:14 +02:00
John Falkowski f1c2ea0b69 fix: kernel access to multiple stateful virtual regions
Related-to: NEO-8350

Signed-off-by: John Falkowski <john.falkowski@intel.com>
2024-04-26 11:18:24 +02:00
Michal Mrozek 4aa7c6c99e performance: optimize counter based waiting schemes
- store latest waited counter value.
- do not wait on values that are already completed.
- disable mechanism when counter overflows.

Signed-off-by: Michal Mrozek <michal.mrozek@intel.com>
2024-04-25 17:20:29 +02:00