Dunajski, Bartosz
c3ca3ff119
refactor: use qword memcpy for Events with qword packet size
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-07-20 07:29:45 +02:00
Neil R Spruit
8efd8853fb
feature: L0 Device Hierarchy support for Composite & Flat modes
...
Related-To: LOCI-4615
- Added Support for users to set ZE_FLAT_DEVICE_HIERARCHY to either FLAT
or COMPOSITE to change how devices are returned in zeDeviceGet and
clGetDeviceIDs.
- COMPOSITE is default behavior that exists today.
- FLAT returns all sub devices which have no sub devices and all root
devices that have no sub devices in zeDeviceGet ie with all devices
flattened out in order.
- Added zeDeviceGetRootDevice for one to retrieve the Root Device for
any SubDevice.
Signed-off-by: Neil R Spruit <neil.r.spruit@intel.com >
2023-07-20 01:00:54 +02:00
Zbigniew Zdanowicz
0374a66d97
performance: enable command list primary batch buffer on gen platforms
...
Related-To: NEO-7807
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-07-19 14:25:18 +02:00
Dominik Dabek
622a3ed89c
performance(ocl): flag to not dcFlush on no event
...
If waitForBarrier is not passed outEvent then do
dcFlush on the next synchronize call.
Related-To: NEO-8147
Signed-off-by: Dominik Dabek <dominik.dabek@intel.com >
2023-07-18 15:38:54 +02:00
Lukasz Jobczyk
2407658376
performance: Flush constant cache only if any ULLS enabled
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-07-17 16:13:55 +02:00
Dunajski, Bartosz
ec9afc5450
feature: ZE_COMMAND_QUEUE_FLAG_IN_ORDER flag support
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-07-17 12:02:56 +02:00
Zbigniew Zdanowicz
ceed88d39e
performance: add debug support to immediate flush task
...
Related-To: NEO-7808
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-07-14 16:10:50 +02:00
Luzynski, Sebastian Jozef
c141fbaf74
fix: Adjust zeVirtualMem functions to store size provided by user
...
Related-To: LOCI-3397
Signed-off-by: Luzynski, Sebastian Jozef <sebastian.jozef.luzynski@intel.com >
2023-07-13 14:17:21 +02:00
Mateusz Hoppe
9fd7f9cf05
fix: set ImplicitArgs size to size of defined fields
...
Resolves: NEO-8169
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-07-12 21:30:32 +02:00
Kacper Nowak
04291be379
refactor: Enhance L0 builtins ULTs
...
- Removed redundant code
- Corrected naming
- Improved code coverage
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com >
2023-07-12 16:52:19 +02:00
Lukasz Jobczyk
83bd33befc
refactor: Add flag to control BCS split for pageable memory
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-07-11 15:12:40 +02:00
Dunajski, Bartosz
37fdb5998e
feature: appendMemoryFill support for in-order cmd lists
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-07-11 13:05:27 +02:00
Dunajski, Bartosz
241d9d447a
feature: optimize programming wait events from the same in-order cmdlist
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-07-11 12:59:34 +02:00
Zbigniew Zdanowicz
3f7269d401
fix: make sip state programing once for all level zero command queues
...
Related-To: NEO-7828
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-07-11 11:34:21 +02:00
Cencelewska, Katarzyna
aa0beb8191
fix: Unify logic calculating threads per work group part 4
...
- also use helper when checking that is simd1 to have same flow
Related-To: NEO-8087
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-07-07 15:34:59 +02:00
Mateusz Hoppe
4aba0f0340
feature: global bindless surface state base support
...
- program global bindless ssba when external allocator used (
UseExternalAllocatorForSshAndDsh)
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-07-06 18:31:49 +02:00
Spruit, Neil R
11c96559a9
fix: Use Symbols map to report exported symbols vs external functions
...
Related-To: LOCI-4578
- Report all symbols in the Symbols Map for a Module as the Exported
symbols instead of using the External Functions Program Info.
- Resolves the issue of reporting symbols for platforms that don't have
ZEBIN binaries by default.
Signed-off-by: Spruit, Neil R <neil.r.spruit@intel.com >
2023-07-06 17:41:45 +02:00
Dunajski, Bartosz
9adfa4b355
feature: improve in-order Events chaining
...
- Clear TS event before chaining
- Dont chain non-TS events
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-07-05 14:21:36 +02:00
Mateusz Jablonski
66c733eac7
fix: add missing zero-initialization of variable
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-07-05 10:39:48 +02:00
Zbigniew Zdanowicz
866e3c37ba
performance: add implementation of cmdlist immediate flush task
...
Related-To: NEO-7808
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-07-05 10:23:16 +02:00
Dunajski, Bartosz
c18198ebd8
feature: change in-order allocation type to uint32_t
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-07-05 09:56:32 +02:00
Dunajski, Bartosz
5b9d45f3bc
feature: handle in-order allocation overflow
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-07-04 15:58:11 +02:00
Cencelewska, Katarzyna
61f701aba5
fix: Unify logic calculating threads per work group part 3
...
Related-To: NEO-8087
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-07-04 15:27:44 +02:00
Cencelewska, Katarzyna
2e17c21728
fix: Unify logic calculating threads per work group part 2
...
- use calculateNumThreadsPerThreadGroup instead of getThreadsPerWG to
have same flow and proper values of threads per work groups
Related-To: NEO-8087
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-07-04 10:34:02 +02:00
Compute-Runtime-Validation
39740da9d1
Revert "fix: Unify logic calculating threads per work group part 2"
...
This reverts commit 1e8a53bd53 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-07-02 07:09:14 +02:00
Dunajski, Bartosz
f9a58b5544
feature: program in-order barrier as non stalling instruction
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-30 17:17:53 +02:00
Young Jin Yoon
c5d675570a
feature: support for zeDriverGetLastErrorDescription
...
Added setErrorDescription() and getErrorDescription() in DriverHandle
to record and retrieve the custom string for errors.
Related-To: LOCI-4619
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com >
2023-06-30 17:12:32 +02:00
Cencelewska, Katarzyna
1e8a53bd53
fix: Unify logic calculating threads per work group part 2
...
- use calculateNumThreadsPerThreadGroup instead of getThreadsPerWG to
have same flow and proper values of threads per work groups
Related-To: NEO-8087
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-06-30 14:16:08 +02:00
Michal Mrozek
5eadedc36e
refactor: Remove not used logic.
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2023-06-30 10:58:35 +02:00
Zbigniew Zdanowicz
21823af419
performance: add skeleton method to cmdlist immediate flush task
...
Related-To: NEO-7808
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-06-30 10:46:20 +02:00
Dunajski, Bartosz
9a8c38fa94
feature: unregister csr client on cmd list host sync
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-29 13:52:08 +02:00
Artur Harasimiuk
cf73ab0df3
refactor: remove not used code
...
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com >
2023-06-29 13:04:23 +02:00
Cencelewska, Katarzyna
0d7aefe66b
fix: Unify logic calculating threads per work group part 1
...
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-06-29 10:43:22 +02:00
Jaroslaw Chodor
023fe38448
fix: Use correct dimensions for UV plane
...
For image view mapped directly to UV plane,
the dimensions should 2 times smaller than
dimensions of the source image.
(1 raw UV pair maps to 2x2 block of original image)
Related-To: NEO-7936
Signed-off-by: Jaroslaw Chodor <jaroslaw.chodor@intel.com >
2023-06-28 23:34:50 +02:00
Dunajski, Bartosz
ecb415bf62
feature: reenable RelaxedOrdering
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-28 12:20:17 +02:00
Dunajski, Bartosz
2383896dbb
fix: stall RelaxedOrdering scheduler when programming Semaphores
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-28 11:46:04 +02:00
Zbigniew Zdanowicz
1067167637
test: adding testing debug flag overriding driver version
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-06-27 17:58:03 +02:00
Dunajski, Bartosz
7ac825e74b
refactor: add debug flag to synchronize Event before reset
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-26 17:38:37 +02:00
Cencelewska, Katarzyna
68d81c82a7
fix: Use proper value about hw local id generations
...
- remove useless flag ForceNumberOfThreadsInGpgpuThreadGroup
- add new flag "RemoveRestrictionsOnNumberOfThreadsInGpgpuThreadGroup"
to restore old path without restrictions about number of threads in
thread group
- fix forwarding information about hw local ids generations to
calculate numOfThreadsInThreadGroup correctly
Related-To: NEO-7952, NEO-7982
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-06-26 16:35:42 +02:00
Dunajski, Bartosz
aea5f435db
feature: unregister CSR client on Event host synchronize
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-26 12:02:14 +02:00
Dunajski, Bartosz
61fb19caab
feature: bring back counter based in-order tracking
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-26 10:01:18 +02:00
Lukasz Jobczyk
bc0a3a7eb5
fix: Consider slm size in suggest work group cache
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-06-26 09:12:54 +02:00
Joshua Santosh Ranjan
97b4d8bab5
feature: add initial support for host mapped timestamps
...
Related-To: LOCI-4171
Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com >
2023-06-26 08:29:58 +02:00
Lukasz Jobczyk
0bc5eead84
fix: Remove not needed BCS split helper
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-06-22 14:36:22 +02:00
Zbigniew Zdanowicz
ddffb8a67f
fix: add missing unrecoverable macro
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-06-22 10:47:18 +02:00
Dunajski, Bartosz
eda13e675a
feature: enable dynamic post sync allocation layout
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-22 08:37:40 +02:00
Lukasz Jobczyk
a61b39c47b
fix: Invalidate constant cache when ULLS enabled
...
Related-To: NEO-8067
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-06-21 15:17:14 +02:00
Mateusz Hoppe
111b112729
feature: add assertBufferPtr to ImplicitArgs
...
Related-To: NEO-5753, NEO-8078
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-06-20 20:43:57 +02:00
Mateusz Jablonski
1be48965ef
fix: parse all extensions passed to zeMemAllocShared
...
reuse common pNext parser
allow to create raytracing allocation with relaxed size
Related-To: NEO-8075
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-06-20 14:32:16 +02:00
Dunajski, Bartosz
46e8c3f5dd
fix: reenable RelaxedOrdering for OCL
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-20 13:05:25 +02:00