Lukasz Jobczyk
83bd33befc
refactor: Add flag to control BCS split for pageable memory
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-07-11 15:12:40 +02:00
Dunajski, Bartosz
37fdb5998e
feature: appendMemoryFill support for in-order cmd lists
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-07-11 13:05:27 +02:00
Dunajski, Bartosz
241d9d447a
feature: optimize programming wait events from the same in-order cmdlist
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-07-11 12:59:34 +02:00
Zbigniew Zdanowicz
3f7269d401
fix: make sip state programing once for all level zero command queues
...
Related-To: NEO-7828
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-07-11 11:34:21 +02:00
Cencelewska, Katarzyna
aa0beb8191
fix: Unify logic calculating threads per work group part 4
...
- also use helper when checking that is simd1 to have same flow
Related-To: NEO-8087
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-07-07 15:34:59 +02:00
Mateusz Hoppe
4aba0f0340
feature: global bindless surface state base support
...
- program global bindless ssba when external allocator used (
UseExternalAllocatorForSshAndDsh)
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-07-06 18:31:49 +02:00
Spruit, Neil R
11c96559a9
fix: Use Symbols map to report exported symbols vs external functions
...
Related-To: LOCI-4578
- Report all symbols in the Symbols Map for a Module as the Exported
symbols instead of using the External Functions Program Info.
- Resolves the issue of reporting symbols for platforms that don't have
ZEBIN binaries by default.
Signed-off-by: Spruit, Neil R <neil.r.spruit@intel.com >
2023-07-06 17:41:45 +02:00
Dunajski, Bartosz
9adfa4b355
feature: improve in-order Events chaining
...
- Clear TS event before chaining
- Dont chain non-TS events
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-07-05 14:21:36 +02:00
Mateusz Jablonski
66c733eac7
fix: add missing zero-initialization of variable
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-07-05 10:39:48 +02:00
Zbigniew Zdanowicz
866e3c37ba
performance: add implementation of cmdlist immediate flush task
...
Related-To: NEO-7808
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-07-05 10:23:16 +02:00
Dunajski, Bartosz
c18198ebd8
feature: change in-order allocation type to uint32_t
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-07-05 09:56:32 +02:00
Dunajski, Bartosz
5b9d45f3bc
feature: handle in-order allocation overflow
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-07-04 15:58:11 +02:00
Cencelewska, Katarzyna
61f701aba5
fix: Unify logic calculating threads per work group part 3
...
Related-To: NEO-8087
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-07-04 15:27:44 +02:00
Cencelewska, Katarzyna
2e17c21728
fix: Unify logic calculating threads per work group part 2
...
- use calculateNumThreadsPerThreadGroup instead of getThreadsPerWG to
have same flow and proper values of threads per work groups
Related-To: NEO-8087
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-07-04 10:34:02 +02:00
Compute-Runtime-Validation
39740da9d1
Revert "fix: Unify logic calculating threads per work group part 2"
...
This reverts commit 1e8a53bd53 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-07-02 07:09:14 +02:00
Dunajski, Bartosz
f9a58b5544
feature: program in-order barrier as non stalling instruction
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-30 17:17:53 +02:00
Young Jin Yoon
c5d675570a
feature: support for zeDriverGetLastErrorDescription
...
Added setErrorDescription() and getErrorDescription() in DriverHandle
to record and retrieve the custom string for errors.
Related-To: LOCI-4619
Signed-off-by: Young Jin Yoon <young.jin.yoon@intel.com >
2023-06-30 17:12:32 +02:00
Cencelewska, Katarzyna
1e8a53bd53
fix: Unify logic calculating threads per work group part 2
...
- use calculateNumThreadsPerThreadGroup instead of getThreadsPerWG to
have same flow and proper values of threads per work groups
Related-To: NEO-8087
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-06-30 14:16:08 +02:00
Michal Mrozek
5eadedc36e
refactor: Remove not used logic.
...
Signed-off-by: Michal Mrozek <michal.mrozek@intel.com >
2023-06-30 10:58:35 +02:00
Zbigniew Zdanowicz
21823af419
performance: add skeleton method to cmdlist immediate flush task
...
Related-To: NEO-7808
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-06-30 10:46:20 +02:00
Dunajski, Bartosz
9a8c38fa94
feature: unregister csr client on cmd list host sync
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-29 13:52:08 +02:00
Artur Harasimiuk
cf73ab0df3
refactor: remove not used code
...
Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com >
2023-06-29 13:04:23 +02:00
Cencelewska, Katarzyna
0d7aefe66b
fix: Unify logic calculating threads per work group part 1
...
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-06-29 10:43:22 +02:00
Jaroslaw Chodor
023fe38448
fix: Use correct dimensions for UV plane
...
For image view mapped directly to UV plane,
the dimensions should 2 times smaller than
dimensions of the source image.
(1 raw UV pair maps to 2x2 block of original image)
Related-To: NEO-7936
Signed-off-by: Jaroslaw Chodor <jaroslaw.chodor@intel.com >
2023-06-28 23:34:50 +02:00
Dunajski, Bartosz
ecb415bf62
feature: reenable RelaxedOrdering
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-28 12:20:17 +02:00
Dunajski, Bartosz
2383896dbb
fix: stall RelaxedOrdering scheduler when programming Semaphores
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-28 11:46:04 +02:00
Zbigniew Zdanowicz
1067167637
test: adding testing debug flag overriding driver version
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-06-27 17:58:03 +02:00
Dunajski, Bartosz
7ac825e74b
refactor: add debug flag to synchronize Event before reset
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-26 17:38:37 +02:00
Cencelewska, Katarzyna
68d81c82a7
fix: Use proper value about hw local id generations
...
- remove useless flag ForceNumberOfThreadsInGpgpuThreadGroup
- add new flag "RemoveRestrictionsOnNumberOfThreadsInGpgpuThreadGroup"
to restore old path without restrictions about number of threads in
thread group
- fix forwarding information about hw local ids generations to
calculate numOfThreadsInThreadGroup correctly
Related-To: NEO-7952, NEO-7982
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-06-26 16:35:42 +02:00
Dunajski, Bartosz
aea5f435db
feature: unregister CSR client on Event host synchronize
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-26 12:02:14 +02:00
Dunajski, Bartosz
61fb19caab
feature: bring back counter based in-order tracking
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-26 10:01:18 +02:00
Lukasz Jobczyk
bc0a3a7eb5
fix: Consider slm size in suggest work group cache
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-06-26 09:12:54 +02:00
Joshua Santosh Ranjan
97b4d8bab5
feature: add initial support for host mapped timestamps
...
Related-To: LOCI-4171
Signed-off-by: Joshua Santosh Ranjan <joshua.santosh.ranjan@intel.com >
2023-06-26 08:29:58 +02:00
Lukasz Jobczyk
0bc5eead84
fix: Remove not needed BCS split helper
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-06-22 14:36:22 +02:00
Zbigniew Zdanowicz
ddffb8a67f
fix: add missing unrecoverable macro
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-06-22 10:47:18 +02:00
Dunajski, Bartosz
eda13e675a
feature: enable dynamic post sync allocation layout
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-22 08:37:40 +02:00
Lukasz Jobczyk
a61b39c47b
fix: Invalidate constant cache when ULLS enabled
...
Related-To: NEO-8067
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-06-21 15:17:14 +02:00
Mateusz Hoppe
111b112729
feature: add assertBufferPtr to ImplicitArgs
...
Related-To: NEO-5753, NEO-8078
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-06-20 20:43:57 +02:00
Mateusz Jablonski
1be48965ef
fix: parse all extensions passed to zeMemAllocShared
...
reuse common pNext parser
allow to create raytracing allocation with relaxed size
Related-To: NEO-8075
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-06-20 14:32:16 +02:00
Dunajski, Bartosz
46e8c3f5dd
fix: reenable RelaxedOrdering for OCL
...
Related-To: NEO-7458
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-20 13:05:25 +02:00
Lukasz Jobczyk
c5f61cce85
fix: Ensure flat ring buffer disabled on regular cmd lists
...
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-06-19 16:27:57 +02:00
Mateusz Hoppe
313fb84fda
feature: bindless addressing mode support
...
- allow bindless kernels to execute
- bindless addressing kernels are using private heaps mode
- do not differentiate bindful and bindless surface state base addresses
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-06-19 12:41:03 +02:00
Zbigniew Zdanowicz
1a80ab2589
performance: store values instead calling virtual methods in command list
...
Related-To: NEO-7828
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-06-16 16:45:16 +02:00
Lukasz Jobczyk
0cf975605b
performance: Cache suggest group size
...
Resolves: NEO-7968
Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com >
2023-06-16 13:26:55 +02:00
Spruit, Neil R
907f41529c
feature: zeModuleInspectLinkageExt to read Linkage of modules
...
Related-To: LOCI-2914
Signed-off-by: Spruit, Neil R <neil.r.spruit@intel.com >
2023-06-15 18:13:43 +02:00
Zhang, Winston
44a8610ebb
feature: Implement support for zeDeviceGetStatus
...
Added support for zeDeviceGetStatus API.
Added ULTs for zeDeviceGetStatus API.
Related-To: LOCI-1558
Signed-off-by: Zhang, Winston <winston.zhang@intel.com >
2023-06-15 06:36:14 +02:00
Cencelewska, Katarzyna
7cb3278eb3
fix: add function to calculate number of threads per tg
...
Signed-off-by: Cencelewska, Katarzyna <katarzyna.cencelewska@intel.com >
2023-06-13 14:02:24 +02:00
Mateusz Jablonski
4f72835b7d
fix: create dedicated class for root device indices to store unique values
...
remove method to removing duplicates from StackVec as the method
implicitly sorted the vector
Related-To: GSD-4692
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-06-12 22:24:06 +02:00
Kamil Kopryk
6a0f7afd64
feature: verify stateful information only when binary is generated by IGC
...
Signed-off-by: Kamil Kopryk <kamil.kopryk@intel.com >
Related-To: NEO-6075
Ngen binaries contain stateful information, however they are
not used in isa on Pvc. Therefore, we can just ignore them.
2023-06-12 11:45:41 +02:00
Dunajski, Bartosz
37f04dc670
feature: in-order CmdList support for BCS split
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-06-09 15:15:45 +02:00