Dunajski, Bartosz
42496ac96d
feature: initial support for patching regular in-order CmdList
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-09-21 14:20:50 +02:00
Maciej Bielski
97e7cda912
feature: Optimize intra-module kernel ISA allocations
...
So far, there is a separate page allocated for each kernel's ISA within
`KernelImmutableData::initialize()`. Apparently the ISA blocks are often
much smaller than a 64k page, which leads to poor memory utilization and
was even observed to cause the device OOM error if a single module has
several keys.
Improve the situation by reusing the parent allocation (owned by the
module instance) for modules, which kernel ISAs can fit together within
a single 64k page. This improves the memory utilization on a single
module level.
Related-To: NEO-7788
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com >
2023-09-21 13:55:45 +02:00
Dunajski, Bartosz
b94f58abaa
feature: debug flag to enable in-order events
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-09-21 11:22:48 +02:00
Kulkarni, Ashwin Kumar
3a5b038c45
fix(sysman): bugfix for return of correct uuid with zesInit flow
...
Related-To: LOCI-4907
Signed-off-by: Kulkarni, Ashwin Kumar <ashwin.kumar.kulkarni@intel.com >
2023-09-21 09:41:49 +02:00
Latif, Raiyan
9bb559d468
feature: Ray Tracing Acceleration Structure (RTAS) Support
...
Related-To: LOCI-3419
Signed-off-by: Latif, Raiyan <raiyan.latif@intel.com >
2023-09-20 19:57:56 +02:00
Dunajski, Bartosz
020822fe29
feature: re-enable in-order semaphores for regular CmdLists
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-09-20 15:34:05 +02:00
Mateusz Jablonski
b1808f7830
fix: correct suggested number of work groups for concurrent kernels on PVC
...
value depends on CCS count:
- single CCS mode (default) - 50% available
- two CCS mode - 25% available
- four CCS mode - 12.5% available
Related-To: NEO-8377
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-09-20 13:40:22 +02:00
Mateusz Hoppe
69f5ca6345
feature: bindless addressing - flush state cache after reusing SS slot
...
- when Surface State is reused for new resource, State Cache needs to be
invalidated
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-09-20 12:53:32 +02:00
Zbigniew Zdanowicz
09ab0ca1d2
performance: remove not needed validation loop
...
- validation of arguments must be moved to validation layer
Related-To: NEO-7828
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-09-20 12:35:18 +02:00
Zbigniew Zdanowicz
40e169f8e2
refactor: add batch buffer attribute for explicit monitor fence dispatch
...
Related-To: NEO-8395
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-09-20 12:18:32 +02:00
Zbigniew Zdanowicz
713511eea9
refactor: inline simple level zero queue method
...
Related-To: NEO-7828
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-09-20 12:12:56 +02:00
Zbigniew Zdanowicz
453d01efb2
performance: remove extra loop processing command lists in execute call
...
Related-To: NEO-7828
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-09-19 18:01:16 +02:00
Dunajski, Bartosz
9da60d8e07
feature: regular CmdList submission counter
...
This is prework for in-order cmd lists
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-09-19 16:26:38 +02:00
Dunajski, Bartosz
d3d5da1f72
feature: initial 64b in-order CmdList support
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-09-19 15:23:37 +02:00
Mateusz Jablonski
8da3879944
refactor: remove not needed code
...
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-09-19 14:40:24 +02:00
Compute-Runtime-Validation
913a926fd4
Revert "feature: Optimize intra-module kernel ISA allocations"
...
This reverts commit c348831470 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-09-19 14:16:05 +02:00
Maciej Bielski
c348831470
feature: Optimize intra-module kernel ISA allocations
...
So far, there is a separate page allocated for each kernel's ISA within
`KernelImmutableData::initialize()`. Apparently the ISA blocks are often
much smaller than a 64k page, which leads to poor memory utilization and
was even observed to cause the device OOM error if a single module has
several keys.
Improve the situation by reusing the parent allocation (owned by the
module instance) for modules, which kernel ISAs can fit together within
a single 64k page. This improves the memory utilization on a single
module level.
Related-To: NEO-7788
Signed-off-by: Maciej Bielski <maciej.bielski@intel.com >
2023-09-19 12:05:09 +02:00
Dunajski, Bartosz
e08d46085b
feature: enable in-order sync allocation for regular cmd lists 3
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-09-19 10:07:57 +02:00
Maciej Plewka
ee21f7c717
fix: Use cmdlist residency container for reused private allocs
...
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2023-09-18 13:50:17 +02:00
Jitendra Sharma
8f7b3e6699
feature: Report 128 GRF count with zetDebugGetRegisterSetProperties
...
Related-to: NEO-6788
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com >
2023-09-18 11:03:59 +02:00
Mateusz Hoppe
371412a826
refactor: remove useLocalMem from allocate32BitGraphicsMemoryImpl
...
- this method allocates System Memory
- argument is not needed - ExternalHeap is selected inside this function
- remove unneeded ults
- allocate memory in Device Pool for external heap allocation in
OsAgnosticMemoryManager
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-09-18 10:08:09 +02:00
Brandon Yates
7c1f5585d7
feature(debugger): online page fault event handling
...
Sync to
https://github.com/intel-gpu/drm-uapi-helper/releases/tag/v2.0-rc20
Related-to: LOCI-2052
Signed-off-by: Brandon Yates <brandon.yates@intel.com >
2023-09-18 00:15:45 +02:00
Aravind Gopalakrishnan
4e7f7b294c
feature(sysman): Add boilerplate for RAS extensions
...
Related-To: LOCI-4877
Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com >
2023-09-17 00:20:48 +02:00
Compute-Runtime-Validation
47dac196d9
Revert "feature: Ray Tracing Acceleration Structure (RTAS) Support"
...
This reverts commit 179abf00de .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-09-16 04:37:55 +02:00
Spruit, Neil R
f901b26222
fix: Return Invalid Argument given the wrong module used in static link
...
Related-To: LOCI-4939
Signed-off-by: Spruit, Neil R <neil.r.spruit@intel.com >
2023-09-15 23:20:10 +02:00
Latif, Raiyan
179abf00de
feature: Ray Tracing Acceleration Structure (RTAS) Support
...
Related-To: LOCI-3419
Signed-off-by: Latif, Raiyan <raiyan.latif@intel.com >
2023-09-15 19:05:20 +02:00
Mateusz Hoppe
fb211a921d
feature: bindless addressing support for image views
...
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-09-15 15:25:47 +02:00
Compute-Runtime-Validation
73731d3be5
Revert "fix: correct suggested number of work groups for concurrent kernels o...
...
This reverts commit 6fc673b0fe .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-09-15 04:21:58 +02:00
Mateusz Jablonski
6fc673b0fe
fix: correct suggested number of work groups for concurrent kernels on PVC
...
value depends on CCS count:
- single CCS mode (default) - no limitations
- two CCS mode - 25% available
- four CCS mode - 12.5% available
Related-To: NEO-8377
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-09-14 16:13:54 +02:00
Brandon Yates
513045bd90
fix(debugger): auto-ack vmbind on tile for module from other tiles
...
Related-to: NEO-8371
Signed-off-by: Brandon Yates <brandon.yates@intel.com >
2023-09-14 15:33:58 +02:00
Compute-Runtime-Validation
c1274cd84f
Revert "feature: enable in-order sync allocation for regular cmd lists 2"
...
This reverts commit efddaa1251 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-09-14 15:30:40 +02:00
Compute-Runtime-Validation
e489c16529
Revert "feature: Report 128 GRF count with zetDebugGetRegisterSetProperties"
...
This reverts commit 6596ce5097 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-09-14 14:34:55 +02:00
Baj, Tomasz
e10f39017d
fix: Add ImageInfo to createGraphicsAllocation on Linux
...
Related-To: NEO-6757
Signed-off-by: Baj, Tomasz <tomasz.baj@intel.com >
2023-09-14 12:58:59 +02:00
Jitendra Sharma
6596ce5097
feature: Report 128 GRF count with zetDebugGetRegisterSetProperties
...
Related-to: NEO-6788
Signed-off-by: Jitendra Sharma <jitendra.sharma@intel.com >
2023-09-13 14:12:38 +02:00
Dunajski, Bartosz
efddaa1251
feature: enable in-order sync allocation for regular cmd lists 2
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-09-13 11:27:48 +02:00
Dunajski, Bartosz
7562842a58
refactor: remove LogicalStateHelper
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-09-13 10:29:53 +02:00
Compute-Runtime-Validation
413365a7bf
Revert "fix: Correct logic for SIMD1"
...
This reverts commit fc099ead2e .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-09-13 08:23:59 +02:00
Kacper Nowak
fc099ead2e
fix: Correct logic for SIMD1
...
- For calculating number of threads per workgroup, treat simd 1 as it
was simd 32
- Correct logic of calculating space for per thread data for simd 1
- Minor: unit tests refactor
- Corrected naming
Related-To: NEO-8261
Signed-off-by: Kacper Nowak <kacper.nowak@intel.com >
2023-09-13 07:03:12 +02:00
Compute-Runtime-Validation
b053e9348e
Revert "feature: enable in-order sync allocation for regular cmd lists"
...
This reverts commit c8a3d7d268 .
Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com >
2023-09-13 02:44:40 +02:00
Fabian Zwolinski
0dd40b8616
refactor: Split compiler interface between OSs
...
Create new file:
`os_handle.h` which contains OS specific Handle type.
Add `shared/source/os_interface/<windows/linux>`
include directory to level_zero.
Related-To: NEO-8092
Signed-off-by: Fabian Zwolinski <fabian.zwolinski@intel.com >
2023-09-12 15:56:53 +02:00
Mrozek, Michal
d9f938f3db
refactor: remove not needed code
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com >
2023-09-12 14:25:04 +02:00
Dunajski, Bartosz
c8a3d7d268
feature: enable in-order sync allocation for regular cmd lists
...
Related-To: NEO-7966
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-09-12 13:20:59 +02:00
Dunajski, Bartosz
6648065703
feature: add indirect semaphore mode
...
Related-To: NEO-8242
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-09-12 13:15:51 +02:00
Dunajski, Bartosz
2a6be2fccd
feature: update conditional bb start to use qword data
...
Related-To: NEO-8242
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-09-12 11:24:28 +02:00
Dunajski, Bartosz
def3f2e9ad
refactor: improve semaphore programming
...
Signed-off-by: Dunajski, Bartosz <bartosz.dunajski@intel.com >
2023-09-12 11:24:11 +02:00
Mrozek, Michal
451c48fc2f
refactor: remove not needed code.
...
Signed-off-by: Mrozek, Michal <michal.mrozek@intel.com >
2023-09-12 10:51:35 +02:00
Zbigniew Zdanowicz
2f43ff28c3
fix: noop close call on immediate command lists
...
Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com >
2023-09-08 16:06:16 +02:00
Maciej Plewka
09c1d474c9
fix: adjust depth limitations for images
...
Related-To: NEO-8239, HSD-14019991752
Signed-off-by: Maciej Plewka <maciej.plewka@intel.com >
2023-09-08 13:05:49 +02:00
Mateusz Jablonski
4f68822a78
fix: don't use static global in global destructor
...
https://github.com/intel/compute-runtime/issues/675
Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com >
2023-09-08 12:51:39 +02:00
Mateusz Hoppe
93469eaf5d
feature: bindless addressing for buffers with offset
...
- allocate SurfaceStates on kernel's heap for offsetted buffers
Related-To: NEO-7063
Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com >
2023-09-08 12:03:23 +02:00