compute-runtime

mirror of https://github.com/intel/compute-runtime.git synced 2026-01-03 23:03:02 +08:00

Author	SHA1	Message	Date
Jaroslaw Warchulski	df07897144	fix: forbid compression for pre-xe2 platforms Related-To: NEO-9465 Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>	2025-04-05 00:15:16 +02:00
Compute-Runtime-Validation	f332571d96	Revert "performance: Do not create global fence allocation on integrated" This reverts commit `ecf8a07d26`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2025-04-04 16:26:19 +02:00
Dominik Dabek	bd516b3552	fix: usm reuse, clean from largest When trimming old allocations in usm reuse start from largest allocations. This will reduce memory usage more quickly once max hold time is hit. Related-To: NEO-6893, NEO-14429 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2025-04-04 14:57:15 +02:00
Zbigniew Zdanowicz	58fe89e116	fix: remove doubled memory prefetch operation when executing command list Related-To: NEO-10356 Signed-off-by: Zbigniew Zdanowicz <zbigniew.zdanowicz@intel.com>	2025-04-04 13:55:16 +02:00
Lukasz Jobczyk	ecf8a07d26	performance: Do not create global fence allocation on integrated Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>	2025-04-04 11:45:22 +02:00
Dominik Dabek	3703ff550c	fix: use real size when putting into usm reuse Real allocation size should be used to properly apply limits and allow more usm reuse hits. Related-To: NEO-6893 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2025-04-04 09:44:32 +02:00
Chandio, Bibrak Qamar	f344eb9bca	test: ULT for makeResidentResources Related-To: NEO-14056 Signed-off-by: Chandio, Bibrak Qamar <bibrak.qamar.chandio@intel.com>	2025-04-04 05:55:08 +02:00
Bartosz Dunajski	f99870e716	fix: improve media handling 2 Related-To: NEO-14462 Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>	2025-04-03 19:11:53 +02:00
Mateusz Jablonski	bb518adf34	fix: patching payload arguments in inline data in case of indirect kernel Related-To: NEO-14532 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2025-04-03 17:21:28 +02:00
Szymon Morek	95e0244f70	fix: properly pass info about 3D image Related-To: NEO-14538 It's valid for 3D image to copy 2D region. Current checks for mip map do not consider that. This change correctly checks for mip mapped 3D image. Signed-off-by: Szymon Morek <szymon.morek@intel.com>	2025-04-03 16:33:57 +02:00
Slawomir Milczarek	7e7e0a000f	refactor: Add ioctl helper for context destruction Related-To: NEO-11817 Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>	2025-04-03 16:08:53 +02:00
Dominik Dabek	be27367020	performance: usm reuse, avoid looking up svmData Save svmData on putting into reuse, instead of searching each time. Change UNRECOVERABLE_IF to DEBUG_BREAK_IF. Related-To: NEO-6893 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2025-04-03 15:50:49 +02:00
Jaroslaw Warchulski	c010d17842	fix: respect compression flag in capability table Related-To: NEO-9465 Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>	2025-04-03 15:36:55 +02:00
Szymon Morek	6ea83f322d	fix: do not override user data beyond slice region Related-To: NEO-14538 If user passes slice pitch which is larger than region to copy, do not override memory beyond region but within that slice pitch. Signed-off-by: Szymon Morek <szymon.morek@intel.com>	2025-04-03 12:37:35 +02:00
Jaroslaw Warchulski	62baf28316	fix: remove unnecesarry WA for DG2 compression Related-To: NEO-9465 Signed-off-by: Jaroslaw Warchulski <jaroslaw.warchulski@intel.com>	2025-04-03 08:04:19 +02:00
Filip Hazubski	bc87b1cff0	test: Minor test improvements Related-To: NEO-14526 Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>	2025-04-03 07:53:18 +02:00
Brandon Yates	4651e72b0b	fix: Fail device init if kernel debugging is misconfigured Also print error to stderr Related-to: GSD-10780 Signed-off-by: Brandon Yates <brandon.yates@intel.com>	2025-04-02 21:06:30 +02:00
Filip Hazubski	504440fc4d	feature: Add ftrHeaplessMode flag Pass hwInfo to isHeaplessModeEnabled and isForceBindlessRequired functions. Related-To: NEO-14526 Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>	2025-04-02 21:06:05 +02:00
Lukasz Jobczyk	deca36fd32	fix: Stop ULLS light when evict resource Related-To: NEO-14406 Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>	2025-04-02 16:37:43 +02:00
Bartosz Dunajski	bb3927531e	refactor: reduce HWTEST2_F usage in test_in_order_cmdlist_1.cpp Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>	2025-04-02 14:29:12 +02:00
Fabian Zwoliński	7ef3880793	feature: implement pool allocator for gpuTimestampDeviceBuffer The patch applies to Level Zero. Only allocations < 2MB will be fetched from the pool. Allocations are shared and reused within a given device. Additionally, I added a new debug flag to control the allocator: EnableTimestampPoolAllocator Related-To: NEO-12287 Signed-off-by: Fabian Zwoliński <fabian.zwolinski@intel.com>	2025-04-02 14:28:56 +02:00
Szymon Morek	8836f6df0b	fix: forward mip map level for 3D images Related-To: NEO-14539 1D and 2D images have already mip map level set correctly. Signed-off-by: Szymon Morek <szymon.morek@intel.com>	2025-04-02 11:46:41 +02:00
Aravind Gopalakrishnan	3a7d7e022c	fix: Add platform support for reservation on svm heap Related-To: GSD-10816 Signed-off-by: Aravind Gopalakrishnan <aravind.gopalakrishnan@intel.com>	2025-04-02 02:46:30 +02:00
Brandon Yates	a48d66ad75	feature: Add programExceptions stub to CSR Related-to: NEO-12967 Signed-off-by: Brandon Yates <brandon.yates@intel.com>	2025-04-01 18:33:40 +02:00
Szymon Morek	3010af596e	performance: add infrastructure for staging with 3D images Related-To: NEO-14026 Signed-off-by: Szymon Morek <szymon.morek@intel.com>	2025-04-01 15:30:30 +02:00
Mateusz Jablonski	ed37a1e7ef	build: remove not needed flag for builtins compilation Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2025-04-01 14:18:49 +02:00
Mateusz Jablonski	744ff08454	test: correct verifying programmed GPU addresses use memcmp instead of comparing dereferenced pointer when address is programmed within Walker's inline data the memory location address is 4B aligned and is not 8B aligned Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2025-04-01 13:18:14 +02:00
Vysochyn, Illia	70af2bc20b	refactor: Adjust size to preferred SLM values array Related-To: NEO-14479 Signed-off-by: Vysochyn, Illia <illia.vysochyn@intel.com>	2025-04-01 11:56:50 +02:00
Chandio, Bibrak Qamar	2ba2970492	performance: Waiting on make resident Windows Related-To: NEO-14056 No need to explicitly wait on Windows KMD during make resident as it has a while loop that does it nevertheless. The KMD wait affects the API overhead of zeCommandQueueExecuteCommandLists some platforms (MTL, and ARL). Signed-off-by: Chandio, Bibrak Qamar <bibrak.qamar.chandio@intel.com>	2025-04-01 00:12:45 +02:00
Lukasz Jobczyk	0a11a96a53	refactor: Add dedicated method to check if any ULLS light enabled Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>	2025-03-31 16:36:20 +02:00
Andrzej Koska	e3e01e94a0	Revert "performance: enable Direct Submission on LNL Linux" This reverts commit `cb3b4d326d`. Related-To: NEO-14517, NEO-9004 Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>	2025-03-31 15:22:29 +02:00
Szymon Morek	62964a0b08	fix: invalidate caches when heap is placed into reuse list Related-To: NEO-9004 Signed-off-by: Szymon Morek <szymon.morek@intel.com>	2025-03-31 12:30:29 +02:00
Bartosz Dunajski	831b488685	fix: improve media engine handling Related-To: NEO-14462 Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>	2025-03-31 10:40:21 +02:00
Slawomir Milczarek	3560b016bd	test: Add errno check to SysCalls wrapper for mkfifo Related-To: NEO-11817 Signed-off-by: Slawomir Milczarek <slawomir.milczarek@intel.com>	2025-03-28 17:07:10 +01:00
Maciej Plewka	a5e19330e9	fix: lock csr before locking residency controller in trim to budget path Lock on csr is needed before lock on residency controller to prevent incorrect lock order. Csr may be locked in waitOnCpu called from trimToBudget, which may lead to deadlocks Signed-off-by: Maciej Plewka <maciej.plewka@intel.com>	2025-03-28 16:18:20 +01:00
Szymon Morek	3fff3dd77b	fix: set misaligned source memory 1-way coherent Related-To: NEO-14443 Signed-off-by: Szymon Morek <szymon.morek@intel.com>	2025-03-28 14:16:45 +01:00
Filip Hazubski	3d9fc8968e	fix: Add BMG device id Add device ID: 0xE211 Signed-off-by: Filip Hazubski <filip.hazubski@intel.com>	2025-03-28 13:40:30 +01:00
Katarzyna Cencelewska	92e40afc49	feature: update debug flag DirectSubmissionPrintSemaphoreUsage instead of printf use makro that make flush after printf Related-To: HSD-14024170600 Signed-off-by: Katarzyna Cencelewska <katarzyna.cencelewska@intel.com>	2025-03-28 13:36:15 +01:00
Compute-Runtime-Validation	88a48f1c5b	Revert "performance: Improve ULLS light residency management" This reverts commit `35eae3f977`. Signed-off-by: Compute-Runtime-Validation <compute-runtime-validation@intel.com>	2025-03-28 11:21:27 +01:00
Mateusz Hoppe	c105c77930	fix: calculation of os context count Related-To: NEO-12952 Signed-off-by: Mateusz Hoppe <mateusz.hoppe@intel.com>	2025-03-28 09:55:17 +01:00
Andrzej Koska	cb3b4d326d	performance: enable Direct Submission on LNL Linux Related-To: NEO-9004 Signed-off-by: Andrzej Koska <andrzej.koska@intel.com>	2025-03-27 17:09:54 +01:00
Lukasz Jobczyk	b43b23b6ed	fix: Init wait utils after hwInfo init for both OS Resolves: HSD-18041922513 Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>	2025-03-27 16:45:22 +01:00
Szymon Morek	ead0842763	feature: add L0 API to query kernel argument info Related-To: NEO-14358 Signed-off-by: Szymon Morek <szymon.morek@intel.com>	2025-03-27 16:43:33 +01:00
Bartosz Dunajski	85f2734ca4	fix: correct gt_id to tile_id engine mapping Signed-off-by: Bartosz Dunajski <bartosz.dunajski@intel.com>	2025-03-27 15:57:57 +01:00
Dominik Dabek	c76edaba4e	fix: enable usm reuse limit based on memory usage Related-To: NEO-14160, NEO-6893 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2025-03-27 15:14:08 +01:00
Dominik Dabek	915d657420	fix: flag to limit usm reuse based on memory usage Host usm and device usm for igfx checks system memory usage. Device usm for dgfx checks local memory usage. If used memory is above limit threshold: - no new allocations will be saved for reuse - cleaner will use shorter hold time of 2 seconds - cleaner will free all eligible allocations, regardless of async deleter thread having work Motivation: in case of gfx memory being full, making resident new allocations will require evictions which leads to massive slowdown on enqueue calls. This change aims to minimize cases where extra memory usage from usm reuse mechanism leads to above situation. Related-To: NEO-6893, NEO-14160 Signed-off-by: Dominik Dabek <dominik.dabek@intel.com>	2025-03-27 10:25:19 +01:00
Jack Myers	0aa2c4f0cb	feature: allow removal of heapful code paths Related-To: NEO-13007 Signed-off-by: Jack Myers <jack.myers@intel.com>	2025-03-27 01:34:35 +01:00
Damian Tomczak	0243004907	feature: additional checkers to enable feature Resolves: NEO-13973 Signed-off-by: Damian Tomczak <damian.tomczak@intel.com>	2025-03-26 18:06:20 +01:00
Mateusz Jablonski	4bc13fa0dc	fix: correct MetricsLibraryGenId for Xe3 Signed-off-by: Mateusz Jablonski <mateusz.jablonski@intel.com>	2025-03-26 16:35:10 +01:00
Lukasz Jobczyk	60b551758c	performance: Adjust waitpkg threshold for discrete devices Related-To: NEO-14336 Signed-off-by: Lukasz Jobczyk <lukasz.jobczyk@intel.com>	2025-03-26 14:59:19 +01:00

... 3 4 5 6 7 ...

8068 Commits