compute-runtime

mirror of https://github.com/intel/compute-runtime.git synced 2025-06-28 17:58:30 +08:00

Author	SHA1	Message	Date
Hoppe, Mateusz	55a045ebe1	Refactor graphics memory allocation scheme - replace createGraphicsAllocationWithRequiredBitness with more general methodallocateGraphicsMemoryInPreferredPool based on passed AllocationData - proper flags for allocation selected based on AllocationType - remove allocateGraphicsMemory(size_t size, size_t alignment) and use allocateGraphicsMemory(size_t size) instead where default alignment is sufficient, otherwise use full options version: allocateGraphicsMemory(size_t size, size_t alignment, bool forcePin, bool uncacheable) Change-Id: I2da891f372ee181253cb840568a61b33c0d71fc9	2018-07-11 15:48:05 +02:00
Hoppe, Mateusz	684b1d75ba	Refactor GraphicsAllocation::AllocationType and allocationType enums - change GraphicsAllocatoin::AllocationType to scoped enumeration so that ALLOCATION_TYPE_ prefix in every enum value can be removed - all accesses are typed (example AllocationType::IMAGE) - Rename allocationType to AllocationUsage to eliminate confusion with multiple AllocationType enums / types Change-Id: I16003297ecfcb0aaa5779ad00706c5d983914bbe	2018-07-06 13:00:08 +02:00
Milczarek, Slawomir	eb1b5ded9c	Add support for AUB subcapture (filter and toggle modes) This commit adds a capability to selectively enable/disable AUB capture, i.e. by toggling the registry key from the outside or specifying the filter with a kernel name and/or kernel start index and kernel end index. Change-Id: Ib5d39c21863fbc4a95aa73c949b9779ff993de0f	2018-06-15 13:02:27 +02:00
Artur Harasimiuk	75ab0c6fe1	Switch clang-format to 6.0 Change-Id: Id96d1f47fb3d479d10d1022f1259dc030a148192 Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>	2018-06-14 09:45:00 +02:00
Zdunowski, Piotr	0cc10e47cc	Use device instead of context when programing surface state. Change-Id: I67615036d373cf905762a43a92562bf3d84854a5	2018-06-11 17:20:11 +02:00
Woloszyn, Wojciech	8a488ad52f	Fix reported row/slicePitch for mip-maps - use information from gmm correctly - modify computation on gen8 Change-Id: Iaefcc20ce9436ef70cd2f4bc36654932c4b5af49	2018-05-22 10:36:54 +02:00
Zdanowicz, Zbigniew	33fee15711	Change interface to pass Interface Descriptor Data as pointer Change-Id: I0f33109b800a7607206954bb1e5cb0826290e6f3	2018-05-11 14:35:53 +02:00
Mrozek, Michal	34ff5852eb	Add capability to csr to allow N:1 aggregation when ooq is created. - This allows applications to force the N:1 aggregation by creating out of order queue. - That switches csr to N:1 submission model where commands from multiple command streams may be aggregated. - That forces scenarios returning an event to be aggregated as well. Change-Id: I8fd8d7f88bb2665234ee90870133120b206710a8	2018-04-26 15:41:20 +02:00
Mrozek, Michal	8d2df3c332	Move indirect heaps from command queues to csr. -This is required to enable N:1 submission model. -If heaps are coming from different command queues that always mean that STATE_BASE_ADDRESS needs to be reloaded -In order to not emit any non pipelined state in CSR, this change moves the ownership of IndirectHeap to one centralized place which is CommandStreamReceiver -This way when there are submissions from multiple command queues then they reuse the same heaps, therefore preventing SBA reload Change-Id: I5caf5dc5cb05d7a2d8766883d9bc51c29062e980	2018-04-26 14:05:40 +02:00
Pawel Wilma	a0c044e6d2	Extend batch buffer flattening in AubCSR to BatchedDispatch mode - batch buffer flatening in batched mode - added MI_USER_INTERRUPT command - added GUC Work Queue Item Change-Id: I35142da34b30d3006bb4ffc1521db7f6ebe68ebc	2018-04-26 12:45:02 +02:00
Zdanowicz, Zbigniew	d94b853c32	Separate HW commands from class declaration and interface Change-Id: Ia49098171f4b7814c42a35686354713a322c9df7	2018-04-25 14:03:00 +02:00
Mrozek, Michal	ce8c44cae3	Add check for local work group size in clEnqueueNDRangeKernel call. - Incoming local work group size cannot exceed device capabilities. Change-Id: I89a7503155c71443e3ebc630debb5d5b466c6cb5	2018-04-20 08:16:16 +02:00
Hoppe, Mateusz	83160213f0	Fix problems in thkWrapper and SharingHandler - ThkWrapper had uninitialized mFunc member, setting it to nullptr - D3DSurface could dereference null image pointer, adding validateUpdateData method in SharingHandler that may return CL_INVALID_MEM_OBJECT if memObject is invalid Change-Id: Iaa4499bcea47baca156c9d28be4c93ba4f0e1ebb	2018-04-19 15:04:38 +02:00
Artur Harasimiuk	75d497a9a9	separate BuiltinDispatchInfoBuilder from built_ins.h We don't need BuiltinDispatchInfoBuilder in every place where built ins are used. specifically in .cpp files generated from kernel binary. Change-Id: Ie739951cdc93873993f78ad14cee656122af51fd Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>	2018-04-19 12:32:13 +02:00
Mrozek, Michal	d900bdffc6	[33/n] Internal 4GB allocator. - Move indirect heap to internal allocator domain. - Add logic in getIndirectHeap to allocate with proper API depending on heap type - Add State base Address programming, reflecting that now Indirect Object Heap is placed in 4GB domain. - For AddPatchInfoCommentsForAUBDump mode , keep all heaps in non 4GB mode. Change-Id: I6862f6a249e444d0d6cfe7e499a10d43f284553e	2018-04-19 08:13:48 +02:00
Mrozek, Michal	8583c68c8c	[29/n] Internal 4GB allocator. - Internal allocations may now coexists with non internal on reusable list. - Caller now specifies if internal allocation is needed. - If criteria are not met , then allocation is not returned. Change-Id: I7da3a4f944768b7c8a873e44fd47248f1d76bf9e	2018-04-17 06:42:56 +02:00
Mrozek, Michal	87b8b6e261	[28/n] Internal 4GB allocator. Avoid default parameter to getIndirectHeap. Change-Id: I105ceaa4b5e9b23ce8dc96631410b9535e5a44e0	2018-04-16 17:56:49 +02:00
Artur Harasimiuk	cb064abb04	fix mapImage for 1D_ARRAY There are differences in qPitch programming between Gen8 vs Gen9+ devices and this requires special operation when image is zero-copy. For Gen8 qPitch is distance in rows while Gen9+ it is in pixels. Minimum value of qPitch is 4 and this causes slicePitch = 4*rowPitch on Gen8. To allow zero-copy we have to tell what is correct value rowPitch which should equal to slicePitch. Change-Id: I58dea004e3c7f9f4dfabd154d02749c15b6b0246 Signed-off-by: Artur Harasimiuk <artur.harasimiuk@intel.com>	2018-04-16 16:13:51 +02:00
Zdanowicz, Zbigniew	e51cb6bd0b	Separate struct EnqueueOperation declaration and implementation Change-Id: I537660867a1c98f957280237c14b7a1554fce3db	2018-04-10 16:36:48 +02:00
Chodor, Jaroslaw	6bf4135def	Fix for externally synchronized events When inheriting task count from parent events, don't take into account externally synchronized events Change-Id: I52d861e482669a18e2aca499c813716bb4951b74	2018-04-09 12:12:58 +02:00
Mrozek, Michal	ffa9b097f5	[26/n] Internal 4GB allocator. - change the way we handle blocked commands. - instead of allocating CPU pointer and populating it with commands, create real IndirectHeap that may be later submitted to the GPU - that removes a lot of copy operations that were happening on submit time - for device enqueue, this requires dsh & shh to be passed directly to the underlying commands, in that scenario device queue buffers are not used Change-Id: I1124a8edbb46777ea7f7d3a5946f302e7fdf9665	2018-04-09 10:47:37 +02:00
Chodor, Jaroslaw	0a97dfbb2f	[1/n] Mipmap support * adding support for map/unmap * adding support for origin/region validation with mipmaps * fixing slices returned in map/unmap * removing ambiguity around mipLevel naming * enabling cl_khr_mipmap_image in current shape * enabling cl_khr_mipmap_image_writes in current shape * fixing CompileProgramWithReraFlag test Change-Id: I0c9d83028c5c376f638e45151755fd2c7d0fb0ab	2018-04-05 01:09:27 +02:00
Zdanowicz, Zbigniew	b6b92ae808	Create GpgpuWalkerHelper class Change-Id: Ia9aa7b816356aff57234b46ea3509b6bd9b7f14b	2018-04-04 16:42:16 +02:00
Mrozek, Michal	cbcf77ae49	Fix out of bound problem while estimating Indirect Object Heap. - While estimating the required size of Indirect Object Heap we were not handling properly the lack of local ids case - In such case we should allocate one GRF per HW thread that will be unused Change-Id: Ibcd359e431e3ffd9d55628ac7cf7eeefad72e7ba	2018-04-03 17:45:28 +02:00
Hoppe, Mateusz	4703417813	Use correct virtual addresses in TBX CSR makeCoherent method - cpu virtual address was used instead of gpu va - this caused incorrect behaviour of TBX server when special heap allocator assigning GPU addresses was used Change-Id: I2328cf2441be797311fd6a3c7b331b0fff79d4fc	2018-04-03 15:54:07 +02:00
Milczarek, Slawomir	b56289a507	User space AUBs capable of memory re-dumps on CPU-side memory modifications. Any CPU related updates such as clEnqueueMapBuffer or similar need to trigger a re-dump of memory prior to the next clEnqueue call. Change-Id: I7b31e559278e92ff55b6ebab8ef4190caef1ebc0	2018-04-03 15:40:29 +02:00
Mrozek, Michal	e4c25f11de	[25/n] Internal 4GB allocator. - Do not obtain pattern allocation from reusable pool. - This is due to the fact that it may contain allocations from internal heap, which cannot be used for arguments declared as kernel argument. Change-Id: I6c73445c409edc4ce25f8d8eba966f512dfd6cc9	2018-03-30 14:59:11 +02:00
Mrozek, Michal	5dc0a7c731	Remove default value for dispatchWalker parameter. Change-Id: I0676a353a4364339664edc416e36da37a345a4f6	2018-03-30 12:57:42 +02:00
Mrozek, Michal	de315db953	[24/n] Internal 4GB allocator. - Refactor tests for better maintenance - Remove duplicated code. Change-Id: I154cad43610497d2e1cabf99217820735d3868cd	2018-03-30 09:12:08 +02:00
Mrozek, Michal	7f3c4d3d70	[22/n] Internal 4GB allocator. - Finalize Instruction Heap removal. Change-Id: Idd7df94a228238a5157c3251180fc3c8d3a189df	2018-03-29 08:17:32 +02:00
Mrozek, Michal	2be5934096	[21/n] Remove Instruction Heap from enqueue path. - This removes Instruction Heap allocation from enqueue path - Blocked path is handled as well - Heap is no longer allocated on demand it is bind to kernelInfo. Change-Id: I54545beceed3404ee0330a8bac2b0934944cac30	2018-03-28 20:15:55 +02:00
Mrozek, Michal	9bdf01468e	[20/n] Internal 4GB allocator. - Switch to internal heap for kernel ISA allocations. - remove IH from various functions - remove IHState from CSR , IH is never dirty - ISA is no longer copied on enqueue calls. Change-Id: I0099cf2a9ebab6192ea03a74dd35f7da963fd5a5	2018-03-28 16:07:26 +02:00
Milczarek, Slawomir	a02c3cb781	KM DAF AubCapture to recapture fill pattern allocations The commit introduces a recapture of fill pattern allocations on every submit. Change-Id: I634af075348dbc59c7809f58b8495326cab804e1	2018-03-27 16:38:41 +02:00
Mrozek, Michal	09923fcb39	[17/n] Internal 4GB allocator. - Make sure that blocks ISA is made resident - both blocked & non blocked path - fix a bug where private surface was not made resident in blocked path. Change-Id: Ie564595b176b94ecc7c79d7efeae20598c5874fb	2018-03-27 10:33:22 +02:00
Milczarek, Slawomir	32825e203e	KM DAF AubCapture to recapture command streams and heap allocations This commit introduces a recapture of CS and Heap resources on every submit. Change-Id: I2a5a763e8988de804da1a6c2c8042154b0786b2e	2018-03-26 18:27:20 +02:00
Hoppe, Mateusz	7f32eb06d1	Kernel Source Level debugger support 4/n - adding DebugSurface allocation and setup - unit tests refactors: - mock kernel with kernel debug option - separating fixtures to headers - added helper for getting internal-options kernels filenames Change-Id: I7b6f4d46e2ab7cff0da8d5212483f44ae0d4be31	2018-03-26 15:02:42 +02:00
Mateusz Jablonski	575d1bf381	Cmake refactor part 20 set global properties with runtime sources, libult sources and os interface tests Change-Id: I9a84edf2f021b4581a16c19c7dbb0b2f94c33f4d	2018-03-23 15:51:12 +01:00
Pawel Wilma	ff1d2361f3	Add patch info comments to AUB dump Collect patching information and add as comments to AUB dump. Change-Id: Ib7c903a2589d68b6e3e614c1774c7cd5a000c29f	2018-03-23 13:08:54 +01:00
Mrozek, Michal	d7fe01454b	Make sure that gtpin callbacks are not executed in enqueue path. -This is to make sure those functions are not called when gtpin is not used -This preserves CPU instruction cache pollution. -Our enqueue path needs to be as thin as possible, even with this small change there is visible gain in ULT execution time. Change-Id: I44cc2144754cda95ca1fe058184cd8a151b8d35c	2018-03-23 12:54:17 +01:00
Dunajski, Bartosz	516082e7c5	Kmd notify improvements [1/n]: Quick KMD sleep optimization - KmdNotifyProperties struct for CapabilityTable that can be extended by incoming KmdNotify related optimizations - Quick KMD sleep optimization that is called from async events handler - Optimization makes a taskCount check in busy loop with much smaller delay than basic version of KMD Notify optimization Change-Id: I60c851c59895f0cf9de1e1f21e755a8b4c2fe900	2018-03-21 20:41:33 +01:00
Woloszyn, Wojciech	ce2f1468b7	Implement cl_khr_mipmap_image [2/n] - Add mipmap handling for clEnqueueCopyImage - Add mipmap handling for clEnqueueCopyImageToBuffer - Add mipmap handling for clEnqueueCopyBufferToImage - Fix typos Change-Id: Ie1a23b1699135afa17fe11bcba3c1e8bdf9a3dd9	2018-03-21 17:04:12 +01:00
Woloszyn, Wojciech	0ad81024b7	Implement cl_khr_mipmap_image [1/n] - Add mipmap image handling for clEnqueueReadImage, clEnqueueWriteImage - Fix mipmap image handling for clCreateImage Change-Id: I42938a330b55c7e69a16c26dce3ab5d66f8a8938	2018-03-21 10:51:13 +01:00
Dunajski, Bartosz	94033a1c51	Update FlushStamp for output Event on CPU data transfer operations - Previously only taskCount was updated - This improves KMD notify usage for Events handled asynchronously Change-Id: I283982890579254033557de0e1cef2239c0035e2	2018-03-20 15:29:35 +01:00
Mrozek, Michal	7644209288	Add debug flag to dump dispatch parameters. - Also refactor debug manager tests , they now check for default value in igdrcl.config file - There is no need to write dedicated tests now , so I remove them. Change-Id: Ib338ca05b6059302c29469c673239e7886dc4b9b	2018-03-16 11:13:35 +01:00
Hoppe, Mateusz	a1a20a3b34	Service read only memory passed as host_ptr - read only memory cannot be used for allocation, Oses cannot create graphics alocation for such memory - if memory allocation fails for host_ptr passed to enqueueWrite calls, then try doing new allocation and copy host_ptr on cpu Change-Id: I415a4673ae1319ea8f77e53bd8fba7489fe85218	2018-03-14 13:16:36 +01:00
Mateusz Jablonski	c6441eba9b	Cmake refactor part 10 Cleanup cmake files in runtime dir Add missing files to solution Change-Id: I5d0cf8b658039f7bdf21681ac6e3750a5699d311	2018-03-08 15:35:25 +01:00
Mrozek, Michal	1602fa5a88	[7/n] Internal 4GB allocator - rename getBase to getCpuBase - change some test names accordingly. Change-Id: I6fb2e4714298250147ea7766a916d7f5d62edc54	2018-03-05 22:16:14 +01:00
Dunajski, Bartosz	1fce275542	Remove forced DC flush and disabled out of order execution for shared objects Change-Id: I0de86c3d5af488a347e83858f5dddbac2ef53c17	2018-03-05 09:45:18 +01:00
Zdanowicz, Zbigniew	533afe472a	Program preemption mode in Interface Descriptor Data Change-Id: I7fce731d71dd0b6dc8505ebfe45d24c65898a08b	2018-03-05 09:36:53 +01:00
Mateusz Jablonski	d1aa5f978d	Cmake refactor part 6 Add macro to add all subdirectories Add macro to create project source tree based on target sources Small cleanup runtime/CMakeLists.txt Change-Id: I9b99145c544f648c4c3fe7421752d0c5d9504edf	2018-03-02 00:39:41 +01:00

1 2 3

106 Commits