intel/llvm - llvm - Gitea: Git with a cup of tea

intel/llvm

mirror of https://github.com/intel/llvm.git synced 2026-01-22 23:49:22 +08:00

Author	SHA1	Message	Date
Alexander Yermolovich	a4610c7182	[BOLT][DWARF] Add support for DW_IDX_parent (#85285 ) This adds support for DW_IDX_parent. If DIE has a parent then DW_IDX_parent in Entry will point to Entry for that parent DIE. Otherwise it will have DW_FORM_flag_present in abbrev. Which takes zero space in Entry. This came from https://discourse.llvm.org/t/rfc-improve-dwarf-5-debug-names-type-lookup-parsing-speed/74151	2024-03-15 13:52:45 -07:00
Amir Ayupov	b431546d41	[BOLT] Check BF state in stale matching (#85339 ) Only apply stale matching if the binary function is in CFG state, i.e. has basic blocks. Test Plan: Updated bolt/test/X86/reader-stale-yaml.test	2024-03-15 10:55:53 -07:00
Maksim Panchenko	49b8a99a0f	[BOLT] Add createCondBranch() and createLongUncondBranch() (#85315 ) Add MCPlusBuilder interface for creating two new branch types.	2024-03-14 15:28:22 -07:00
Maksim Panchenko	bba790db47	[BOLT] Refactor instruction creation interface. NFCI (#85292 ) Refactor MCPlusBuilder's create{Instruction}() functions that used to return bool. We almost never check the return value as we rely on llvm_unreachable() to detect unimplemented functionality. There were a couple of cases that checked the return value, but they would hit the unreachable condition first (at least in debug builds) before the return value gets checked.	2024-03-14 13:17:17 -07:00
Maksim Panchenko	59ab86bb2f	[BOLT] Clear operands when creating new instructions. NFCI (#85191 ) Reset operand list whenever we create a new instruction via a parameter passed by reference. Most functions were already doing this, but there are several places missing the reset. Potentially, if we don not clear the list it could lead to invalid instruction operands. But the existing code is unaffected.	2024-03-14 11:00:08 -07:00
Maksim Panchenko	fd32e744a5	[BOLT] Add support for Linux kernel PCI fixup section (#84982 ) .pci_fixup section contains a table with entries allowing to invoke a fixup hook whenever a problem is encountered with a PCI device. The hookup code typically points to the start of a function. As we are not relocating functions in the kernel (at least not yet), verify this assumption while reading the table and ignore any functions with a fixup code in the middle.	2024-03-12 15:52:27 -07:00
Alexander Yermolovich	6d4aa9d70e	[BOLT][DWWARF] Fix foreign TU index with local TUs (#84594 ) The foreign TU list immediately follows the local TU list and they both use the same index, so that if there are N local TU entries, the index for the first foreign TU is N. Changed so that the size of local TU is accounted for when setting foreign TU index.	2024-03-11 12:20:25 -07:00
Maksim Panchenko	a9b0d7590b	[BOLT] Properly propagate Cursor errors (#84378 ) Handle out-of-bounds reading errors correctly in LinuxKernelRewriter.	2024-03-07 15:29:38 -08:00
Maksim Panchenko	143afb405a	[BOLT] Add reading support for Linux kernel .altinstructions section (#84283 ) Read .altinstructions and annotate instructions that have alternative sequences with "AltInst" annotation. Note that some instructions may have more than one alternatives, in which case they will have multiple annotations in the form "AltInst", "AltInst2", "AltInst3", etc.	2024-03-07 13:04:02 -08:00
Maksim Panchenko	02629793a4	[BOLT] Add reading support for Linux kernel __bug_table section (#84082 ) Read __bug_table section and annotate ud2 instructions with a corresponding bug entry ID.	2024-03-06 23:34:03 -08:00
Fangrui Song	50bdc6f3ec	[BOLT,test] Remove -relax-relocations The option is always true (see `2aedfdd9b8`).	2024-03-06 22:37:18 -08:00
Mehdi Amini	716042a63f	Rename llvm::ThreadPool -> llvm::DefaultThreadPool (NFC) (#83702 ) The base class llvm::ThreadPoolInterface will be renamed llvm::ThreadPool in a subsequent commit. This is a breaking change: clients who use to create a ThreadPool must now create a DefaultThreadPool instead.	2024-03-05 18:00:46 -08:00
Maksim Panchenko	f51ade25b9	[BOLT] Add reading support for Linux kernel .parainstructions section (#83965 ) Read .parainstruction section and mark call instructions with ParaSite annotations.	2024-03-05 13:57:55 -08:00
Mehdi Amini	4a4fb930a5	Use the new ThreadPoolInterface base class instead of the concrete implementation (NFC) (#84056 )	2024-03-05 12:37:11 -08:00
Maksim Panchenko	ccf0c8da1a	[BOLT] Add reading support for Linux kernel exception table (#83100 ) Read Linux exception table and ignore functions with exceptions for now. Proper support requires an introduction of new control flow since some instructions with memory access can cause a control flow change. Hence looking at disassembly or CFG with exceptions annotations is valuable for code analysis, delay marking functions with exceptions as non-simple until immediately before emitting the code.	2024-03-04 17:24:16 -08:00
sinan	71c2a132b2	[BOLT] support AArch64 JUMP26 createRelocation (#83531 ) Add R_AARCH64_JUMP26 implementation for createRelocation, which could significantly reduce the number of failed scan-refs cases if we perform bolt on a selective range of functions.	2024-03-04 17:11:47 +08:00
Mehdi Amini	6594f428de	Split the llvm::ThreadPool into an abstract base class and an implementation (#82094 ) This decouples the public API used to enqueue tasks and wait for completion from the actual implementation, and opens up the possibility for clients to set their own thread pool implementation for the pool. https://discourse.llvm.org/t/construct-threadpool-from-vector-of-existing-threads/76883	2024-03-02 19:10:50 -08:00
Maksim Panchenko	d7d564b2fc	[BOLT] Add BinaryFunction::registerBranch(). NFC (#83337 ) Add an external interface to register a branch in a function that is in disassembled state. Allows to make custom modifications to the disassembler. E.g., a pre-CFG pass can add an instruction and register a branch that will later be used during the CFG construction.	2024-02-28 20:04:28 -08:00
Maksim Panchenko	3f2a9e5910	[BOLT] Sort TakenBranches immediately before use. NFCI (#83333 ) Move code that sorts TakenBranches right before the branches are used. We can populate TakenBranches in pre-CFG post-processing and hence have to postpone the sorting to a later point in the processing pipeline. Will add such a pass later. For now it's NFC.	2024-02-28 19:51:44 -08:00
Maksim Panchenko	7c206c7812	[BOLT] Refactor interface for instruction labels. NFCI (#83209 ) To avoid accidentally setting the label twice for the same instruction, which can lead to a "lost" label, introduce getOrSetInstLabel() function. Rename existing functions to getInstLabel()/setInstLabel() to make it explicit that they operate on instruction labels. Add an assertion in setInstLabel() that the instruction did not have a prior label set.	2024-02-27 18:44:28 -08:00
Maksim Panchenko	0e84e2748b	[BOLT] Move test under X86 target. NFCI (#83202 ) instrument-wrong-target.s test requires X86 host. Move it under runtime/X86.	2024-02-27 15:38:31 -08:00
Elvina Yakubova	b98e6a5ced	[BOLT][AArch64] Skip BBs only instead of functions (#81989 ) After [this ](`846eb76761`) commit we noticed that the size of fdata file decreased a lot. That's why the better and more precise way will be to skip basic blocks with exclusive instructions only instead of the whole function	2024-02-27 19:19:47 +03:00
Alexander Yermolovich	6de5fcc746	[BOLT][DWARF] Add support for .debug_names (#81062 ) DWARF5 spec supports the .debug_names acceleration table. This is the formalized version of combination of gdb-index/pubnames/types. Added implementation of it to BOLT. It supports both monolothic and split dwarf, with and without Type Units. It does not include parent indices. This will be in followup PR. Unlike LLVM output this will put all the CUs and TUs into one Module.	2024-02-26 14:00:31 -08:00
Alexander Yermolovich	841a4168ad	[BOLT] Fix runtime/instrument-wrong-target.s test (#82858 ) Test was failing when only X86 was specified for LLVM_TARGETS_TO_BUILD. Changed so that it will now report unsupporeted. For "X86;AArch64" it still passes. For "X86" reports UNSUPPORTED: BOLT :: runtime/instrument-wrong-target.s (1 of 1)	2024-02-26 13:43:39 -08:00
Alexander Yermolovich	004c1972b4	[BOLT][DWARF][NFC] Expose DebugStrOffsetsWriter::clear (#82548 ) Refactored cod that clears data-structures in DebugStrOffsetsWriter into clear() function and made initialize() public. This is for https://github.com/llvm/llvm-project/pull/81062.	2024-02-21 16:48:02 -08:00
Alexander Yermolovich	640e781dc8	[BOLT][DWARF][NFC] Use SkeletonCU in place of IsDWO check (#82540 ) Changed isDWO to a function that checks Skeleton CU that is passed in. This is for preparation for https://github.com/llvm/llvm-project/pull/81062.	2024-02-21 16:18:18 -08:00
Maksim Panchenko	5daf2001a1	[BOLT] Fix memory leak in BinarySection (#82520 ) The change in #80950 exposed a memory leak in BinarySection. Let BinarySection manage memory passed via updateContents() unless a valid SectionID is set indicating that the contents are managed by JITLink.	2024-02-21 11:54:34 -08:00
Mehdi Amini	744616b3ae	Rename `ThreadPool::getThreadCount()` to `getMaxConcurrency()` (NFC) (#82296 ) This is addressing a long-time TODO to rename this misleading API. The old one is preserved for now but marked deprecated.	2024-02-19 18:07:12 -08:00
Maksim Panchenko	0ce0171243	[BOLT][NFC] Switch logging in LinuxKernelRewriter (#82195 ) Use journaling streams introduced in #81524 for LinuxKernelRewriter.	2024-02-19 03:24:04 +00:00
Maksim Panchenko	2646dccaa3	[BOLT] Add support for Linux kernel static calls table (#82072 ) Static calls are calls that are getting patched during runtime. Hence, for every such call the kernel runtime needs the location of the call or jmp instruction that will be patched. Instruction locations together with a corresponding key are stored in the static call site table. As BOLT rewrites these instructions it needs to update the table.	2024-02-18 17:20:25 -08:00
Alexander Yermolovich	f81f7a5766	[BOLT][DWARF] Remove redundant code (#82118 ) Removed some redundant code. Should be NFC change.	2024-02-17 12:37:07 -08:00
Maksim Panchenko	5a82daafc1	[BOLT][NFC] Remove redundant assertion (#82056 ) processLKSections() used to be a member of RewriteInstance. Since now it is part of the LinuxKernelRewriter, the assertion is no longer needed.	2024-02-16 15:37:54 -08:00
Maksim Panchenko	5a29887145	[BOLT] Add writing support for Linux kernel ORC (#80950 ) Update ORC information based on the new code layout and emit corresponding ORC sections for the Linux kernel. We rewrite ORC sections in place, which puts a limit on the size of new section contents. Since ORC info changes for the new code layout and the number of ORC entries can become larger, we free up space in the tables by removing redundant ORC terminators. As a result, we effectively emit fewer entries and have to add duplicate terminators at the end to match the original section sizes. Ideally, we need to update ORC boundaries to reflect the reduced size and optimize runtime lookup, but we will need relocations for this, and the benefits will be marginal, if any.	2024-02-16 14:25:59 -08:00
Alexander Yermolovich	5ff8b30327	[BOLT][DWARF] Do not emit zero low_pc address arange (#81955 ) According to DWARF spec zero entires indicate end of arange. Changed so that BOLT does not emit zero low_pc arange.	2024-02-16 11:23:28 -08:00
Amir Ayupov	340b1ab9dc	[BOLT] Add missing include Address the comment in https://github.com/llvm/llvm-project/pull/76906#issuecomment-1947335336	2024-02-15 15:01:33 -08:00
Amir Ayupov	d2c9a19dd8	[BOLT][NFC] Pass BF/BB hashes to BAT Test Plan: NFC Reviewers: dcci, rafaelauler, maksfb, ayermolo Reviewed By: rafaelauler Pull Request: https://github.com/llvm/llvm-project/pull/76906	2024-02-15 12:49:43 -08:00
Alexander Yermolovich	82ca752393	[BOLT][DWARF] Add test for DW_AT_ranges input without function output (#81794 ) Added a test that relies on -fbasic-block-sections=all and --gc-sections that exercises a code path that previously printed a warning.	2024-02-14 15:43:39 -08:00
Alexander Yermolovich	c9e8e91aca	[BOLT][DWARF] Fix out of order rangelists/loclists (#81645 ) GCC can generate rangelists/loclists that are out of order. Fixed so that we don't assert, and instead generate partially optimized list. Through most code paths we do sort rnglists/loclists, but not for loclist for a path where BOLT does not modify a function. Although it's nice to have lists sorted, this implementation shouldn't rely on it. This also fixes an issue if we partially capture a list we would write out *end_of_list in helper function. So tools won't see the rest of the addresses being written out.	2024-02-14 11:23:57 -08:00
Amir Ayupov	52cf07116b	[BOLT][NFC] Log through JournalingStreams (#81524 ) Make core BOLT functionality more friendly to being used as a library instead of in our standalone driver llvm-bolt. To accomplish this, we augment BinaryContext with journaling streams that are to be used by most BOLT code whenever something needs to be logged to the screen. Users of the library can decide if logs should be printed to a file, no file or to the screen, as before. To illustrate this, this patch adds a new option `--log-file` that allows the user to redirect BOLT logging to a file on disk or completely hide it by using `--log-file=/dev/null`. Future BOLT code should now use `BinaryContext::outs()` for printing important messages instead of `llvm::outs()`. A new test log.test enforces this by verifying that no strings are print to screen once the `--log-file` option is used. In previous patches we also added a new BOLTError class to report common and fatal errors, so code shouldn't call exit(1) now. To easily handle problems as before (by quitting with exit(1)), callers can now use `BinaryContext::logBOLTErrorsAndQuitOnFatal(Error)` whenever code needs to deal with BOLT errors. To test this, we have fatal.s that checks we are correctly quitting and printing a fatal error to the screen. Because this is a significant change by itself, not all code was yet ported. Code from Profiler libs (DataAggregator and friends) still print errors directly to screen. Co-authored-by: Rafael Auler <rafaelauler@fb.com> Test Plan: NFC	2024-02-12 14:53:53 -08:00
Amir Ayupov	13d60ce2f2	[BOLT][NFC] Propagate BOLTErrors from Core, RewriteInstance, and passes (2/2) (#81523 ) As part of the effort to refactor old error handling code that would directly call exit(1), in this patch continue the migration on libCore, libRewrite and libPasses to use the new BOLTError class whenever a failure occurs. Test Plan: NFC Co-authored-by: Rafael Auler <rafaelauler@fb.com>	2024-02-12 14:51:15 -08:00
Amir Ayupov	fa7dd4919a	[BOLT][NFC] Add BOLTError and return it from passes (1/2) (#81522 ) As part of the effort to refactor old error handling code that would directly call exit(1), in this patch we add a new class BOLTError and auxiliary functions `createFatalBOLTError()` and `createNonFatalBOLTError()` that allow BOLT code to bubble up the problem to the caller by using the Error class as a return type (or Expected). Also changes passes to use these. Co-authored-by: Rafael Auler <rafaelauler@fb.com> Test Plan: NFC	2024-02-12 14:39:59 -08:00
Amir Ayupov	a5f3d1a803	[BOLT][NFC] Return Error from BinaryFunctionPass::runOnFunctions (#81521 ) As part of the effort to refactor old error handling code that would directly call exit(1), in this patch we change the interface to `BinaryFunctionPass` to return an Error on `runOnFunctions()`. This gives passes the ability to report a serious problem to the caller (RewriteInstance class), so the caller may decide how to best handle the exceptional situation. Co-authored-by: Rafael Auler <rafaelauler@fb.com> Test Plan: NFC	2024-02-12 14:36:12 -08:00
Maksim Panchenko	7fe97f0420	[BOLT] Always run CheckLargeFunctions in non-relocation mode (#80922 ) We run CheckLargeFunctions pass in non-relocation mode to prevent the emission of functions that later could not be written to the output due to their large size. The main reason behind the pass is to prevent the emission of metadata for such functions since this metadata becomes incorrect if the function is left unmodified. Currently, the pass is enabled in non-relocation mode only when debug info output is also enabled. As we emit increasingly more kinds of metadata, e.g. for the Linux kernel, it becomes more challenging to track metadata that needs to be fixed. Hence, I'm enabling the pass to always run in non-relocation mode.	2024-02-08 14:21:49 -08:00
Job Noorman	e7c0e59bbc	[BOLT] Fix crash for relocs in data sections against ABS symbols (#76026 ) Fixes #75771	2024-02-07 07:53:02 +00:00
Maksim Panchenko	8ea7f1d20a	[BOLT][NFCI] Keep instruction annotations (#80382 ) We used to delete most instruction annotations before code emission. It was done to release memory taken by annotations and to reduce overall memory consumption. However, since the implementation of annotations has moved to using existing instruction operands, the memory overhead associated with them has reduced drastically. I measured that savings are less than 0.5% on large binaries and processing time is just slightly reduced if we keep them. Additionally, I plan to use annotations in pre-emission passes for the Linux kernel rewriter.	2024-02-06 19:59:53 -08:00
Jon Roelofs	b98db441f0	[BOLT] Make ifunc test not statically-resolvable. NFC This fixes a breakage caused by `e976385415`	2024-02-06 15:15:11 -08:00
Maksim Panchenko	8075f0db16	[BOLT] Use new contents when emitting sections with relocations (#80782 ) We can use BinarySection::updateContents() to change section contents. However, if we also add relocations for new contents, then the original data (i.e. not updated) is going to be used. Fix that. A follow-up diff will use the update interface and will include a test case.	2024-02-06 14:38:21 -08:00
Maksim Panchenko	082fe9a5dd	[BOLT] Remove duplicate expression (#80380 ) Reported by cpp check static analyzer in #80111. Fixes #80111.	2024-02-01 19:05:11 -08:00
Maksim Panchenko	a693ae5306	[BOLT] Enable re-writing of Linux kernel binary (#80228 ) Write modified Linux kernel binary to disk. The output is not supposed to be functional at the moment, but it will allow for future patches to test the output binary.	2024-02-01 12:11:26 -08:00
Maksim Panchenko	116e801a15	[BOLT] Adjust section sizes based on file offsets (#80226 ) When we adjust section sizes while rewriting a binary, we should be using section offsets and not addresses to determine if section overlap. NFC for existing binaries.	2024-02-01 12:08:41 -08:00

1 2 3 4 5 ...

2026 Commits