intel/llvm - llvm - Gitea: Git with a cup of tea

intel/llvm

mirror of https://github.com/intel/llvm.git synced 2026-01-22 06:19:46 +08:00

Author	SHA1	Message	Date
Shunsuke Watanabe	c9900015a9	[flang] Add -fcomplex-arithmetic= option and select complex division algorithm (#146641 ) This patch adds an option to select the method for computing complex number division. It uses `LoweringOptions` to determine whether to lower complex division to a runtime function call or to MLIR's `complex.div`, and `CodeGenOptions` to select the computation algorithm for `complex.div`. The available option values and their corresponding algorithms are as follows: - `full`: Lower to a runtime function call. (Default behavior) - `improved`: Lower to `complex.div` and expand to Smith's algorithm. - `basic`: Lower to `complex.div` and expand to the algebraic algorithm. See also the discussion in the following discourse post: https://discourse.llvm.org/t/optimization-of-complex-number-division/83468 --------- Co-authored-by: Tarun Prabhu <tarunprabhu@gmail.com>	2025-07-09 13:43:54 +09:00
Valentin Clement (バレンタインクレメン)	46caad52ac	[flang][cuda] Do not produce data transfer in offloaded do concurrent (#147435 ) If a `do concurrent` loop is offloaded then there should be no CUDA data transfer in it. Update the semantic and lowering to take that into account. `AssignmentChecker` has to be put into a separate pass because the checkers in `SemanticsVisitor` cannot have the same `Enter/Leave` functions. The `DoForallChecker` already has `Eneter/Leave` functions for the `DoConstruct`.	2025-07-08 10:52:15 -07:00
jeanPerier	faefe7cf7d	[flang] add option to generate runtime type info as external (#146071 ) Reland #145901 with a fix for shared library builds. So far flang generates runtime derived type info global definitions (as opposed to declarations) for all the types used in the current compilation unit even when the derived types are defined in other compilation units. It is using linkonce_odr to achieve derived type descriptor address "uniqueness" aspect needed to match two derived type inside the runtime. This comes at a big compile time cost because of all the extra globals and their definitions in apps with many and complex derived types. This patch adds and experimental option to only generate the rtti definition for the types defined in the current compilation unit and to only generate external declaration for the derived type descriptor object of types defined elsewhere. Note that objects compiled with this option are not compatible with object files compiled without because files compiled without it may drop the rtti for type they defined if it is not used in the compilation unit because of the linkonce_odr aspect. I am adding the option so that we can better measure the extra cost of the current approach on apps and allow speeding up some compilation where devirtualization does not matter (and the build config links to all module file object anyway).	2025-06-30 09:58:00 +02:00
Valentin Clement (バレンタインクレメン)	9992668404	[flang][cuda] Add runtime check for passing device arrays (#144003 )	2025-06-12 20:47:58 -07:00
Valentin Clement (バレンタインクレメン)	a3201ce9e1	[flang][cuda] Add option to disable warp function in semantic (#143640 ) These functions are not available in some lower compute capabilities. Add option in the language feature to enforce the semantic check on these.	2025-06-10 22:10:26 -07:00
Slava Zakharin	3f6ae3f0a8	[flang] Added driver options for arrays repacking. (#134002 ) Added options: * -f[no-]repack-arrays * -f[no-]stack-repack-arrays * -frepack-arrays-contiguity=whole/innermost	2025-04-03 10:43:28 -07:00
Tom Eccles	9b2fd1a6ec	[flang][OpenMP] Bump default OpenMP version to 3.1 (#133745 ) Precise OpenMP standards support information is being documented in #132707 Flang now has good support for OpenMP Version 3.1 and earlier.	2025-04-02 10:43:48 +01:00
Kareem Ergawy	5d364481e3	[flang][OpenMP] Upstream first part of `do concurrent` mapping (#126026 ) This PR starts the effort to upstream AMD's internal implementation of `do concurrent` to OpenMP mapping. This replaces #77285 since we extended this WIP quite a bit on our fork over the past year. An important part of this PR is a document that describes the current status downstream, the upstreaming status, and next steps to make this pass much more useful. In addition to this document, this PR also contains the skeleton of the pass (no useful transformations are done yet) and some testing for the added command line options. This looks like a huge PR but a lot of the added stuff is documentation. It is also worth noting that the downstream pass has been validated on https://github.com/BerkeleyLab/fiats. For the CPU mapping, this achived performance speed-ups that match pure OpenMP, for GPU mapping we are still working on extending our support for implicit memory mapping and locality specifiers. PR stack: - https://github.com/llvm/llvm-project/pull/126026 (this PR) - https://github.com/llvm/llvm-project/pull/127595 - https://github.com/llvm/llvm-project/pull/127633 - https://github.com/llvm/llvm-project/pull/127634 - https://github.com/llvm/llvm-project/pull/127635	2025-04-02 09:24:38 +02:00
Slava Zakharin	fd0e20a64b	[flang] Generate fir.pack/unpack_array in Lowering. (#131704 ) Basic generation of array repacking operations in Lowering.	2025-03-18 21:26:33 -07:00
Nikita Popov	f137c3d592	[TargetRegistry] Accept Triple in createTargetMachine() (NFC) (#130940 ) This avoids doing a Triple -> std::string -> Triple round trip in lots of places, now that the Module stores a Triple.	2025-03-12 17:35:09 +01:00
Michael Kruse	b815a3942a	[Flang] Move non-common headers to FortranSupport (#124416 ) Move non-common files from FortranCommon to FortranSupport (analogous to LLVMSupport) such that * declarations and definitions that are only used by the Flang compiler, but not by the runtime, are moved to FortranSupport * declarations and definitions that are used by both ("common"), the compiler and the runtime, remain in FortranCommon * generic STL-like/ADT/utility classes and algorithms remain in FortranCommon This allows a for cleaner separation between compiler and runtime components, which are compiled differently. For instance, runtime sources must not use STL's `<optional>` which causes problems with CUDA support. Instead, the surrogate header `flang/Common/optional.h` must be used. This PR fixes this for `fast-int-sel.h`. Declarations in include/Runtime are also used by both, but are header-only. `ISO_Fortran_binding_wrapper.h`, a header used by compiler and runtime, is also moved into FortranCommon.	2025-02-06 15:29:10 +01:00
Kiran Chandramohan	ce32625966	Reland "[Flang][Driver] Add a flag to control zero initialization" (#123606 ) Reverts llvm/llvm-project#123330	2025-01-21 07:57:44 +00:00
Kiran Chandramohan	8a229f595a	Revert "Revert "Revert "[Flang][Driver] Add a flag to control zero initializa…" (#123330 ) Reverts llvm/llvm-project#123097 Reverting due to buildbot failure https://lab.llvm.org/buildbot/#/builders/89/builds/14577.	2025-01-17 12:27:58 +00:00
Kiran Chandramohan	8c63648117	Revert "Revert "[Flang][Driver] Add a flag to control zero initializa… (#123097 ) …tion of global v…" (#123067)" This reverts commit `44ba43aa2b`. Adds the flag to bbc as well.	2025-01-17 12:14:20 +00:00
Slava Zakharin	9d33874936	[flang] Support -f[no-]realloc-lhs. (#120165 ) -frealloc-lhs is the default. If -fno-realloc-lhs is specified, then an allocatable on the left side of an intrinsic assignment is not implicitly (re)allocated to conform with the right hand side. Fortran runtime will issue an error if there is a mismatch in shape/type/allocation-status.	2024-12-17 09:06:05 -08:00
Yusuke MINATO	a88677edc0	Reland "[flang] Integrate the option -flang-experimental-integer-overflow into -fno-wrapv" (#118933 ) This relands #110063. The performance issue on 503.bwaves_r is found not to be related to the patch, and is resolved by `fbd89bcc` when LTO is enabled.	2024-12-10 16:26:53 +09:00
Ivan R. Ivanov	e7e5541616	[flang] Lower omp.workshare to other omp constructs (#101446 ) Add a new pass that lowers an `omp.workshare` with its binding `omp.workshare.loop_wrapper` loop nests into other OpenMP constructs that can be lowered to LLVM. More specifically, in order to preserve the sequential execution semantics of the code contained, it wraps portions that needs to be executed on a single thread in `omp.single` blocks, converts code that must be parallelized into `omp.wsloop` nests and inserts the appropriate synchronization.	2024-11-19 17:02:16 +09:00
Yusuke MINATO	bd6ab32e6e	Revert "[flang] Integrate the option -flang-experimental-integer-overflow into -fno-wrapv" (#113901 ) Reverts llvm/llvm-project#110063 due to the performance regression on 503.bwaves_r in SPEC2017.	2024-10-28 14:19:20 +00:00
Yusuke MINATO	96bb375f5c	[flang] Integrate the option -flang-experimental-integer-overflow into -fno-wrapv (#110063 ) nsw is now added to do-variable increment when -fno-wrapv is enabled as GFortran seems to do. That means the option introduced by #91579 isn't necessary any more. Note that the feature of -flang-experimental-integer-overflow is enabled by default.	2024-10-25 15:20:23 +09:00
Tarun Prabhu	839344f025	[clang][flang][mlir] Reapply "Support -frecord-command-line option (#102975 )" The underlying issue was caused by a file included in two different places which resulted in duplicate definition errors when linking individual shared libraries. This was fixed in `c3201ddaea` [#109874].	2024-10-14 08:44:24 -06:00
Yusuke MINATO	b91a25ef58	[flang] add nsw to operations in subscripts (#110060 ) This patch adds nsw to operations when lowering subscripts. See also the discussion in the following discourse post. https://discourse.llvm.org/t/rfc-add-nsw-flags-to-arithmetic-integer-operations-using-the-option-fno-wrapv/77584/9	2024-10-03 10:56:01 +09:00
Krzysztof Parzyszek	3b20a833cb	[flang][Semantics] Add LangOptions to SemanticsContext (#110013 ) The motivation for this is to make OpenMP settings visible in the semantic checks (OpenMP version in particular).	2024-09-26 08:56:22 -05:00
Tarun Prabhu	c3201ddaea	[flang][NFC] Refactor to remove .inc file containing shared code (#109874 ) Remove flang/include/flang/Tools/CLOptions.inc - which was included as is in - several places. Move the code in it to header and source files which are used used in the "standard" way. Some minor cleanup such as removing trailing whitespace and excessive newlines and reordering entries alphabetically for files that were modified along the way. Update the documentation that referenced CLOptions.inc.	2024-09-25 14:04:38 -06:00
David Spickett	737c414e1d	Revert "[clang][flang][mlir] Support -frecord-command-line option (#102975 )" This reverts commit `b3533a156d`. It caused test failures in shared library builds: https://lab.llvm.org/buildbot/#/builders/80/builds/3854	2024-09-20 11:30:50 +00:00
Tarun Prabhu	b3533a156d	[clang][flang][mlir] Support -frecord-command-line option (#102975 ) Add support for the -frecord-command-line option that will produce the llvm.commandline metadata which will eventually be saved in the object file. This behavior is also supported in clang. Some refactoring of the code in flang to handle these command line options was carried out. The corresponding -grecord-command-line option which saves the command line in the debug information has not yet been enabled for flang.	2024-09-19 18:28:50 -06:00
Renaud Kauffmann	697bc748f9	Allow disabling of types from the command line (#107126 ) Adding hidden options to disable types through the `TargetCharacteristics`. I am seeing issues when I do this programmatically and would like, for anyone, to have the ability to reproduce them for development and testing purposes. I am planning to file a couple of issues following this patch.	2024-09-04 08:59:55 -07:00
Sergio Afonso	9dadb1f62b	[Flang][OpenMP] Add frontend support for -fopenmp-targets (#100155 ) This patch adds support for the `-fopenmp-targets` option to the `bbc` and `flang -fc1` tools. It adds an `OMPTargetTriples` property to the `LangOptions` structure, which is filled with the triples represented by the compiler option. This is used to initialize the `omp.target_triples` module attribute for later use by lowering stages.	2024-08-02 10:54:15 +01:00
Alexis Perry-Holby	f1d3fe7aae	Add basic -mtune support (#98517 ) Initial implementation for the -mtune flag in Flang. This PR is a clean version of PR #96688, which is a re-land of PR #95043	2024-07-16 16:48:24 +01:00
Ramkumar Ramachandra	db791b278a	mlir/LogicalResult: move into llvm (#97309 ) This patch is part of a project to move the Presburger library into LLVM.	2024-07-02 10:42:33 +01:00
Tarun Prabhu	8dd9494056	Revert "[flang] Add basic -mtune support" (#96678 ) Reverts llvm/llvm-project#95043	2024-06-25 13:25:39 -06:00
Alexis Perry-Holby	a790279bf2	[flang] Add basic -mtune support (#95043 ) This PR adds -mtune as a valid flang flag and passes the information through to LLVM IR as an attribute on all functions. No specific architecture optimizations are added at this time.	2024-06-25 18:39:35 +01:00
Sergio Afonso	b9549261e2	[Flang][OpenMP] Add -fopenmp-force-usm option to flang (#94359 ) This patch enables the `-fopenmp-force-usm` option to be passed to the flang driver, which forwards it to the compiler frontend. This flag, when set, results in the introduction of the `unified_shared_memory` bit to the `omp.requires` attribute of the top-level module operation. This is later combined with any other target device-related REQUIRES clauses that may have been explicitly set in the compilation unit.	2024-06-05 14:43:58 +01:00
Yusuke MINATO	526553b251	[flang] Add nsw flag to do-variable increment with a new option (#91579 ) This patch adds nsw flag to the increment of do-variables when a new option is enabled. NOTE 11.10 in the Fortran 2018 standard says they never overflow. See also the discussion in #74709 and the following discourse post. https://discourse.llvm.org/t/rfc-add-nsw-flags-to-arithmetic-integer-operations-using-the-option-fno-wrapv/77584/5	2024-05-16 13:16:07 +09:00
Valentin Clement (バレンタインクレメン)	e8eb52d167	[flang][cuda] Extends matching distance computation (#91810 ) Extends the computation of the matching distance in the generic resolution to support options described in the table: https://docs.nvidia.com/hpc-sdk/archive/24.3/compilers/cuda-fortran-prog-guide/index.html#cfref-var-attr-unified-data Options are added as language features in the `SemanticsContext` and a flag is added in bbc for testing purpose.	2024-05-13 08:34:36 -07:00
Vijay Kandiah	86ab10c4d9	[flang] Added extension point callbacks to default FIR optimizer pipeline. (#90674 ) This change inserts a few extension point callbacks in the DefaultFIROptimizerPassPipeline. As an example usage of callbacks in the FIR optimizer pipeline, the FIRInlinerCallback is now used to register the default MLIR inliner pass in flang-new, tco, and bbc compilation flows. Other compilation flows can use these callbacks to add extra passes at different points of the pass pipeline. --------- Co-authored-by: Vijay Kandiah <vkandiah@sky6.pgi.net>	2024-05-02 13:15:37 -05:00
Peter Klausler	f2e808932c	[flang] Intermix messages from parser and semantic analysis (#90654 ) When there are one or more fatal error messages produced by the parser, semantic analysis is not performed. But when there are messages produced by the parser and none of them are fatal, those messages are emitted to the user before compilation continues with semantic analysis, and any messages produced by semantics are emitted after the messages from parsing. This can be confusing for the user, as the messages may no longer all be in source file location order. It also makes it difficult to write tests that check for both non-fatal messages from parsing as well as messages from semantics using inline CHECK: or other expected messages in test source code. This patch ensures that if semantic analysis is performed, and non-fatal messages were produced by the parser, that all the messages will be combined and emitted in source file order.	2024-05-01 13:49:33 -07:00
jeanPerier	d0829fbded	[flang] Enable polymorphic lowering by default (#83285 ) Polymorphic entity lowering status is good. The main remaining TODO is to allow lowering of vector subscripted polymorphic entity, but this does not deserve blocking all application using polymorphism. Remove experimental option and enable lowering of polymorphic entity by default.	2024-03-19 11:45:31 +01:00
Krzysztof Parzyszek	47aee8b56d	[flang][OpenMP] Set OpenMP attributes in MLIR module in bbc before lo… (#82774 ) …wering Right now attributes like OpenMP version or target attributes for offload are set after lowering in bbc. The flang frontend sets them before lowering, making them available in the lowering process. This change sets them before lowering in bbc as well.	2024-02-23 15:27:47 -06:00
Krzysztof Parzyszek	a24421fef7	[flang][bbc] Fix dangling reference to `envDefaults` (#82800 ) The lowering bridge stores the evvironment defaults (passed to the constructor) as a reference. In the call to the constructor in bbc, the defaults were passed as `{}`, which creates a temporary whose lifetime ends immediately after the call. The flang driver passes a member of the compilation instance to the constructor, which presumably remains alive long enough, so storing the reference in the bridge is justified. To avoid the dangling reference, create an actual object `envDefaults` in bbc.	2024-02-23 12:51:26 -06:00
agozillon	ec1fcb381d	[Flang][bbc] Prevent bbc -emit-fir command invoking OpenMP passes twice (#80927 ) Currently when the bbc tool is invoked with the emit-fir command the pass pipeline will be invoked twice for verification causing the previously added OpenMP pass pipeline to be invoked multiple times. This change seeks to prevent that from occurring by using a seperate pass manager and run command immediately when it is necessary for the OpenMP passes to be executed.	2024-02-08 14:03:39 +01:00
Vijay Kandiah	369b822184	[flang] Introducing a method to dynamically and conditionally register dialect interfaces. (#80881 ) This change introduces the `addFIRExtensions` method to dynamically and conditionally register dialect interfaces. As a use case of `addFIRExtensions`, this change moves the static registration of `FIRInlinerInterface` out of the constructor of `FIROpsDialect` to be dynamically registered while loading the necessary MLIR dialects required by Flang. This registration of `FIRInlinerInterface` is also guarded by a boolean `addFIRInlinerInterface` which defaults to true. --------- Co-authored-by: Vijay Kandiah <vkandiah@nvidia.com>	2024-02-07 12:39:44 -08:00
Sergio Afonso	837bff11cb	[Flang][Lower] Attach target_cpu and target_features attributes to MLIR functions (#78289 ) This patch forwards the target CPU and features information from the Flang frontend to MLIR func.func operation attributes, which are later used to populate the target_cpu and target_features llvm.func attributes. This is achieved in two stages: 1. Introduce the `fir.target_cpu` and `fir.target_features` module attributes with information from the target machine immediately after the initial creation of the MLIR module in the lowering bridge. 2. Update the target rewrite flang pass to get this information from the module and pass it along to all func.func MLIR operations, respectively as attributes named `target_cpu` and `target_features`. These attributes will be automatically picked up during Func to LLVM dialect lowering and used to initialize the corresponding llvm.func named attributes. The target rewrite and FIR to LLVM lowering passes are updated with the ability to override these module attributes, and the `CodeGenSpecifics` optimizer class is augmented to make this information available to target-specific MLIR transformations. This completes a full flow by which target CPU and features make it all the way from compiler options to LLVM IR function attributes.	2024-01-30 13:45:56 +00:00
Dominik Adamski	f443fbc49b	[Flang][OpenMP][MLIR] Add support for -nogpulib option (#71045 ) If -nogpulib option is passed by the user, then the OpenMP device runtime is not used and we should not emit globals to configure debugging at compile-time for the device runtime. Link to -nogpulib flag implementation for Clang: https://reviews.llvm.org/D125314	2024-01-10 09:38:58 +01:00
jeanPerier	e59e848805	[flang] Updating drivers to create data layout before semantics (#73301 ) Preliminary patch to change lowering/code generation to use llvm::DataLayout information instead of generating "sizeof" GEP (see https://github.com/llvm/llvm-project/issues/71507). Fortran Semantic analysis needs to know about the target type size and alignment to deal with common blocks, and intrinsics like C_SIZEOF/TRANSFER. This information should be obtained from the llvm::DataLayout so that it is consistent during the whole compilation flow. This change is changing flang-new and bbc drivers to: 1. Create the llvm::TargetMachine so that the data layout of the target can be obtained before semantics. 2. Sharing bbc/flang-new set-up of the SemanticConstext.targetCharateristics from the llvm::TargetMachine. For now, the actual part that set-up the Fortran type size and alignment from the llvm::DataLayout is left TODO so that this change is mostly an NFC impacting the drivers. 3. Let the lowering bridge set-up the mlir::Module datalayout attributes since it is doing it for the target attribute, and that allows the llvm data layout information to be available during lowering. For flang-new, the changes are code shuffling: the `llvm::TargetMachine` instance is moved to `CompilerInvocation` class so that it can be used to set-up the semantic contexts. `setMLIRDataLayout` is moved to `flang/Optimizer/Support/DataLayout.h` (it will need to be used from codegen pass for fir-opt target independent testing.)), and the code setting-up semantics targetCharacteristics is moved to `Tools/TargetSetup.h` so that it can be shared with bbc. As a consequence, LLVM targets must be registered when running semantics, and it is not possible to run semantics for a target that is not registered with the -triple option (hence the power pc specific modules can only be built if the PowerPC target is available.	2023-12-06 14:20:06 +01:00
jeanPerier	aa8af04d0c	[flang] Switch lowering to use the HLFIR step by default (#72090 ) Patch 3/3 of the transition step 1 described in https://discourse.llvm.org/t/rfc-enabling-the-hlfir-lowering-by-default/72778/7 This patch changes bbc and flang-new driver to use HLFIR lowering by default. `-hlfir=false` can be used with bbc and `-flang-deprecated-no-hlfir` with flang-new to get the previous default lowering behavior, but these options will only be available for a limited period of time. If any user needs these options to workaround bugs, they should open an issue against flang in llvm github repo so that the regression can be fixed in HLFIR.	2023-11-15 10:00:34 +01:00
Peter Klausler	1c91d9bdea	[flang] Ensure that portability warnings are conditional (#71857 ) Before emitting a warning message, code should check that the usage in question should be diagnosed by calling ShouldWarn(). A fair number of sites in the code do not, and can emit portability warnings unconditionally, which can confuse a user that hasn't asked for them (-pedantic) and isn't terribly concerned about portability to other compilers. Add calls to ShouldWarn() or IsEnabled() around messages that need them, and add -pedantic to tests that now require it to test their portability messages, and add more expected message lines to those tests when -pedantic causes other diagnostics to fire.	2023-11-13 16:13:50 -08:00
Valentin Clement (バレンタインクレメン)	f0812480d2	[flang][openacc] Support fixed form sentinel format (#69970 ) The OpenACC standard mentions directive format for fixed form source files. The following sentinels are accepted: ``` !$acc directive-name [clause-list] c$acc directive-name [clause-list] *$acc directive-name [clause-list] ``` Update the parser to accepts these. A new option is added to `bbc` so the change can be tested.	2023-10-23 14:27:39 -07:00
Mats Petersson	7006b90a06	[flang][NFCI]Use config structure for MLIR to LLVM pass creation (#67792 ) The CreateMLIRToLLVMPassPipeline function has quite a few arguments, all of which has default values. Create a struct, with a constructor for the default values, and pass that struct instead. Re-arrange a few include files to make everything available. No functional change intended.	2023-10-03 14:01:50 +01:00
Sergio Afonso	fb4bdf361f	[Flang][OpenMP] Run Flang-specific OpenMP MLIR passes in bbc This patch moves the group of OpenMP MLIR passes using after lowering of Fortran to MLIR into a pipeline to be shared by `flang-new` and `bbc`. Currently, the `bbc` tool does not produce the expected FIR for offloading- enabled OpenMP codes due to not running these passes. Unit tests exercising these passes are updated to check `bbc` output as well.	2023-09-18 14:10:04 +01:00
Kelvin Li	00769d69fb	[flang] Add -fppc-native-vector-element-order option to control the element order in PowerPC vector types This patch also adds a LIT test for the vec_cvf intrinsic that can be affected by the option. Co-authored-by: Mark Danial <Mark.Danial@ibm.com> Co-authored-by: Daniel Chen <cdchen@ca.ibm.com> Differential Revision: https://reviews.llvm.org/D155852	2023-08-04 17:11:30 -04:00

1 2

93 Commits