intel/llvm - llvm - Gitea: Git with a cup of tea

intel/llvm

mirror of https://github.com/intel/llvm.git synced 2026-01-22 07:01:03 +08:00

Author	SHA1	Message	Date
serge-sans-paille	afe8b93ffd	[clang] Avoid memcopy for small structure with padding under -ftrivial-auto-var-init (#71677 ) Recommit of `0d2860b795` with extra test cases fixed.	2023-11-25 00:11:20 +01:00
Raymond Chang	85b2e9c022	[Clang][OpenMP] Emit unsupported directive error (#70233 ) Hello! This PR fixes #63871. Clang should no longer crash and instead emits an error message. Below is an example of the new error message: ``` ~/dev/fork-llvm-project omp_dispatch_unimpl ❯ ./install/bin/clang -fopenmp -c -emit-llvm -Xclang -disable-llvm-passes test.c test.c:6:5: error: cannot compile this OpenMP dispatch directive yet 6 \| #pragma omp dispatch \| ^~~~~~~~~~~~~~~~~~~~ 1 error generated. ```	2023-11-24 16:28:32 -05:00
Florian Hahn	419a4e41fc	Revert "[clang] Avoid memcopy for small structure with padding under -ftrivial-auto-var-init (#71677 )" This reverts commit `fe5c360a9a`. The commit causes the tests below to fail on many buildbots, e.g. https://lab.llvm.org/buildbot/#/builders/245/builds/17047 Clang :: CodeGen/aapcs-align.cpp Clang :: CodeGen/aapcs64-align.cpp	2023-11-23 20:18:55 +00:00
serge-sans-paille	fe5c360a9a	[clang] Avoid memcopy for small structure with padding under -ftrivial-auto-var-init (#71677 ) Recommit of `0d2860b795` with extra test cases fixed.	2023-11-23 17:37:03 +01:00
Shraiysh	8840eb3fb5	[flang][OpenMP] Add semantic check for declare target (#72770 )	2023-11-22 16:13:14 -06:00
Joseph Huber	52204a29ab	[Offload] Initial support for registering offloading entries on COFF targets (#72697 ) Summary: This patch provides the initial support to allow handling the new driver's offloading entries. Normally, the ELF target can emit varibles at C-identifier named sections and the linker will provide a pointer to the section. For COFF target, instead the linker merges sections containing a `$` in alphabetical order. We thus can emit these variables at sections and then emit two variables that are guaranteed to be sorted before and after the others to traverse it. Previous patches consolidated the handling of offloading entries so that this patch more easily can handle mapping them to the appropriate section. Ideally, the only remaining step to allow the new driver to run on Windows targets is to accurately map the following `ld.lld` arguments to their `llvm-link` equivalents. These are used inside the linker-wrapper, so we should simply need to remap the arguments to the same functionality if possible. ``` -o, -output -l, --library -L, --library-path -v, --version -rpath -whole-archive, -no-whole-archive ``` I have not tested this at runtime as I do not have access to a windows machine. This patch was adapted from some initial efforts in https://reviews.llvm.org/D137470.	2023-11-21 06:48:34 -06:00
Krzysztof Parzyszek	ddfed815c9	Revert "[OpenMP] atomic compare fail : Parser & AST support" This reverts commit `edd675ac28`. This breaks clang build where every component is a shared library. The file clang/lib/Basic/OpenMPKinds.cpp, which is a part of libclangBasic.so, uses `getOpenMPClauseName` which isn't: /usr/bin/ld: CMakeFiles/obj.clangBasic.dir/OpenMPKinds.cpp.o: in functio n `clang ::getOpenMPSimpleClauseTypeName(llvm::omp::Clause, unsigned int )': OpenMPKinds.cpp:(.text._ZN5clang29getOpenMPSimpleClauseTypeNameEN4llvm3o mp6ClauseEj+0x9b): undefined reference to `llvm::omp::getOpenMPClauseNam e(llvm::omp::Clause)'	2023-11-20 10:48:06 -06:00
Aaron Ballman	8bd06d5b65	[C23] Complete support for WG14 N2508 (#71398 ) In Clang 16, we implemented the ability to add a label at the end of a compound statement. These changes complete the implementation by allowing a label to be followed by a declaration in C. Note, this seems to have fixed an issue with some OpenMP stand-alone directives not being properly diagnosed as per: https://www.openmp.org/spec-html/5.1/openmpsu19.html#x34-330002.1.3 (The same requirement exists in OpenMP 5.2 as well.)	2023-11-20 10:52:11 -05:00
Sunil Kuravinakop	edd675ac28	[OpenMP] atomic compare fail : Parser & AST support Diff Revision: https://reviews.llvm.org/D123235	2023-11-20 03:05:31 -06:00
Antonio Frighetto	970bf07d0b	[clang][CodeGen] Ensure consistent `mustprogress` attribute emission Emission of `mustprogress` attribute previously occurred only within `EmitFunctionBody`, after generating the function body. Other routines for function body creation may lack the attribute, potentially leading to suboptimal optimizations later in the pipeline. Attribute emission is now anticipated prior to generating the function body. Fixes: https://github.com/llvm/llvm-project/issues/69833.	2023-11-11 09:43:03 +01:00
Johannes Doerfert	7318fe6334	[OpenMP][FIX] Ensure device reduction geps work for multi-var reductions If we have more than one reduction variable we need to be consistent wrt. indexing. In `3de645efe3` we broke this as the buffer type was reduced to a singleton but the index computation was not adjusted to account for that offset. This fixes it by interleaving the reduction variables properly in a array-of-struct style. We can revert it back to struct-of-array in a follow up if turns out to be a problem. I doubt it since half the accesses should benefit from the locallity this layout offers and only the other half were consecutive before.	2023-11-10 14:34:46 -08:00
Joseph Huber	237adfca4e	[OpenMP] Rework handling of global ctor/dtors in OpenMP (#71739 ) Summary: This patch reworks how we handle global constructors in OpenMP. Previously, we emitted individual kernels that were all registered and called individually. In order to provide more generic support, this patch moves all handling of this to the target backend and the runtime plugin. This has the benefit of supporting the GNU extensions for constructors an destructors, removing a class of failures related to shared library destruction order, and allows targets other than OpenMP to use the same support without needing to change the frontend. This is primarily done by calling kernels that the backend emits to iterate a list of ctor / dtor functions. For x64, this is automatic and we get it for free with the standard `dlopen` handling. For AMDGPU, we emit `amdgcn.device.init` and `amdgcn.device.fini` functions which handle everything atuomatically and simply need to be called. For NVPTX, a patch https://github.com/llvm/llvm-project/pull/71549 provides the kernels to call, but the runtime needs to set up the array manually by pulling out all the known constructor / destructor functions. One concession that this patch requires is the change that for GPU targets in OpenMP offloading we will use `llvm.global_dtors` instead of using `atexit`. This is because `atexit` is a separate runtime function that does not mesh well with the handling we're trying to do here. This should be equivalent in all cases except for cases where we would need to destruct manually such as: ``` struct S { ~S() { foo(); } }; void foo() { static S s; } ``` However this is broken in many other ways on the GPU, so it is not regressing any support, simply increasing the scope of what we can handle. This changes the handling of ctors / dtors. This patch now outputs a information message regarding the deprecation if the old format is used. This will be completely removed in a later release. Depends on: https://github.com/llvm/llvm-project/pull/71549	2023-11-10 14:53:53 -06:00
Baodi	df2725f3d5	[Clang][OpenMP] Return empty QualType when a negative array was created (#71552 ) Fix #69198	2023-11-09 20:42:59 -05:00
Mitch Phillips	a141a9fa97	Revert "[OpenMP] atomic compare fail : Parser & AST support" This reverts commit `086b65340c`. Reason: Broke under -Werror. More details in https://reviews.llvm.org/D123235	2023-11-08 11:20:17 +01:00
Sunil Kuravinakop	086b65340c	[OpenMP] atomic compare fail : Parser & AST support This is a support for " #pragma omp atomic compare fail ". It has Parser & AST support for now. Reviewed By: tianshilei1992, ABataev Differential Revision: https://reviews.llvm.org/D123235	2023-11-07 16:57:50 -06:00
Johannes Doerfert	3de645efe3	[OpenMP][NFC] Split the reduction buffer size into two components Before we tracked the size of the teams reduction buffer in order to allocate it at runtime per kernel launch. This patch splits the number into two parts, the size of the reduction data (=all reduction variables) and the (maximal) length of the buffer. This will allow us to allocate less if we need less, e.g., if we have less teams than the maximal length. It also allows us to move code from clangs codegen into the runtime as we now know how large the reduction data is.	2023-11-06 11:50:41 -08:00
Johannes Doerfert	921bd29913	[OpenMP] Remove alignment for global <-> local reduction functions The alignment did likely not help much but increases the memory requirement. Note that half of the affected accesses are all performed by a single thread in each block. The reads are by consecutive threads in a single block.	2023-11-06 11:50:41 -08:00
Johannes Doerfert	d3e7a48cbd	[OpenMP][NFC] Remove a no-op function	2023-11-03 10:28:36 -07:00
Dominik Adamski	2b1948c2be	[NFC][OpenMP][Clang]Update OpenMP clang tests Replace hardcoded constants by regular expressions	2023-11-03 05:13:06 -05:00
Johannes Doerfert	f9a89e6b9c	[OpenMP][FIX] Allocate per launch memory for GPU team reductions (#70752 ) We used to perform team reduction on global memory allocated in the runtime and by clang. This was racy as multiple instances of a kernel, or different kernels with team reductions, would use the same locations. Since we now have the kernel launch environment, we can allocate dynamic memory per-launch, allowing us to move all the state into a non-racy place. Fixes: https://github.com/llvm/llvm-project/issues/70249	2023-11-01 11:11:48 -07:00
Johannes Doerfert	b8cbc5c02c	[OpenMP] Introduce the KernelLaunchEnvironment as implicit argument (#70401 ) The KernelEnvironment is for compile time information about a kernel. It allows the compiler to feed information to the runtime. The KernelLaunchEnvironment is for dynamic information per kernel launch. It allows the rutime to feed information to the kernel that is not shared with other invocations of the kernel. The first use case is to replace the globals that synchronize teams reductions with per-launch versions. This allows concurrent teams reductions. More uses cases will follow, e.g., per launch memory pools. Fixes: https://github.com/llvm/llvm-project/issues/70249	2023-10-31 19:38:43 -07:00
Nathan Sidwell	7b2e0095bc	[clang] Robustify openmp test (#69739 ) If the source path contains 'alias' this would spuriously fail. Be more specific about not wanting global aliases	2023-10-30 07:39:45 -04:00
Johannes Doerfert	d346c82435	[OpenMP] Associate the KernelEnvironment with the GenericKernelTy (#70383 ) By associating the kernel environment with the generic kernel we can access middle-end information easily, including the launch bounds ranges that are acceptable. By constraining the number of threads accordingly, we now obey the user-provided bounds that were passed via attributes.	2023-10-29 11:35:34 -07:00
Johannes Doerfert	31b91213bd	[OpenMP] Unify the min/max thread/teams pathways We used to pass the min/max threads/teams values through different paths from the frontend to the middle end. This simplifies the situation by passing the values once, only when we will create the KernelEnvironment, which contains the values. At that point we also manifest the metadata, as appropriate. Some footguns have also been removed, e.g., our target check is now triple-based, not calling convention-based, as the latter is dependent on the ordering of operations. The types of the values have been unified to int32_t.	2023-10-29 10:53:20 -07:00
Amara Emerson	1a2e77cf9e	Revert "Revert "Inlining: Run the legacy AlwaysInliner before the regular inliner."" This reverts commit `86bfeb906e`. This is a long time coming re-application that was originally reverted due to regressions, unrelated to the actual inlining change. These regressions have since been fixed due to another long-in-the-making change of `a66051c6` landing. Original commit message for reference: --- We have several situations where it's beneficial for code size to ensure that every call to always-inline functions are inlined before normal inlining decisions are made. While the normal inliner runs in a "MandatoryOnly" mode to try to do this, it only does it on a per-SCC basis, rather than the whole module. Ensuring that all mandatory inlinings are done before any heuristic based decisions are made just makes sense. Despite being referred to the "legacy" AlwaysInliner pass, it's already necessary for -O0 because the CGSCC inliner is too expensive in compile time to run at -O0. This also fixes an exponential compile time blow up in https://github.com/llvm/llvm-project/issues/59126 Differential Revision: https://reviews.llvm.org/D143624 ---	2023-10-28 23:21:11 -07:00
Chi Chun Chen	391181062f	Revert "[OpenMP] Patch for Support to loop bind clause : Checking Parent Region" This reverts commit `85f6b2fac9`.	2023-10-26 16:57:36 -05:00
Johannes Doerfert	0ba57c8bba	[OpenMP] Pass min/max thread and team count to the OMPIRBuilder (#70247 ) We now provide the information about the min/max thread and team count from to the OMPIRBuilder, no matter what the source was. That means we unify `thread_limit`, `num_teams`, `num_threads` handling with the target specific attriutes (`__launch_bounds__` and `amdgpu_flat_work_group_size`). This is in preparation to pass the values to the runtime, and to allow the middle-end (OpenMP-opt) to tighten the values if it seems appropriate. There is no "real" change after this commit.	2023-10-26 14:45:07 -07:00
Johannes Doerfert	57cebc709d	[OpenMP][NFC] Fix test (remove wrong autogen header)	2023-10-26 14:38:24 -07:00
Johannes Doerfert	289a0f255d	[OpenMP] Remove SPMD specific handling during globalization Globalization and SPMD are different things that used to be conflated. Some leftover crossover interactions remain, trying to remove them now.	2023-10-26 14:38:23 -07:00
Johannes Doerfert	331085b469	[OpenMP][NFC] Clang format some tests	2023-10-26 14:38:23 -07:00
David Pagan	52315f9b75	[clang][OpenMP] Fix target data if/logical expression assert fail (#70268 ) Fixed assertion failure Basic Block in function 'main' does not have terminator! label %land.end caused by premature setting of CodeGenIP upon entry to emitTargetDataCalls, where subsequent evaluation of logical expression created new basic blocks, leaving CodeGenIP pointing to the wrong basic block. CodeGenIP is now set near the end of the function, just prior to generating a comparison of the logical expression result (from the if clause) which uses CodeGenIP to insert new IR.	2023-10-26 13:19:37 -07:00
Jessica Clarke	f9ead46931	[AST] Only dump desugared type when visibly different (#65214 ) These are an artifact of how types are structured but serve little purpose, merely showing that the type is sugared in some way. For example, ElaboratedType's existence means struct S gets printed as 'struct S':'struct S' in the AST, which is unnecessary visual clutter. Note that skipping the second print when the types have the same string matches what we do for diagnostics, where the aka will be skipped.	2023-10-26 19:28:28 +01:00
Sunil Kuravinakop	85f6b2fac9	[OpenMP] Patch for Support to loop bind clause : Checking Parent Region Differential revision: https://reviews.llvm.org/D158266	2023-10-26 05:08:41 -05:00
Fazlay Rabbi	9237ce4613	[OpenMP 5.2] Deprecate old syntax of linear clause (#70152 ) The syntax of the linear clause that specifies its argument and linear-modifier as linear-modifier(list) was deprecated since OpenMP 5.2 and the step modifier was added for specifying the linear step. Reference: OpenMP 5.2 Spec, Page 627, Line 15	2023-10-25 15:36:36 -07:00
Fazlay Rabbi	567a660a25	[OpenMP 5.2] Initial parsing and semantic analysis suppport for 'step' modifier on 'linear' clause Reference: (1) OpenMP 5.2 Specification - Seciton 5.4.6 Differential revision: https://reviews.llvm.org/D159546	2023-10-24 15:04:23 -07:00
Fazlay Rabbi	86b4388c90	[OpenMP 5.2] Deprecate syntax of map modifiers without comma separators (#69534 ) The syntax of modifiers without comma separators in the map clause was deprecated in OpenMP 5.2. Reference: OpenMP 5.2 Spec, page 627, line 19	2023-10-24 10:17:52 -07:00
Aaron Ballman	84a3aadf0f	Diagnose use of VLAs in C++ by default Reapplication of `7339c0f782` with a fix for a crash involving arrays without a size expression. Clang supports VLAs in C++ as an extension, but we currently only warn on their use when you pass -Wvla, -Wvla-extension, or -pedantic. However, VLAs as they're expressed in C have been considered by WG21 and rejected, are easy to use accidentally to the surprise of users (e.g., https://ddanilov.me/default-non-standard-features/), and they have potential security implications beyond constant-size arrays (https://wiki.sei.cmu.edu/confluence/display/c/ARR32-C.+Ensure+size+arguments+for+variable+length+arrays+are+in+a+valid+range). C++ users should strongly consider using other functionality such as std::vector instead. This seems like sufficiently compelling evidence to warn users about VLA use by default in C++ modes. This patch enables the -Wvla-extension diagnostic group in C++ language modes by default, and adds the warning group to -Wall in GNU++ language modes. The warning is still opt-in in C language modes, where support for VLAs is somewhat less surprising to users. RFC: https://discourse.llvm.org/t/rfc-diagnosing-use-of-vlas-in-c/73109 Fixes https://github.com/llvm/llvm-project/issues/62836 Differential Revision: https://reviews.llvm.org/D156565	2023-10-20 13:10:03 -04:00
Aaron Ballman	f5043f46c0	Revert "Diagnose use of VLAs in C++ by default" This reverts commit `7339c0f782`. Breaks bots: https://lab.llvm.org/buildbot/#/builders/139/builds/51875 https://lab.llvm.org/buildbot/#/builders/164/builds/45262	2023-10-20 10:00:18 -04:00
Aaron Ballman	7339c0f782	Diagnose use of VLAs in C++ by default Clang supports VLAs in C++ as an extension, but we currently only warn on their use when you pass -Wvla, -Wvla-extension, or -pedantic. However, VLAs as they're expressed in C have been considered by WG21 and rejected, are easy to use accidentally to the surprise of users (e.g., https://ddanilov.me/default-non-standard-features/), and they have potential security implications beyond constant-size arrays (https://wiki.sei.cmu.edu/confluence/display/c/ARR32-C.+Ensure+size+arguments+for+variable+length+arrays+are+in+a+valid+range). C++ users should strongly consider using other functionality such as std::vector instead. This seems like sufficiently compelling evidence to warn users about VLA use by default in C++ modes. This patch enables the -Wvla-extension diagnostic group in C++ language modes by default, and adds the warning group to -Wall in GNU++ language modes. The warning is still opt-in in C language modes, where support for VLAs is somewhat less surprising to users. RFC: https://discourse.llvm.org/t/rfc-diagnosing-use-of-vlas-in-c/73109 Fixes https://github.com/llvm/llvm-project/issues/62836 Differential Revision: https://reviews.llvm.org/D156565	2023-10-20 09:50:21 -04:00
Joseph Huber	85feb9347f	[OpenMP] Fix setting visibility on declare target variables Summary: A previous patch changed the logic to force external visibliity on declare target variables. This is because they need to be exported in the dynamic symbol table to be usable as the standard depicts. However, the logic was always setting the visibility to `protected`, which would override some symbols. For example, when calling `libc` functions for CPU offloading. This patch changes the logic to only fire if the variable has hidden visibliity to start with.	2023-10-09 07:56:43 -05:00
Joseph Huber	1d959f9327	[OpenMP] Prevent AMDGPU from overriding visibility on DT_nohost variables (#68264 ) Summary: There's some logic in the AMDGPU target that manually resets the requested visibility of certain variables. This was triggering when we set a constant variable in OpenMP. However, we shouldn't do this for OpenMP when the variable has the `nohost` type. That implies that the variable is not visible to the host and therefore does not need to be visible, so we should respect the original value of it.	2023-10-05 17:10:03 -05:00
Shilei Tian	d6254e1b2e	Introduce the initial support for OpenMP kernel language (#66844 ) This patch starts the support for OpenMP kernel language, basically to write OpenMP target region in SIMT style, similar to kernel languages such as CUDA. What included in this first patch is the `ompx_bare` clause for `target teams` directive. When `ompx_bare` exists, globalization is disabled such that local variables will not be globalized. The runtime init/deinit function calls will not be emitted. That being said, almost all OpenMP executable directives are not supported in the region, such as parallel, task. This patch doesn't include the Sema checks for that, so the use of them is UB. Simple directives, such as atomic, can be used. We provide a set of APIs (for C, they are prefix with `ompx_`; for C++, they are in `ompx` namespace) to get thread id, block id, etc. Please refer to https://tianshilei.me/wp-content/uploads/llvm-hpc-2023.pdf for more details.	2023-10-05 17:38:06 -04:00
JP Lehr	1bff5f6d0b	Revert "[OpenMP] Introduce the initial support for OpenMP kernel language (#66844 )" This reverts commit `e997dca333`.	2023-09-29 15:35:10 -05:00
JP Lehr	27770a7de1	Revert "[NFC][Clang][OpenMP] Fix the test issue of incompatible pointer size" This reverts commit `7a80a5d3b7`.	2023-09-29 15:21:32 -05:00
Shilei Tian	7a80a5d3b7	[NFC][Clang][OpenMP] Fix the test issue of incompatible pointer size	2023-09-29 14:38:34 -04:00
Shilei Tian	e997dca333	[OpenMP] Introduce the initial support for OpenMP kernel language (#66844 ) This patch starts the support for OpenMP kernel language, basically to write OpenMP target region in SIMT style, similar to kernel languages such as CUDA. What included in this first patch is the `ompx_bare` clause for `target teams` directive. When `ompx_bare` exists, globalization is disabled such that local variables will not be globalized. The runtime init/deinit function calls will not be emitted. That being said, almost all OpenMP executable directives are not supported in the region, such as parallel, task. This patch doesn't include the Sema checks for that, so the use of them is UB. Simple directives, such as atomic, can be used. We provide a set of APIs (for C, they are prefix with `ompx_`; for C++, they are in `ompx` namespace) to get thread id, block id, etc. For more details, you can refer to https://tianshilei.me/wp-content/uploads/llvm-hpc-2023.pdf.	2023-09-29 13:11:09 -04:00
Richard Smith	9408500662	Add -fclang-abi-compat=latest to a bunch of tests for manglings that changed since v17. Per discussion on https://reviews.llvm.org/D147655, various vendors use a different default.	2023-09-25 16:34:38 -07:00
Richard Smith	4b163e343c	Implement mangling rules for C++20 concepts and requires-expressions. This implements proposals from: - https://github.com/itanium-cxx-abi/cxx-abi/issues/24: mangling for constraints, requires-clauses, requires-expressions. - https://github.com/itanium-cxx-abi/cxx-abi/issues/31: requires-clauses and template parameters in a lambda expression are mangled into the <lambda-sig>. - https://github.com/itanium-cxx-abi/cxx-abi/issues/47 (STEP 3): mangling for template argument is prefixed by mangling of template parameter declaration if it's not "obvious", for example because the template parameter is constrained (we already implemented STEP 1 and STEP 2). This changes the manglings for a few cases: - Functions and function templates with constraints. - Function templates with template parameters with deduced types: `typename<auto N> void f();` - Function templates with template template parameters where the argument has a different template-head: `template<template<typename...T>> void f(); f<std::vector>();` In each case where a mangling changed, the change fixes a mangling collision. Note that only function templates are affected, not class templates or variable templates, and only new constructs (template parameters with deduced types, constrained templates) and esoteric constructs (templates with template template parameters with non-matching template template arguments, most of which Clang still does not accept by default due to `-frelaxed-template-template-args` not being enabled by default), so the risk to ABI stability from this change is relatively low. Nonetheless, `-fclang-abi-compat=17` can be used to restore the old manglings for cases which we could successfully but incorrectly mangle before. Fixes #48216, #49884, #61273 Reviewed By: erichkeane, #libc_abi Differential Revision: https://reviews.llvm.org/D147655	2023-09-20 12:38:15 -07:00
Aaron Jarmusch	131ba0ae01	Revert "[Clang][OpenMP] Clang adding the addrSpace according to DataLayout fix (#65483 )" This reverts commit `e831a32c93`.	2023-09-12 22:46:09 +00:00
Aaron Jarmusch	e831a32c93	[Clang][OpenMP] Clang adding the addrSpace according to DataLayout fix (#65483 ) Fix for an issue where clang was not adding the address space according to the data layout, instead was using the default which resulted in a crash at times. The fix includes changes to the cases of LargeCapMemAlloc and CGroupMemAlloc where we are setting the AddrSpace according to the DataLayout.	2023-09-12 15:44:39 -04:00

1 2 3 4 5 ...

2221 Commits