intel/llvm - llvm - Gitea: Git with a cup of tea

intel/llvm

mirror of https://github.com/intel/llvm.git synced 2026-01-25 10:55:58 +08:00

Author	SHA1	Message	Date
Joseph Huber	e537c83975	[libc] Add basic support for calling host functions from the GPU This patch adds the `rpc_host_call` function as a GPU extension. This is exported from the `libc` project to use the RPC interface to call a function pointer via RPC any copying the arguments by-value. The interface can only support a single void pointer argument much like pthreads. The function call here is the bare-bones version of what's required for OpenMP reverse offloading. Full support will require interfacing with the mapping table, nowait support, etc. I decided to test this interface in `libomptarget` as that will be the primary consumer and it would be more difficult to make a test in `libc` due to the testing infrastructure not really having a concept of the "host" as it runs directly on the GPU as if it were a CPU target. Reviewed By: jplehr Differential Revision: https://reviews.llvm.org/D155003	2023-07-19 10:11:46 -05:00
Johannes Doerfert	f914208c43	[OpenMP][NFCI] Avoid storing non-constant values in ICV If we store a constant in an ICV it is easier for the optimizer to propagate it. Since we often use the full block for the thread limit and the parallel team size, we can instead replace that dynamic value with a constant that otherwise cannot occur, here 0.	2023-07-18 16:50:50 -07:00
Johannes Doerfert	88a68de14c	[OpenMP][NFCI] Split assertion message from assertion expression We ended up with `llvm.assume(icmp ne ptr as(4) null, as(4) @str)` because the string in address space 4 was not known to be non-null. There is no need to create these assumes.	2023-07-18 16:50:50 -07:00
Matt Arsenault	e9725628ba	libomptarget: Try to fix dependency tracking for llvm tools	2023-07-18 06:21:33 -04:00
Jay Foad	92542f2a40	[AMDGPU] Add targets gfx1150 and gfx1151 This is the target definition only. Currently they are treated the same as GFX 11.0.x. Differential Revision: https://reviews.llvm.org/D155429	2023-07-17 13:06:12 +01:00
Joseph Huber	2dbc532672	[OMPT] Fix use of 'DEBUG_PREFIX' in the OMPT headers This is the only place that defines this prefix in a header file and was thus overriding and redefining other users of it. If we must use it in a header file, at least repsect its old values. Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D155316	2023-07-14 15:58:24 -05:00
Adrian Munera	739164c024	[OpenMP] Build device runtimes for sm_87 Summary: These were missing from the list of all architectures. Differential Revision: https://reviews.llvm.org/D155287	2023-07-14 13:49:27 -05:00
Joseph Huber	48da62617e	[OpenMP] Add documentation on using the `libc` in OpenMP This points users to the `libc` documentation and explains the basics of how it's used inside the runtime. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D155318	2023-07-14 13:28:29 -05:00
Carlos Eduardo Seo	7d0df44d39	[OpenMP] Disable veccopy tests for AArch64 Like for x86_64-linux-gnu, these need to be disabled for aarch64-linux-gnu. Differential Revision: https://reviews.llvm.org/D155109	2023-07-12 23:53:14 +00:00
Joseph Huber	1776dc8124	[Libomptarget][Obvious] Fix uninitialized pointer Summary: This pointer was not initliazed to null which meant that it would be erronenously deleted by plugins that were not in use.	2023-07-11 15:41:46 -05:00
Joseph Huber	8a0763f19c	[Libomptarget] Remove RPCHandleTy indirection The 'RPCHandleTy' was intended to capture the intention that a specific device owns its slot in the RPC server. However, this required creating a temporary store to hold these pointers. This was causing really weird spurious failure due to undefined behaviour in the order of library teardown. For example, the x64 plugin would be torn down, set this to some invalid memory, and then the CUDA plugin would crash. Rather than spend the time to fully diagnose this problem I found it pertinent to simply remove the failure mode. This patch removes this indirection so now the usage of the RPC server must always be done with the intended device. This just requires some extra handling for the AMDGPU indirection where we need to store a reference to the device. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D154971	2023-07-11 10:54:40 -05:00
Joachim Jenke	81bc7cf609	[OpenMP][NFC] lit: Allow setting default environment variables for test Add CHECK_OPENMP_ENV environment variable which will be passed to environment variables for test (make check-* target). This provides a handy way to exercise various openmp code with different settings during development. For example, to change default barrier pattern: ``` $ env CHECK_OPENMP_ENV="KMP_FORKJOIN_BARRIER_PATTERN=hier,hier \ KMP_PLAIN_BARRIER_PATTERN=hier,hier \ KMP_REDUCTION_BARRIER_PATTERN=hier,hier" \ ninja check-openmp ``` Even with this, each test can set appropriate environment variables if needed as before. Also, this commit adds missing documention about how to run tests in README. Patch provided by t-msn Differential Revision: https://reviews.llvm.org/D122645	2023-07-11 15:00:40 +02:00
Michael Halkenhaeuser	142faf56f5	[OpenMP] [OMPT] [amdgpu] [5/8] Implemented device init/fini/load callbacks Added support in the generic plugin to invoke registered callbacks. Depends on D124070 Patch from John Mellor-Crummey <johnmc@rice.edu> (With contributions from Dhruva Chakrabarti <Dhruva.Chakrabarti@amd.com>) Differential Revision: https://reviews.llvm.org/D124652	2023-07-11 07:13:22 -04:00
Carlos Eduardo Seo	d9a2b83dcd	[OpenMP] Fix note section type notation for AArch64 Like Arm, AArch64 also uses "%" instead of "@" for note section types. Differential Revision: https://reviews.llvm.org/D154859	2023-07-10 17:21:54 +00:00
Shao-Ce SUN	048423702d	[OpenMP] Fix build warnings ``` llvm-project/openmp/libomptarget/src/private.h:260:9: warning: 'DEBUG_PREFIX' macro redefined [-Wmacro-redefined] #define DEBUG_PREFIX GETNAME(TARGET_NAME) ^ llvm-project/openmp/libomptarget/include/ompt_device_callbacks.h:22:9: note: previous definition is here #define DEBUG_PREFIX "OMPT" ^ 1 warning generated. ``` ``` llvm-project/openmp/libomptarget/plugins-nextgen/common/PluginInterface/PluginInterface.cpp:458:14: warning: moving a local object in a return statement prevents copy elision [-Wpessimizing-move] return std::move(Err); ^ llvm-project/openmp/libomptarget/plugins-nextgen/common/PluginInterface/PluginInterface.cpp:458:14: note: remove std::move call here return std::move(Err); ^~~~~~~~~~ ~ llvm-project/openmp/libomptarget/plugins-nextgen/common/PluginInterface/PluginInterface.cpp:552:12: warning: moving a local object in a return statement prevents copy elision [-Wpessimizing-move] return std::move(Err); ^ llvm-project/openmp/libomptarget/plugins-nextgen/common/PluginInterface/PluginInterface.cpp:552:12: note: remove std::move call here return std::move(Err); ^~~~~~~~~~ ~ 2 warnings generated. ``` Reviewed By: jhuber6 Differential Revision: https://reviews.llvm.org/D154787	2023-07-09 22:12:23 +08:00
Elliot Goodrich	a11efd4926	Add missing StringExtras.h includes In preparation for removing the `#include "llvm/ADT/StringExtras.h"` from the header to source file of `llvm/Support/Error.h`, first add in all the missing includes that were previously included transitively through this header. This is fixing all files missed in `b0abd4893f` and `39d8e6e22c`. Differential Revision: https://reviews.llvm.org/D154763	2023-07-08 20:06:21 +01:00
Joachim Jenke	820be30ad9	[OpenMP][OMPT] Introduce VERBOSE_INIT in ompt-multiplex.h OpenMP 5.1 added OMP_TOOL_VERBOSE_INIT. This env variable is extremely helpful to understand the issue when loading a tool fails unexpectedly (e.g., errors from dlopen, when the libc available at runtime is older than libc used at compile time of the tool -> missed to load the right gcc module). This patch replicates the verbose init code from libomp watching out for a different env variable. Similar to CLIENT_TOOL_LIBRARIES_VAR, a tool can define the name of the env var by defining CLIENT_TOOL_VERBOSE_INIT_VAR before including ompt-multiplex.h. Alternatively, a tool can define OMPT_MULTIPLEX_TOOL_NAME to specify the tool name which will be the prefix for both _TOOL_LIBRARIES and _VERBOSE_INIT var. Finally, if none of the two macros is defined, the header will print a compiler warning and look at OMP_TOOL_VERBOSE_INIT. Patch prepared by Semih Burak Differential Revision: https://reviews.llvm.org/D112809	2023-07-08 17:09:57 +02:00
Joseph Huber	e526a7fc15	[Libomptarget][NFC] Clean up warnings and format	2023-07-07 18:59:26 -05:00
Joseph Huber	b83e29027c	[Libomptarget] Fix tests only including the LTO variant Summary: These were overriding rather than appending. Fix that.	2023-07-07 16:24:27 -05:00
Martin Storsjö	f105c1dc58	[OpenMP] Remove the workaround of passing "-x assembler-with-cpp" manually By building the assembly with language ASM now (since `4072c8aee4` and `cbaa3597aa`), this shouldn't be needed any longer. Differential Revision: https://reviews.llvm.org/D150701	2023-07-07 23:32:27 +03:00
Joseph Huber	338c80516b	[Libomptarget] Refine logic for determining if we support RPC Summary: Add a requirement for the GPU libc to only be on if its enabled explicitly. Fix the logic around the pythonification of the variable.	2023-07-07 14:06:58 -05:00
Joseph Huber	d3748d942a	[Libomptarget] Fix test logic for optionally adding the libcgpu.a Summary: This was not operating as expected and was causing the build to fail on non-configured systems.	2023-07-07 12:49:50 -05:00
Joseph Huber	691dc2d10d	[Libomptarget] Begin implementing support for RPC services This patch adds the intial support for running an RPC server in libomptarget to handle host services. We interface with the library provided by the `libc` project to stand up a basic server. We introduce a new type that is controlled by the plugin and has each device intialize its interface. We then run a basic server to check the RPC buffer. This patch does not fully implement the interface. In the future each plugin will want to define special handlers via the interface to support things like malloc or H2D copies coming from RPC. We will also want to allow the plugin to specify t he number of ports. This is currently capped in the implementation but will be adjusted soon. Right now running the server is handled by whatever thread ends up doing the waiting. This is probably not a completely sound solution but I am not overly familiar with the behaviour of OpenMP tasks and what would be required here. This works okay with synchrnous regions, and somewhat fine with `nowait` regions, but I've observed some weird behavior when one of those regions calls `exit`. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D154312	2023-07-07 12:36:46 -05:00
Joachim Jenke	124d36e093	[OpenMP][OMPT] Change OMPT kind for OpenMP test lock functions The OpenMP specification mentions that omp_test_lock and omp_test_nest_lock dispatch OMPT callbacks with ompt_mutex_test_lock and ompt_mutex_test_nest_lock for their kind respectively. Previously, the values ompt_mutex_lock and ompt_mutex_nest_lock were used. This could cause issues in application relying on the kind to correctly determine lock states. This commit changes the kind to the expected ones. Also update callback.h and OMPT tests to reflect this change. Patch prepared by Thyre Differential Review: https://reviews.llvm.org/D153028 Differential Review: https://reviews.llvm.org/D153031 Differential Review: https://reviews.llvm.org/D153032	2023-07-07 14:49:47 +02:00
Joachim Jenke	d679c904c2	[OpenMP][OMPT] Rename callback master to masked in ompt-multiplex.h OpenMP 5.1 replaced callback ompt_callback_master_t by ompt_callback_masked_t. In order to stick to the standard, the implementation is updated accordingly. Patch prepared by Semih Burak Differential Revision: https://reviews.llvm.org/D112798	2023-07-07 14:01:40 +02:00
Joachim Jenke	94ec997521	[OpenMP][OMPT] Add two missing nullpointer checks in ompt-multiplex.h In the functions ompt_multiplex_get_own_ompt_data and ompt_multiplex_get_client_ompt_data in addition to data being NULL, also the void pointer field "ptr" of "data" could be NULL, leading to a subsequent segfault. This patch add the corresponding checks. Patch prepared by Semih Burak Differential Revision: https://reviews.llvm.org/D112806	2023-07-07 14:01:39 +02:00
Joachim Jenke	73d411d1b2	[OpenMP][Tools] Add omp_all_memory support for Archer The semantic of depend(out:omp_all_memory) is quite similar to taskwait in that it separates all tasks (with dependency) created before an all_memory-task from all tasks (with dependency) created after an all_memory-task. Only a single of such tasks can execute at a time. Similar to taskwait, we have a CV (AllMemory[1]) in the generating task to express the dependency sink semantic of an all_memory-task. In addition, AllMemory[0] describes the dependency source semantic of an all_memory-task. All tasks with dependency create an HB-arc towards the sink and terminate an HB-arc from the source. Since we expect that not many applications will use such dependency, the support for handling the synchronization semantic is off by default and can be turned on using ARCHER_OPTION="all_memory=1". The most costly part is the precautionary posting of an HB-arc towards the sink, which represents a potentially contentious write from all concurrently executing sibling tasks. A warning is printed at runtime, when the option is off while such dependency is observed. In most cases the lazy activation will still lead to false alerts. Differential Revision: https://reviews.llvm.org/D111895	2023-07-07 13:55:46 +02:00
Joachim Jenke	6ef16f2618	[OpenMP] Add OMPT support for omp_all_memory task dependence omp_all_memory currently has no representation in OMPT. Adding new dependency flags as suggested by omp-lang issue #3007. Differential Revision: https://reviews.llvm.org/D111788	2023-07-07 13:44:53 +02:00
Jonathan Peyton	05e2bc25e8	[OpenMP] Ensure socket layer is not first in CPUID topology detection * Return 0 length topology if socket layer is detected first * Fix DEBUG ASSERT	2023-07-06 12:35:34 -05:00
Jonathan Peyton	2d02988f74	[OpenMP] Remove gcc-12 warnings from libomp	2023-07-06 11:47:45 -05:00
Joseph Huber	b420e0ed27	[Libomptarget] Disable the 'mapping/prelock.cpp' test on AMDGPU Summary: This test was not functional on the new plugins, now that the old ones have been deleted it doesn't work. Disable until we get a fix.	2023-07-06 11:45:18 -05:00
Joseph Huber	071c8a41cc	[Libomptarget] Fix tests after deleting the next-gen plugins The next-gen plugins didn't correctly configure tests and were never actually being run. Since deleting the old plugin we stopped getting `libomptarget` tests. This patch fixes the issue and allows the targets to be built Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D154619	2023-07-06 10:44:50 -05:00
Joseph Huber	e90ab9148b	[OpenMP] Delete old plugins It's time to remove the old plugins as the next-gen has already been set to default in LLVM 16. Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D142820	2023-07-05 17:39:47 -05:00
Joseph Huber	70c08dbcfb	[Libomptarget] Remove the remote and ve plugins from libomptarget These plugins are unmaintained and are not in a workable state. The VE plugin has not been touched for years and has never had any running tests. The remote plugin is in an unfinished state and is not production ready upstream. These will need to be ported to the new nextgen interface in the future if they are needed. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D154548	2023-07-05 17:39:46 -05:00
Nawrin Sultana	50a95e3e6b	[OpenMP] Minor improvement in error msg and fixes few coverity reported issues Differential Revision: https://reviews.llvm.org/D152289	2023-07-05 12:07:51 -05:00
Joseph Huber	33859fb962	[Libomptarget][Obvious] Missing comma on enum	2023-07-04 22:01:03 -05:00
Joseph Huber	ec39b35178	[Libomptarget] Add missing HSA agent info enumeration Summary: This was not added to dynamic_hsa.h	2023-07-04 21:55:49 -05:00
Joseph Huber	6764301a6b	[Libomptarget] Correctly implement `getWTime` on AMDGPU AMDGPU provides a fixed frequency clock since some generations back. However, the frequency is variable by card and must be looked up at runtime. This patch adds a new device environment line for the clock frequency so that we can use it in the same way as NVPTX. This is the correct implementation and the version in ASO should be replaced. Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D154456	2023-07-04 21:50:43 -05:00
Joseph Huber	18a6ccea3a	[Libomptarget] Fix misused macro name preventing printing of library name Summary: This code used `LIBOMPTARGET_DEBUG` which is not the macro name, but the environment variable. This caused this portion to always be disabled. In the long run we should aim for this to always be availible as it's useful for other diagnostic message.	2023-07-04 08:00:27 -05:00
Joel E. Denny	6e127c6f29	[OpenMP] libomptarget: Don't map alignment padding to host In the case of partially mapped structs, libomptarget sometimes adds padding to device allocations to ensure they are aligned properly. However, without this patch, it considers that padding to be mapped to the host, which can cause presence checks (e.g., `omp_target_is_present` or a `present` modifier) to misbehave for unmapped parts of the struct. This patch keeps the padding but treats it as unmapped. See the new test case for examples. Reviewed By: grokos, jdoerfert Differential Revision: https://reviews.llvm.org/D149685	2023-07-03 10:23:38 -04:00
Dhruva Chakrabarti	6a1d1f7eef	[OpenMP] Added memory scope to atomic::inc API and used the device scope in reduction. With https://reviews.llvm.org/D137524, memory scope and ordering attributes are being used to generate the required instructions for atomic inc/dec on AMDGPU. This patch adds the memory scope attribute to the atomic::inc API and uses the device scope in reduction. Without the device scope in atomic_inc, the default system scope leads to unnecessary L2 write-backs/invalidates. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D154172	2023-06-30 15:05:01 -04:00
Joseph Huber	968f65ae03	[OpenMP] Adjust using the NVPTX architecture detection tool A previous patch by @arsenm adjusted these to find the `amdgpu-arch` tool correctly if we do a `LLVM_ENABLE_PROJECTS` build. This patch applies the same to `nvptx-arch` tool to keep it consistent. Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D154107	2023-06-29 12:14:44 -05:00
Ethan Luis McDonough	341c3cf78c	[flang][openmp] Fortran offloading test Flang currently supports offloading for AMD GPUs. This patch establishes a test structure for Fortran offloading tests in libomptarget. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D148778	2023-06-28 15:15:32 -05:00
Matt Arsenault	17f564f305	OpenMP: Revert accidental cmake change to make amdgpu-arch errors fatal I still think this should be done but should be done separately.	2023-06-28 07:33:27 -04:00
Matt Arsenault	7c3fa755f1	OpenMP/cmake: Use TARGET instead of looking for amdgpu-arch Not sure if the standalone build case is supposed to be a supported path. Should probably rely on find_package and imported targets anyway.	2023-06-28 06:55:15 -04:00
Job Noorman	8de9f2b558	Move SubtargetFeature.h from MC to TargetParser SubtargetFeature.h is currently part of MC while it doesn't depend on anything in MC. Since some LLVM components might have the need to work with target features without necessarily needing MC, it might be worthwhile to move SubtargetFeature.h to a different location. This will reduce the dependencies of said components. Note that I choose TargetParser as the destination because that's where Triple lives and SubtargetFeatures feels related to that. This issues came up during a JITLink review (D149522). JITLink would like to avoid a dependency on MC while still needing to store target features. Reviewed By: MaskRay, arsenm Differential Revision: https://reviews.llvm.org/D150549	2023-06-26 11:20:08 +02:00
Shao-Ce SUN	f042890521	[openmp] remove initializeRewriteSymbolsLegacyPassPass Fix build error caused by D153679 Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D153704	2023-06-25 00:35:01 +08:00
Matt Arsenault	6e94a9bf54	Revert "OpenMP/cmake: Use list append instead of repeating variable name" This reverts commit `e429fdd036`.	2023-06-23 15:44:05 -04:00
Matt Arsenault	a2f5bcc766	OpenMP/cmake: Use DEPFILE instead of IMPLICIT_DEPENDS IMPLICIT_DEPENDS doesn't actually work with ninja and this does.	2023-06-23 15:25:10 -04:00
Matt Arsenault	e429fdd036	OpenMP/cmake: Use list append instead of repeating variable name	2023-06-23 15:25:10 -04:00

1 2 3 4 5 ...

2862 Commits