intel/llvm - llvm - Gitea: Git with a cup of tea

intel/llvm

mirror of https://github.com/intel/llvm.git synced 2026-02-04 20:00:11 +08:00

Author	SHA1	Message	Date
Joseph Huber	b1af3c0857	[libc][FIXME] Disable math tests to make the GPU bots green (#71603 ) Summary: This is a quick hack to disable affected GPU math tests so the bots will be green again. The offending commit is `d2361b2048`. If that is reverted along with this patch the tests also pass.	2023-11-07 17:43:21 -06:00
michaelrj-google	009ba779c4	[libc][NFC] Remove libcpp include from atanf_test (#71449 ) The test for atanf used <initializer_list> to simplify iterating through an array. This caused issues with the new features.h change by creating a libcpp dependency in the test. This change moves the list to an array variable, removing the need for that dependency.	2023-11-07 10:35:09 -08:00
lntue	d2361b2048	[libc][math] Add min/max/min_denorm/max_denorm constants to FPBits and clean up its constants return types. (#71298 )	2023-11-06 18:22:34 -05:00
lntue	bc7a3bd864	[libc][math] Implement powf function correctly rounded to all rounding modes. (#71188 ) We compute `pow(x, y)` using the formula ``` pow(x, y) = x^y = 2^(y * log2(x)) ``` We follow similar steps as in `log2f(x)` and `exp2f(x)`, by breaking down into `hi + mid + lo` parts, in which `hi` parts are computed using the exponent field directly, `mid` parts will use look-up tables, and `lo` parts are approximated by polynomials. We add some speedup for common use-cases: ``` pow(2, y) = exp2(y) pow(10, y) = exp10(y) pow(x, 2) = x * x pow(x, 1/2) = sqrt(x) pow(x, -1/2) = rsqrt(x) - to be added ```	2023-11-06 16:54:25 -05:00
michaelrj-google	73e974c00a	[libc] Cleanup ErrnoSetterMatcher target (#71240 ) The ErrnoSetterMatcher target was renamed in a previous patch, but not all uses were caught. This patch fixes those that remain.	2023-11-03 17:00:08 -07:00
michaelrj-google	8ca565cd3b	[libc] Fix printf long double truncation bound (#70705 ) The calculation for if a number being printed is truncated and should be rounded up assumed a double for one of its constants, causing occassional misrounding. This fixes that by making the constant based on the mantissa width.	2023-10-30 14:04:00 -07:00
michaelrj-google	6e863c4073	[libc] Fix incorrect printing for alt mode ints (#70252 ) Previously, our printf would incorrectly handle conversions like ("%#x",0) and ("%#o",0). This patch corrects the behavior to match what is described in the standard.	2023-10-27 11:04:11 -07:00
michaelrj-google	2282af26ea	[libc] Disable -NaN test on float128 systems (#70146 ) Some float128 systems (specifically the ones used for aarch64 buildbots) don't respect signs for long double NaNs. This patch disables the printf test that was failing due to this.	2023-10-24 16:45:54 -07:00
michaelrj-google	b4e552999d	[libc] Fix printf long double inf, bitcast in msan (#70067 ) These bugs were found with the new printf long double fuzzing. The long double inf vs nan bug was introduced when we changed to get_explicit_exponent. The bitcast msan issue hadn't come up previously, but isn't a real bug, just a poisoning confusion.	2023-10-24 15:41:54 -07:00
lntue	6d53fdeab4	[libc][NFC] Attempt to deflake gettimeofday_test. (#69719 ) Only check if gettimeofday call succeeds.	2023-10-20 11:08:01 -04:00
lntue	ec10c36b07	[libc][NFC] Forcing data type in gettimeofday_test when comparing the diff. (#69652 )	2023-10-19 19:49:59 -04:00
Joseph Huber	630037ede4	[libc] Partially implement 'rand' for the GPU (#66167 ) Summary: This patch partially implements the `rand` function on the GPU. This is partial because the GPU currently doesn't support thread local storage or static initializers. To implement this on the GPU. I use 1/8th of the local / shared memory quota to treak the shared memory as thread local storage. This is done by simply allocating enough storage for each thread in the block and indexing into this based off of the thread id. The downside to this is that it does not initialize `srand` correctly to be `1` as the standard says, it is also wasteful. In the future we should figure out a way to support TLS on the GPU so that this can be completely common and less resource intensive.	2023-10-19 17:01:43 -04:00
lntue	3fd5113cba	[libc][math][NFC] Remove global scope constants declaration in math tests (#69558 ) Clean up usage of `DECLARE_SPECIAL_CONSTANTS` in global scope.	2023-10-19 10:30:11 -04:00
alfredfo	d404130134	[libc] Fix accidental LIBC_NAMESPACE_syscall definition (#69548 ) Building helloworld.c currently errors with "undefined symbol: __llvm_libc_syscall" See: https://github.com/llvm/llvm-project/pull/67032	2023-10-19 11:22:16 +02:00
Joseph Huber	ddc30ff802	[libc] Implement the 'ungetc' function on the GPU (#69248 ) Summary: This function follows closely with the pattern of all the other functions. That is, making a new opcode and forwarding the call to the host. However, this also required modifying the test somewhat. It seems that not all `libc` implementations follow the same error rules as are tested here, and it is not explicit in the standard, so we simply disable these EOF checks when targeting the GPU.	2023-10-17 13:02:31 -05:00
michaelrj-google	8a47ad4b67	[libc] Add simple long double to printf float fuzz (#68449 ) Recent testing has uncovered some hard-to-find bugs in printf's long double support. This patch adds an extra long double path to the fuzzer with minimal extra effort. While a more thorough long double fuzzer would be useful, it would need to handle the non-standard cases of 80 bit long doubles such as unnormal and pseudo-denormal numbers. For that reason, a standalone long double fuzzer is left for future development.	2023-10-16 13:32:34 -07:00
Joseph Huber	4cb6c1c7cb	[libc] Enable missing memory tests on the GPU (#68111 ) Summary: There were a few tests that weren't enabled on the GPU. This is because the logic caused them to be skipped as we don't use CPU featured on the host. This also disables the logic making multiple versions of the memory functions.	2023-10-06 08:27:36 -05:00
michaelrj-google	bfcfc2a6d4	[libc] Fix typo in long double negative block (#68243 ) The long double version of float to string's get_negative_block had a bug in table mode. In table mode, one of the tables is named "MIN_BLOCK_2" and it stores the number of blocks that are all zeroes before the digits start for a given index. The check for long doubles was incorrectly "block_index <= MIN_BLOCK_2[idx]" when it should be "block_index < MIN_BLOCK_2[idx]" (without the equal sign). This bug caused an off-by-one error for some long double values. This patch fixes the bug and adds tests to ensure it doesn't regress.	2023-10-04 13:00:48 -07:00
Mikhail R. Gadelha	714b4c82bb	[libc][NFC] Fix -Wdangling-else when compiling libc with gcc >= 7 (#67833 ) Explicit braces were added to fix the "suggest explicit braces to avoid ambiguous ‘else’" warning since the current solution (switch (0) case 0: default:) doesn't work since gcc 7 (see https://github.com/google/googletest/issues/1119) gcc 13 generates about 5000 of these warnings when building libc without this patch.	2023-10-04 11:44:42 -04:00
Mikhail R. Gadelha	824b1677a4	[libc][NFC] Fix missing field 'tm_isdst' initializer warning (#67837 ) This patch fixes several warnings thrown by clang about an uninitialized member of struct tm, tm_isdst. Weirdly, gcc doesn't complain about it, probably this member is never read in the tests.	2023-10-02 19:32:55 -04:00
Mikhail R. Gadelha	8fc87f54a8	[libc][NFC] Couple of small warning fixes (#67847 ) This patch fixes a couple of warnings when compiling with gcc 13: * CPP/type_traits_test.cpp: 'apply' overrides a member function but is not marked 'override' * UnitTest/LibcTest.cpp:98: control reaches end of non-void function * MPFRWrapper/MPFRUtils.cpp:75: control reaches end of non-void function * smoke/FrexpTest.h:92: backslash-newline at end of file * __support/float_to_string.h:118: comparison of unsigned expression in ‘>= 0’ is always true * test/src/__support/CPP/bitset_test.cpp:197: comparison of unsigned expression in ‘>= 0’ is always true --------- Signed-off-by: Mikhail R. Gadelha <mikhail@igalia.com>	2023-10-02 19:29:26 -04:00
Mikhail R. Gadelha	dbceb1d936	[libc] Fix unused variable in fputc test (#67830 ) This is probably a copy-and-paste error and the variable 'more' was left unused.	2023-09-29 12:31:40 -04:00
lntue	da28593d71	[libc][math] Implement double precision expm1 function correctly rounded for all rounding modes. (#67048 ) Implementing expm1 function for double precision based on exp function algorithm: - Reduced x = log2(e) * (hi + mid1 + mid2) + lo, where: * hi is an integer * mid1 * 2^-6 is an integer * mid2 * 2^-12 is an integer * \|lo\| < 2^-13 + 2^-30 - Then exp(x) - 1 = 2^hi * 2^mid1 * 2^mid2 * exp(lo) - 1 ~ 2^hi * (2^mid1 * 2^mid2 * (1 + lo * P(lo)) - 2^(-hi) ) - We evaluate fast pass with P(lo) is a degree-3 Taylor polynomial of (e^lo - 1) / lo in double precision - If the Ziv accuracy test fails, we use degree-6 Taylor polynomial of (e^lo - 1) / lo in double double precision - If the Ziv accuracy test still fails, we re-evaluate everything in 128-bit precision.	2023-09-28 16:43:15 -04:00
Siva Chandra	425defd810	[libc][Obvious] Remove the previous ErrnoSetterMatcher target. (#67469 ) A target still depending on the old target has been updated.	2023-09-26 11:01:21 -07:00
Mikhail R. Gadelha	e3087c4b8c	[libc] Start to refactor riscv platform abstraction to support both 32 and 64 bits versions This patch enables the compilation of libc for rv32 by unifying the current rv64 and rv32 implementation into a single rv implementation. We updated the cmake file to match the new riscv32 arch and force LIBC_TARGET_ARCHITECTURE to be "riscv" whenever we find "riscv32" or "riscv64". This is required as LIBC_TARGET_ARCHITECTURE is used in the path for several platform specific implementations. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D148797	2023-09-26 12:32:25 -03:00
Siva Chandra	aecb58005c	[libc][NFC] Remove an inappropriate -ffreestanding arg to memory_utils test. (#67435 )	2023-09-26 08:04:08 -07:00
Guillaume Chatelet	b6bc9d72f6	[libc] Mass replace enclosing namespace (#67032 ) This is step 4 of https://discourse.llvm.org/t/rfc-customizable-namespace-to-allow-testing-the-libc-when-the-system-libc-is-also-llvms-libc/73079	2023-09-26 11:45:04 +02:00
Joseph Huber	6f4ed39b4a	[libc] Enable hermetic tests for the stdio test suite (#67339 ) Summary: There are several tests here that are not yet using the `add_libc_test`. Rather than do this individually we should just update these all at once. These all pass on my x64 build so I'm assuming it should be fine.	2023-09-25 11:14:17 -05:00
michaelrj-google	a5a008ff4f	[libc] Refactor scanf reader to match printf (#66023 ) In a previous patch, the printf writer was rewritten to use a single writer class with a buffer and a callback hook. This patch refactors scanf's reader to match conceptually.	2023-09-22 12:50:02 -07:00
Siva Chandra	62a3d84f5c	[libc][NFC] Extend ErrnoSetterMatcher to test expected inequalities. (#67153 ) Before this change, ErrnoSetterMatcher only allowed testing for equality of the expected return and errno values. This change extends it to allow testing for expected inequalities of the return and errno values. The test libc.test.src.stdio.fileop_test has been updated to use the ErrnoSetterMatcher with tests for inequalities.	2023-09-22 08:59:10 -07:00
Joseph Huber	e0be78be42	[libc] Template the printf / scanf parser class (#66277 ) Summary: The parser class for stdio currently accepts different argument providers. In-tree this is only used for a fuzzer test, however, the proposed implementation of the GPU handling of printf / scanf will require custom argument handlers. This makes the current approach of using a preprocessor macro messier. This path proposed folding this logic into a template instantiation. The downside to this is that because the implementation of the parser class is placed into an implementation file we need to manually instantiate the needed templates which will slightly bloat binary size. Alternatively we could remove the implementation file, or key off of the `libc` external packaging macro so it is not present in the installed version.	2023-09-21 17:02:26 -05:00
michaelrj-google	5bd34e0a55	[libc] Fix Off By One Errors In Printf Long Double (#66957 ) Two major off-by-one errors are fixed in this patch. The first is in float_to_string.h with length_for_num, which wasn't accounting for the implicit leading bit when calculating the length of a number, causing a missing digit on 80 bit float max. The other off-by-one is the ryu_long_double_constants.h (a.k.a the Mega Table) not having any entries for the last POW10_OFFSET in POW10_SPLIT. This was also found on 80 bit float max. Finally, the integer calculation mode was using a slightly too short integer, again on 80 bit float max, not accounting for the mantissa width. All of these are fixed in this patch.	2023-09-21 11:43:29 -07:00
Joseph Huber	59896c168a	[libc] Remove the 'rpc_reset' routine from the RPC implementation (#66700 ) Summary: This patch removes the `rpc_reset` function. This was previously used to initialize the RPC client on the device by setting up the pointers to communicate with the server. The purpose of this was to make it easier to initialize the device for testing. However, this prevented us from enforcing an invariant that the buffers are all read-only from the client side. The expected way to initialize the server is now to copy it from the host runtime. This will allow us to maintain that the RPC client is in the constant address space on the GPU, potentially through inference, and improving caching behaviour.	2023-09-21 11:07:09 -05:00
michaelrj-google	d37496e75a	[libc] Fix printf config not working (#66834 ) The list of printf copts available in config.json wasn't working because the printf_core subdirectory was included before the printf_copts variable was defined, making it effectively nothing for the printf internals. Additionally, the tests weren't respecting the flags so they would cause the tests to fail. This patch reorders the cmake in src and adds flag handling in test.	2023-09-19 15:36:14 -07:00
Tue Ly	84c899b235	[libc][math] Extract non-MPFR math tests into libc-math-smoke-tests. Extract non-MPFR math tests into libc-math-smoke-tests. Reviewed By: sivachandra, jhuber6 Differential Revision: https://reviews.llvm.org/D159477	2023-09-19 12:10:21 -04:00
Guillaume Chatelet	2dbdc9fc85	[libc] Add invoke / invoke_result type traits (#65750 )	2023-09-15 11:15:41 +02:00
Joseph Huber	a1be5d69df	[libc] Implement more input functions on the GPU (#66288 ) Summary: This patch implements the `fgets`, `getc`, `fgetc`, and `getchar` functions on the GPU. Their implementations are straightforward enough. One thing worth noting is that the implementation of `fgets` will be extremely slow due to the high latency to read a single char. A faster solution would be to make a new RPC call to call `fgets` (due to the special rule that newline or null breaks the stream). But this is left out because performance isn't the primary concern here.	2023-09-14 15:39:29 -05:00
Alex Brachet	2ad7a06cb1	[libc] Fix some warnings (#66366 ) Some compilers will warn about dangling else and missleading lack of parentheses.	2023-09-14 08:47:21 -04:00
Joseph Huber	ef169f5707	[libc] Improve the implementation of the rand() function (#66131 ) Summary: This patch improves the implementation of the standard `rand()` function by implementing it in terms of the xorshift64star pRNG as described in https://en.wikipedia.org/wiki/Xorshift#xorshift*. This is a good, general purpose random number generator that is sufficient for most applications that do not require an extremely long period. This patch also correctly initializes the seed to be `1` as described by the standard. We also increase the `RAND_MAX` value to be `INT_MAX` as the standard only specifies that it can be larger than 32768.	2023-09-12 16:52:20 -05:00
Guillaume Chatelet	7329816285	[libc] Add is_object (#65749 ) Add the is_object type traits. Implementation comes from https://en.cppreference.com/w/cpp/types/is_object	2023-09-12 10:35:22 +02:00
Guillaume Chatelet	a1f5a495e0	[libc] Add type_traits tests (#65956 ) This is not exhaustive for now but it provides a placeholder for `invoke_result` test mentioned in #65750.	2023-09-11 14:15:12 +00:00
Joseph Huber	60c0d303d6	[libc] Implement stdio writing functions for the GPU port (#65809 ) Summary: This patch implements fwrite, putc, putchar, and fputc on the GPU. These are very straightforward, the main difference for the GPU implementation is that we are currently ignoring `errno`. This patch also introduces a minimal smoke test for `putc` that is an exact copy of the `puts` test except we print the string char by char. This also modifies the `fopen` test to use `fwrite` to mirror its use of `fread` so that it is tested as well.	2023-09-09 13:27:07 -05:00
Michael Jones	dd51ae81d8	[libc] Fix printf %p format The %p format wasn't correctly passing along flags and modifiers to the integer conversion behind the scenes. This patch fixes that behavior, as well as changing the nullptr behavior to be a string conversion behind the scenes. Reviewed By: lntue, jhuber6 Differential Revision: https://reviews.llvm.org/D159458	2023-09-07 14:13:35 -07:00
Tue Ly	f0d05bb699	[libc][math] Fix signed zeros for acosf, acoshf, and atanf in FE_DOWNWARD mode. Fix signed zeros for acosf, acoshf, and atanf in FE_DOWNWARD mode. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D159476	2023-09-07 15:21:33 +00:00
Mikhail R. Gadelha	4151859770	[libc] Fix gmtime test on systems with sizeof(time_t) == 4 (#65388 ) This test creates a time_t variable and assigns 0xfffffffffe1d7b01 which overflows the maximum time_t value for 64-bit time_t, then checks if the syscall fails and errno was set. In systems with sizeof(time_t) == 4, the value is narrowed down to 0xfe1d7b01 and doesn't overflow, causing the test to fail. This patch then disables the test on systems with 32 bits long time_t.	2023-09-07 09:28:47 -04:00
Mikhail R. Gadelha	6f387135ae	[libc] Fix failing mktime test case in 32-bit systems (#65390 ) Previously, these tests expected that calling mktime with a struct tm that caused overlow to succeed with return -1 (TimeConstants::OUT_OF_RANGE_RETURN_VALUE), however, the Succeeds call expects the errno to be zero (no failure). This patch fixes the expected calls to fail with EOVERFLOW. These tests are only enabled to 32-bit systems, and are probably not being tested on the arm32 buildbot, that's why this was not a problem before.	2023-09-06 14:29:31 -04:00
Mikhail R. Gadelha	10fb71bdff	[libc] Fix test passing negative value in timespec passed to nanosleep (#65346 ) This test was setting tv_nsec to a negative value, which as per the standard this is an EINVAL: The value in the tv_nsec field was not in the range [0, 999999999] or tv_sec was negative. https://man7.org/linux/man-pages/man2/nanosleep.2.html	2023-09-06 14:28:31 -04:00
Mikhail R. Gadelha	ce3bade0cf	[libc] Fix call to clock_gettime (#65166 ) The calls were missing the __llvm_libc:: namespace, which can allow the test case to be linked to glibc's clock_gettime.	2023-09-06 14:26:20 -04:00
Fangrui Song	678e3ee123	[lldb] Fix duplicate word typos; NFC Those fixes were taken from https://reviews.llvm.org/D137338	2023-09-01 21:32:24 -07:00
Jon Chesterfield	1143da2245	[libc][gpu] Thread divergence fix on volta The inbox/outbox loads are performed by the current warp, not a single thread. The outbox load indicates whether a port has been successfully opened. If some lanes in the warp think it has and others think the port open failed, as the warp happened to be diverged when the load occurred, all the subsequent control flow will be incorrect. The inbox load indicates whether the machine on the other side of the RPC channel has progressed. If lanes in the warp have different ideas about that, some will try to progress their state transition while others won't. As far as the RPC layer is concerned this is a performance problem and not a correctness one - none of the lanes can start the transition early, only miss it and start late - but in practice the calls layered on top of RPC do not have the interface required to detect this event and retry the load on the stalled lanes, so the calls layered on top will be broken. None of this is broken on amdgpu, but it's likely that the readfirstlane will have beneficial performance properties there. Possible significant enough that it's worth landing this ahead of fixing gpu::broadcast_value on volta. Essentially volta wasn't adequately considered when writing this part of the protocol. It's a bug present in the initial prototype and propagated thus far, because none of the test cases push volta into a warp diverged state in the middle of the RPC sequence. We should have some test cases for volta where port_open and equivalent are called from diverged warps. Reviewed By: jhuber6 Differential Revision: https://reviews.llvm.org/D159276	2023-08-31 14:34:02 +01:00

1 2 3 4 5 ...

893 Commits