freebsd

mirror of https://git.FreeBSD.org/src.git synced 2024-12-19 10:53:58 +00:00

Author	SHA1	Message	Date
Bruce Evans	3b46e988e7	Use a better algorithm for reducing the error in __kernel_cos[f](). This supersedes the fix for the old algorithm in rev.1.8 of k_cosf.c. I want this change mainly because it is an optimization. It helps make software cos[f](x) and sin[f](x) faster than the i387 hardware versions for small x. It is also a simplification, and reduces the maximum relative error for cosf() and sinf() on machines like amd64 from about 0.87 ulps to about 0.80 ulps. It was validated for cosf() and sinf() by exhaustive testing. Exhaustive testing is not possible for cos() and sin(), but ucbtest reports a similar reduction for the worst case found by non-exhaustive testing. ucbtest's non-exhaustive testing seems to be good enough to find problems in algorithms but not maximum relative errors when there are spikes. E.g., short runs of it find only 3 ulp error where the i387 hardware cos() has an error of about 2**40 ulps near pi/2.	2005-10-26 12:36:18 +00:00
Bruce Evans	a92cb60b4e	More fixes for arg reduction near pi/2 on systems with broken assignment to floats (mainly i386's). All errors of more than 1 ulp for float precision trig functions were supposed to have been fixed; however, compiling with gcc -O2 uncovered 18250 more such errors for cosf(), with a maximum error of 1.409 ulps. Use essentially the same fix as in rev.1.8 of k_rem_pio2f.c (access a non-volatile variable as a volatile). Here the -O1 case apparently worked because the variable is in a 2-element array and it takes -O2 to mess up such a variable by putting it in a register. The maximum error for cosf() on i386 with gcc -O2 is now 0.5467 (it is still 0.5650 with gcc -O1). This shows that -O2 still causes some extra precision, but the extra precision is now good. Extra precision is harmful mainly for implementing extra precision in software. We want to represent x+y as w+r where both "+" operations are in infinite precision and r is tiny compared with w. There is a standard algorithm for this (Knuth (1981) 4.2.2 Theorem C), and fdlibm uses this routinely, but the algorithm requires w and r to have the same precision as x and y. w is just x+y (calculated in the same finite precision as x and y), and r is a tiny correction term. The i386 gcc bugs tend to give extra precision in w, and then using this extra precision in the calculation of r results in the correction mostly staying in w and being missing from r. There still tends to be no problem if the result is a simple expression involving w and r -- modulo spills, w keeps its extra precision and r remains the right correction for this wrong w. However, here we want to pass w and r to extern functions. Extra precision is not retained in function args, so w gets fixed up, but the change to the tiny r is tinier, so r almost remains as a wrong correction for the right w.	2005-10-25 12:13:37 +00:00
Bruce Evans	4339c67c48	Moved the optimization for tiny x from __kernel_{cos,sin}[f](x) to {cos_sin}[f](x) so that x doesn't need to be reclassified in the "kernel" functions to determine if it is tiny (it still needs to be reclassified in the cosine case for other reasons that will go away). This optimization is quite large for exponentially distributed x, since x is tiny for almost half of the domain, but it is a pessimization for uniformally distributed x since it takes a little time for all cases but rarely applies. Arg reduction on exponentially distributed x rarely gives a tiny x unless the reduction is null, so it is best to only do the optimization if the initial x is tiny, which is what this commit arranges. The imediate result is an average optimization of 1.4% relative to the previous version in a case that doesn't favour the optimization (double cos(x) on all float x) and a large pessimization for the relatively unimportant cases of lgamma[f][_r](x) on tiny, negative, exponentially distributed x. The optimization should be recovered for lgamma() as part of fixing lgamma()'s low-quality arg reduction. Fixed various wrong constants for the cutoff for "tiny". For cosine, the cutoff is when x2/2! == {FLT or DBL}_EPSILON/2. We round down to an integral power of 2 (and for cos() reduce the power by another 1) because the exact cutoff doesn't matter and would take more work to determine. For sine, the exact cutoff is larger due to the ration of terms being x2/3! instead of x2/2!, but we use the same cutoff as for cosine. We now use a cutoff of 2-27 for double precision and 2-12 for single precision. 2-27 was used in all cases but was misspelled 2**27 in comments. Wrong and sloppy cutoffs just cause missed optimizations (provided the rounding mode is to nearest -- other modes just aren't supported).	2005-10-24 14:08:36 +00:00
Bruce Evans	74bbe8ed42	Fixed range reduction for large multiples of pi/2 on systems with broken assignment to floats (e.g., i386 with gcc -O, but not amd64 or ia64; i386 with gcc -O0 worked accidentally). Use an unnamed volatile temporary variable to trick gcc -O into clipping extra precision on assignment. It's surprising that only 1 place needed to be changed. For tanf() on i386 with gcc -O, the bug caused errors > 1 ulp with a density of 2.3% for args larger in magnitude than 128pi/2, with a maximum error of 1.624 ulps. After this fix, exhaustive testing shows that range reduction for floats works as intended assuming that it is in within a factor of about 2^16 of working as intended for doubles. It provides >= 8 extra bits of precision for all ranges. On i386: range max error in double/single ulps extra precision ----- ------------------------------- --------------- 0 to 3pi/4 0x000d3132 / 0.0016 9+ bits 3pi/4 to 128pi/2 0x00160445 / 0.0027 8+ 128pi/2 to +Inf 0x00000030 / 0.00000009 23+ 128pi/2 up, -O0 before fix 0x00000030 / 0.00000009 23+ 128*pi/2 up, -O1 before fix 0x10000000 / 0.5 1 The 23+ bits of extra precision for large multiples corresponds to almost perfect reduction to a pair of floats (24 extra would be perfect). After this fix, the maximum relative error (relative to the corresponding fdlibm double precision function) is < 1 ulp for all basic trig functions on all 2^32 float args on all machines tested: amd64 ia64 i386-O0 i386-O1 ------ ------ ------ ------ cosf: 0.8681 0.8681 0.7927 0.5650 sinf: 0.8733 0.8610 0.7849 0.5651 tanf: 0.9708 0.9329 0.9329 0.7035	2005-10-11 07:56:05 +00:00
Bruce Evans	59b8fc1535	Fixed range reduction near (but not very near) medium-sized multiples of pi/2 (1 line) and expand a comment about related magic (many lines). The bug was essentially the same as for the +-pi/2 case (a mistranslated mask), but was smaller so it only significantly affected multiples starting near +-13*pi/2. At least on amd64, for cosf() on all 2^32 float args, the bug caused 128 errors of >= 1 ulp, with a maximum error of 1.2393 ulps.	2005-10-10 20:02:02 +00:00
Bruce Evans	11cba99f67	Fix numerous errors of >= 1 ulp for cosf(x) and sinf(x) (1 line) and add a comment about related magic (many lines)). __kernel_cos[f]() needs a trick to reduce the error to below 1 ulp when \|x\| >= 0.3 for the range-reduced x. Modulo other bugs, naive code that doesn't use the trick would have an error of >= 1 ulp in about 0.00006% of cases when \|x\| >= 0.3 for the unreduced x, with a maximum relative error of about 1.03 ulps. Mistransation of the trick from the double precision case resulted in errors in about 0.2% of cases, with a maximum relative error of about 1.3 ulps. The mistranslation involved not doing implicit masking of the 32-bit float word corresponding to to implicit masking of the lower 32-bit double word by clearing it. sinf() uses __kernel_cosf() for half of all cases so its errors from this bug are similar. tanf() is not affected. The error bounds in the above and in my other recent commit messages are for amd64. Extra precision for floats on i386's accidentally masks this bug, but only if k_cosf.c is compiled with -O. Although the extra precision helps here, this is accidental and depends on longstanding gcc precision bugs (not clipping extra precision on assignment...), and the gcc bugs are mostly avoided by compiling without -O. I now develop libm mainly on amd64 systems to simplify error detection and debugging.	2005-10-09 21:07:23 +00:00
Bruce Evans	a0e34da09f	Oops, the last-minute optimization in rev.1.8 wasn't a good idea. The 17+17+24 bit pi/2 must only be used when subtraction of the first 2 terms in it from the arg is exact. This happens iff the the arg in bits is one of the 2**17[-1] values on each side of (float)(pi/2). Revert to the algorithm in rev.1.7 and only fix its threshold for using the 3-term pi/2. Use the threshold that maximizes the number of values for which the 3-term pi/2 is used, subject to not changing the algorithm for comparing with the threshold. The 3-term pi/2 ends up being used for about half of its usable range (about 64K values on each side).	2005-10-09 04:29:08 +00:00
Bruce Evans	cd604283af	Fixed syntax error (a missing brace) in previous commit.	2005-10-08 22:55:36 +00:00
Bruce Evans	a7b8acac04	Fixed range reduction near (but not very near) +-pi/2. A bug caused a maximum error of 2.905 ulps for cosf(), but the algorithm for cosf() is good for < 1 ulps and happens to give perfect rounding (< 0.5 ulps) near +-pi/2 except for the bug. The extra relative errors for tanf() were similar (slightly larger). The bug didn't affect sinf() since sinf'(+-pi/2) is 0. For range reduction in ~[-3pi/4, -pi/4] and ~[pi/4, 3pi/4] we must subtract +-pi/2 and the only complication is that this must be done in extra precision. We have handy 17+24-bit and 17+17+24-bit approximations to pi/2. If we always used the former then we would lose up to 24 bits of accuracy due to cancelation of leading bits, but we need to keep at least 24 bits plus a guard digit or 2, and should keep as many guard bits as efficiency permits. So we used the less-precise pi/2 not very near +-pi/2 and switched to using the more-precise pi/2 very near +-pi/2. However, we got the threshold for the switch wrong by allowing 19 bits to cancel, so we ended up with only 21 or 22 bits of accuracy in some cases, which is even worse than naively subtracting pi/2 would have done. Exhaustive checking shows that allowing only 17 bits to cancel (min. accuracy ~24 bits) is sufficient to reduce the maximum error for cosf() near +-pi/2 to 0.726 ulps, but allowing only 6 bits to cancel (min. accuracy ~35-bits) happens to give perfect rounding for cosf() at little extra cost so we prefer that. We actually (in effect) allow 0 bits to cancel and always use the 17+17+24-bit pi/2 (min. accuracy ~41 bits). This is simpler and probably always more efficient too. Classifying args to avoid using this pi/2 when it is not needed takes several extra integer operations and a branch, but just using it takes only 1 FP operation. The patch also fixes misspelling of 17 as 24 in many comments. For the double-precision version, the magic numbers include 33+53 bits for the less-precise pi/2 and (53-32-1 = 20) bits being allowed to cancel, so there are ~33-20 = 13 guard bits. This is sufficient except probably for perfect rounding. The more-precise pi/2 has 33+33+53 bits and we still waste time classifying args to avoid using it. The bug is apparently from mistranslation of the magic 32 in 53-32-1. The number of bits allowed to cancel is not critical and we use 32 for double precision because it allows efficient classification using a 32-bit comparison. For float precision, we must use an explicit mask, and there are fewer bits so there is less margin for error in their allocation. The 32 got reduced to 4 but should have been reduced almost in proportion to the reduction of mantissa bits.	2005-10-08 22:43:55 +00:00
Bruce Evans	0b42281ee9	Fixed aliasing bugs in TRUNC() by using the fdlibm macros for access to doubles as bits. fdlibm-1.1 had similar aliasing bugs, but these were fixed by NetBSD or Cygnus before a modified version of fdlibm was imported in 1994. TRUNC() is only used by tgamma() and some implementation-detail functions. The aliasing bugs were detected by compiling with gcc -O2 but don't seem to have broken tgamma() on i386's or amd64's. They broke my modified version of tgamma(). Moved the definition of TRUNC() to mathimpl.h so that it can be fixed in one place, although the general version is even slower than necessary because it has to operate on pointers to volatiles to handle its arg sometimes being volatile. Inefficiency of the fdlibm macros slows down libm generally, and tgamma() is a relatively unimportant part of libm. The macros act as if on 32-bit words in memory, so they are hard to optimize to direct actions on 64-bit double registers for (non-i386) machines where this is possible. The optimization is too hard for gcc on amd64's, and declaring variables as volatile makes it impossible.	2005-09-19 11:28:19 +00:00
David Schultz	26bd283f2a	Add a missing ldexpf() alias for amd64. Noticed by: bz@, tjr@	2005-09-12 20:54:00 +00:00
Ken Smith	a84020c2b9	Bump the shared library version number of all libraries that have not been bumped since RELENG_5. Reviewed by: ru Approved by: re (not needed for commit check but in principle...)	2005-07-22 17:19:05 +00:00
Ruslan Ermilov	01293bdb90	Markup nit. Approved by: re (blanket)	2005-06-16 21:56:03 +00:00
Ruslan Ermilov	70db9cd000	Fixed compile warning. Approved by: re (blanket)	2005-06-16 21:55:45 +00:00
Ruslan Ermilov	f789cb8293	Assorted markup fixes. Approved by: re	2005-06-15 19:04:04 +00:00
Daniel Eischen	7f8fa2cf47	Prevent these functions from using stack outside of their frame. Reported by: Marc Olzheim <marcolz at stack dot nl> OK'd by: das	2005-05-06 15:44:20 +00:00
Stefan Farfeleder	66116c07a7	Revert the last change, the conversion from long double to double can raise unwanted underflow exceptions. Pointed out by: das	2005-04-28 19:45:55 +00:00
Stefan Farfeleder	8f58ab910f	Use double additions to raise the inexact exception to work around problems with long double addition on sparc64.	2005-04-22 09:57:55 +00:00
Stefan Farfeleder	9eb30792de	Fix raising the inexact exception (FE_INEXACT) if the result differs from the argument. Noticed by: das	2005-04-22 08:30:33 +00:00
Andrey A. Chernov	db7354df52	Fix truncl.3 MLINKS	2005-04-17 19:57:52 +00:00
David Schultz	a4ca7ca8ac	More optimized math functions.	2005-04-16 21:12:55 +00:00
David Schultz	2f2ee27de4	Implement truncl() based on floorl().	2005-04-16 21:12:47 +00:00
David Schultz	07f3bc5b9c	Add roundl(), lroundl(), and llroundl().	2005-04-08 01:24:08 +00:00
David Schultz	4bb190a74b	These files should include s_lround.c instead of s_lrint.c. This only matters for efficiency, not for correctness.	2005-04-08 00:52:27 +00:00
David Schultz	fc87986708	Fix a (coincidentally harmless) bug.	2005-04-08 00:52:16 +00:00
David Schultz	46691dfbe7	Fix a long-standing bug in k_rem_pio2(), which led to large errors when tanf() was called with big arguments close to multiples of pi/2. Reported by: ucbtest via bde	2005-04-05 23:27:47 +00:00
David Schultz	d06a0070af	Build exp2(), exp2f(), and related documentation.	2005-04-05 02:57:39 +00:00
David Schultz	90232fdf16	Document exp2() and exp2f(), and make other minor tweaks and updates.	2005-04-05 02:57:28 +00:00
David Schultz	f8d6ede6b5	Implement exp2() and exp2f().	2005-04-05 02:57:15 +00:00
David Schultz	3b9141ee91	Implement and document remquo() and remquof().	2005-03-25 04:40:44 +00:00
David Schultz	2c2435825a	Fix the double rounding problem with subnormals, and remove the XXX comments, which no longer apply.	2005-03-18 02:27:59 +00:00
David Schultz	21122bea01	Add missing prototypes for fma() and fmaf(), and remove an inaccurate comment.	2005-03-18 01:47:42 +00:00
David Schultz	9233b45ad9	Make the fenv.h routines work for programs that use SSE for floating-point arithmetic on i386. Now I'm going to make excuses for why this code is kinda scary: - To avoid breaking the ABI with 5.3-RELEASE, we can't change sizeof(fenv_t). I stuck the saved mxcsr in some discontiguous reserved bits in the existing structure. - Attempting to access the mxcsr on older processors results in an illegal instruction exception, so support for SSE must be detected at runtime. (The extra baggage is optimized away if either the application or libm is compiled with -msse{,2}.) I didn't run tests to ensure that this doesn't SIGILL on older 486's lacking the cpuid instruction or on other processors lacking SSE. Results from running the fenv regression test on these processors would be appreciated. (You'll need to compile the test with -DNO_STRICT_DFL_ENV.) If you have an 80386, or if your processor supports SSE but the kernel didn't enable it, then you're probably out of luck. Also, I un-inlined some of the functions that grew larger as a result of this change, moving them from fenv.h to fenv.c.	2005-03-17 22:21:46 +00:00
David Schultz	56ad27535a	Spell 'fedisableexcept' correctly.	2005-03-16 22:34:14 +00:00
David Schultz	2e5fb44003	Document feenableexcept(), fedisableexcept(), and fegetexcept().	2005-03-16 19:04:28 +00:00
David Schultz	10b01832c3	Replace fegetmask() and fesetmask() with feenableexcept(), fedisableexcept(), and fegetexcept(). These two sets of routines provide the same functionality. I implemented the former as an undocumented internal interface to make the regression test easier to write. However, fe(enable\|disable\|get)except() is already part of glibc, and I would like to avoid gratuitous differences. The only major flaw in the glibc API is that there's no good way to report errors on processors that don't support all the unmasked exceptions.	2005-03-16 19:03:46 +00:00
David Schultz	3d266bde6d	Replace strong references with weak references. There's no particularly good reason to do this, except that __strong_reference does type checking, whereas __weak_reference does not. On Alpha, the compiler won't accept a 'long double' parameter in place of a 'double' parameter even thought the two types are identical.	2005-03-07 21:27:37 +00:00
Stefan Farfeleder	3ddc6e9440	Remove an obsolete sentence from a comment.	2005-03-07 20:28:26 +00:00
David Schultz	c8642491d5	- If z is 0, one of x or y is 0, and the other is infinite, raise an invalid exception and return an NaN. - If a long double has 113 bits of precision, implement fma in terms of simple long double arithmetic instead of complicated double arithmetic. - If a long double is the same as a double, alias fma as fmal.	2005-03-07 05:02:09 +00:00
David Schultz	388bf3b630	Document scalbnl and scalblnl.	2005-03-07 05:00:44 +00:00
David Schultz	6af2c5a60c	Document nextafterl and nexttoward{,f,l}.	2005-03-07 05:00:29 +00:00
David Schultz	15a53f77fd	Add nexttoward to the list of implemented functions, and explicitly list the four that are still missing.	2005-03-07 04:59:53 +00:00
David Schultz	66d672d8cb	Document fmal.	2005-03-07 04:59:43 +00:00
David Schultz	94e03502dc	Remove ldexp and ldexpf. The former is in libc, and the latter is identical to scalbnf, which is now aliased as ldexpf. Note that the old implementations made the mistake of setting errno and were the only libm routines to do so.	2005-03-07 04:59:30 +00:00
David Schultz	aeb5e711f3	- Remove s_ldexpf.c (now aliased to scalbn.) - Add nexttoward{,f,l} and nextafterl. On all platforms, nexttowardl is an alias for nextafterl. - Add fmal. - Add man pages for new routines: fmal, nextafterl, nexttoward{,f,l}, scalb{,l}nl. Note that on platforms where long double is the same as double, we generally just alias the double versions of the routines, since doing so avoids extra work on the source code level and redundant code in the binary. In particular: ldbl53 ldbl64/113 fmal s_fma.c s_fmal.c ldexpl s_scalbn.c s_scalbnl.c nextafterl s_nextafter.c s_nextafterl.c nexttoward s_nextafter.c s_nexttoward.c nexttowardf s_nexttowardf.c s_nexttowardf.c nexttowardl s_nextafter.c s_nextafterl.c scalbnl s_scalbn.c s_scalbnl.c	2005-03-07 04:59:11 +00:00
David Schultz	228ad57d05	- Define FP_FAST_FMA for sparc64, since fma() is now implemented using sparc64's 128-bit long doubles. - Define FP_FAST_FMAL for ia64. - Prototypes for fmal, frexpl, ldexpl, nextafterl, nexttoward{,f,l}, scalblnl, and scalbnl.	2005-03-07 04:58:43 +00:00
David Schultz	beed720c37	Alias scalbn as ldexpl and scalbnl on platforms where long double is the same as double.	2005-03-07 04:58:03 +00:00
David Schultz	7b6a19039d	- Implement scalblnl. - In scalbln and scalblnf, check the bounds of the second argument. This is probably unnecessary, but strictly speaking, we should report an error if someone tries to compute scalbln(x, INT_MAX + 1ll).	2005-03-07 04:57:50 +00:00
David Schultz	caacab9b5f	Implement nexttowardf. This is used on both platforms with 11-bit exponents and platforms with 15-bit exponents for long doubles.	2005-03-07 04:57:38 +00:00
David Schultz	ef94de735a	Implement nexttoward and nextafterl; the latter is also known as nexttowardl. These are not needed on machines where long doubles look like IEEE-754 doubles, so the implementation only supports the usual long double formats with 15-bit exponents. Anything bizarre, such as machines where floating-point and integer data have different endianness, will cause problems. This is the case with big endian ia64 according to libc/ia64/_fpmath.h. Please contact me if you managed to get a machine running this way.	2005-03-07 04:56:46 +00:00
David Schultz	a506506a1c	- Try harder to trick gcc into not optimizing away statements that are intended to raise underflow and inexact exceptions. - On systems where long double is the same as double, nextafter should be aliased as nexttoward, nexttowardl, and nextafterl.	2005-03-07 04:55:58 +00:00
David Schultz	e0fe8e4440	Implement frexpl.	2005-03-07 04:54:51 +00:00
David Schultz	f8a40fca14	Alias frexp as frexpl on platforms where a long double is the same as a double.	2005-03-07 04:54:39 +00:00
David Schultz	65e60ab108	Implement fmal.	2005-03-07 04:54:20 +00:00
David Schultz	b1f37dcef4	- Define the LDBL_PREC to be the number of significant bits in a long double's mantissa. - Add an assembly version of fmal.	2005-03-07 04:54:02 +00:00
David Schultz	99401fa2e9	- Define the LDBL_PREC to be the number of significant bits in a long double's mantissa. - Add an assembly version of scalbnl.	2005-03-07 04:53:48 +00:00
David Schultz	4be31f0664	Define the LDBL_PREC to be the number of significant bits in a long double's mantissa.	2005-03-07 04:53:36 +00:00
David Schultz	4442891961	Add an assembly version of fmal.	2005-03-07 04:53:11 +00:00
David Schultz	cd7d05b5a2	Add scalbnl, also known as as ldexpl.	2005-03-07 04:52:58 +00:00
David Schultz	4b2011300b	Alias scalbnf as ldexpf. The two are identical in binary floating-point formats.	2005-03-07 04:52:43 +00:00
David Schultz	1b32579f23	Fix a mistake in the exponent range.	2005-03-06 19:08:18 +00:00
David Schultz	f4a5643005	Work around a gcc bug. This fixes feholdexcept() et al. at -O1. Symptoms of the problem included assembler warnings and nondeterministic runtime behavior when a fe*() call that affects the fpsr is closely followed by a float point op. The bug (at least, I think it's a bug) is that gcc does not insert a break between a volatile asm and a dependent instruction if the volatile asm came from an inlined function. Volatile asms seem to be fine in other circumstances, even without -mvolatile-asm-stop, so perhaps the compiler adds the stop bits before inlining takes place. The problem does not occur at -O0 because inlining is disabled, and it doesn't happen at -O2 because -fschedule-insns2 knows better.	2005-03-05 20:34:45 +00:00
David Schultz	57276bb6ea	Un-document the non-extant exp10() and exp10f() functions. exp10() was a casualty of the transition away from the VAX.	2005-02-26 08:54:45 +00:00
David Schultz	aa28340df9	Revert rev 1.8, which causes small (e.g. 2 ulp) errors for some inputs. The trouble with replacing two floats with a double is that the latter has 6 extra bits of precision, which actually hurts accuracy in many cases. All of the constants are optimal when float arithmetic is used, and would need to be recomputed to do this right. Noticed by: bde (ucbtest)	2005-02-24 06:32:13 +00:00
David Schultz	adec44c08b	Use hardware instructions for sqrt() and sqrtf().	2005-02-21 18:27:57 +00:00
David Schultz	96efaf6c36	Use double arithmetic instead of simulating it with two floats. This results in a performance gain on the order of 10% for amd64 (sledge), ia64 (pluto1), i386+SSE (Pentium 4), and sparc64 (panther), and a negligible improvement for i386 without SSE. (The i386 port still uses the hardware instruction, though.)	2005-02-21 17:44:57 +00:00
David Schultz	f674c13c78	Remove the i387 versions of atan(), atan2(), and atan2f(). They are slower than the MI routines on modern hardware, except for degenerate cases such as the Pentium 4. PR: 67469	2005-02-21 16:04:23 +00:00
David Schultz	c4691a5da9	Remove i387 versions of asin() and acos(). Although the hardware instruction was faster on the 486, it's slower than our MD version on modern processors. Determined by: bde PR: 67469	2005-02-20 22:51:08 +00:00
David Schultz	dab1571b90	Remove the float versions of the i387 trig functions obtained from NetBSD. They're buggy, giving particularly for inputs larger in magnitude than 2**63. Noticed by: bde PR: 67469	2005-02-20 22:50:40 +00:00
David Schultz	e02846ce13	Fix a small scripting snafu in the previous revision.	2005-02-04 20:05:39 +00:00
David Schultz	b21154f677	Remove another vestige of support for a non-IEEE libm.	2005-02-04 18:32:13 +00:00
David Schultz	3f70824172	Reduce diffs against vendor source (Sun fdlibm 5.3).	2005-02-04 18:26:06 +00:00
David Schultz	79b990338f	Move machine-dependent crud to its own makefile.	2005-02-04 14:33:39 +00:00
David Schultz	e1b61b5b93	Remove wrappers and other cruft intended to support SVID, mistakes in C90, and other arcana. Most of these features were never fully supported or enabled by default. Ok: bde, stefanf	2005-02-04 14:08:32 +00:00
Ruslan Ermilov	1f8ee0e102	Typo.	2005-01-28 21:14:16 +00:00
Ruslan Ermilov	d7a604cc33	Properly terminate sentence.	2005-01-28 21:13:34 +00:00
David Schultz	29bf6af890	- Move the functions presently described in in ieee(3) to their own manpages. They are not very related, so separating them makes it easier to add meaningful cross-references and extend some of the descriptions. - Move the part of math(3) that discusses IEEE 754 to the ieee(3) manpage.	2005-01-27 05:46:17 +00:00
Olivier Houchard	15d3b4db61	Define FE_TONEAREST, FE_TOWARDZERO, FE_UPWARD, FE_DOWNWARD and _ROUND_MASK to unbreak the build for arm.	2005-01-24 00:35:02 +00:00
David Schultz	cb2d2321cd	Update comment to reflect the code change in the previous revision. Noticed by: ceri	2005-01-23 22:56:08 +00:00
David Schultz	52611c608e	Many changes, including the following major ones: - Rearrange the list of functions into categories. - Remove the ulps column. It was appropriate for only some of the functions in the list, and correct for even fewer of them. - Add some new paragraphs, and remove some old ones about NaNs that may do more harm than good. - Document precisions other than double-precision.	2005-01-23 22:05:33 +00:00
David Schultz	3c4d0a0973	If x == y, return y, not x. C99 (though not IEEE 754) requires that nextafter(+0.0, -0.0) returns -0.0 and nextafter(-0.0, +0.0) returns +0.0.	2005-01-23 15:46:22 +00:00
David Schultz	d5580d091a	Add fma() and fmaf(), which implement a fused multiply-add operation.	2005-01-22 09:53:18 +00:00
Ruslan Ermilov	24a0682c64	Sort sections.	2005-01-20 09:17:07 +00:00
Ruslan Ermilov	5391441c05	Use the \*(If string provided by mdoc(7), to represent infinity.	2005-01-16 16:49:10 +00:00
Ruslan Ermilov	1fbb01b7f0	Removed redundant .br call.	2005-01-16 16:46:14 +00:00
David Schultz	cd3cc47033	amd64 assembly versions of sqrt(), lrint(), and llrint() using SSE2.	2005-01-15 03:32:28 +00:00
David Schultz	b6e65225a6	Most libm routines depend on the rounding mode and/or set exception flags, so they are not pure. Remove the __pure2 annotation from them. I believe that the following routines and their float and long double counterparts are the only ones here that can be __pure2: copysign is* fabs finite fmax fmin fpclassify ilogb nan signbit When gcc supports FENV_ACCESS, perhaps there will be a new annotation that allows the other functions to be considered pure when FENV_ACCESS is off. Discussed with: bde	2005-01-15 02:55:10 +00:00
David Schultz	71936f351e	Braino. Revert rev 1.50. Pointy hat to: das	2005-01-15 00:37:31 +00:00
David Schultz	8e26469445	Remove numerous references to VAX floating-point and the setting of errno, replacing them with a discussion of IEEE exceptions where appropriate. Cross-reference fenv(3) whenever exceptions are mentioned.	2005-01-14 23:28:28 +00:00
David Schultz	ce4e53c460	Set math_errhandling to MATH_ERREXCEPT. Now that we have fenv.h, we basically support this, subject to gcc's lack of FENV_ACCESS support. In any case, the previous setting of math_errhandling to 0 is not allowed by POSIX.	2005-01-14 22:03:27 +00:00
David Schultz	c165c4b9aa	Remove some #if 0'd code.	2005-01-14 21:51:46 +00:00
Ruslan Ermilov	e880667b92	Tiny markup nits.	2005-01-14 09:12:05 +00:00
David Schultz	f365db00e5	Mark all inline asms that read the floating-point control or status registers as volatile. Instructions that wrote to FP state were already marked volatile, but apparently gcc has license to move non-volatile asms past volatile asms. This broke amd64's feupdateenv at -O2 due to a WAR conflict between fnstsw and fldenv there.	2005-01-14 07:09:23 +00:00
Stefan Farfeleder	749f5f532e	Fixed too many of "the", and enclose multi-word argument in double quotes. Obtained from: ru	2005-01-13 20:33:42 +00:00
David Schultz	fe69257da2	Import the subset of J.T. Conklin's single-precision x86-optimized math routines that appear to be (a) correct and (b) faster than their MI counterparts on my Pentium 4. Obtained from: NetBSD	2005-01-13 18:58:25 +00:00
David Schultz	0d8f9eca28	The isnormal() in rev 1.2 should have been isfinite() so subnormals round correctly. Noticed by: stefanf	2005-01-13 15:43:41 +00:00
David Schultz	3cdb8115d7	Things that are broken, unneeded, and unused since 1997 belong in the attic.	2005-01-13 15:43:22 +00:00
Ruslan Ermilov	83e0359d53	Markup nits.	2005-01-13 10:43:01 +00:00
Ruslan Ermilov	113ed1bb1d	Fixed too many of "the", and enclose multi-word argument in double quotes.	2005-01-13 09:35:47 +00:00
Stefan Farfeleder	43295fac79	Implement and document ceill().	2005-01-13 09:11:41 +00:00
Stefan Farfeleder	4067ee86a5	Bump .Dd for the last commit.	2005-01-13 09:08:16 +00:00
Stefan Farfeleder	7e2ee1f065	Hook up and document floorl().	2005-01-12 22:16:26 +00:00
Stefan Farfeleder	17f418f9f4	Implement floorl().	2005-01-12 22:10:46 +00:00
Stefan Farfeleder	a7d82b7150	Whitespace nit.	2005-01-12 22:05:41 +00:00
David Schultz	10c9ffa425	Add MI implementations of [l]lrint[f]() and [l]lround[f](). Discussed with: bde	2005-01-11 23:12:55 +00:00
David Schultz	2aac156d2e	Document [l]lrint[f]() and [l]lround[f]().	2005-01-11 23:12:17 +00:00
David Schultz	439e59cf85	Faster lrint() and llrint() implementations for x86.	2005-01-11 23:10:53 +00:00
David Schultz	c1b70ced4f	Mark inline stmxcsr instructions as volatile, since this appears to be the only way to convince gcc that they read the MXCSR. The volatile annotation may be needed elsewhere as well.	2005-01-11 22:10:43 +00:00
Ruslan Ermilov	2d82ac3110	Scheduled mdoc(7) sweep.	2005-01-11 20:50:51 +00:00
Ruslan Ermilov	4e05ab77a8	Sanitize the markup, as prompted.	2005-01-11 20:16:03 +00:00
David Schultz	527055d12f	GC unused declaration	2004-12-16 20:40:49 +00:00
David Schultz	17519e9b79	Cosmetic changes only: - style - remove unused variables - de-support VAX Inspired by: bin/42388	2004-12-16 20:40:37 +00:00
David Schultz	dbc8f2b5ce	More updates for math(3): - Make some minor rearrangements in the introduction. - Mention the problem with argument reduction on i386. - Add recently-implemented functions to the table. - Un-document the error bounds that only apply to the old 4BSD math library, and fill in the correct values where I know them. No attempt has been made to document bounds lower than 1 ulp, although smaller bounds are usually achievable in round-to-nearest mode.	2004-10-11 20:13:52 +00:00
Stefan Farfeleder	2fd3a32ee1	Add and document ilogbl(), a long double version of ilogb().	2004-10-11 18:13:52 +00:00
Stefan Farfeleder	552ebda9dd	Use the FP_ILOG macros from <math.h> rather than hardcoded return values. Also be prepared for FP_ILOGBNAN != INT_MAX. Reviewed by: md5	2004-10-09 17:14:28 +00:00
Ken Smith	85a8b887df	Bump the library version numbers for the following libraries: /lib/{libm,libreadline} /usr/lib/{libhistory,libopie,libpcap} in preparation for doing the same thing to RELENG_5. HUGE amounts of help for determining what to bump provided by kris. Discussed on: freebsd-current Approved by: re (not required for commit but something like this should be)	2004-10-01 15:38:07 +00:00
David Schultz	d622ef6993	Further refine some #ifs: - Simplify the logic by using __GNUC_PREREQ__. Suggested by stefanf. - Make math.h compile with old (pre-8.0) versions of icc. Submitted by sf [sic].	2004-09-17 05:15:33 +00:00
Stefan Farfeleder	bef5493789	Add man pages for the cimag(), conj() and creal() functions.	2004-08-07 23:03:36 +00:00
Olivier Houchard	60b22cf1c2	Only use rfs and wfs if ARM_HARD_FLOAT is defined, and use stubs if it is not, in order to unbreak arm make world. The right way to do it with soft floats will be figured out later. Discussed with: das	2004-08-05 14:07:24 +00:00
David Schultz	2208ce0a06	Replace s_isnan.c and s_isnanf.c with the more compact s_isnan.c from libc. The externally-visible effect of this is to add __isnanl() to libm, which means that libm.so.2 can once again link against libc.so.4 when LD_BIND_NOW is set. This was broken by the addition of fdiml(), which calls __isnanl().	2004-08-05 01:46:11 +00:00
David Schultz	8dc56b6821	Use isnormal() instead of fpclassify() to avoid dependency on libc.so.5.	2004-08-05 01:44:55 +00:00
Alexander Kabaev	dd86691ec8	Work around known GCC 3.4.x problem and use ANSI prototype for dremf().	2004-07-28 05:53:18 +00:00
David Schultz	ec79bc0da9	Fix two bugs in the signbit() macro, which was implemented last year: - It was added to libc instead of libm. Hopefully no programs rely on this mistake. - It didn't work properly on large long doubles because its argument was converted to type double, resulting in undefined behavior.	2004-07-19 08:16:10 +00:00
Stefan Farfeleder	9979bae3e7	Fix minor namespace pollution: The prototypes for f{dim,max,min}(), nearbyint(), round() and trunc() shouldn't be visible when compiling with -D_XOPEN_SOURCE=500.	2004-07-17 15:03:52 +00:00
David Schultz	205d3300b8	Tweak the conditions under which certain gcc builtins are used: - Unlike the builtin relational operators, builtin floating-point constants were not available until gcc 3.3, so account for this.[1] - Apparently some versions of the Intel C Compiler fallaciously define __GNUC__ without actually being compatible with the claimed gcc version. Account for this, too.[2] [1] Noticed by: Christian Hiris <4711@chello.at> [2] Submitted by: Alexander Leidinger <Alexander@Leidinger.net>	2004-07-16 06:21:56 +00:00
David Schultz	9fc5c45bad	Remove the declaration of isnan() from this file. It is no longer needed as of math.h v1.40, and its prototype is incorrect here.	2004-07-09 10:01:10 +00:00
David Schultz	240dbabfa8	Implement the classification macros isfinite(), isinf(), isnan(), and isnormal() the hard way, rather than relying on fpclassify(). This is a lose in the sense that we need a total of 12 functions, but it is necessary for binary compatibility because we have never bumped libm's major version number. In particular, isinf(), isnan(), and isnanf() were BSD libc functions before they were C99 macros, so we can't reimplement them in terms of fpclassify() without adding a dependency on libc.so.5. I have tried to arrange things so that programs that could be compiled in FreeBSD 4.X will generate the same external references when compiled in 5.X. At the same time, the new macros should remain C99-compliant. The isinf() and isnan() functions remain in libc for historical reasons; however, I have moved the functions that implement the macros isfinite() and isnormal() to libm where they belong. Moreover, half a dozen MD versions of isinf() and isnan() have been replaced with MI versions that work equally well. Prodded by: kris	2004-07-09 03:32:40 +00:00
David Schultz	b2d5d0b376	Define the following macros in terms of [gi]cc builtins when the builtins are available: HUGE_VAL, HUGE_VALF, HUGE_VALL, INFINITY, and NAN. These macros now expand to floating-point constant expressions rather than external references, as required by C99. Other compilers will retain the historical behavior. Note that it is not possible say, e.g. #define HUGE_VAL 1.0e9999 because the above may result in diagnostics at translation time and spurious exceptions at runtime. Hence the need for compiler support for these features. Also use builtins to implement the macros isgreater(), isgreaterequal(), isless(), islessequal(), islessgreater(), and isunordered() when such builtins are available. Although the old macros are correct, the builtin versions are much faster, and they avoid double-expansion problems.	2004-07-09 03:31:09 +00:00
David Schultz	9428e108c9	Add C99's nearbyint{,f}() functions as wrappers around rint(). These trivial implementations are about 25 times slower than rint{,f}() on x86 due to the FP environment save/restore. They should eventually be redone in terms of fegetround() and bit fiddling.	2004-07-06 04:46:08 +00:00
Ruslan Ermilov	30950a21e1	Eliminate double whitespace.	2004-07-03 22:30:10 +00:00
Ruslan Ermilov	1a0a934547	Mechanically kill hard sentence breaks.	2004-07-02 23:52:20 +00:00
Ruslan Ermilov	862b46f607	Markup, grammar, punctuation.	2004-07-01 18:20:57 +00:00
David Schultz	4f82cb46c4	Implement and document fdim{,f,l}, fmax{,f,l}, and fmin{,f,l}.	2004-06-30 07:04:01 +00:00
Marcel Moolenaar	c987479dd0	s/ARCH/ARCH_SUBDIR/g -- This reduces the chance of possible conflicts with the user's environment. Wondered why his cross-builds kept failing: marcel	2004-06-24 00:02:32 +00:00
Stefan Farfeleder	c8764bba5a	Completely remove s_ilogb.S as the assembler implementation gives very little speed improvement to none at all over the MI version. Submitted by: bde	2004-06-20 10:42:23 +00:00
David Schultz	f7748f6e01	Uncomment some functions that we now support.	2004-06-20 10:39:09 +00:00
David Schultz	a9a0bf07f3	Cross-reference round(3) and trunc(3) as appropriate.	2004-06-20 09:27:17 +00:00
David Schultz	209547598d	Connect scalbln(), trunc(), and the associated documentation to the build.	2004-06-20 09:27:03 +00:00
David Schultz	62247e9034	Declare scalbln(), scalblnf(), trunc(), and truncf().	2004-06-20 09:26:41 +00:00
David Schultz	7ffaea8021	Implement trunc() and truncf().	2004-06-20 09:25:43 +00:00
David Schultz	2f90a15e14	Add trivial implementations of scalbln() and scalblnf(). These routines are specified in C99 for the sake of architectures where an int isn't big enough to represent the full range of floating-point exponents. However, even the 128-bit long double format has an exponent smaller than 15 bits, so for all practical purposes, scalbln() and scalblnf() are aliases for scalbn() and scalbnf(), respectively.	2004-06-20 09:25:27 +00:00
Stefan Farfeleder	32ef5abfe3	Document ilogb()'s return values in terms of the FP_ILOGB* macros.	2004-06-19 09:33:29 +00:00
Stefan Farfeleder	b6161bb16a	Return the same result as the MI version for 0.0, INFINITY and NaN. Reviewed by: standards@	2004-06-19 09:30:00 +00:00
Stefan Farfeleder	83bc89312c	Our MI implementation of ilogb() returns -INT_MAX for the argument 0.0 rather than INT_MIN, so adjust FP_ILOGB0 to reflect this. Use <machine/_limits.h> for INT_MAX's value while there. Reviewed by: standards@	2004-06-19 09:25:21 +00:00
David Schultz	2a6bf1fadb	Memory's free, but all the world ain't a VAX anymore. Bring math.3 kicking and screaming into the 1980's. This change converts most of the markup from man(7) to mdoc(7) format, and I believe it removes or updates everything that was flat out wrong. However, much work is still needed to sanitize the markup, improve coverage, and reduce overlap with other manpages. Some of the sections would better belong in a philosophy_of_w_kahan.3 manpage, but they are informative and remain at least as reminders of topics to cover. Reviewed by: doc@, trhodes@	2004-06-19 03:25:28 +00:00
David Schultz	9772caa388	The references to scalbn and scalbnf should be scalb and scalbf. (The former are actually useful, and ieee_test(3) only documents functions that aren't.) Add a sentence describing the domain of scalb() and scalbf().	2004-06-12 04:40:47 +00:00
David Schultz	16919a6cf7	Shift the FPSR contents by the correct amount so feupdateenv() raises the correct exceptions from the old environment.	2004-06-11 02:35:30 +00:00
David Schultz	0d2354c6fd	Insert a missing '~' in feholdexcept(), so that it correctly clears the exception flags in the mxcsr as well as the x87 FPU.	2004-06-11 02:35:19 +00:00
David Schultz	c4da2324a3	Fix a bug where rintf() rounded the wrong way in round-to-nearest mode on all inputs of the form x.75, where x is an even integer and log2(x) = 21. A similar problem occurred when rounding upward. The bug involves the following snippet copied from rint(): i>>=1; if((i0&i)!=0) i0 = (i0&(~i))\|((0x100000)>>j0); The constant 0x100000 should be 0x200000. Apparently this case was never tested. It turns out that the bit manipulation is completely superfluous anyway, so remove it. (It tries to simulate 90% of the rounding process that the FPU does anyway.) Also, the special case of +-0 is handled twice (in different ways), so remove the second instance. Throw in some related simplifications from bde: - Work around a bug where gcc fails to clip to float precision by declaring two float variables as volatile. Previously, we tricked gcc into generating correct code by declaring some float constants as doubles. - Remove additional superfluous bit manipulation. - Minor reorganization. - Include <sys/types.h> explicitly. Note that some of the equivalent lines in rint() also appear to be unnecessary, but I'll defer to the numerical analysts who wrote it, since I can't test all 2^64 cases. Discussed with: bde	2004-06-09 21:24:52 +00:00
David Schultz	207bc1d79b	Include <sys/cdefs.h> earlier to get the various visibility constants. Previously, we were relying on <sys/_types.h> to include it implicitly.	2004-06-09 10:32:05 +00:00
David Schultz	d0f1363370	Add round(3) and roundf(3) and the associated documentation. PR: 59797 Submitted by: "Steven G. Kargl" <kargl@troutmask.apl.washington.edu> Reviewed by: bde (earlier version, last year)	2004-06-07 08:05:36 +00:00
David Schultz	54dd6976a8	Add fenv.h, fenv.c, and the associated documentation to the libm build. To facilitate this, add ${.CURDIR}/${ARCH} to make's search path unconditionally. Reviewed by: standards@	2004-06-06 10:06:57 +00:00
David Schultz	07235cc8f7	Add documentation for: - fenv(3) - feclearexcept(3), fegetexceptflag(3), feraiseexcept(3), fesetexceptflag(3), fetestexcept(3) - fegetround(3), fesetround(3) - fegetenv(3), feholdexcept(3), fesetenv(3), feupdateenv(3) Reviewed by: standards@	2004-06-06 10:06:26 +00:00
David Schultz	7ab6d2aa74	Add an fenv.h implementation for the sparc64 port. Reviewed by: standards@	2004-06-06 10:05:57 +00:00
David Schultz	122e138072	Add an fenv.h implementation for the powerpc port. Reviewed by: standards@	2004-06-06 10:05:10 +00:00
David Schultz	50c4f20324	Add an fenv.h implementation for the ia64 port. Reviewed by: standards@	2004-06-06 10:04:43 +00:00
David Schultz	0b71a226d1	Add an fenv.h implementation for the i386 port. Reviewed by: standards@	2004-06-06 10:04:17 +00:00
David Schultz	19220bc13f	Add an fenv.h implementation for the arm port. It does not appear to be possible to cross-build arm from i386 at the moment, and I have no ARM hardware anyway. Thus, I'm sure there are bugs. I will gladly fix these when the arm port is more mature. Reviewed by: standards@	2004-06-06 10:03:59 +00:00
David Schultz	fc27daefcd	Add an fenv.h implementation for the amd64 port. Reviewed by: standards@	2004-06-06 10:03:25 +00:00
David Schultz	7993050251	Add an fenv.h implementation for the alpha port. All of the standard features appear to work, subject to the caveat that you tell gcc you want standard rather than recklessly fast behavior (-mieee-with-inexact -mfp-rounding-mode=d). The non-standard feature of delivering a SIGFPE when an application raises an unmasked exception does not work, presumably due to a kernel bug. This isn't so bad given that floating-point exceptions on the Alpha architecture are not precise, so making them useful in userland requires a significant amount of wizardry. Reviewed by: standards@	2004-06-06 09:58:55 +00:00
Bruce Evans	4f8f819975	Fixed lots of 1 ULP errors caused by a broken approximation for pi/2. We approximate pi with more than float precision using pi_hi+pi_lo in the usual way (pi_hi is actually spelled pi in the source code), and expect (float)0.5pi_lo to give the low part of the corresponding approximation for pi/2. However, the high part for pi/2 (pi_o_2) is rounded to nearest, which happens to round up, while the high part for pi was rounded down. Thus pi_o_2+(float)0.5pi (in infinite precision) was a very bad approximation for pi/2 -- the low term has the wrong sign and increases the error drom less than half an ULP to a full ULP. This fix rounds up instead of down for pi_hi. Consistently rounding down instead of up should work, and is the method used in e_acosf.c and e_asinf.c. The reason for the difference is that we sometimes want to return precisely pi/2 in e_atan2f.c, so it is convenient to have a correctly rounded (to nearest) value for pi/2 in a variable. a_acosf.c and e_asinf.c also differ in directly approximating pi/2 instead pi; they multiply by 2.0 instead of dividing by 0.5 to convert the approximation. These complications are not directly visible in the double precision versions because rounding to nearest happens to round down.	2004-06-02 17:09:05 +00:00
David Schultz	73fbb89dd6	Port a bugfix from FDLIBM 5.3. The bug really only applies to tan() and not tanf() because float type can't represent numbers large enough to trigger the problem. However, there seems to be a precedent that the float versions of the fdlibm routines should mirror their double counterparts. Also update to the FDLIBM 5.3 license. Obtained from: FDLIBM Reviewed by: exhaustive comparison	2004-06-02 04:39:44 +00:00
David Schultz	21d39caaee	Merge a bugfix from FDLIBM 5.3 to ensure that the error in tan() is always less than 1 ulp. Also update to the 5.3 license. Obtained from: FDLIBM	2004-06-02 04:39:29 +00:00
Bruce Evans	f88a48cc43	Merged from double precision case (e_pow.c 1.10: sign fixes).	2004-06-01 19:33:30 +00:00
Bruce Evans	f083533b68	Fixed the sign of the result in some overflow and underflow cases (ones where the exponent is an odd integer and the base is negative). Obtained from: fdlibm-5.3 Sun finally released a new version of fdlibm just a coupe of weeks ago. It only fixes 3 bugs (this one, another one in pow() that we already have (rev.1.9), and one in tan(). I've learned too much about powf() lately, so this fix was easy to merge. The patch is not verbatim, because our base version has many differences for portability and I didn't like global renaming of an unrelated variable to keep it separate from the sign variable. This patch uses a new variable named sn for the sign.	2004-06-01 19:28:38 +00:00
Bruce Evans	5f20e5ce7f	Fixed another precision bug in powf(). This one is in the computation [t=p_l+p_h High]. We multiply t by lg2_h, and want the result to be exact. For the bogus float case of the high-low decomposition trick, we normally discard the lowest 12 bits of the fraction for the high part, keeping 12 bits of precision. That was used for t here, but it doesnt't work because for some reason we only discard the lowest 9 bits in the fraction for lg2_h. Discard another 3 bits of the fraction for t to compensate. This bug gave wrong results like: powf(0.9999999, -2.9999995) = 1.0000002 (should be 1.0000001) hex values: 3F7FFFFF C03FFFFE 3F800002 3F800001 As explained in the log for the previous commit, the bug is normally masked by doing float calculations in extra precision on i386's, but is easily detected by ucbtest on systems that don't have accidental extra precision. This completes fixing all the bugs in powf() that were routinely found by ucbtest.	2004-06-01 19:03:31 +00:00
Bruce Evans	12be4e0d5a	Fixed 2 bugs in the computation /* t_h=ax+bp[k] High */. (1) The bit for the 1.0 part of bp[k] was right shifted by 4. This seems to have been caused by a typo in converting e_pow.c to e_powf.c. (2) The lower 12 bits of ax+bp[k] were not discarded, so t_h was actually plain ax+bp[k]. This seems to have been caused by a logic error in the conversion. These bugs gave wrong results like: powf(-1.1, 101.0) = -15158.703 (should be -15158.707) hex values: BF8CCCCD 42CA0000 C66CDAD0 C66CDAD4 Fixing (1) gives a result wrong in the opposite direction (hex C66CDAD8), and fixing (2) gives the correct result. ucbtest has been reporting this particular wrong result on i386 systems with unpatched libraries for 9 years. I finally figured out the extent of the bugs. On i386's they are normally hidden by extra precision. We use the trick of representing floats as a sum of 2 floats (one much smaller) to get extra precision in intermediate calculations without explicitly using more than float precision. This trick is just a pessimization when extra precision is available naturally (as it always is when dealing with IEEE single precision, so the float precision part of the library is mostly misimplemented). (1) and (2) break the trick in different ways, except on i386's it turns out that the intermediate calculations are done in enough precision to mask both the bugs and the limited precision of the float variables (as far as ucbtest can check). ucbtest detects the bugs because it forces float precision, but this is not a normal mode of operation so the bug normally has little effect on i386's. On systems that do float arithmetic in float precision, e.g., amd64's, there is no accidental extra precision and the bugs just give wrong results.	2004-06-01 18:08:39 +00:00
Stefan Farfeleder	8b5cd5a662	Add implementations for cimag{,f,l}, creal{,f,l} and conj{,f,l}. They are needed for cases where GCC's builtin functions cannot be used and for compilers that don't know about them. Approved by: das (mentor)	2004-05-30 09:21:56 +00:00
David Schultz	6955d806c0	Remove some kludges designed to ensure that the compiler didn't round constants the wrong way on the VAX. Instead, use C99 hexadecimal floating-point constants, which are guaranteed to be exact on binary IEEE machines. (The correct hexadecimal values were already provided in the source, but not used.) Also, convert the constants to lowercase to work around a gcc bug that wasn't fixed until gcc 3.4.0. Prompted by: stefanf	2004-05-17 01:04:37 +00:00
Stefan Farfeleder	b60cb13f76	Add an implementation of copysignl(), a long double version of copysign(). Approved by: das (mentor)	2004-05-07 18:56:31 +00:00
Stefan Farfeleder	325152e8fb	Add an MLINK for fabsl(). Approved by: das (mentor)	2004-05-07 17:55:07 +00:00
Stefan Farfeleder	89c5bc6db4	The prototypes for cabs() and cabsf() are in <complex.h>. Fix their arguments' types and describe them briefly. Reviewed by: ru, bde Approved by: das (mentor)	2004-05-06 13:11:18 +00:00
David Schultz	8f3f7c66d0	Make sure that symbols are declared in math.h iff the appropriate namespaces are visible. Previously, math.h failed to hide some C99-, XSI-, and BSD-specific symbols in certain compilation environments. The referenced PR has a nice listing of the appropriate conditions for making symbols visible in math.h. The only non-stylistic difference between the patch in the PR and this commit is that I superfluously test for __BSD_VISIBLE in a few places to be more explicit about which symbols have historically been part of the FreeBSD environment. PR: 65939 Submitted by: Stefan Farfeleder <stefan@fafoe.narf.at>	2004-04-25 02:35:42 +00:00
David Schultz	334c760eea	Remove a stale comment referring to values.h, which has never been part of FreeBSD. PR: 65939	2004-04-25 02:32:46 +00:00
Bruce Evans	6eb2d83e44	Initial support for C99's (or is it POSIX.1-2001's?) MATH_ERRNO, MATH_ERREXCEPTION and math_errhandling, so that C99 applications at least have the possibility of determining that errno is not set for math functions. Set math_errhandling to the non-standard-conforming value of 0 for now to indicate that we don't support either method of reporting errors. We intentionally don't support MATH_ERRNO because errno is a mistake, and we are missing support for MATH_ERREXCEPTION (<fenv.h>, compiler support for <fenv.h>, and actually setting the exception flags correctly).	2004-03-12 12:02:03 +00:00
David Schultz	7a773faadc	Fix a problem where libm compiled under 5.X would depend on features that are only in libc.so.5. This broke some 4.X applications linked to libm and run under 5.X. Background: In C99, isinf() and isnan() cannot be implemented as regular functions. We use macros that call libc functions in 5.X, but for libm-internal use, we need to use the old versions until the next time libm's major version number is bumped. Submitted by: bde Reported by: imp, kris	2003-10-27 01:28:07 +00:00
Dag-Erling Smørgrav	2a063d30f6	Better safe than clever. Submitted by: das	2003-10-25 19:53:28 +00:00
Dag-Erling Smørgrav	801517fd4e	Document fabsl(3). Submitted by: Stefan Farfeleder <stefan@fafoe.narf.at>	2003-10-25 13:45:11 +00:00
Dag-Erling Smørgrav	e334ea2edc	- fabsl.c should be named s_fabsl.c for consistency with libmsun's documented naming scheme (unfortunately the documentation isn't in the tree as far as I can tell); no repocopy is required as there is no history to preserve. - replace simple and almost-correct implementation with slightly hackish but definitely correct implementation (tested on i386, alpha, sparc64) which requires pulling in fpmath.h and the MD _fpmath.h from libc. - try not to make a mess of the Makefile in the process. - enterprising minds are encouraged to implement more C99 long double functions.	2003-10-25 09:32:18 +00:00
Dag-Erling Smørgrav	4318dce616	Connect fabsl.c to the build.	2003-10-23 08:23:51 +00:00
Dag-Erling Smørgrav	29bd23abf0	Add prototypes for all long double functions in C99. Leave them all #if 0'd out, except for fabsl(3) which I've implemented.	2003-10-23 08:23:38 +00:00
Dag-Erling Smørgrav	017e4316ae	Implement fabsl(3), allowing the world to build with -fno-builtin.	2003-10-23 08:20:47 +00:00
Gordon Tetlow	41d8423f71	Stage 3 of dynamic root support. Make all the libraries needed to run binaries in /bin and /sbin installed in /lib. Only the versioned files reside in /lib, the .so symlink continues to live /usr/lib so the toolchain doesn't need to be modified.	2003-08-17 08:28:46 +00:00
Bruce Evans	262e4c00bd	Fixed some style bugs (misplacement and misformatting of some commented-out code).	2003-07-23 09:24:44 +00:00
Peter Wemm	3819e84017	Only provide one copy of the math functions. If we provide a MD function, do not also provide a __generic_XXX version as well. This is how we used to runtime select the generic vs i387 versions on the i386 platform. This saves a pile of #defines in the src/math_private.h file to undo the __generic_XXX renames in some of the *.c files.	2003-07-23 04:53:47 +00:00
Peter Wemm	d48084b9e5	No longer need the internal __get_hw_float() function.	2003-07-23 04:25:04 +00:00
Peter Wemm	c3e6df78e1	Now that we do not need to do runtime detection for the broken default fp emulator, stop doing the runtime selection of hardware or emulated floating point operations on i386. Note that I have not suppressed the duplicate compiles yet. While here, fix the alpha. It has provided specific copysign/copysignf functions since the beginning of time, but they have never been used.	2003-07-23 04:23:36 +00:00
Mike Barcroft	6f9622a926	Fix two misuses of __BSD_VISIBLE. Submitted by: bde Approved by: re	2003-05-22 17:07:57 +00:00
Peter Wemm	8e80f8a438	AMD64 support (another IEEEFP platform)	2003-04-30 21:06:30 +00:00
David Schultz	6d3bd9530d	Fix braino in definition of isfinite(). Noticed by: marcus Pointy hat to: das	2003-04-04 13:27:47 +00:00
Ruslan Ermilov	3892c30012	mdoc(7) police: Nits.	2003-03-02 21:04:21 +00:00
Warner Losh	457f6cd2d6	- gamma_r, lgamma_r, gammaf_r, and lgammaf_r were protected by _REENTRANT in math.h; the consensus here was that __BSD_VISIBLE was correct instead. - gamma_r, lgamma_r, gammaf_r, and lgammaf_r had no documentation in the lgamma(3) manpage. Reviewed by: standards@ Submitted by: Ben Mesander	2003-02-26 13:12:03 +00:00
Mike Barcroft	5d62092f94	o Implement C99 classification macros isfinite(), isinf(), isnan(), isnormal(). The current isinf() and isnan() are perserved for binary compatibility with 5.0, but new programs will use the macros. o Implement C99 comparison macros isgreater(), isgreaterequal(), isless(), islessequal(), islessgreater(), isunordered(). Submitted by: David Schultz <dschultz@uclink.Berkeley.EDU>	2003-02-12 20:03:41 +00:00
Mike Barcroft	8e9b28311e	Implement C99's signbit() macro.	2003-02-11 21:56:21 +00:00
Mike Barcroft	8cf5ed5125	Implement fpclassify(): o Add a MD header private to libc called _fpmath.h; this header contains bitfield layouts of MD floating-point types. o Add a MI header private to libc called fpmath.h; this header contains bitfield layouts of MI floating-point types. o Add private libc variables to lib/libc/$arch/gen/infinity.c for storing NaN values. o Add __double_t and __float_t to <machine/_types.h>, and provide double_t and float_t typedefs in <math.h>. o Add some C99 manifest constants (FP_ILOGB0, FP_ILOGBNAN, HUGE_VALF, HUGE_VALL, INFINITY, NAN, and return values for fpclassify()) to <math.h> and others (FLT_EVAL_METHOD, DECIMAL_DIG) to <float.h> via <machine/float.h>. o Add C99 macro fpclassify() which calls __fpclassify{d,f,l}() based on the size of its argument. __fpclassifyl() is never called on alpha because (sizeof(long double) == sizeof(double)), which is good since __fpclassifyl() can't deal with such a small `long double'. This was developed by David Schultz and myself with input from bde and fenner. PR: 23103 Submitted by: David Schultz <dschultz@uclink.Berkeley.EDU> (significant portions) Reviewed by: bde, fenner (earlier versions)	2003-02-08 20:37:55 +00:00
Jens Schweikhardt	9d5abbddbf	Correct typos, mostly s/ a / an / where appropriate. Some whitespace cleanup, especially in troff files.	2003-01-01 18:49:04 +00:00
Jens Schweikhardt	57bd0fc6e8	english(4) police.	2002-12-27 12:15:40 +00:00
Archie Cobbs	83999f5a32	Re-apply the previously backed-out commit that fixes the problem where HUGE_VAL is not properly aligned on some architectures. The previous fix now works because the two versions of 'math.h' (include/math.h and lib/msun/src/math.h) have since been merged into one. PR: bin/43544	2002-10-31 23:05:20 +00:00
Mark Murray	bf2f52b5fa	Remove duplicate declaration.	2002-10-23 17:35:11 +00:00
Bruce Evans	54e9b36765	Fixed a last-minute editing error in previous commit. nfs and/or cvs replaced a 14-byte change in the middle of the file with 14 NULs at EOF despite or because of aborting the initial commit to pick up the change.	2002-10-01 11:44:35 +00:00

... 2 3 4 5 6 ...

460 Commits