freebsd

mirror of https://git.FreeBSD.org/src.git synced 2024-12-19 10:53:58 +00:00

History

Bruce Evans 4339c67c48 Moved the optimization for tiny x from __kernel_{cos,sin}[f](x) to {cos_sin}[f](x) so that x doesn't need to be reclassified in the "kernel" functions to determine if it is tiny (it still needs to be reclassified in the cosine case for other reasons that will go away). This optimization is quite large for exponentially distributed x, since x is tiny for almost half of the domain, but it is a pessimization for uniformally distributed x since it takes a little time for all cases but rarely applies. Arg reduction on exponentially distributed x rarely gives a tiny x unless the reduction is null, so it is best to only do the optimization if the initial x is tiny, which is what this commit arranges. The imediate result is an average optimization of 1.4% relative to the previous version in a case that doesn't favour the optimization (double cos(x) on all float x) and a large pessimization for the relatively unimportant cases of lgamma[f][_r](x) on tiny, negative, exponentially distributed x. The optimization should be recovered for lgamma() as part of fixing lgamma()'s low-quality arg reduction. Fixed various wrong constants for the cutoff for "tiny". For cosine, the cutoff is when x2/2! == {FLT or DBL}_EPSILON/2. We round down to an integral power of 2 (and for cos() reduce the power by another 1) because the exact cutoff doesn't matter and would take more work to determine. For sine, the exact cutoff is larger due to the ration of terms being x2/3! instead of x2/2!, but we use the same cutoff as for cosine. We now use a cutoff of 2-27 for double precision and 2-12 for single precision. 2-27 was used in all cases but was misspelled 2**27 in comments. Wrong and sloppy cutoffs just cause missed optimizations (provided the rounding mode is to nearest -- other modes just aren't supported).		2005-10-24 14:08:36 +00:00
..
alpha	Replace fegetmask() and fesetmask() with feenableexcept(),	2005-03-16 19:03:46 +00:00
amd64	Add a missing ldexpf() alias for amd64.	2005-09-12 20:54:00 +00:00
arm	Replace fegetmask() and fesetmask() with feenableexcept(),	2005-03-16 19:03:46 +00:00
bsdsrc	Fixed aliasing bugs in TRUNC() by using the fdlibm macros for access	2005-09-19 11:28:19 +00:00
i387	Prevent these functions from using stack outside of their frame.	2005-05-06 15:44:20 +00:00
ia64	Replace fegetmask() and fesetmask() with feenableexcept(),	2005-03-16 19:03:46 +00:00
man	Markup nit.	2005-06-16 21:56:03 +00:00
powerpc	Replace fegetmask() and fesetmask() with feenableexcept(),	2005-03-16 19:03:46 +00:00
sparc64	Replace fegetmask() and fesetmask() with feenableexcept(),	2005-03-16 19:03:46 +00:00
src	Moved the optimization for tiny x from __kernel_{cos,sin}[f](x) to	2005-10-24 14:08:36 +00:00
Makefile	Bump the shared library version number of all libraries that have not	2005-07-22 17:19:05 +00:00