Accuracy of single precision functions #212

ViralBShah · 2020-08-28T21:25:01Z

@zimmermann6 has a publication comparing the accuracy of single precision functions across several libms.

https://members.loria.fr/PZimmermann/papers/accuracy.pdf

The tests are with openlibm 0.4.1, and it would be nice to be able to try the latest release. In general musl fares better.

kargl · 2020-08-30T17:40:37Z

While the max ULP is an important metric, two additional metrics may be more important: speed and distribution of ULP (if max ULP > 1). Consider j0f(x). It is well-known that polynomial approximations break down near zeros of a function. Exhaustive testing in the domain [1:4], which contains 1 zero, shows

% tlibm -DfEsP -x 1 -X 4 j0
Interval tested for j0f: [1,4]
1000000 calls, 0.090046 secs, 0.09005 usecs/call
       ulp <= 0.5: 49.102%   8237966 |  49.102%   8237966
0.5 <  ulp <  0.6:  7.534%   1263940 |  56.636%   9501906
0.6 <  ulp <  0.7:  6.538%   1096871 |  63.174%  10598777
0.7 <  ulp <  0.8:  5.690%    954682 |  68.864%  11553459
0.8 <  ulp <  0.9:  4.883%    819227 |  73.747%  12372686
0.9 <  ulp <  1.0:  4.015%    673617 |  77.762%  13046303
1.0 <  ulp <  1.5: 12.171%   2042032 |  89.933%  15088335
1.5 <  ulp <  2.0:  5.488%    920737 |  95.422%  16009072
2.0 <  ulp <  3.0:  3.391%    568873 |  98.812%  16577945
3.0 <  ulp <     :  1.188%    199271 | 100.000%  16777216
Max ulp: 900690.687500 at 2.40482545e+00

The reference function was j0(x). The zeros of the regular Bessel functions are simple zeros, so the polynomial approximation can take this into account. For example, j0f(x) = (z - z0) * P(x) with the zero z0 split into high and low parts with a total of 48 bits of precision. This gives

% ./tlibm -DfEsP -x 1 -X 4 j0
Interval tested for j0f: [1,4]
1000000 calls, 0.045775 secs, 0.04578 usecs/call
       ulp <= 0.5: 57.112%   9581798 |  57.112%   9581798
0.5 <  ulp <  0.6:  7.607%   1276254 |  64.719%  10858052
0.6 <  ulp <  0.7:  5.920%    993139 |  70.639%  11851191
0.7 <  ulp <  0.8:  4.656%    781177 |  75.295%  12632368
0.8 <  ulp <  0.9:  3.957%    663922 |  79.252%  13296290
0.9 <  ulp <  1.0:  3.321%    557242 |  82.573%  13853532
1.0 <  ulp <  1.5: 10.080%   1691123 |  92.653%  15544655
1.5 <  ulp <  2.0:  4.563%    765544 |  97.216%  16310199
2.0 <  ulp <  3.0:  2.529%    424342 |  99.746%  16734541
3.0 <  ulp <     :  0.254%     42675 | 100.000%  16777216
Max ulp: 5.246863 at 3.98081970e+00

A much more accurate approximation that is twice as fast!

Unfortunately, Bessel functions are not periodic so argument reduction cannot be used. Thus, one needs multiple polynomial approximations to cover a significant domain. J. Harrison [1] proposed an algorithm that uses multiple polynomial for x < 45 and an asymptotic expansion about the zeros for x > 45. I haven't had time to pursue this.

[1] (https://www.cl.cam.ac.uk/~jrh13/papers/bessel.html)

zimmermann6 · 2020-08-31T06:15:39Z

Dear Steve, on my side I am more interested by the max ULP. I know other people are more interested by speed. I guess the Intel Math Library implements the algorithm from John Harrison.

zimmermann6 · 2020-09-16T06:26:24Z

@zimmermann6 has a publication comparing the accuracy of single precision functions across several libms.

https://members.loria.fr/PZimmermann/papers/accuracy.pdf

The tests are with openlibm 0.4.1, and it would be nice to be able to try the latest release. In general musl fares better.

I have updated my results with OpenLibm 0.7.0. The results are almost the same, except for y1f whose largest error has improved. The url is still the same. Previous versions are available from https://members.loria.fr/PZimmermann/papers/

zimmermann6 · 2020-12-04T16:32:43Z

I have revised my note with double and quadruple precision, and bivariate functions like atan2, hypot, pow:

https://homepages.loria.fr/PZimmermann/papers/#accuracy

For double precision, the largest error I found for OpenLibm 0.7.0 is less than 4 ulps, except for the Bessel functions, and the lgamma and tgamma functions.

If something has changed in newer versions of OpenLibm, please tell me and I will update my note.

zimmermann6 · 2021-02-05T10:45:47Z

a new version of my note is now available, including the "long double" format:

https://homepages.loria.fr/PZimmermann/papers/#accuracy

I found a few issues with Openlibm, which I reported in separate new issues here.

ViralBShah · 2021-02-06T16:56:12Z

I wonder what the easiest way is to fix some of these. Maybe just pick the good implementations from other libm libraries and bring them in here?

zimmermann6 · 2021-08-03T08:09:31Z

I am about to release of new version of the accuracy comparison. It will be with Openlibm 0.7.5, the last release. For single precision, the large error for powf has been fixed. It remains large errors for the Bessel functions (j0f, j1f, y0f, y1f) and lgammaf. All other functions have a maximal error less than 3.17 ulps.

zimmermann6 · 2021-09-06T08:06:48Z

I said above that in OpenLibm 0.7.5, for single precision, the large error for powf has been fixed. But I have found a new pair of inputs giving a huge error: for x=0x1.ffffecp-1 and y=-0x1.000002p+27 OpenLibm 0.7.5 yields +Inf, whereas the correct result is 0x1.557a86p115 (rounding to nearest).

kargl · 2021-09-06T15:29:25Z

This patch fixes the issue reported by Paul Zimmerman.

--- /usr/src/lib/msun/src/e_powf.c	2021-02-21 03:29:00.956878000 -0800
+++ src/e_powf.c	2021-09-06 08:17:09.800008000 -0700
@@ -136,7 +136,7 @@
     /* |y| is huge */
 	if(iy>0x4d000000) { /* if |y| > 2**27 */
 	/* over/underflow if x is not close to one */
-	    if(ix<0x3f7ffff7) return (hy<0)? sn*huge*huge:sn*tiny*tiny;
+	    if(ix<0x3f7ffff6) return (hy<0)? sn*huge*huge:sn*tiny*tiny;
 	    if(ix>0x3f800007) return (hy>0)? sn*huge*huge:sn*tiny*tiny;
 	/* now |1-x| is tiny <= 2**-20, suffice to compute
 	   log(x) by x-x^2/2+x^3/3-x^4/4 */

Summary: From Steve Kargl: Paul Zimmermann has identified a bug in Openlibm's powf(), which is identical to FreeBSD's libm. Both derived from fdlibm. JuliaMath/openlibm#212. Consider % cat h.c int main(void) { float x, y, z; x = 0x1.ffffecp-1F; y = -0x1.000002p+27F; z = 0x1.557a86p115F; printf("%e %e %e <-- should be %e\n", x, y, powf(x,y), z); return 0; } % cc -o h -fno-builtin h.c -lm && ./h 9.999994e-01 -1.342177e+08 inf <-- should be 5.540807e+34 Reviewers: manu Subscribers: imp, andrew, emaste Differential Revision: https://reviews.freebsd.org/D31865

ViralBShah · 2021-09-07T03:37:57Z

Fix in a090d3e

Summary: From Steve Kargl: Paul Zimmermann has identified a bug in Openlibm's powf(), which is identical to FreeBSD's libm. Both derived from fdlibm. JuliaMath/openlibm#212. Consider % cat h.c int main(void) { float x, y, z; x = 0x1.ffffecp-1F; y = -0x1.000002p+27F; z = 0x1.557a86p115F; printf("%e %e %e <-- should be %e\n", x, y, powf(x,y), z); return 0; } % cc -o h -fno-builtin h.c -lm && ./h 9.999994e-01 -1.342177e+08 inf <-- should be 5.540807e+34 Reviewers: manu Subscribers: imp, andrew, emaste Differential Revision: https://reviews.freebsd.org/D31865

zimmermann6 · 2022-03-24T12:43:06Z

within the CORE-MATH project, we now have a tool to measure the efficiency of math library functions. Here is a comparison between the CORE-MATH routines and OpenLibm: https://sympa.inria.fr/sympa/arc/core-math/2022-03/msg00011.html. The CORE-MATH rountines are under MIT license, and should be easy to integrate in OpenLibm.

zimmermann6 · 2022-05-18T10:28:15Z

here are updated timings for the single precision functions, with revision d14da3d of CORE-MATH, on an Intel i5-4590:

zimmerma@oignon:~/svn/core-math$ LIBM=/tmp/libopenlibm.a CORE_MATH_PERF_MODE=rdtsc ./perf-all.sh 
GNU libc version: 2.33
GNU libc release: release
acoshf 25.235 34.838 57.576
acosf 30.597 33.853 28.818
asinhf 31.645 52.324 77.610
asinf 26.226 30.628 30.436
atan2f 41.272 92.957 83.223
atanhf 29.906 76.711 74.118
atanf 30.843 39.067 31.997
cbrtf 25.882 63.389 28.038
coshf 33.966 25.702 41.201
cosf 37.762 28.093 32.371
erfcf 58.559 104.513 111.192
erff 20.508 75.847 70.405
exp10f 34.963 13.060 13.053
exp2f 16.133 10.229 16.260
expm1f 18.394 54.059 42.504
expf 17.705 10.954 28.197
hypotf 14.142 17.062 56.209
log10f 17.930 24.270 29.549
log1pf 21.399 32.188 31.572
log2f 15.032 12.018 27.589
logf 16.051 12.016 26.305
powf 45.266 29.032 231.151
sinhf 30.979 82.966 65.106
sinf 29.070 26.443 31.827
tanhf 17.895 74.847 62.337
tanf 29.203 65.431 32.322

First column is CORE-MATH's reciprocal throughput, 2nd is for GNU libc, and 3rd for OpenLibm.

@kargl

ae2d916 Correctly round double precision sqrt (#256) 81d5e16 Add fmod assembly version (#255) 465ca0a Update README.md 428e7af Support for riscv64 architecture (#254) ed7aea3 Bump version to 0.8 (#248) 69bb280 Another Windows ARM64 fix (#253) 3d4a902 Fixes for Windows ARM64 (#251) a9568fb [Windows] install import library to libdir (#249) f88e52a CI (Windows): set `msys2 {0}` as the default shell for all Windows steps (#247) b48a2f7 CI (Linux and macOS): Remove the `arch` variable, which currently has no effect (#246) 2a47fa5 CI: A variety of miscellaneous tweaks (#244) d0ef09a prefix symbols with _ for 32-bit x86 Windows (#242) 60dec83 msys2 ci (#243) 6ea5fa2 Merge pull request #240 from JuliaMath/vs/msys 437c139 Update ci.yml e993267 Update ci.yml 4a36c50 Update ci.yml 24cec17 Update ci.yml d26ed98 Update ci.yml 7b96025 Update ci.yml 7af65db Update ci.yml a2e053e Revert "Update ci.yml" 4a52bb0 Update ci.yml fb10fcf Update ci.yml abf5aaa Update ci.yml 98dcc48 Update ci.yml ff822f3 Update ci.yml ab8d1ad Update ci.yml 4d97e2d Update ci.yml 72caeab Update ci.yml 9dd3049 Create ci.yml 15119bc Merge pull request #239 from JuliaMath/revert-238-patch-1 4bca0f2 Revert "prefix symbols with _ for 32-bit x86 Windows" 3b9454f Merge pull request #238 from jeremyd2019/patch-1 6ae6318 Update src/cdefs-compat.h 71a8fd1 Merge pull request #233 from lephe/more-long-double-aliases 7a3ef59 prefix symbols with _ for 32-bit x86 Windows a871457 Merge pull request #230 from PetteriAimonen/master a090d3e Fix powf: JuliaMath/openlibm#212 (comment) 57dd0f9 add missing weak references for long double functions 327b1bd Replace remaining __strong_alias uses f052f42 Merge pull request #228 from JuliaMath/aa/hypotl 711654e Fix incorrect results in `hypotl` near underflow aeab19f Fix for #211 Co-authored by: @kargl 5449705 Merge pull request #227 from JuliaMath/vs/powf 98f8713 Fix #211 Patched by importing latest msun version 6a85b33 Merge pull request #225 from JuliaMath/vs/strict_assign 40dac9d Restore STRICT_ASSIGN on FreeBSD as suggested in #215 2d10c90 Merge pull request #218 from jcestibariz/fix-wasm32 5d70ac5 Merge pull request #221 from maleadt/tb/static_fenv 63aa875 Make fenv methods static on additional platforms. 9152b0d Fix compilation errors on wasm32 3cb8045 Merge pull request #217 from epsilon-0/master c856101 Merge pull request #219 from maleadt/tb/dont_export_fenv be31bff Revert "Export `fenv` functions on all platforms (#213)" eb21e8a don't alter toolchain vars if already provided b34f107 Fix Apple Silicon build (#214) 5a27b4c Export `fenv` functions on all platforms (#213) 878948d Update list of libm libraries 508603d Update index.html 0276147 Merge pull request #209 from embeddedartistry/master f2a8b36 Update download links to point to releases 1d6befd Merge pull request #208 from cndesantana/patch-1 4f559d4 Replace a few remaining __strong_reference uses (#210) 0418324 Refactor: OLM_DLLEXPORT definition now lives in a standalone header. 861b2ad Fix small typo 5b0e7e9 Update FUNDING.yml 382b8e9 Add musl-libc math library to resources. f6ad75a update openlibm website. 5efed30 Bump SONAME as discussed in #200 e6ac7d7 Update README with new OS and arch support f731481 Merge pull request #199 from llucinat/wasm32-weakref f952e16 Fix weak reference macro redefinition in wasm32 target 97de1a4 Merge pull request #198 from gufe44/netbsd-fix-openlibm_weak_reference c4dca1e Add files via upload d4077aa Suggestions ea065f9 Update src/cdefs-compat.h 2080b23 NetBSD fix 14bf902 Merge pull request #195 from ode33/patch-1 3bb2215 Update README.md 33c8313 Create FUNDING.yml 72f33a3 wasm32 support (#192) f24b1bf Fix compilation of gcc when using openlibm as system libm (#190) 0f22aeb Create CNAME c68e7d2 Delete CNAME b524581 Rename doc -> docs ebbba43 Move website to doc/ on master instead of gh-pages branch 4e3d709 update ULPs for s390 (#187) 65d7406 Merge pull request #185 from sharkcz/s390x 2a1e568 s390(x) port cca41bc Merge branch 'master' of github.com:JuliaMath/openlibm 74b54c7 Add MIPS ce69bf1 Update references to JuliaLang to point to JuliaMath (#182) a96f074 Merge pull request mirage#130 from ginggs/enable-optimization-again c782ca2 Merge pull request #177 from JuliaMath/aa/windows 52df60b Update appveyor.yml ce33de1 Add Windows testing with AppVeyor 4971b56 Update Make.inc ca996ac Merge pull request #180 from JuliaLang/ginggs-0.5.6 73b3d88 Merge pull request #181 from CDLuminate/mipsport 3aa5c3b Merge pull request #174 from iniserve/master 4b4b41c Merge pull request #178 from JuliaLang/aa/upstream a4b3fde travis: Add mips, mipsel, mips64el build. ad9673e Makefile: clean mips/*.o 4dcc76e Using cdefs-compat.h and stdint.h instead <sys/types.h> fenv-softfloat.h file added SOFTFLOAT code parts are not tested. 4f5112e Support for mips architectures a24a5eb Enable optimization again for *int.c and *intf.c a40570b Bump version to 0.5.6 787652b msun: signed overflow in atan2 8d91ecb Add TOOLPREFIX git-subtree-dir: openlibm git-subtree-split: ae2d91698508701c83cab83714d42a1146dccf85

Summary: From Steve Kargl: Paul Zimmermann has identified a bug in Openlibm's powf(), which is identical to FreeBSD's libm. Both derived from fdlibm. JuliaMath/openlibm#212. Consider % cat h.c int main(void) { float x, y, z; x = 0x1.ffffecp-1F; y = -0x1.000002p+27F; z = 0x1.557a86p115F; printf("%e %e %e <-- should be %e\n", x, y, powf(x,y), z); return 0; } % cc -o h -fno-builtin h.c -lm && ./h 9.999994e-01 -1.342177e+08 inf <-- should be 5.540807e+34 Reviewers: manu Subscribers: imp, andrew, emaste Differential Revision: https://reviews.freebsd.org/D31865 (cherry picked from commit 292815e)

ViralBShah mentioned this issue Aug 28, 2020

Faster logl implementation available in FreeBSD msun #203

Open

ViralBShah added a commit that referenced this issue Sep 7, 2021

Fix powf: #212 (comment)

a090d3e

ViralBShah mentioned this issue May 18, 2022

Integrate CORE-MATH implementations into openlibm #260

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Accuracy of single precision functions #212

Accuracy of single precision functions #212

ViralBShah commented Aug 28, 2020

kargl commented Aug 30, 2020

zimmermann6 commented Aug 31, 2020

zimmermann6 commented Sep 16, 2020

zimmermann6 commented Dec 4, 2020

zimmermann6 commented Feb 5, 2021

ViralBShah commented Feb 6, 2021

zimmermann6 commented Aug 3, 2021

zimmermann6 commented Sep 6, 2021

kargl commented Sep 6, 2021 •

edited

Loading

ViralBShah commented Sep 7, 2021

zimmermann6 commented Mar 24, 2022

zimmermann6 commented May 18, 2022

Accuracy of single precision functions #212

Accuracy of single precision functions #212

Comments

ViralBShah commented Aug 28, 2020

kargl commented Aug 30, 2020

zimmermann6 commented Aug 31, 2020

zimmermann6 commented Sep 16, 2020

zimmermann6 commented Dec 4, 2020

zimmermann6 commented Feb 5, 2021

ViralBShah commented Feb 6, 2021

zimmermann6 commented Aug 3, 2021

zimmermann6 commented Sep 6, 2021

kargl commented Sep 6, 2021 • edited Loading

ViralBShah commented Sep 7, 2021

zimmermann6 commented Mar 24, 2022

zimmermann6 commented May 18, 2022

kargl commented Sep 6, 2021 •

edited

Loading