faster Float32 and Float16 pow #40236

oscardssmith · 2021-03-27T17:38:37Z

Returns Float32 speed to roughly the speed of 1.5. Also speeds up Float16 pow. These new methods are .5 ULP (from limited testing). There is further room to optimize these, but this fixes the regression. At some point, I hope to have a Float64 version, but that will be much harder as it requires a log2 function that gives extra bits of precision, and an exp2 function that takes in a Double Double

oscardssmith · 2021-03-27T22:05:46Z

Do we require pow produce exact results for Integer arguments? That's the test that is currently failing.

oscardssmith · 2021-04-14T13:31:31Z

I think this is ready to go. I haven't done fully rigorous tests on it, but every test-case I've used has worked, and conceptually, this should be just over .5 ULP.

oscardssmith · 2021-04-18T04:52:47Z

I've tested a wide variety of random numbers and haven't found more than .5 ULP. Given that current pow is system dependent (about .75 ULP on my system), I think we should merge.

oscardssmith · 2021-04-21T18:30:35Z

Bumping this. Can someone look at it and merge?

vtjnash · 2021-04-21T19:34:23Z

base/math.jl

@@ -867,29 +867,35 @@ end
 z
 end
 @inline function ^(x::Float32, y::Float32)
- z = ccall("llvm.pow.f32", llvmcall, Float32, (Float32, Float32), x, y)
+ z = Float32(exp2_fast(log2(Float64(x))*y))


why don't we want to trust llvm (aka libm) here anymore?

1.6 introduced about a 3x regression on this due to a switch in which libm got loaded.

Should we fix openlibm?

probably. That said, I generally think that we shouldn't be relying on external libraries, so I'm not that sad about replacing this anyway.

base/math.jl

kimikage · 2021-04-25T12:12:53Z

@oscardssmith, does this cause a problem when used with @fastmath?
(cc: @vtjnash)

julia> versioninfo()
Julia Version 1.7.0-DEV.1006
Commit 248c02f531* (2021-04-24 17:37 UTC)
Platform Info:
  OS: Windows (x86_64-w64-mingw32)
  CPU: Intel(R) Core(TM) i7-8565U CPU @ 1.80GHz
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-11.0.1 (ORCJIT, skylake)

julia> Float16(-1)^2
Float16(1.0)

julia> @fastmath Float16(-1)^2 # used in Colors.jl
ERROR: DomainError with -1.0:
log2 will only return a complex result if called with a complex argument. Try log2(Complex(x)).

The previous commit is OK.

julia> versioninfo()
Julia Version 1.7.0-DEV.998
Commit ac7974acef* (2021-04-23 20:59 UTC)
Platform Info:
  OS: Windows (x86_64-w64-mingw32)
  CPU: Intel(R) Core(TM) i7-8565U CPU @ 1.80GHz
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-11.0.1 (ORCJIT, skylake)

julia> @fastmath Float16(-1)^2
Float16(1.0)

kimikage · 2021-04-25T12:47:29Z

Another case (found in ColorVectorSpace.jl)

julia> (-1.0f0)^2.0f0
ERROR: DomainError with -1.0:
log2 will only return a complex result if called with a complex argument. Try log2(Complex(x)).

cf. PkgEval: https://github.com/JuliaCI/NanosoldierReports/blob/master/pkgeval/by_date/2021-04/25/report.md

oscardssmith · 2021-04-26T06:31:44Z

That's unfortunate. I'll revert tomorrow unless I can think of something clever to fix it without too much performance impact.

This reverts commit 1474566.

Approximately .5 ULP, relatively fast. Update float^integer as well

…40610) This reverts commit 1474566.

Approximately .5 ULP, relatively fast. Update float^integer as well

…40610) This reverts commit 1474566.

Approximately .5 ULP, relatively fast. Update float^integer as well

…40610) This reverts commit 1474566.

dkarrasch added domain:maths Mathematical functions performance Must go faster labels Mar 27, 2021

retry commits with hopefully not including way too many commits

7aab2dd

oscardssmith force-pushed the better-pow-32 branch from d1cd58b to 7aab2dd Compare April 14, 2021 03:54

update float^integer as well

776b83c

oscardssmith requested a review from vtjnash April 18, 2021 03:09

vtjnash reviewed Apr 21, 2021

View reviewed changes

base/math.jl Show resolved Hide resolved

vtjnash mentioned this pull request Apr 21, 2021

floating point precision changes between julia versions #40525

Closed

oscardssmith merged commit 1474566 into JuliaLang:master Apr 24, 2021

oscardssmith deleted the better-pow-32 branch April 24, 2021 05:31

palday mentioned this pull request Apr 26, 2021

Exponentiation of negative single and half precision fails on nightly #40609

Closed

simeonschaub added a commit that referenced this pull request Apr 26, 2021

Revert "faster Float32 and Float16 pow (#40236)"

31b648d

This reverts commit 1474566.

simeonschaub mentioned this pull request Apr 26, 2021

Revert "faster Float32 and Float16 pow" #40610

Merged

KristofferC pushed a commit that referenced this pull request Apr 26, 2021

Revert "faster Float32 and Float16 pow (#40236)" (#40610)

728aa90

This reverts commit 1474566.

oscardssmith mentioned this pull request Apr 27, 2021

Faster pow32 and pow16 without the negative number bug #40620

Merged

ElOceanografo pushed a commit to ElOceanografo/julia that referenced this pull request May 4, 2021

faster Float32 and Float16 pow (JuliaLang#40236)

d8a723f

Approximately .5 ULP, relatively fast. Update float^integer as well

ElOceanografo pushed a commit to ElOceanografo/julia that referenced this pull request May 4, 2021

Revert "faster Float32 and Float16 pow (JuliaLang#40236)" (JuliaLang#…

b06b01f

…40610) This reverts commit 1474566.

jarlebring pushed a commit to jarlebring/julia that referenced this pull request May 4, 2021

Revert "faster Float32 and Float16 pow (JuliaLang#40236)" (JuliaLang#…

2a81eb7

…40610) This reverts commit 1474566.

antoine-levitt pushed a commit to antoine-levitt/julia that referenced this pull request May 9, 2021

faster Float32 and Float16 pow (JuliaLang#40236)

d3d3977

Approximately .5 ULP, relatively fast. Update float^integer as well

antoine-levitt pushed a commit to antoine-levitt/julia that referenced this pull request May 9, 2021

Revert "faster Float32 and Float16 pow (JuliaLang#40236)" (JuliaLang#…

7fdc5dc

…40610) This reverts commit 1474566.

johanmon pushed a commit to johanmon/julia that referenced this pull request Jul 5, 2021

faster Float32 and Float16 pow (JuliaLang#40236)

4647ded

Approximately .5 ULP, relatively fast. Update float^integer as well

johanmon pushed a commit to johanmon/julia that referenced this pull request Jul 5, 2021

Revert "faster Float32 and Float16 pow (JuliaLang#40236)" (JuliaLang#…

d84e425

…40610) This reverts commit 1474566.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

faster Float32 and Float16 pow #40236

faster Float32 and Float16 pow #40236

oscardssmith commented Mar 27, 2021 •

edited

Loading

oscardssmith commented Mar 27, 2021

oscardssmith commented Apr 14, 2021

oscardssmith commented Apr 18, 2021

oscardssmith commented Apr 21, 2021

vtjnash Apr 21, 2021

oscardssmith Apr 21, 2021

vtjnash Apr 21, 2021

oscardssmith Apr 21, 2021 •

edited

Loading

kimikage commented Apr 25, 2021 •

edited

Loading

kimikage commented Apr 25, 2021

oscardssmith commented Apr 26, 2021

faster Float32 and Float16 pow #40236

faster Float32 and Float16 pow #40236

Conversation

oscardssmith commented Mar 27, 2021 • edited Loading

oscardssmith commented Mar 27, 2021

oscardssmith commented Apr 14, 2021

oscardssmith commented Apr 18, 2021

oscardssmith commented Apr 21, 2021

vtjnash Apr 21, 2021

Choose a reason for hiding this comment

oscardssmith Apr 21, 2021

Choose a reason for hiding this comment

vtjnash Apr 21, 2021

Choose a reason for hiding this comment

oscardssmith Apr 21, 2021 • edited Loading

Choose a reason for hiding this comment

kimikage commented Apr 25, 2021 • edited Loading

kimikage commented Apr 25, 2021

oscardssmith commented Apr 26, 2021

oscardssmith commented Mar 27, 2021 •

edited

Loading

oscardssmith Apr 21, 2021 •

edited

Loading

kimikage commented Apr 25, 2021 •

edited

Loading