ISOCPP std-proposals List: Re: [std-proposals] Faster pow for integral exponents?

From: Jonathan Wakely <cxx_at_[hidden]>
Date: Mon, 24 Mar 2025 11:35:59 +0000

On Mon, 24 Mar 2025 at 09:39, Robin Savonen Söderholm <
robinsavonensoderholm_at_[hidden]> wrote:

> Cool! So standardizing this should not be a problem? I read from the first
> link that powi(0.,<negative number>) gives Inf, my reference implementation
> really does an 1./0. (which should end up as NaN). Is there a problem that
> this differs? (NaN is more correct from a mathematics point of view.)
>

I think IEEE requires inf for x/0.

>
> On Mon, Mar 24, 2025, 10:31 Jonathan Wakely <cxx_at_[hidden]> wrote:
>
>>
>>
>> On Mon, 24 Mar 2025, 09:15 Robin Savonen Söderholm via Std-Proposals, <
>> std-proposals_at_[hidden]> wrote:
>>
>>> Hi!
>>> Unsure if this should be a proposal here or if it is more of a
>>> implementation detail, but from what I can read upon cppreference it seems
>>> to be correct to float it here.
>>>
>>> Background/Motivation:
>>> I have seen and experienced time when you want to do something like
>>> <double> ^ <some int larger than 2 but less than say 100>. And in general
>>> there are two different approaches here: use `std::pow(...)` or roll your
>>> own manual integer-pow function. The thing is that std::pow(...) can be
>>> slower than just doing a naive loop with multiplications for small
>>> integers, but the roll will quickly become slower as you approach higher
>>> exponents (I think my desktop showed signs of serious degradation at
>>> between 32 and 64 values in exponents).
>>> I propose a slightly different integer-exponent pow that has a
>>> O(log2(n)) complexity (in contrast to the naive loop that as O(n), where
>>> 'n' here is the exponent). Some quick benchmarks showed that this
>>> implementation seemed to be as fast as the naive solution at low exponents
>>> and did not seem to get worse than std::pow for large N. An implementation
>>> can be found here:
>>> dooc-template-library/lib/math/inc/dtl/lpow.hpp at main ·
>>> doocman/dooc-template-library · GitHub
>>> <https://github.com/doocman/dooc-template-library/blob/main/lib/math/inc/dtl/lpow.hpp>
>>> , the benchmarks was here:
>>> dooc-template-library/lib/math/benchmarks/lpow.bb.cpp at main ·
>>> doocman/dooc-template-library · GitHub
>>> <https://github.com/doocman/dooc-template-library/blob/main/lib/math/benchmarks/lpow.bb.cpp>
>>> and a godbolt link: https://godbolt.org/z/Khz9K385P
>>>
>>> In addition to speed, the new power-function has the advantage of being
>>> usable with any type that supports multiplication (and division in the
>>> negative exponent case).
>>>
>>> Note: I am sure there are certain corner-cases still left with the
>>> reference implementation, but I still think that it can be useful to better
>>> merge readability with performance. And there could be further
>>> optimisations to do to improve the speed further (looking into how branches
>>> are made in the assembly, vectorising operations (can probably only be made
>>> on certain types)).
>>>
>>> Proposal 1:
>>> The first version of the proposal is to add an alternative to std::pow
>>> that could be named like `std::lpow` or `std::powi` with the signature:
>>> ```c++
>>> constexpr auto lpow(/*multiplicable*/ auto b, std::integral auto e)
>>> requires(!std::is_signed_v<decltype(e)> || /*divisable<decltype(b)>) ->
>>> decltype(b) {...}
>>> ```
>>> . Then linters in user-code can identify places where an integer (or at
>>> least integer literal/compile time integer) is used in a std::pow and
>>> suggest using the new function.
>>>
>>>
>>> Proposal 2:
>>> The second version would be to simply overload the existing `std::pow`
>>> with the aforementioned signature and let the compiler automatically use
>>> the appropriate overload. I don't have this as the first version because
>>> there could be cornercases in behavious that I am missing, causing this to
>>> become a breaking change. For example, the return-type of std::pow(float,
>>> std::int_least64_t) seems to return double on both GCC and MSVC while
>>> current reference returns a float.
>>>
>>> Proposal 3:
>>> An alternative solution could be to incorporate both versions but with a
>>> change: the std::pow should have it's returntype unchanged from todays
>>> implementation.
>>>
>>> Ending
>>> Now I can't help but ending with a question: this 'power strategy' feels
>>> like it should not be a too novel approach, so is there a reason why we
>>> don't already have something like this?
>>>
>>
>> We do, just not in the standard.
>>
>> https://www.delorie.com/djgpp/doc/libc/libc_621.html
>>
>> https://gcc.gnu.org/onlinedocs/gcc/Other-Builtins.html#index-_005f_005fbuiltin_005fpowi
>>
>> Implementations should probably just optimise pow to powi when possible.
>>
>>
>>

Received on 2025-03-24 11:36:15