Use epsilon() for tolerances #142

certik · 2020-02-04T17:58:55Z

The original tolerances look arbitrary:

real(sp), parameter :: sptol = 1.2e-06_sp
real(dp), parameter :: dptol = 2.2e-15_dp

but when written using epsilon() they are almost exactly 10x the machine
epsilon for the particular precision kind:

real(sp), parameter :: sptol = 10.06633 * epsilon(1._sp)
real(dp), parameter :: dptol = 9.907919 * epsilon(1._dp)

so I just rounded it to 10:

real(sp), parameter :: sptol = 10 * epsilon(1._sp)
real(dp), parameter :: dptol = 10 * epsilon(1._dp)

which now shows the intent more clearly: the tolerance assumes we lose
one digit of accuracy.

The original tolerances look arbitrary: real(sp), parameter :: sptol = 1.2e-06_sp real(dp), parameter :: dptol = 2.2e-15_dp but when written using epsilon() they are almost exactly 10x the machine epsilon for the particular precision kind: real(sp), parameter :: sptol = 10.06633 * epsilon(1._sp) real(dp), parameter :: dptol = 9.907919 * epsilon(1._dp) so I just rounded it to 10: real(sp), parameter :: sptol = 10 * epsilon(1._sp) real(dp), parameter :: dptol = 10 * epsilon(1._dp) which now shows the intent more clearly: the tolerance assumes we lose one digit of accuracy.

jvdp1 · 2020-02-04T18:11:23Z

Both options are fine with me, if we keep it consistent across the files.
Here is an answer of @aradi when I proposed to use epsilon.

certik · 2020-02-04T18:50:03Z

Thanks @jvdp1. @aradi in that comment you gave epsilon(1.0_dp)**(7.0_dp / 8.0_dp) as an example. Rather, as in this PR, one should use 10 * epsilon(1.0_dp), and as shown in the commit description, the result is in fact equivalent to your original hard wired tolerance. (In some other cases, one might need to use something like 1e3_dp * epsilon(1._dp) if more tolerance is needed.)

aradi

Yes, I agree, your solution shows the intention much more, so let's got with that.

nncarlson · 2020-02-05T00:09:37Z

I feel like a drive-by sniper by commenting. But with a carefully chosen data set (e.g., integral values, power of 2 number of values) the calculations will be done without any round-off and tests can be for equality (bit-for-bit). One can dispense entirely with any notion of an appropriate epsilon, and the data easily hard-coded in the test itself without the extra complexity of reading from a file. The test would be far simpler. What exactly is being gained by using data that uses all the significant digits?

jvdp1 · 2020-02-05T08:04:24Z

I feel like a drive-by sniper by commenting. But with a carefully chosen data set (e.g., integral values, power of 2 number of values) the calculations will be done without any round-off and tests can be for equality (bit-for-bit). One can dispense entirely with any notion of an appropriate epsilon, and the data easily hard-coded in the test itself without the extra complexity of reading from a file. The test would be far simpler. What exactly is being gained by using data that uses all the significant digits?

I like this idea. But do you think it is possible for functions more complicate than mean, and with all algorithms?

aradi · 2020-02-05T08:41:12Z

@nncarlson Your comments are more than welcome! We do the PRs in public in order to get critical comments from skilled developers and improve so the quality of stdlib.

As for the bit by bit comparison: I agree, one could maybe use special numbers and check by bitwise comparison. However, we should keep in mind that

this may not work for more complicated functions and
this is not the typical use case.

Especially latter is important IMO, as the test suite should definitely include tests for the typical use cases (using all significant bits). However, adding the specially designed input values with bitwise check for the results could make up for very nice additional tests.

nncarlson · 2020-02-05T14:19:16Z

I like this idea. But do you think it is possible for functions more complicate than mean, and with all algorithms?

Unfortunately not. Only a subset of those things where something rational is being computed. But in cases where it is possible, it makes the tests much more straightforward, and I think that is preferable.

Especially latter [this is not the typical use case] is important IMO, as the test suite should definitely include tests for the typical use cases (using all significant bits).

It's certainly not the typical use case, but what exactly is being tested? I think it is that the correct elements from the input arrays are being summed and normalized by the correct value; i.e., the code of the mean function. For that, carefully chosen integral values (in this case) are sufficient to test that. I think the only thing that using general values (requiriing all significant bits) adds to the test is to test that the hardware does addition and division correctly. But I think that is something this and other tests can assume.

certik · 2020-02-05T21:38:30Z

We can further improve this to test things exactly, but I am also fine with just keeping this as is. This PR is effectively just a refactoring what is in master, so I merged it.

certik requested a review from aradi February 4, 2020 17:58

certik mentioned this pull request Feb 4, 2020

Added support and test for mean of complex arrays #140

Merged

aradi approved these changes Feb 4, 2020

View reviewed changes

certik merged commit 2508db6 into fortran-lang:master Feb 5, 2020

certik deleted the epsilon branch February 5, 2020 21:36

jvdp1 mentioned this pull request Feb 6, 2020

A function to compare two floating-point numbers #145

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use epsilon() for tolerances #142

Use epsilon() for tolerances #142

certik commented Feb 4, 2020

jvdp1 commented Feb 4, 2020

certik commented Feb 4, 2020

aradi left a comment

nncarlson commented Feb 5, 2020

jvdp1 commented Feb 5, 2020

aradi commented Feb 5, 2020

nncarlson commented Feb 5, 2020

certik commented Feb 5, 2020

Use epsilon() for tolerances #142

Use epsilon() for tolerances #142

Conversation

certik commented Feb 4, 2020

jvdp1 commented Feb 4, 2020

certik commented Feb 4, 2020

aradi left a comment

Choose a reason for hiding this comment

nncarlson commented Feb 5, 2020

jvdp1 commented Feb 5, 2020

aradi commented Feb 5, 2020

nncarlson commented Feb 5, 2020

certik commented Feb 5, 2020