IEEE-754

=?ISO-8859-1?Q?Roman_T=F6ngi?= · Aug 23, 2007

IEEE-754 Arithmetic:
Most real numbers can't be stored exactly on the computer, but there can
be stated the range within which a machine number lies.

For the following example, I assume double precision and the round mode
in effect to be 'round to nearest' and that the number lies within the
normalized range:

Definitions:
x := real number
round(x) := correctly rounded normalized number
eps := machine epsilon (2^(-52) for double precision)
abs(x) := absolute value of x

That is:

round(x) = x*(1 + delta)

with delta:

abs(delta) <= 1/2*eps (round to nearest)

i.d. abs(delta) <= 2^(-53) (double precision)

abs(delta) corresponds to the relative rounding error.

Now I can state the range including round(x):

-----------------------------------------
x*(1-2(-53)) <= round(x) <= x*(1+2^(-53))
-----------------------------------------

Is this the correct range according to my assumptions?

Thanks a lot
Roman

Boudewijn Dijkstra · Aug 23, 2007

Op Thu, 23 Aug 2007 12:45:52 +0200 schreef Roman Töngi

IEEE-754 Arithmetic:
Most real numbers can't be stored exactly on the computer, but there can
be stated the range within which a machine number lies.

For the following example, I assume double precision and the round mode
in effect to be 'round to nearest' and that the number lies within the
normalized range:

Definitions:
x := real number
round(x) := correctly rounded normalized number
eps := machine epsilon (2^(-52) for double precision)
abs(x) := absolute value of x

That is:
round(x) = x*(1 + delta)

with delta:
abs(delta) <= 1/2*eps (round to nearest)
i.d. abs(delta) <= 2^(-53) (double precision)

abs(delta) corresponds to the relative rounding error.

Now I can state the range including round(x):

Yes, but your assumptions are invalid. How did you arrive at a machine
epsilon of 2^(-52)?

Eric Sosman · Aug 23, 2007

Roman Töngi wrote On 08/23/07 06:45,:

IEEE-754 Arithmetic:
Most real numbers can't be stored exactly on the computer, but there can
be stated the range within which a machine number lies.

For the following example, I assume double precision and the round mode
in effect to be 'round to nearest' and that the number lies within the
normalized range:

Definitions:
x := real number
round(x) := correctly rounded normalized number
eps := machine epsilon (2^(-52) for double precision)
abs(x) := absolute value of x

That is:

round(x) = x*(1 + delta)

with delta:

abs(delta) <= 1/2*eps (round to nearest)

i.d. abs(delta) <= 2^(-53) (double precision)

abs(delta) corresponds to the relative rounding error.

Now I can state the range including round(x):

It looks right to me for x >= 0 (for x < 0 the
inequalities are backwards), and given suitable hand-
waving for abs(x) very small or very large. It might
be possible (I'm not sure) to sharpen the analysis a
tiny bit and change a `<=' to a `<', but whether that's
worth trying depends on your purpose in obtaining the
bound in the first place.

Note that the C language does not require IEEE
floating-point, nor does it require round-to-nearest,
nor does it specify the value of eps.

=?ISO-8859-15?Q?Roman_T=F6ngi?= · Aug 23, 2007

Boudewijn said:
Op Thu, 23 Aug 2007 12:45:52 +0200 schreef Roman Töngi

Yes, but your assumptions are invalid. How did you arrive at a machine
epsilon of 2^(-52)?

From the IEEE-specification for double format.

Boudewijn Dijkstra · Aug 24, 2007

Op Thu, 23 Aug 2007 18:08:15 +0200 schreef Roman Töngi

From the IEEE-specification for double format.

I asked how, not where. Unless it says something like: "the machine
epsilon is 2^(-52); this corresponds to the upper limit of the rounding
error."

cr88192 · Aug 24, 2007

Boudewijn Dijkstra said:
Op Thu, 23 Aug 2007 18:08:15 +0200 schreef Roman Töngi

I asked how, not where. Unless it says something like: "the machine
epsilon is 2^(-52); this corresponds to the upper limit of the rounding
error."

my guess (probably OT here, oh well):

it will be this, presumably, unless the machine computes using less bits
than the format (such as if the calculations were internally performed with
floats, or with 48 bit mantissa values, or such).

may be a little higher really, as presumably the exact values of the low
order bits will depend on the exact HW.

for example, calculations performed with doubles in SSE are often slightly
off from those performed in the FPU, given the FPU uses an internal 80 bit
representation (with a 64 bit mantissa).

now, if our basic value is 1, and things are properly normalized (I think
this is required, except in the edge case of very small values), then our
epsilon is about the same as the relative weight of our low order bits.

now, if the major value were something other than 1, then the epsilon would
differ, in step with the exponent.

or such...

Boudewijn Dijkstra · Aug 27, 2007

Op Fri said:
now, if our basic value is 1, and things are properly normalized (I think
this is required, except in the edge case of very small values), then our
epsilon is about the same as the relative weight of our low order bits.

now, if the major value were something other than 1, then the epsilon
would differ, in step with the exponent.

or such...

Exactly. The epsilon will be proportional to the exponent.

Peter J. Holzer · Aug 27, 2007

Exactly. The epsilon will be proportional to the exponent.

And now read the OP again.

hp

Boudewijn Dijkstra · Aug 28, 2007

Op Mon, 27 Aug 2007 15:52:30 +0200 schreef Peter J. Holzer

And now read the OP again.

You're beyond me now. The OP was talking about a constant epsilon for the
whole range of numbers within the normalized range. Or did you read
something else between lines?

CBFalconer · Aug 28, 2007

Boudewijn said:
You're beyond me now. The OP was talking about a constant epsilon
for the whole range of numbers within the normalized range. Or
did you read something else between lines?

You are the first I have noted to consider 'proportional' to denote
a constant.

Peter J. Holzer · Aug 28, 2007

Op Mon, 27 Aug 2007 15:52:30 +0200 schreef Peter J. Holzer

You're beyond me now. The OP was talking about a constant epsilon for the
whole range of numbers within the normalized range.

Yes, but that epsilon was always multiplied by the number:

| round(x) = x*(1 + delta)
^ here
|
| with delta:
|
| abs(delta) <= 1/2*eps (round to nearest)
|
| i.d. abs(delta) <= 2^(-53) (double precision)
|
| abs(delta) corresponds to the relative rounding error.
|
| Now I can state the range including round(x):
|
| -----------------------------------------
| x*(1-2(-53)) <= round(x) <= x*(1+2^(-53))
^ here ^ here
| -----------------------------------------

This is afaik the normal use of eps. See for example
http://en.wikipedia.org/wiki/Machine_epsilon.

Or did you read something else between lines?

No, I just read the lines.

hp

Boudewijn Dijkstra · Aug 29, 2007

Op Tue, 28 Aug 2007 13:59:18 +0200 schreef CBFalconer

You are the first I have noted to consider 'proportional' to denote
a constant.

You could note that, but it'd be more correct to note that I wasn't
denoting a constant, but an entity identified by the OP as a constant.

Boudewijn Dijkstra · Aug 29, 2007

Op Wed, 29 Aug 2007 00:45:30 +0200 schreef Peter J. Holzer

Yes, but that epsilon was always multiplied by the number:

| round(x) = x*(1 + delta)
^ here

Yes, you're right. I was being incredibly thick (which doesn't usually
happen (for this long)).

Function is not worked in C	2	Jun 27, 2023
Tic Tac Toe Game	2	Mar 10, 2024
Weird Behavior with Rays in C and OpenGL	4	Feb 13, 2024
How do I translate the following two Matlab expressions into C++?	4	Aug 16, 2007
How to use ufixed when it involves multiplication a number of times?(VHDL question)	0	Aug 22, 2016
Conversion from double to float, and undefined behaviour.	1	Jul 18, 2010
struct, IEEE-754 and internal representation	4	Nov 9, 2005
C program: memory leak/ segmentation fault/ memory limit exceeded	0	Nov 12, 2022

IEEE-754

=?ISO-8859-1?Q?Roman_T=F6ngi?=

Boudewijn Dijkstra

Eric Sosman

=?ISO-8859-15?Q?Roman_T=F6ngi?=

Boudewijn Dijkstra

cr88192

Boudewijn Dijkstra

Peter J. Holzer

Boudewijn Dijkstra

CBFalconer

Peter J. Holzer

Boudewijn Dijkstra

Boudewijn Dijkstra

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads