Floating Point

Overview

The IEEE floating-point standard defines an encoding used to represent numbers of form $(- 1)^{s} \times M \times 2^{E}$ where $s$ denotes the sign bit, $M$ the significand, and $E$ the exponent. The binary representation of floating point numbers are segmented into three fields: the sign bit, the exponent field, and the fraction field. Furthermore, there are three classes these fields are interpreted with respect to:

Normalized Form
- Here the exponent field is neither all 0s nor all 1s.
- The significand is $1 + f$ , where $f$ denotes the fractional part.
- $E = e - B ia s$ where $e$ is the unsigned interpretation of the exponent field.
Denormalized Form
- Here the exponent field is all 0s.
- The significand is $f$ , where $f$ denotes the fractional part.
- $E = 1 - B ia s$ , defined for smooth transition between normalized and denormalized values.
Special Values
- Here the exponent field is all 1s.
- If the fraction field is all 0s, we have an $\infty$ value.
- If the fraction field is not all 0s, we have $N a N$ .

The $B ia s$ in the first two forms is set to $2^{k - 1} - 1$ where $k$ denotes the number of bits that make up the exponent field. In C, fields have the following widths:

Declaration	Sign Bit	Exponent Field	Fractional Field
`float`	`1`	`8`	`23`
`double`	`1`	`11`	`52`

The precision of a floating-point type refers to the number of bits found in the fractional field.

Rounding

Because floating-point arithmetic can’t represent every real number, it must round results to the “nearest” representable number, however “nearest” is defined. The IEEE floating-point standard defines four rounding modes to influence this behavior:

Round-to-even rounds numbers to the closest representable value. In the case of values equally between two representations, it rounds to the number with an even least significant digit.
Round-toward-zero rounds downward for positive values and upward for negative values.
Round-down always rounds downward.
Round-up always rounds upward.

Arithmetic

Bibliography

Bryant, Randal E., and David O’Hallaron. Computer Systems: A Programmer’s Perspective. Third edition, Global edition. Always Learning. Pearson, 2016.
“Scientific Notation.” In Wikipedia, March 6, 2024. https://en.wikipedia.org/w/index.php?title=Scientific_notation&oldid=1212169750.

My Notebook

Explorer

Explorer

Floating Point

Overview

Rounding

Arithmetic

Bibliography

Graph View

Table of Contents

Backlinks