Skip to content

Commit aef1140

Browse files
committed
Auto merge of rust-lang#78618 - workingjubilee:ieee754-fmt, r=m-ou-se
Add IEEE 754 compliant fmt/parse of -0, infinity, NaN This pull request improves the Rust float formatting/parsing libraries to comply with IEEE 754's formatting expectations around certain special values, namely signed zero, the infinities, and NaN. It also adds IEEE 754 compliance tests that, while less stringent in certain places than many of the existing flt2dec/dec2flt capability tests, are intended to serve as the beginning of a roadmap to future compliance with the standard. Some relevant documentation is also adjusted with clarifying remarks. This PR follows from discussion in rust-lang/rfcs#1074, and closes rust-lang#24623. The most controversial change here is likely to be that -0 is now printed as -0. Allow me to explain: While there appears to be community support for an opt-in toggle of printing floats as if they exist in the naively expected domain of numbers, i.e. not the extended reals (where floats live), IEEE 754-2019 is clear that a float converted to a string should be capable of being transformed into the original floating point bit-pattern when it satisfies certain conditions (namely, when it is an actual numeric value i.e. not a NaN and the original and destination float width are the same). -0 is given special attention here as a value that should have its sign preserved. In addition, the vast majority of other programming languages not only output `-0` but output `-0.0` here. While IEEE 754 offers a broad leeway in how to handle producing what it calls a "decimal character sequence", it is clear that the operations a language provides should be capable of round tripping, and it is confusing to advertise the f32 and f64 types as binary32 and binary64 yet have the most basic way of producing a string and then reading it back into a floating point number be non-conformant with the standard. Further, existing documentation suggested that e.g. -0 would be printed with -0 regardless of the presence of the `+` fmt character, but it prints "+0" instead if given such (which was what led to the opening of rust-lang#24623). There are other parsing and formatting issues for floating point numbers which prevent Rust from complying with the standard, as well as other well-documented challenges on the arithmetic level, but I hope that this can be the beginning of motion towards solving those challenges.
2 parents feaac19 + e8dfbac commit aef1140

File tree

10 files changed

+275
-211
lines changed

10 files changed

+275
-211
lines changed

library/alloc/src/fmt.rs

+2-3
Original file line numberDiff line numberDiff line change
@@ -161,9 +161,8 @@
161161
//!
162162
//! * `+` - This is intended for numeric types and indicates that the sign
163163
//! should always be printed. Positive signs are never printed by
164-
//! default, and the negative sign is only printed by default for the
165-
//! `Signed` trait. This flag indicates that the correct sign (`+` or `-`)
166-
//! should always be printed.
164+
//! default, and the negative sign is only printed by default for signed values.
165+
//! This flag indicates that the correct sign (`+` or `-`) should always be printed.
167166
//! * `-` - Currently not used
168167
//! * `#` - This flag indicates that the "alternate" form of printing should
169168
//! be used. The alternate forms are:

library/alloc/tests/fmt.rs

+1-2
Original file line numberDiff line numberDiff line change
@@ -151,8 +151,7 @@ fn test_format_macro_interface() {
151151
t!(format!("{:+10.3e}", -1.2345e6f64), " -1.234e6");
152152

153153
// Float edge cases
154-
t!(format!("{}", -0.0), "0");
155-
t!(format!("{:?}", -0.0), "-0.0");
154+
t!(format!("{}", -0.0), "-0");
156155
t!(format!("{:?}", 0.0), "0.0");
157156

158157
// sign aware zero padding

library/core/src/fmt/float.rs

+6-13
Original file line numberDiff line numberDiff line change
@@ -54,21 +54,14 @@ where
5454
}
5555

5656
// Common code of floating point Debug and Display.
57-
fn float_to_decimal_common<T>(
58-
fmt: &mut Formatter<'_>,
59-
num: &T,
60-
negative_zero: bool,
61-
min_precision: usize,
62-
) -> Result
57+
fn float_to_decimal_common<T>(fmt: &mut Formatter<'_>, num: &T, min_precision: usize) -> Result
6358
where
6459
T: flt2dec::DecodableFloat,
6560
{
6661
let force_sign = fmt.sign_plus();
67-
let sign = match (force_sign, negative_zero) {
68-
(false, false) => flt2dec::Sign::Minus,
69-
(false, true) => flt2dec::Sign::MinusRaw,
70-
(true, false) => flt2dec::Sign::MinusPlus,
71-
(true, true) => flt2dec::Sign::MinusPlusRaw,
62+
let sign = match force_sign {
63+
false => flt2dec::Sign::Minus,
64+
true => flt2dec::Sign::MinusPlus,
7265
};
7366

7467
if let Some(precision) = fmt.precision {
@@ -156,14 +149,14 @@ macro_rules! floating {
156149
#[stable(feature = "rust1", since = "1.0.0")]
157150
impl Debug for $ty {
158151
fn fmt(&self, fmt: &mut Formatter<'_>) -> Result {
159-
float_to_decimal_common(fmt, self, true, 1)
152+
float_to_decimal_common(fmt, self, 1)
160153
}
161154
}
162155

163156
#[stable(feature = "rust1", since = "1.0.0")]
164157
impl Display for $ty {
165158
fn fmt(&self, fmt: &mut Formatter<'_>) -> Result {
166-
float_to_decimal_common(fmt, self, false, 0)
159+
float_to_decimal_common(fmt, self, 0)
167160
}
168161
}
169162

library/core/src/num/dec2flt/mod.rs

+7-5
Original file line numberDiff line numberDiff line change
@@ -239,13 +239,15 @@ fn dec2flt<T: RawFloat>(s: &str) -> Result<T, ParseFloatError> {
239239
ParseResult::Valid(decimal) => convert(decimal)?,
240240
ParseResult::ShortcutToInf => T::INFINITY,
241241
ParseResult::ShortcutToZero => T::ZERO,
242-
ParseResult::Invalid => match s {
243-
"inf" => T::INFINITY,
244-
"NaN" => T::NAN,
245-
_ => {
242+
ParseResult::Invalid => {
243+
if s.eq_ignore_ascii_case("nan") {
244+
T::NAN
245+
} else if s.eq_ignore_ascii_case("inf") || s.eq_ignore_ascii_case("infinity") {
246+
T::INFINITY
247+
} else {
246248
return Err(pfe_invalid());
247249
}
248-
},
250+
}
249251
};
250252

251253
match sign {

library/core/src/num/flt2dec/mod.rs

+6-26
Original file line numberDiff line numberDiff line change
@@ -399,45 +399,25 @@ fn digits_to_exp_str<'a>(
399399
/// Sign formatting options.
400400
#[derive(Copy, Clone, PartialEq, Eq, Debug)]
401401
pub enum Sign {
402-
/// Prints `-` only for the negative non-zero values.
403-
Minus, // -inf -1 0 0 1 inf nan
404-
/// Prints `-` only for any negative values (including the negative zero).
405-
MinusRaw, // -inf -1 -0 0 1 inf nan
406-
/// Prints `-` for the negative non-zero values, or `+` otherwise.
407-
MinusPlus, // -inf -1 +0 +0 +1 +inf nan
408-
/// Prints `-` for any negative values (including the negative zero), or `+` otherwise.
409-
MinusPlusRaw, // -inf -1 -0 +0 +1 +inf nan
402+
/// Prints `-` for any negative value.
403+
Minus, // -inf -1 -0 0 1 inf nan
404+
/// Prints `-` for any negative value, or `+` otherwise.
405+
MinusPlus, // -inf -1 -0 +0 +1 +inf nan
410406
}
411407

412408
/// Returns the static byte string corresponding to the sign to be formatted.
413409
/// It can be either `""`, `"+"` or `"-"`.
414410
fn determine_sign(sign: Sign, decoded: &FullDecoded, negative: bool) -> &'static str {
415411
match (*decoded, sign) {
416412
(FullDecoded::Nan, _) => "",
417-
(FullDecoded::Zero, Sign::Minus) => "",
418-
(FullDecoded::Zero, Sign::MinusRaw) => {
413+
(_, Sign::Minus) => {
419414
if negative {
420415
"-"
421416
} else {
422417
""
423418
}
424419
}
425-
(FullDecoded::Zero, Sign::MinusPlus) => "+",
426-
(FullDecoded::Zero, Sign::MinusPlusRaw) => {
427-
if negative {
428-
"-"
429-
} else {
430-
"+"
431-
}
432-
}
433-
(_, Sign::Minus | Sign::MinusRaw) => {
434-
if negative {
435-
"-"
436-
} else {
437-
""
438-
}
439-
}
440-
(_, Sign::MinusPlus | Sign::MinusPlusRaw) => {
420+
(_, Sign::MinusPlus) => {
441421
if negative {
442422
"-"
443423
} else {

0 commit comments

Comments
 (0)