Bug #12548
closedRounding modes inconsistency between round versus sprintf
Added by unclekiki (Kieran McCusker) over 8 years ago. Updated over 4 years ago.
Description
Hi
I've found this possible bug with sprintf
in our production code, but it
seems very odd that no one has reported it so I'm doubting myself.
I've tested a few versions (CRuby 2.3.1 and 2.1.9) and it was present in
both.
To reproduce in irb:
sprintf('%1.0f', 12.5)
Expected result 13 actual 12
In fact if you look at the number sequence 0.5 -> 99.5 you find the even
half will all round down and the odd half round up. 12.5.round produces the correct result (13)
If you do the same in jruby 2.2.1 you get the expected result of 13.
Many thanks
Kieran
Files
round_even.patch (4.43 KB) round_even.patch | noahgibbs (Noah Gibbs), 09/22/2016 04:40 AM |
Updated by shyouhei (Shyouhei Urabe) over 8 years ago
I doubt if this is a bug or not. The rounding mode you describe is so-called "round to the nearest even" which is described in ISO 31-0 as "generally preferable". Maybe the author of sprintf explicitly chose such mode (not sure).
It seems the problem here is whether should we specify sprintf's rounding mode or leave it implementation-defined.
Updated by unclekiki (Kieran McCusker) over 8 years ago
Having read a bit more I see that there are indeed multiple possible ways of handling tie-breaking i.e. when a value is n.5, but is it reasonable for two implementations in ruby core (sprintf and round) to use a different method? It's at least surprising (very in my case) and will catch out anyone coming from other programming backgrounds.
Updated by shyouhei (Shyouhei Urabe) over 8 years ago
- Subject changed from Ruby sprintf bug to Rounding modes inconsistency between round versus sprintf
I agree that can be a point. I changed the subject.
Updated by matz (Yukihiro Matsumoto) over 8 years ago
It should be consistent. Will be fixed.
Matz.
Updated by noahgibbs (Noah Gibbs) over 8 years ago
It looks like floats are rounded via numeric.c, using the flo_round function. That definitely does not use round-even. Maybe it should? It already implements a rounding function, so it shouldn't be too hard to implement round-even.
It looks like sprintf uses ruby_sprintf, through a series of calls to BSD_vfprintf in vsnprintf.c. At least on my machine. And BSD_vprintf winds up calling ruby_dtoa (#defined from BSD__dtoa) with mode 3. If you look for "round-even" in util.c, you'll see an extensive comment explaining why they used round-even mode.
So basically: right now, Numeric#round uses flo_round(), while sprintf uses util.c's round-even mode.
Presumably we should change flo_round to do round-even.
Updated by nobu (Nobuyoshi Nakada) over 8 years ago
I agree that round-even is preferable, but it introduces an incompatibility.
Some libraries, e.g., rexml, depend on the current behavior.
So I'm thinking an optional parameter to select rounding mode.
2.5.round #=> 3
2.5.round(:even) #=> 2
Updated by noahgibbs (Noah Gibbs) over 8 years ago
Nobu: but then, default will still be inconsistent.
Maybe add a parameter and require libraries to supply it if they care? Then later we could change the default from :even to :closest or :biased or whatever we call the common rounding behavior.
Updated by shyouhei (Shyouhei Urabe) over 8 years ago
I doubt if we need multiple rounding modes. I guess for every situation when a specific rounding mode is mandatory, what is needed is the round-even mode. I have never experienced the needs of other modes. Almost everybody don't care what mode they use.
Defaulting to that rounding mode should just suffice. Adding parameter solves a problem that doesn't exist.
Updated by noahgibbs (Noah Gibbs) over 8 years ago
Sounds good to me. Do we need to fix ReXML, if it depends on the current rounding mode?
Updated by noahgibbs (Noah Gibbs) over 8 years ago
Another reason not to add an argument: Float#round
already takes an argument for the number of digits to round to.
Updated by nobu (Nobuyoshi Nakada) over 8 years ago
Noah Gibbs wrote:
Another reason not to add an argument:
Float#round
already takes an argument for the number of digits to round to.
It's easy to distinguish a Fixnum
and a Symbol
.
Or keyword arguments will work well.
Updated by nobu (Nobuyoshi Nakada) over 8 years ago
Noah Gibbs wrote:
Sounds good to me. Do we need to fix ReXML, if it depends on the current rounding mode?
The rdoc of REXML::Functions.number
states:
a string that consists of optional whitespace followed by an optional minus
sign followed by a Number followed by whitespace is converted to the IEEE 754
number that is nearest (according to the IEEE 754 round-to-nearest rule) to
the mathematical value represented by the string; any other string is
converted to NaN
Updated by noahgibbs (Noah Gibbs) over 8 years ago
Okay, so sounds like just a documentation fix for ReXML. That makes sense. Thanks!
Updated by noahgibbs (Noah Gibbs) about 8 years ago
Ah - looks like the ReXML docs don't need to change. The round-even mode Ruby uses is still referred to as "round to nearest, ties to even" - https://en.wikipedia.org/wiki/IEEE_floating_point#Roundings_to_nearest
Updated by noahgibbs (Noah Gibbs) about 8 years ago
- File round_even.patch added
- ruby -v changed from 2.3.1 to 2.4.0
Here's a patch that makes floating-point rounding behavior match sprintf()'s round-to-even. It also adds a unit test for the new behavior. The change doesn't break any existing unit tests.
Updated by nobu (Nobuyoshi Nakada) about 8 years ago
Your patch breaks two tests.
[24/42] TestVector#test_round = 0.00 s
1) Failure:
TestVector#test_round [test/matrix/test_vector.rb:162]:
<Vector[1.23, 2.34, 3.4]> expected but was
<Vector[1.23, 2.35, 3.4]>.
[30/42] REXMLTests::FunctionsTester#test_floor_ceiling_round = 0.02 s
2) Failure:
REXMLTests::FunctionsTester#test_floor_ceiling_round [test/rexml/test_functions.rb:168]:
<[<b id='1'/>]> expected but was
<[]>.
Updated by noahgibbs (Noah Gibbs) about 8 years ago
- File round_even.patch added
You're right. I apologize. I had been running test-ruby instead of test-all without realizing it.
It appears both tests won't work if we correctly implement the round-even behavior, so the new patch fixes those.
New patch is attached.
Updated by noahgibbs (Noah Gibbs) about 8 years ago
- File deleted (
round_even.patch)
Updated by noahgibbs (Noah Gibbs) about 8 years ago
- File round_even.patch round_even.patch added
Updated by noahgibbs (Noah Gibbs) about 8 years ago
- File deleted (
round_even.patch)
Updated by nobu (Nobuyoshi Nakada) about 8 years ago
Noah Gibbs wrote:
It appears both tests won't work if we correctly implement the round-even behavior, so the new patch fixes those.
That is what I had mentioned.
Updated by nobu (Nobuyoshi Nakada) about 8 years ago
I have a patch too, but it breaks compatibilities in rubyspec.
1)
Float#round returns rounded values for big values FAILED
Expected 200000000000000000000
to have same value and type as 300000000000000000000
spec/rubyspec/core/float/round_spec.rb:70:in `block (2 levels) in <top (required)>'
spec/rubyspec/core/float/round_spec.rb:3:in `<top (required)>'
2)
Integer#round returns itself rounded if passed a negative value FAILED
Expected 200
to have same value and type as 300
spec/rubyspec/core/integer/round_spec.rb:22:in `block (2 levels) in <top (required)>'
spec/rubyspec/core/integer/round_spec.rb:4:in `<top (required)>'
3)
Rational#round with no arguments (precision = 0) returns the truncated value toward the nearest integer FAILED
Expected 0
to equal 1
spec/rubyspec/shared/rational/round.rb:17:in `block (3 levels) in <top (required)>'
spec/rubyspec/core/rational/round_spec.rb:3:in `<top (required)>'
4)
Time#strftime rounds an offset to the nearest second when formatting with %z FAILED
Expected "+01:01:04"
to equal "+01:01:05"
spec/rubyspec/core/time/strftime_spec.rb:50:in `block (2 levels) in <top (required)>'
spec/rubyspec/core/time/strftime_spec.rb:7:in `<top (required)>'
Updated by noahgibbs (Noah Gibbs) about 8 years ago
@nobu (Nobuyoshi Nakada): makes sense. I'll file an issue with RubySpec to let them know that at least some of these tests aren't compatible with IEEE 754 round-nearest-to-even mode. Examining the first test failure (spec/core/float/round_spec.rb line 70), it clearly isn't since it expects 2.5 to round to 3 at that decimal place.
I'll examine the other failures and try to make sure they're all equally clear.
Updated by noahgibbs (Noah Gibbs) about 8 years ago
Also: looks like your patch fixes it for integer rounding too, based on your RubySpec failures. Mine does not. Sprintf doesn't have an integer rounding mode, but your way is probably still better.
Updated by noahgibbs (Noah Gibbs) about 8 years ago
Opened an issue with RubySpec: https://github.com/ruby/spec/issues/294
Updated by noahgibbs (Noah Gibbs) about 8 years ago
Benoit Daloze says that RubySpec should test previous rounding behavior for Ruby before 2.4.0, and test round-even for 2.4.0+. That makes sense to me. Opened RubySpec PR #322 (https://github.com/ruby/spec/pull/322) to reflect this.
When/if we merge Nobu's branch for this bug, we should also ask RubySpec to merge PR #322 or something very similar.
Updated by noahgibbs (Noah Gibbs) about 8 years ago
Benoit says RubySpec PR #322 looks good. Just waiting for us to commit round-to-nearest-even mode to MRI so he can merge it. I've used Nobu's branch locally, including to create PR #322, and it works well for me. +1
Updated by nobu (Nobuyoshi Nakada) about 8 years ago
We had a discussion today, and the conclusions:
- add
half:
optional keyword argument toround
methods, - its value must be one of
:up
,:even
, andnil
, -
:up
is compatible with the existing behavior, -
:even
is the new behavior compatible withsprintf
, and - if
half:
option isn't given or isnil
, same as:even
.
Updated by nobu (Nobuyoshi Nakada) about 8 years ago
- Status changed from Open to Closed
Applied in changeset r56590.
numeric.c: round to nearest even
- numeric.c (flo_round, int_round): support round-to-nearest-even
semantics of IEEE 754 to match sprintf behavior, and addhalf:
optional keyword argument for the old behavior.
[ruby-core:76273] [Bug #12548]
Updated by Eregon (Benoit Daloze) about 8 years ago
- Related to Bug #12958: Breaking change in how `#round` works added
Updated by shyouhei (Shyouhei Urabe) almost 8 years ago
- Status changed from Closed to Open
Reopening.
The fix was reverted due to issue #12958. This issue is now back alive.
Updated by nobu (Nobuyoshi Nakada) almost 8 years ago
- Status changed from Open to Closed
Applied in changeset r57130.
numeric.c: allow nil as rounding mode option
- numeric.c (rb_num_get_rounding_option): allow nil same as the
default behavior, per [ruby-core:77961]. [Bug #12548]
Updated by nobu (Nobuyoshi Nakada) almost 8 years ago
- Description updated (diff)
- Status changed from Closed to Open
Probably, [Feature #10000] can add half
option to String#%
.
Updated by unclekiki (Kieran McCusker) almost 7 years ago
Hi again
After I raised this bug and saw the comment from matz I stopped watching assuming that sprintf would be "fixed" not that round would be changed instead. IEEE 754 is useful when you are taking them mean of a large number of rounded values and care about the mean. Rounding to even will give you a more accurate rounded mean in large sets of randomly distributed data. When rounding individual numbers it seems very unintuitive
This suggests that the typical use of sprintf is to take a large number of floating point values round them so they can be converted back from string to floats and then summed thus preserving their summed mean. Hummm, would it not be more likely to be used for formatting percentage values etc. where users would not expect 11.5 and 12.5 to be formatted to 12%.
Wouldn't it have been more sensible to make sprintf use Round half away from zero as round does (and JRuby's implementation did when I last looked) this is what I assumed would happen. There may well be a separate issue that round could be made aware of IEEE 754 for the fairly specialized case I first outlined.
As it stands in ruby 2.5 my original issue is still outstanding and the two uses are still inconsistent.
Thanks
Kieran
Updated by vor_lord (Brett Williams) over 6 years ago
I don't think that Float#round and sprintf should have different default behavior. Whichever rounding mode is considered default should apply to both. Is there any consensus on this issue?
Updated by shyouhei (Shyouhei Urabe) over 6 years ago
The current status is:
- We once have fixed this bug.
- That caused rails breakage.
- Everybody complained.
- So we reverted.
I personally believe this issue is a real bug but it seems I am a minority here. The consensus, if any, is that it should be kept as it is.
See also: https://bugs.ruby-lang.org/issues/12958
Updated by unclekiki (Kieran McCusker) over 6 years ago
It is a real bug, it's just that, as I have said before, the "fix" took the wrong direction. sprintf should behave like round (as it does, or at least did, in jruby with no apparent issues). Round even is a very specialised use case.
Imagine you take the height of 100 school children to the nearest half centimetre and then want to output frequencies using sprintf to the nearest whole centimetre. Do you really want the frequencies to spaced like (117), (117.5, 118, 118.5), (119), (119.5, 120, 120.5)
The real solution is to make sprintf perform like round.
As Matz said 2 years ago "it should be consistent. Will be fixed."
Updated by nobu (Nobuyoshi Nakada) over 6 years ago
unclekiki (Kieran McCusker) wrote:
Imagine you take the height of 100 school children to the nearest half centimetre and then want to output frequencies using sprintf to the nearest whole centimetre. Do you really want the frequencies to spaced like (117), (117.5, 118, 118.5), (119), (119.5, 120, 120.5)
Why grouping by sprintf
, instead of Float#round
?
It doesn't sound a reasonable use-case.
Updated by unclekiki (Kieran McCusker) over 6 years ago
Ok
How about listing them in the format
name ......... height (rounded to whole centimetre)
sprintf would seem a great choice for that but fails due to it's rounding choices. By the way, I noticed this in the first place because I was using sprintf to format frequencies on charts using code like sprintf('%1.0f%%', frequency) so it was a reasonable use case for me.
Could you give me a use case for sprintf where using round-even is a good enough choice for the decision to make it behave differently to round is justified? I know it wasn't really a conscious decision of course, but given that we have an implementation of sprintf in jruby that matches round and the sky does not appear to be falling down I find it difficult to understand the reasoning why sprintf rounding wasn't just brought in line with round.
Updated by mackuba (Kuba Suder) over 5 years ago
I came across this thread now, because I've noticed that sprintf's behavior has actually changed in Ruby 2.4 too, just... not the same way as round's did...
> echo "n = [5.005, 5.015, 5.025]; puts 'round: ' + n.map { |x| x.round(2) }.join(' '); puts 'sprintf: ' + n.map { |x| sprintf('%.2f', x) }.join(' ')" > /tmp/floats.rb
> rvm 2.3.7 do ruby /tmp/floats.rb
round: 5.01 5.02 5.03
sprintf: 5.00 5.01 5.03
> rvm 2.4.0 do ruby /tmp/floats.rb
round: 5.01 5.02 5.03
sprintf: 5.00 5.02 5.02
Updated by matz (Yukihiro Matsumoto) over 4 years ago
- Status changed from Open to Closed
It is inconsistent but fixing either way could cause problems. I vote for keeping as it is.
Matz.