Feature #14042
closedIO#puts: use writev if available
Description
Hi,
I've attached a patch to make IO#puts use writev if available. Currently, IO#puts calls write
twice: Once to write the string, and the second to write a newline (if the string doesn't end with one already). With this patch, those two calls are replaced with a single writev
call.
A test has been added that demonstrates the problem.
For a bit of background:
- A related issue: https://bugs.ruby-lang.org/issues/9420. (I couldn't figure out a way to 'attach' my patch to that issue, so I'm creating a new one)
- A blog post I wrote a while back about this: https://hackernoon.com/rubys-puts-is-not-atomic-889c57fc9a28
Command I used to run the test I added: make test-all TESTS='ruby/test_io.rb -n test_puts_parallel'
I'm a first time contributor, a bit confused as to where a changelog entry should be added. Is the NEWS
file the right place?
Regards,
Paul
Files
Updated by shyouhei (Shyouhei Urabe) about 7 years ago
- Related to Feature #9323: IO#writev added
Updated by nobu (Nobuyoshi Nakada) about 7 years ago
- Is duplicate of Feature #9420: warn and puts should be atomic added
Updated by nobu (Nobuyoshi Nakada) about 7 years ago
It breaks other tests which mock $stdout
and $stderr
.
Updated by shyouhei (Shyouhei Urabe) about 7 years ago
nobu (Nobuyoshi Nakada) wrote:
It breaks other tests which mock
$stdout
and$stderr
.
I strongly believe implementation details of IO#puts shall never be what a test should care about. I think it's okay to suppress them. @rohitpaulk (Paul Kuruvilla) can you also provide changes to our tests so that make check
should pass?
Updated by Eregon (Benoit Daloze) about 7 years ago
shyouhei (Shyouhei Urabe) wrote:
I strongly believe implementation details of IO#puts shall never be what a test should care about.
I agree.
For information, ruby/spec used to check write
calls from puts
(~1 year ago) but I changed it to only care about what's written, not individual calls.
FWIW, TruffleRuby has an atomic #puts, currently simply by adding \n
to the String before the write() call, if it does not already end with it.
Updated by mame (Yusuke Endoh) about 7 years ago
shyouhei (Shyouhei Urabe) wrote:
nobu (Nobuyoshi Nakada) wrote:
It breaks other tests which mock
$stdout
and$stderr
.I strongly believe implementation details of IO#puts shall never be what a test should care about.
I'm unsure if it is a spec or not, but at least, it is a well-known practice to assign to $stdout
a dummy object that implements a write
method:
- https://sketchucation.com/forums/viewtopic.php?f=180&t=31160
- http://d.hatena.ne.jp/rubikitch/20080421/1208758284
Matz has also showed a code using the practice:
http://blade.nagaokaut.ac.jp/cgi-bin/scat.rb/ruby/ruby-list/42076
I think we need to care the existing programs that uses the practice.
Updated by rohitpaulk (Paul Kuruvilla) about 7 years ago
- File ruby-changes.patch ruby-changes.patch added
Patch updated (attached) to fix the failures mentioned.
There were 10 failures, all within test/mkmf
. The mocked IO object's #write
method has been changed to support the interface change in https://bugs.ruby-lang.org/projects/ruby-trunk/repository/revisions/60343
- def write(s)
+ def write(*args)
if @out
- @buffer << s
+ @buffer << args.join
elsif @origin
- @origin << s
+ @origin << args.join
end
end
Updated by Eregon (Benoit Daloze) about 7 years ago
mame (Yusuke Endoh) wrote:
I'm unsure if it is a spec or not, but at least, it is a well-known practice to assign to
$stdout
a dummy object that implements awrite
method:
Good catch!
The patch calls #write, but with 2 arguments.
Probably one needs to check that the receiver #write can accept multiple arguments,
or do this logic only is $stdout.write is the original IO#write.
Updated by rohitpaulk (Paul Kuruvilla) about 7 years ago
- File deleted (
ruby-changes.patch)
Updated by rohitpaulk (Paul Kuruvilla) about 7 years ago
Probably one needs to check that the receiver #write can accept multiple arguments
I think that'd be a good approach to ensure that mocked objects in existing programs don't break.
Would it make sense to add a deprecation warning in this case? i.e. If the receiver doesn't accept multiple arguments, we emit a deprecation warning and make 2 calls instead of one. If the receiver accepts multiple arguments, we make a single call.
Updated by rohitpaulk (Paul Kuruvilla) about 7 years ago
Would it make sense to add a deprecation warning in this case? i.e. If the receiver doesn't accept multiple arguments, we emit a deprecation warning and make 2 calls instead of one. If the receiver accepts multiple arguments, we make a single call.
Something along the lines of...
rb_io_writev(VALUE io, int argc, VALUE *argv)
{
- return rb_funcallv(io, id_write, argc, argv);
+ if (rb_obj_method_arity(io, id_write) == -1) {
+ rb_funcallv(io, id_write, argc, argv);
+ }
+ else {
+ /**
+ * Previously, IO#write only accepted one argument. This was changed
+ * to use multiple arguments in revision #60343.
+ *
+ * To play well with programs that might've mocked an IO object and are
+ * only expecting a single argument - let's make multiple calls with
+ * a single argument each.
+ */
+ rb_warn("IO#write has been changed to accept multiple arguments. \
+You are seeing this warning because an object expected to implement the IO \
+interface has a #write method that doesn't accept multiple arguments. Please \
+change the implementation to accept multiple arguments.");
+ for (int i = 0; i < argc; i++) {
+ rb_io_write(io, argv[i]);
+ }
+ }
}
+ def test_puts_works_with_io_objects_that_only_accept_single_arg
+ klass = Class.new do
+ attr_reader :captured
+
+ def write(str)
+ (@captured ||= "") << str
+ end
+ end
+
+ mocked_io_obj = klass.new
+ old_stdout, $stdout = $stdout, mocked_io_obj
+ puts "hey" # Should write to the mocked class
+ assert_equal("hey\n", mocked_io_obj.captured)
+ ensure
+ $stdout = old_stdout
+ end
+
Updated by nobu (Nobuyoshi Nakada) about 7 years ago
- Status changed from Open to Closed
Applied in changeset trunk|r60417.
io.c: write a newline together
- io.c (rb_io_puts): write a newline together at once for each
argument. based on the patch by rohitpaulk (Rohit Kuruvilla) at
[ruby-core:83508]. [Feature #14042]
Updated by Eregon (Benoit Daloze) about 7 years ago
I think it would be safer to only do this if we know we are calling the original IO#write.
Seeing r60428 and how it breaks mspec too, the current approach sounds not compatible enough.
For instance, if write is defined as write(*args), there is no way to know whether that the arguments are just passed to IO#write or does some of its own parsing, which will very likely break with this change.
As an example (not realistic as only keeps the last call to #write),
require 'pathname'
$stdout = Pathname.new("log")
puts "foo"
raises an error with this change:
Traceback (most recent call last):
4: from test.rb:3:in `<main>'
3: from test.rb:3:in `puts'
2: from test.rb:3:in `puts'
1: from test.rb:3:in `write'
test.rb:3:in `write': no implicit conversion of String into Integer (TypeError)
Updated by Eregon (Benoit Daloze) about 7 years ago
Correction, it only breaks one spec, not mspec.
The spec is "IO.popen raises IOError when writing a read-only pipe" in popen_spec.rb.
I reduced the code and this behaves differently:
$ chruby trunk
$ ruby -e 'IO.popen(["ruby", "-e", "sleep 1; puts :a"], "r").close'
Traceback (most recent call last):
3: from -e:1:in `<main>'
2: from -e:1:in `puts'
1: from -e:1:in `puts'
-e:1:in `write': Broken pipe @ io_writev - <STDOUT> (Errno::EPIPE)
$ chruby 2.4.2
$ ruby -e 'IO.popen(["ruby", "-e", "sleep 1; puts :a"], "r").close'
I don't understand why though.
Updated by Eregon (Benoit Daloze) about 7 years ago
Nobu told me it is because of different buffering with writev, i.e. no buffering.
Nobu fixed it in r60535.
The spec was also racy (child #write+flush raced with parent #close), so I fixed it in r60567.